Difference between revisions of "Crunchbase 2013 Snapshot"

From edegan.com
Jump to navigation Jump to search
(Created page with "==Retrieval== The data was retrieved by Shrey and Matthew - STATE HOW AND FROM WHERE ==Content== The snapshot contained 2 .tar.qz files, which were extracted into 181/crunc...")
 
Line 5: Line 5:
 
==Content==
 
==Content==
  
The snapshot contained 2 .tar.qz files, which were extracted into 181/crunchbase using the command
+
The snapshot contained 2 .tar.qz files
 +
** which were extracted into 181/crunchbase using the command
 
  tar -zxvf file.tar.gz
 
  tar -zxvf file.tar.gz
  
The files and their contents are:
+
The files and their contents are
*crunchbase_2013_snapshot_mysql.tar.gz
+
 
**license.txt 526  
+
crunchbase_2013_snapshot_mysql.tar.gz
**cb_objects.sql 338955612  
+
*license.txt 526  
**cb_offices.sql 14850092  
+
*cb_objects.sql 338955612  
**cb_people.sql 13253952  
+
*cb_offices.sql 14850092  
**cb_ipos.sql 178397  
+
*cb_people.sql 13253952  
**cb_milestones.sql 10498840  
+
*cb_ipos.sql 178397  
**cb_funds.sql 385010  
+
*cb_milestones.sql 10498840  
**cb_relationships.sql 48655529  
+
*cb_funds.sql 385010  
**cb_degrees.sql 13829471  
+
*cb_relationships.sql 48655529  
**cb_investments.sql 6185134  
+
*cb_degrees.sql 13829471  
**cb_acquisitions.sql 2309393  
+
*cb_investments.sql 6185134  
**cb_funding_rounds.sql 14681705  
+
*cb_acquisitions.sql 2309393  
*odm.csv.tar.gz
+
*cb_funding_rounds.sql 14681705  
 +
 
 +
odm.csv.tar.gz
 
**organizations.csv 212013301
 
**organizations.csv 212013301
***Fields: crunchbase_uuid,type,primary_role,name,crunchbase_url,homepage_domain,homepage_url,profile_image_url,facebook_url,twitter_url,linkedin_url,stock_symbol,location_city,location_region,location_country_code,short_description
+
**Fields:  
***459916 records
+
***crunchbase_uuid
 +
***type
 +
***primary_role
 +
***name
 +
***crunchbase_url
 +
***homepage_domain
 +
***homepage_url
 +
***profile_image_url
 +
***facebook_url
 +
***twitter_url
 +
***linkedin_url
 +
***stock_symbol
 +
***location_city
 +
***location_region
 +
***location_country_code
 +
***short_description
 +
**459916 records
 
**people.csv 188924229
 
**people.csv 188924229
***Fields: crunchbase_uuid,type,first_name,last_name,crunchbase_url,profile_image_url,facebook_url,twitter_url,linkedin_url,location_city,location_region,location_country_code,title,organization,organization_crunchbase_url
+
**Fields:  
***521634 records
+
***crunchbase_uuid
**crunchbase_license.txt 487
+
***type
 +
***first_name
 +
***last_name
 +
***crunchbase_url
 +
***profile_image_url
 +
***facebook_url
 +
***twitter_url
 +
***linkedin_url
 +
***location_city
 +
***location_region
 +
***location_country_code
 +
***title
 +
***organization
 +
***organization_crunchbase_url
 +
**521634 records
 +
*crunchbase_license.txt 487

Revision as of 15:34, 9 March 2017

Retrieval

The data was retrieved by Shrey and Matthew - STATE HOW AND FROM WHERE

Content

The snapshot contained 2 .tar.qz files

    • which were extracted into 181/crunchbase using the command
tar -zxvf file.tar.gz

The files and their contents are

crunchbase_2013_snapshot_mysql.tar.gz

  • license.txt 526
  • cb_objects.sql 338955612
  • cb_offices.sql 14850092
  • cb_people.sql 13253952
  • cb_ipos.sql 178397
  • cb_milestones.sql 10498840
  • cb_funds.sql 385010
  • cb_relationships.sql 48655529
  • cb_degrees.sql 13829471
  • cb_investments.sql 6185134
  • cb_acquisitions.sql 2309393
  • cb_funding_rounds.sql 14681705

odm.csv.tar.gz

    • organizations.csv 212013301
    • Fields:
      • crunchbase_uuid
      • type
      • primary_role
      • name
      • crunchbase_url
      • homepage_domain
      • homepage_url
      • profile_image_url
      • facebook_url
      • twitter_url
      • linkedin_url
      • stock_symbol
      • location_city
      • location_region
      • location_country_code
      • short_description
    • 459916 records
    • people.csv 188924229
    • Fields:
      • crunchbase_uuid
      • type
      • first_name
      • last_name
      • crunchbase_url
      • profile_image_url
      • facebook_url
      • twitter_url
      • linkedin_url
      • location_city
      • location_region
      • location_country_code
      • title
      • organization
      • organization_crunchbase_url
    • 521634 records
  • crunchbase_license.txt 487