Changes

Jump to navigation Jump to search
='''Veeral's Summer Work'''=
==WHAT I'VE DONE==
'''1)''' Used the 2013 Crunchbase Snapshot information to find more accelerators using keyword matching and manual researching/googling. Ended up with ~70 new accelerators which were all added to the current list
 
'''2)''' Cohorts were manually obtained for each new accelerator and saved under (E:\McNair\Projects\Accelerators\Data) in the form [Accelerator Name].cohort.txt
 
'''3)''' All new accelerators and corresponding cohorts were added to Cleaned Cohort Data.xls spreadsheet in a new sheet called "Veeral - Updated"
 
'''4)''' Crawled through the Global Accelerator Network (GAN) site to obtain all of the GAN data. The parser, input, and output is located in (E:\McNair\Projects\Accelerators\GAN_Data)
 
'''5)''' Used the Crunchbase "Organizations" data and Whois parser to put together a comprehensive Textfile with all of our current accelerators and information on them (like URL, Location, Creation Date) located in (E:\McNair\Projects\Accelerators\Veeral\Accelerator_Data)
 
'''6)''' Matched existing SDC Platinum VC funding data (located in E:\McNair\Projects\Accelerators\VC Data) with Updated Cohort Data using the Matcher to obtain the Updated AccCo_VC matched file.
 
'''7)''' Copied the Updated AccCo_VC matched file and the Updated Cohort data textfile into the Z:\Accelerators database location.
 
==NEXT STEPS==
 
'''1)''' Calculate the Percent VC funding rates for newly updated accelerator cohort data.
 
'''2)''' Find a way to obtain more variables for the current list of accelerators.
*POTENTIAL VARIABLES WE WANT:
**Company Type (i.e. Corporate, University, etc)
**Industry (i.e. Health, High-Tech, Food, etc)
**Equity
**Cohort size
**Seed Capital
**Employees
**ANY MORE YOU CAN FIND THAT MAY BE STATISTICALLY SIGNIFICANT
 
'''3)''' WRITE PAPERS
==All New Files and what they Contain==
'''Accelerator Data'''
(Located in E:\McNair\Projects\Accelerators\Veeral)
Accelerator Data (TXT) - list of all Accelerators in Updated Cohort Data and other collected Accelerator characteristics. We have the cohort txt files (Located in Data folder; called "Accelerator Name".cohort) for every Accelerator in this list.
=='''SQL Data for acquiring VC funding rates=='''
*(Located in Z:\Accelerators)*(Instructions for using SQL are located in E:\McNair\Projects\Accelerators\SQL_Data under "accelerator sql V")*(Database is called "Accelerators")
Updated_AccCo_VC (TXT) - newer version of AccCo_VC
Updated_Cohort_Data (TXT) - newer version of Cohort_Data
'''GAN Data'''(Located in E:\McNair\Projects\Accelerators\GAN Data) ==Complete Completing Master List of Accelerators(Process)==
(Note: all files are found and stored under E:\McNair\Projects\Accelerators)
'''RESULTS'''
New AccCO_VC Match file - (E:\McNair\Projects\Accelerators\Veeral\Updated AccCo_VC)
'''1.''' Cleaned Cohort Data COMPLETED MASTER LIST - (E:\McNair\Projects\Accelerators\Cleaned Cohort Data.xls)*Updated 7-12-2017 sheet - Most up to date Accelerator cohort data*Updated sheet - Accelerator cohort data minus the new 70~ Crunchbase 2013-snapshot-obtained accelerators '''2.''' Updated Cleaned Cohort Data 7-12-2017 - (E:\McNair\ProjectsVeeral\Accelerators\Updated Cleaned Cohort Data 7-12-2017.txtAccelerator_Data)
==Global Accelerator Network Parser Spec==
383

edits

Navigation menu