Changes

Jump to navigation Jump to search
804 bytes added ,  13:41, 21 September 2020
no edit summary
{{Project|Has project output=Data|Has sponsor=McNair ProjectsCenter
|Has title=Composite Accelerator Data
|Has owner=Matthew Ringheanu, Shrey Agarwal,
|Has keywords=Accelerator, Data
|Has notes=Continuation of [Accelerator Seed List (Data)]
|Has project status=ActiveSubsume
|Is dependent on=Accelerator Seed List (Data),
}}
 
=Relevant Files=
==Location for All Relevant Files==
==List of All Relevant Files==
*'''Original Search'''
**'''List of Preliminary Accelerators'''***Original Location: [[Accelerator Seed List (Data)]]***Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.***Variables: Names of potential accelerators
**'''accelerator_data_noflag'''***Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data***Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.***Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes
*'''Cohort Directory "Big Push"'''
**'''Data'''***Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\***Description: This folder contains files for each of the accelerators that we searched through from the "List of Preliminary Accelerators". There are three files per accelerator: 1) The "accelerator name.txt" file which contains each of the variables recorded by all of the McNair Center workers during our big push on the project winter 2016, 2) The .html file for the cohort page if the entry was indeed an accelerator and if the worker could find the cohort page on that accelerator, and 3) a "accelerator name.cohort.txt" file which contains a list of the cohort companies as well as all variables which were easily found alongside the cohort.
**'''List of Python files'''***'''parse_accelerator_data'''***'''parse_cohort_data'''***'''process_locations'''***'''wayback_machine'''***Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data***Description: These files contain the code which Peter used to categorize the data from the "Data Copy" folder in Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\, which is just a copy of our cleaned data file. From this code, Peter returned for us a list of accelerators categorized by their flag and a compiled list of all the cohort companies as well as the variables recorded by McNair workers.**'''Note''': We manually altered the cohort data which came out of Peter's code so that we could homogenize the formatting. This resulted in a unique cohort file which will not be replicated when running the code again. On the other hand, we manually altered the individual txt files for the accelerators to fix format so running Peter's code again should result in a similar file. *'''Cleaned Cohort Data'''**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017**Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.**Variables: Accelerator Name, Company Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, Program, Cohort, Year
**'''Cleaned Cohort DataFirst_Incomplete_PercentVC_Table'''***Original Location: Bulk(EZ:)\McNair\Projects\Accelerators\Fall 2017***Description: This Excel file contains all data on all cohort companies The VC percentage raise rate for our entire list of current 198 accelerators. All At this point we realized we were missing almost 100 accelerators were updated by Veeral and , so we have used this as decided to expand our final list of cohort companies for all acceleratorsand gather more data.***Variables: Accelerator Name, Company Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, ProgramNumber of Cohort Companies, Number of VC Backed CohortCompanies, YearRaise rate percentage
*'''Refining the List'''
*'''New Crunchbase Accelerators'''
**Variables: Accelerator name, Whois parser code
*'''Additional Variables'''
*'''Accelerator_Cohort_Companies'''

Navigation menu