<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>http://www.edegan.com/mediawiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Shrey</id>
	<title>edegan.com - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="http://www.edegan.com/mediawiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Shrey"/>
	<link rel="alternate" type="text/html" href="http://www.edegan.com/wiki/Special:Contributions/Shrey"/>
	<updated>2026-05-21T18:34:54Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.34.2</generator>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=22709</id>
		<title>Accelerator Seed List (Data)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=22709"/>
		<updated>2018-03-29T20:46:43Z</updated>

		<summary type="html">&lt;p&gt;Shrey: /* Current Work */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Accelerator Seed List (Data)&lt;br /&gt;
|Has owner=Shrey Agarwal, Matthew Ringheanu, Veeral Shah,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has keywords=Accelerators,Data&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Industry Classifier&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Current Work=&lt;br /&gt;
Google Master Sheet: https://docs.google.com/spreadsheets/d/1ikuxYwp9JIRrjz4qQcbdwTpbHOne-q2PterYTjzofjw/edit?ts=5aa2f1f9#gid=0&lt;br /&gt;
*Cross-reference sheet with data from Peter's old accelerator consolidation file (&amp;quot;accelerator_data_noflag&amp;quot; and &amp;quot;accelerator_data&amp;quot; in &amp;quot;All Relevant Files&amp;quot;) and fill in missing data&lt;br /&gt;
*Variables that are 100% NOT in these 2 files:&lt;br /&gt;
**Cohort Breakout?&lt;br /&gt;
**Subtype&lt;br /&gt;
**Designed for Students?&lt;br /&gt;
**Campuses&lt;br /&gt;
**Stage&lt;br /&gt;
**Software Tech&lt;br /&gt;
**What stage do they look for?&lt;br /&gt;
&lt;br /&gt;
TODO:&lt;br /&gt;
 McNair/Projects/Accelerators/Fall 2017/unfound_founders.txt&lt;br /&gt;
A 0 means we don't have founder data for that accelerator.&lt;br /&gt;
Specs: A tab delimited text file with the following fields:&lt;br /&gt;
 Accelerator   First Name   Last Name   LinkedInURL(if possible)&lt;br /&gt;
Getting the LinkedInURL will ensure accuracy, but will work without it.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Shrey: Find &amp;quot;demo day&amp;quot; keywords, so that we can search AcceleratorName Year Keyword and get back potential demo day pages&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Accelerator Type project==&lt;br /&gt;
&lt;br /&gt;
File to edit is called &amp;quot;Accelerator type list&amp;quot;.  Located in the folder E:\McNair\Projects\Accelerators\Spring 2018\Grouping project of ListOfAccs.  More systematic information and instructions are in&amp;quot;Instructions for Accelerator type project&amp;quot; in E:\McNair\Projects\Accelerators\Spring 2018\Grouping project of ListOfAccs.&lt;br /&gt;
&lt;br /&gt;
NOTE: until we get through all 270 accelerators, we will just categorize each accelerator into the following three categories as quickly as possible with short notes in teh &amp;quot;other info&amp;quot; column for these; once we have this, we will go back through the ones that aren't categorized and add notes to the &amp;quot;other info&amp;quot; column.   &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Type list:&lt;br /&gt;
*Private&lt;br /&gt;
*Corporate&lt;br /&gt;
*Academic&lt;br /&gt;
 Note: if DEAD, noted here.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Other info:&lt;br /&gt;
*nonprofit? (y/n)&lt;br /&gt;
&lt;br /&gt;
*Subtype abbreviations:&lt;br /&gt;
**S: for if a social entrepreneurship initiative&lt;br /&gt;
**I: for if an incubator&lt;br /&gt;
**A: for an angel group&lt;br /&gt;
**F: for foreign&lt;br /&gt;
**C: for in coworking space/hub/etc&lt;br /&gt;
**V: for if part of venture fund&lt;br /&gt;
**G: for if government funded/partnered&lt;br /&gt;
**T: for international&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
 Note: subtypes (from individual text files in E:\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data) were only found for 23 of the 270 accelerators.  These accelerators were initially intended to be removed from the master list.  Remaining subtypes are currently being added.&lt;br /&gt;
&lt;br /&gt;
other info: &lt;br /&gt;
&lt;br /&gt;
international offices, founders, industries, org type, program duration, or other interesting, easily accessed variables.  Additional information is especially important for accelerators that have no other subtype abbreviation listed.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Steps to research an accelerator===&lt;br /&gt;
&lt;br /&gt;
1. Copy/paste URL listed in Accelerator type list file into google.  If website is insufficient, try googling:&lt;br /&gt;
 the name of the accelerator&lt;br /&gt;
 the name of the accelerator + &amp;quot;crunchbase&amp;quot;&lt;br /&gt;
 the name of the accelerator + &amp;quot;nonprofit&amp;quot; &lt;br /&gt;
&lt;br /&gt;
the above steps sometimes lead to other helpful databases/news articles&lt;br /&gt;
&lt;br /&gt;
2. Note whether: &lt;br /&gt;
 1) Academic/Corporate/Private &lt;br /&gt;
 2) For Profit/Nonprofit.  Sometimes this isn't directly stated but can be inferred through their description of, say their investment process.  If they don't address this at all it's probably For Profit. &lt;br /&gt;
 3) subtype (S, I, A, F, C, V, G, T).  &lt;br /&gt;
 4) Additional, easily-accessed info.  Number 4 is really important if there's no subtype. &lt;br /&gt;
&lt;br /&gt;
All 270 need to be done by the end of the semester.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Type list file saved as &lt;br /&gt;
 &amp;quot;Accelerator type list&amp;quot; in E:\McNair\Projects\Accelerators\Spring 2017\Grouping project of ListOfAccs.&lt;br /&gt;
The list of ListofAccs, from which we drew Accelerator type list, should have no matches with any of the flagged accelerators in E:\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data.  There are 23 matches though.  So all subtypes must be searched and entered manually.  Whether some were a nonprofit was listed in E:\McNair\Projects\Accelerators\Spring 2017\Grouping project of ListOfAccs, called &amp;quot;whether nonprofit...&amp;quot;.  Accelerators with no info there on whether nonprofit need to have info entered manually.&lt;br /&gt;
&lt;br /&gt;
=Funded By Accelerators=&lt;br /&gt;
&lt;br /&gt;
Reference the like-named portion in [[Crunchbase Data#Funded by Accelerators|Crunchbase Data]]&lt;br /&gt;
&lt;br /&gt;
=End of Semester Report=&lt;br /&gt;
The end of semester report will focus on ranking accelerators and environments based on the variables we have gathered. Our primary form of categorization will be ranking individual accelerators based on their venture capital raise rate. We can probably generate information over time for accelerators and the amount of VC they raised to get a sense of what locations have developed in the past five years from the dates of transactions recorded by SDC. To obtain these rankings, we will identify which cohorts companies were trained in, as well as complete details of the accelerator and the details of cohort companies. We will focus only on accelerators because there are many other entities in each ecosystem. We will also utilize information on IPO or acquisition by companies, obtained through Crunchbase, to gain some sense of how successful startups emerging from a particular accelerator are. To obtain the data over time, we will need to fill out the cohort date information column in our cohort data, which will require the help of either Crunchbase or the Wayback machine for older accelerators. In ranking the accelerators across regions, we can also track industry-specific hotspots for accelerators such as medicine in Memphis or technology in San Francisco.&lt;br /&gt;
&lt;br /&gt;
To complete the report, we need to fill information in:&lt;br /&gt;
*Industry and focus&lt;br /&gt;
*Location&lt;br /&gt;
*Name, description&lt;br /&gt;
*Matched VC data&lt;br /&gt;
*Founder information (maybe)&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
This project is developing broad and near-population data on accelerators and their cohort companies. The objective is to identify which cohorts of which accelerators a cohort company was trained in, obtain details of the accelerators, and obtain details of the cohort companies, including information about any venture capital investment that the cohort company might have received and any IPO or acquisition the company may have experienced.&lt;br /&gt;
&lt;br /&gt;
The primary use of this data is for an academic paper detailed on the [[Matching Entrepreneurs to Accelerators and VCs (Academic Paper)]] page. &lt;br /&gt;
&lt;br /&gt;
However, this project can also provide useful data to other academic papers ([[Urban Start-up Agglomeration]], [[Hubs (Academic Paper)]], and [[Hubs Scorecard (Academic Paper)]]), projects ([[Houston Entrepreneurship]]) and blog posts (under the [[Emerging Ecosystems]] umbrella project).&lt;br /&gt;
&lt;br /&gt;
This project needs the results of the [[Industry Classifier]], [[Whois Parser]], and other tools.&lt;br /&gt;
&lt;br /&gt;
=Current Project Write-Up=&lt;br /&gt;
&lt;br /&gt;
==Things To Do==&lt;br /&gt;
*Obtain all URLs for accelerators in order to run through the Wayback Machine to find out when they started.&lt;br /&gt;
*Match Crunchbase Data with our Accelerator List to see if they have any accelerators that we do not.&lt;br /&gt;
*Obtain an example of accelerator that started early and has multiple companies but does not separate them into cohorts and figure out a way to determine which companies went through each cohort.&lt;br /&gt;
&lt;br /&gt;
==What Each File in the &amp;quot;Accelerator&amp;quot; Folder on the RDP Contains==&lt;br /&gt;
*&amp;quot;Accelerator List Sources&amp;quot; (Folder) - This folder contains most of the sources that we pulled accelerator names from at the very beginning of the project.&lt;br /&gt;
*&amp;quot;Code+Final_Data&amp;quot; (Folder) - This folder contains Peter's code for pulling the data from the text files in the &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Crunchbase Snapshot&amp;quot; (Folder) - This folder contains the data we obtained from Crunchbase. There is a massive amount of data which we will need to sort through to find useful information and hopefully match that data with our current cohort data.&lt;br /&gt;
*&amp;quot;Data&amp;quot; (Folder) - This folder contains all of our data on accelerators including cohort information and the html files of each cohort page. I would estimate that it is about 95% clean currently.&lt;br /&gt;
*&amp;quot;Data - Copy&amp;quot; (Folder) - This is just a copy of our current &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Data_Copy&amp;quot; (Folder) - This is a copy of our original &amp;quot;Data&amp;quot; folder before we did any manual cleaning.&lt;br /&gt;
*&amp;quot;Enclosing_Circle&amp;quot; (Folder) - This folder seems to contain some data on VC but I'm not sure how it pertains to the Accelerator project.&lt;br /&gt;
*&amp;quot;F6S Accelerator HTMLs&amp;quot; (Folder) - This folder contains the HTML pages of all the pages on the F6S website. We used it to add more potential accelerators to our list.&lt;br /&gt;
*&amp;quot;Google_SiteSearch&amp;quot; (Folder) - This folder contains Python code for Google searches.&lt;br /&gt;
*&amp;quot;Industry_Classifier&amp;quot; (Folder) - This folder seems to contain Python code but I'm not sure what for.&lt;br /&gt;
*&amp;quot;Matcher&amp;quot; (Folder) - This folder contains the Matcher.&lt;br /&gt;
*&amp;quot;Python WebCrawler&amp;quot; (Folder) - This folder contains code that is a work in progress for pulling descriptions from accelerator websites. It is Jeemin's project.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data Copy&amp;quot; (Excel File) - This file contains a copy of our cleaned cohort data.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data&amp;quot; (Excel File) - This file contains the most current, completely cleaned data on cohort company information.&lt;br /&gt;
*&amp;quot;NormalizeFixedWidth&amp;quot; (PL File) - This is the normalizer.&lt;br /&gt;
*&amp;quot;PortCoNames&amp;quot; (TXT File) - This file contains all of the names of the cohort companies as well as the accelerator they went through.&lt;br /&gt;
*&amp;quot;VC Data&amp;quot; (Excel File) - This file contains all of the names of the companies that have ever received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data&amp;quot; (TXT File) - This file contains that non-normalized data of all of the VC information.&lt;br /&gt;
*&amp;quot;VC_Data_Names&amp;quot; (TXT File) - This file contains all of the names of companies that have received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data_Names_Matched_PortCoNames&amp;quot; (Excel File) - This file contains all of the cohort companies that have also received VC funding. Still needs to be sorted through.&lt;br /&gt;
&lt;br /&gt;
==Process==&lt;br /&gt;
After accumulating the massive amount of data on accelerators, their cohorts, and their html files, we began cleaning those text files, which are located in the &amp;quot;Data&amp;quot; folder within &amp;quot;Accelerators&amp;quot;. After going through the first round of cleaning, we ran a code through the cohort data which put all of that information into an Excel document called &amp;quot;Cleaned Cohort Data&amp;quot;. There were still some mistakes in the cohort information unfortunately, which we fixed within the Excel file itself. Therefore, there are some text files within the &amp;quot;Data&amp;quot; folder that do not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file. If we were to run the cohort code through the &amp;quot;Data&amp;quot; folder, we would get something that does not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file, which is problematic. The solution to this (other than manually cleaning the text files again) would be to write a code from the &amp;quot;Cleaned Cohort Data&amp;quot; file which would allow us to clean the data in the &amp;quot;Data&amp;quot; folder through the format of the Excel file. We have also matched all of the cohort companies with our list of all companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
=Current To Do=&lt;br /&gt;
&lt;br /&gt;
#Work on the [[Crunchbase 2013 Snapshot]]&lt;br /&gt;
#Match cohort companies to VC-backed portfolio companies&lt;br /&gt;
#Refine our data to work out which cohort each cohort company was a member of, cohort start dates and locations, etc.&lt;br /&gt;
#Make a list of top accelerator lists (e.g., http://tech.co/top-startup-accelerators-ranked-2012-08) and check that we have those accelerators&lt;br /&gt;
&lt;br /&gt;
=End of Semester Notes=&lt;br /&gt;
&lt;br /&gt;
*We have compiled a very long list of accelerators from many different databases. For the past couple of weeks, everyone in the center has been going through this list, 20 at a time, classifying each one as an accelerator or not an accelerator, and then proceeding to gather data on the accelerator using the process outlined below. This process went very smoothly. We have successfully gone through about 80% of the list. We are still missing information on the last hundred or so names. All of the collected data is located on the RDP, within the &amp;quot;Accelerators&amp;quot; folder under &amp;quot;Data&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
=Data Collection Notes=&lt;br /&gt;
&lt;br /&gt;
==MATCHING==&lt;br /&gt;
&lt;br /&gt;
The files we used to match are located in the E drive. We used the matcher to match our portfolio company names from the cohort file located in E:\McNair\Projects\Accelerators. &lt;br /&gt;
*The files used to matching are located E:\McNair\Projects\Accelerators\Matcher&lt;br /&gt;
*Portco is the name of the companies pulled from the cohort file&lt;br /&gt;
*AccCo includes both the cohort company name, along with the name of the accelerator itself&lt;br /&gt;
*In the matcher, the inputs are the PortCo names, as well as the VC data from our pull in SDC&lt;br /&gt;
*The outputs include the AccCo_VC data located in E:\McNair\Projects\Accelerators which give a lot of information on the matches, including:&lt;br /&gt;
:*name of the match itself&lt;br /&gt;
:*number of investments&lt;br /&gt;
:*dates that the company received its investments&lt;br /&gt;
&lt;br /&gt;
==SDC Pull==&lt;br /&gt;
&lt;br /&gt;
We accessed SDC platinum and pulled information on round-based funding that all registered companies received from between the years 1999 to 2017.&lt;br /&gt;
&lt;br /&gt;
The receipt is as follows:&lt;br /&gt;
&lt;br /&gt;
Session Details&lt;br /&gt;
---------------&lt;br /&gt;
Request   Hits    Request Description&lt;br /&gt;
   0        -     DATABASE: Portfolio Companies (VIPC)&lt;br /&gt;
   1     96155    Venture Related Deals: Select All Venture Related Deals&lt;br /&gt;
   2     79572    Round Date: 1/1/1999 to 3/1/2017 (Custom) (Calendar)&lt;br /&gt;
   3              Custom Report: VC Data (Columnar) - Save As:&lt;br /&gt;
                  E:\McNair\Projects\Accelerators\VC Data.txt&lt;br /&gt;
�&lt;br /&gt;
Billing Ref # : 2054025&lt;br /&gt;
Capture File  : riceuniv.2054025&lt;br /&gt;
Session Name  : &lt;br /&gt;
&lt;br /&gt;
The VC data pull includes the following variables: &lt;br /&gt;
&lt;br /&gt;
Company Name                                                           Date Company      Date Company      Company        Company City                           Company Street Address, Line 1               Company Street Address, Line 2            Total Known     Company Industry Sub-Group 3                              Company Industry Major Group     Round          Company Stage Level 3     Round Amt,       Round Amt,&lt;br /&gt;
&lt;br /&gt;
==3 files==&lt;br /&gt;
&lt;br /&gt;
For each accelerator in the list, put files in E:\Projects\Accelerators\Data&lt;br /&gt;
*AcceleratorName.txt - copy and paste the variables below into a (tab-delimited) txt file and complete&lt;br /&gt;
*AcceleratorName.cohort - your cohort text file (see below)&lt;br /&gt;
*AcceleratorName.html (possibly automatically with a folder too) - save a copy of the html of the cohort page&lt;br /&gt;
&lt;br /&gt;
==.txt Variables==&lt;br /&gt;
&lt;br /&gt;
 Name	&lt;br /&gt;
 Score	&lt;br /&gt;
 Flag	&lt;br /&gt;
 CohortURL	&lt;br /&gt;
 Address	&lt;br /&gt;
 Duration	&lt;br /&gt;
 Vintage		&lt;br /&gt;
 Industry	&lt;br /&gt;
 Description	&lt;br /&gt;
 Equity	&lt;br /&gt;
 NonProfit	 &lt;br /&gt;
 Notes	&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Try to get '''Name, Score, Flag, Cohort URL and Address''' for all. ONLY GRAB OTHER VARIABLES IF EASY. Just leave things blank if you can't find them quickly.&lt;br /&gt;
&lt;br /&gt;
'''If the score is 0, or the flag is S, I, A, or F just stop''' - don't bother downloading a cohort list, saving an HTML file, etc. If possible, do stick a very brief description of the problem in the notes field.&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Score: is 0-1 where 0 is definitely not an accelerator, 1 is definitely an accelerator&lt;br /&gt;
*Flag: (leave blank if not needed), if multiple then separate by comma&lt;br /&gt;
**S for social entrep&lt;br /&gt;
**I for incubator&lt;br /&gt;
**A for an angel group&lt;br /&gt;
**F is for foreign&lt;br /&gt;
**C for in coworking space/hub/etc&lt;br /&gt;
**V for if part of venture fund&lt;br /&gt;
**D is for Dead&lt;br /&gt;
*Put just the root URL in Cohort URL if there isn't a Cohort page&lt;br /&gt;
*Duration: in wks (months x 4.33 and round)&lt;br /&gt;
*Vintage is year of first cohort if possible&lt;br /&gt;
*Industry is industry focus but only if clear focus&lt;br /&gt;
*Equity is a number (don't put %) or Y/N&lt;br /&gt;
*Notes is only there if need it. Particularly try to use this field to note discards.&lt;br /&gt;
&lt;br /&gt;
==.cohort files==&lt;br /&gt;
&lt;br /&gt;
Your .cohort files must:&lt;br /&gt;
*Be tab delimited txt&lt;br /&gt;
*Have a header&lt;br /&gt;
*The first column must be the portfolio company name&lt;br /&gt;
*Grab as many columns as you can easily (and name them)&lt;br /&gt;
&lt;br /&gt;
==Standardized format for text files==&lt;br /&gt;
&lt;br /&gt;
Information Text file&lt;br /&gt;
*1 tab only after each category&lt;br /&gt;
*No spaces after commas for flags or industry&lt;br /&gt;
*For duration put only a number in weeks but do not write &amp;quot;weeks&amp;quot;&lt;br /&gt;
*Equity is either only a number (no percent sign) or a Y/N&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Cohort Text file&lt;br /&gt;
*1 tab between each column&lt;br /&gt;
*Titles of each column on top&lt;br /&gt;
*Make a new category for &amp;quot;Cohort Number&amp;quot; and write either &amp;quot;1 2 3 4 etc.&amp;quot;&lt;br /&gt;
*Matthew: 1-225 (done) Shrey: 226-550 (done)&lt;br /&gt;
&lt;br /&gt;
==Link to Crunchbase API application==&lt;br /&gt;
&lt;br /&gt;
https://about.crunchbase.com/forms/research-access-apply/ (Does not work anymore)&lt;br /&gt;
&lt;br /&gt;
https://data.crunchbase.com/v3/docs/using-the-api (Has new instructions for application)&lt;br /&gt;
&lt;br /&gt;
==Sign-Ups==&lt;br /&gt;
&lt;br /&gt;
 Ed - 1-10 (done)&lt;br /&gt;
 Carlin -  11-20 (done)&lt;br /&gt;
 Carlin - 21-40 (done)&lt;br /&gt;
 Christy - 41-60 (done)&lt;br /&gt;
 Avesh - 61-80 (done)&lt;br /&gt;
 Eliza - 81-100 (done)&lt;br /&gt;
 Meghana - 101-120 (done)&lt;br /&gt;
 Peter - 121-140 (done)&lt;br /&gt;
 Ramee - 141-160 (done)&lt;br /&gt;
 Will - 161-180 (done)&lt;br /&gt;
 Matthew - 181-200 (done)&lt;br /&gt;
 Julia - 201-220 (done)&lt;br /&gt;
 Peter - 221-240 (done)&lt;br /&gt;
 Shrey - 241-260 (done)&lt;br /&gt;
 Matthew - 261-280 (done)&lt;br /&gt;
 Eliza - 281-300 (done)&lt;br /&gt;
 Julia - 301-320 (done)&lt;br /&gt;
 Shrey - 321-340 (done)&lt;br /&gt;
 Carlin - 341-361 (done)&lt;br /&gt;
 Julia - 362-380 (done)&lt;br /&gt;
 Dylan - 381-393 (done)&lt;br /&gt;
 Jake - 394-404 (done)&lt;br /&gt;
 Dylan - 405-410 (done)&lt;br /&gt;
 Avesh - 411-415 (done)&lt;br /&gt;
 Dylan - 416-423 (done)&lt;br /&gt;
 Peter - 424-460(done)&lt;br /&gt;
 Carlin - 461-480 (done)&lt;br /&gt;
 Peter - 481-490(done)&lt;br /&gt;
 Julia - 491-510 (done)&lt;br /&gt;
 Peter - 511-515 (done)&lt;br /&gt;
 Julia - 516-529 (done)&lt;br /&gt;
 Ben - 530-540 (done)&lt;br /&gt;
 Shrey - 541-551 (done)&lt;br /&gt;
&lt;br /&gt;
=List of Accelerators=&lt;br /&gt;
#10Xelerator&lt;br /&gt;
#1440&lt;br /&gt;
#33entrepreneurs&lt;br /&gt;
#500 Startups&lt;br /&gt;
#9Mile Labs&lt;br /&gt;
#AIA Accelerator&lt;br /&gt;
#ARK Challenge&lt;br /&gt;
#AT&amp;amp;T Aspire Accelerator&lt;br /&gt;
#ATDC Community&lt;br /&gt;
#AZ TechCelerator&lt;br /&gt;
#AccelFoods&lt;br /&gt;
#Acceleprise&lt;br /&gt;
#Accelerate Baltimore&lt;br /&gt;
#Accelerate Genius&lt;br /&gt;
#Accelerate Tectoria Accelerator&lt;br /&gt;
#Accelerator Centre&lt;br /&gt;
#Advanced Technology Development Center (ATDC)&lt;br /&gt;
#Airbus BizLab&lt;br /&gt;
#Alchemist Accelerator&lt;br /&gt;
#AlphaLab&lt;br /&gt;
#Amplify.LA&lt;br /&gt;
#Angel Capital&lt;br /&gt;
#Angelcube&lt;br /&gt;
#Angelpad&lt;br /&gt;
#Annual Business BootCamp&lt;br /&gt;
#Arizona Center for Innovation&lt;br /&gt;
#Arizona Furnace&lt;br /&gt;
#Arrowhead Tech Incubator 2016&lt;br /&gt;
#Aspire 3 Accelerator 2017&lt;br /&gt;
#Atlanta Ventures Accelerator &lt;br /&gt;
#AutoXLR8R&lt;br /&gt;
#Awesome Inc.&lt;br /&gt;
#Axel Springer Plug and Play&lt;br /&gt;
#B 4 Change Impact Accelerator&lt;br /&gt;
#B2B Acceleration Program&lt;br /&gt;
#B4C Social Venture Accelerator&lt;br /&gt;
#BBC Worldwide Labs&lt;br /&gt;
#BMW Startup Garage&lt;br /&gt;
#Brandcelerate&lt;br /&gt;
#Bunker Labs&lt;br /&gt;
#Bank of Ireland Accelerator Programme&lt;br /&gt;
#Bantunium Labs Accelerator&lt;br /&gt;
#Barclays Accelerator&lt;br /&gt;
#Barclays New York Summer 2015&lt;br /&gt;
#Berkley Ventures&lt;br /&gt;
#Bessemer Business Incubation System&lt;br /&gt;
#Beta-i&lt;br /&gt;
#Beta.MN&lt;br /&gt;
#BetaFactory&lt;br /&gt;
#BetaSpring&lt;br /&gt;
#Betablox&lt;br /&gt;
#Betaspring RevUp  (DUPLICATE)&lt;br /&gt;
#Bethnal Green Ventures&lt;br /&gt;
#BioAccel&lt;br /&gt;
#BioInspire&lt;br /&gt;
#Bir 2015&lt;br /&gt;
#BitAngel Engagement Level&lt;br /&gt;
#BitAngels Startup Summer Program of 2013&lt;br /&gt;
#Bizdom&lt;br /&gt;
#Black Forest Accelerator&lt;br /&gt;
#Blue Startups&lt;br /&gt;
#Blueprint Health&lt;br /&gt;
#Bolt Boston&lt;br /&gt;
#Bonnier Accelerator&lt;br /&gt;
#BoomStartup&lt;br /&gt;
#BoomStartup Winter 2017 (DUPLICATE)&lt;br /&gt;
#Boomtown Accelerator&lt;br /&gt;
#Boomtown Health Tech (DUPLICATE)&lt;br /&gt;
#Boost VC&lt;br /&gt;
#BootupLabs&lt;br /&gt;
#Brandery&lt;br /&gt;
#Brooklyn Beta Summer Camp&lt;br /&gt;
#Budweiser Dream Brewery&lt;br /&gt;
#Buildit&lt;br /&gt;
#BuiltinPGH Companies&lt;br /&gt;
#Business Innovation Center&lt;br /&gt;
#Business Opportunity Academy 2017&lt;br /&gt;
#Business Technology Development Center (BizTech)&lt;br /&gt;
#CLT Joules Energy Accelerator 2014&lt;br /&gt;
#CWI Ventures&lt;br /&gt;
#CWI Ventures Application (DUPLICATE)&lt;br /&gt;
#CableLabs Technology Tours 2016&lt;br /&gt;
#Capital Factory&lt;br /&gt;
#Capital Innovators&lt;br /&gt;
#Capital Investment Network (Startups)&lt;br /&gt;
#Caroline Plouff&lt;br /&gt;
#Catalyst Partners&lt;br /&gt;
#Cause Collective : Social Innovation Lab&lt;br /&gt;
#Center for Entrepreneurial Innovation&lt;br /&gt;
#Chain Reaction Innovations 2017&lt;br /&gt;
#Chemical Angel Network&lt;br /&gt;
#Chinaccelerator&lt;br /&gt;
#Cisco Entrepreneurs in Residence&lt;br /&gt;
#Citi Accelerator&lt;br /&gt;
#Citrix Startup Accelerator&lt;br /&gt;
#Claremont/Upland Makerspace Fablab&lt;br /&gt;
#Climate Ventures 2.0 Accelerator&lt;br /&gt;
#Co.Lab accelerator&lt;br /&gt;
#Code for America Accelerator&lt;br /&gt;
#Cohab's Traxtion Point&lt;br /&gt;
#Collision Conference Investors&lt;br /&gt;
#Common Bond&lt;br /&gt;
#Communitech Hyperdrive&lt;br /&gt;
#Conquer Accelerator&lt;br /&gt;
#Coolhouse Labs&lt;br /&gt;
#CuriousMinds Incubator / Accelerator&lt;br /&gt;
#CyberTECH San Diego&lt;br /&gt;
#DBS Accelerator&lt;br /&gt;
#DPD Last Mile labs&lt;br /&gt;
#DV X Labs&lt;br /&gt;
#Dat Ventures&lt;br /&gt;
#Decatur-Morgan County Entrepreneurial Center&lt;br /&gt;
#Deep Space Ventures&lt;br /&gt;
#Demo Accelerator 2016- 2017&lt;br /&gt;
#DeveloperTown&lt;br /&gt;
#Difference Engine&lt;br /&gt;
#Digital Malaysia Corporate Accelerator Program&lt;br /&gt;
#Digital Media Zone Incubator/Accelerator&lt;br /&gt;
#Disney Accelerator&lt;br /&gt;
#DogFish Accelerator&lt;br /&gt;
#Domi Station&lt;br /&gt;
#Dotforge accelerator&lt;br /&gt;
#Dream Funded&lt;br /&gt;
#DreamIT Health&lt;br /&gt;
#DreamStart - Free Mentoring Program&lt;br /&gt;
#Dreamit Ventures (DUPLICATE)&lt;br /&gt;
#Ducky Diggy Lloyd &lt;br /&gt;
#E-Capital Summit&lt;br /&gt;
#EC Mentor Skills Inventory&lt;br /&gt;
#EIGERlab&lt;br /&gt;
#ETRAC&lt;br /&gt;
#EY Startup Challenge&lt;br /&gt;
#Eco Holding&lt;br /&gt;
#Eleven Startup Accelerator&lt;br /&gt;
#Emerge Xcelerate&lt;br /&gt;
#EnterpriseWorks Incubation Program&lt;br /&gt;
#Entrepreneur Development Center&lt;br /&gt;
#Entrepreneurs Roundtable Accelerator&lt;br /&gt;
#Environmental Business Cluster&lt;br /&gt;
#Equity Legal&lt;br /&gt;
#Excelerate Labs&lt;br /&gt;
#Execution Labs&lt;br /&gt;
#Exhilarator&lt;br /&gt;
#Extreme Startups&lt;br /&gt;
#Extreme University&lt;br /&gt;
#FOOD-X&lt;br /&gt;
#Factory45&lt;br /&gt;
#Fargo Startup House 2014-2015&lt;br /&gt;
#FastTrack Propero Healthcare&lt;br /&gt;
#FbFund&lt;br /&gt;
#Female Propeller for High Flyers&lt;br /&gt;
#FinTech Innovation Lab&lt;br /&gt;
#FinTech Studios 2015&lt;br /&gt;
#Fintech Founders Club #2&lt;br /&gt;
#First Growth Venture Network&lt;br /&gt;
#Fishbowl Labs AOL&lt;br /&gt;
#Flagship Enterprise Center&lt;br /&gt;
#FlashStarts&lt;br /&gt;
#Flashpoint&lt;br /&gt;
#Flat6 Labs&lt;br /&gt;
#Fledge9&lt;br /&gt;
#Flextronics Lab IX&lt;br /&gt;
#Food Future Scale-up Accelerator 2017&lt;br /&gt;
#Food System 6 (FS6) Accelerator&lt;br /&gt;
#FoodForwardX&lt;br /&gt;
#Fortify Ventures&lt;br /&gt;
#Founder Institute&lt;br /&gt;
#FounderFuel&lt;br /&gt;
#FoundersPad&lt;br /&gt;
#Fownders Accelerator&lt;br /&gt;
#French Accelerator 2016&lt;br /&gt;
#Fund the Food&lt;br /&gt;
#Fuse Corps Host&lt;br /&gt;
#GAKKEN Accelerator Program&lt;br /&gt;
#Gainesville Technology Enterprise Center&lt;br /&gt;
#Game CoLab Incubator Program 2014&lt;br /&gt;
#GameFounders&lt;br /&gt;
#GammaRebels&lt;br /&gt;
#Gazelle Lab&lt;br /&gt;
#Gener8tor&lt;br /&gt;
#German Accelerator Life Sciences&lt;br /&gt;
#German Accelerator Tech&lt;br /&gt;
#Global Accelerator Network 2015&lt;br /&gt;
#Good Works Houston Lab&lt;br /&gt;
#GoodCompany Ventures&lt;br /&gt;
#Google Launchpad Accelerator&lt;br /&gt;
#Grants4Apps Accelerator&lt;br /&gt;
#GreenStart&lt;br /&gt;
#Greenlite Labs&lt;br /&gt;
#GrowLab&lt;br /&gt;
#Growth Hacking Accelerator 2015&lt;br /&gt;
#Gulf Coast Center for Innovation and Entrepreneurship&lt;br /&gt;
#H-Farm Ventures&lt;br /&gt;
#HACKT Mission for International Founders&lt;br /&gt;
#HAXLR8R&lt;br /&gt;
#HCC Entrepreneurship Launchpad&lt;br /&gt;
#HIGHLINE Academy&lt;br /&gt;
#HUB&lt;br /&gt;
#HUBB Accelerator&lt;br /&gt;
#HUBB GTLA 2016&lt;br /&gt;
#HackFWD&lt;br /&gt;
#Hatch&lt;br /&gt;
#Health Wildcatters&lt;br /&gt;
#Health accelerator&lt;br /&gt;
#Healthbox&lt;br /&gt;
#Hero City Co-Working Space&lt;br /&gt;
#High Street Startups Accelerator&lt;br /&gt;
#Highway1&lt;br /&gt;
#Honda Xcelerator &lt;br /&gt;
#Houston Technology Center&lt;br /&gt;
#Hub Ventures&lt;br /&gt;
#HugeThing&lt;br /&gt;
#I/O ventures&lt;br /&gt;
#ICONYC labs&lt;br /&gt;
#IDC Elevator&lt;br /&gt;
#INcubes Funnel and Accelerator 2014/2015&lt;br /&gt;
#INcubes Online Form&lt;br /&gt;
#INcubes Startup Visa&lt;br /&gt;
#Illumina Accelerator&lt;br /&gt;
#Illuminator,  New York Accelerator 2015&lt;br /&gt;
#Imagine K12&lt;br /&gt;
#Immokalee Business Development Center&lt;br /&gt;
#Impact Engine&lt;br /&gt;
#Impact USA - 2017&lt;br /&gt;
#Incubate Miami&lt;br /&gt;
#Infuse Accelerator&lt;br /&gt;
#Ingenuity Partner Program&lt;br /&gt;
#InnoSpring&lt;br /&gt;
#Innov&amp;amp;Connect&lt;br /&gt;
#Innov8 for Health&lt;br /&gt;
#Innova Memphis&lt;br /&gt;
#InnovateOC&lt;br /&gt;
#Innovation Depot&lt;br /&gt;
#Innovation Pavilion&lt;br /&gt;
#Innovation Showcase Winter 2017&lt;br /&gt;
#Insight Accelerator Labs&lt;br /&gt;
#Intel Education Accelerator&lt;br /&gt;
#Investment Preparedness Lab&lt;br /&gt;
#Invoke Collective&lt;br /&gt;
#Iowa Startup Accelerator&lt;br /&gt;
#JFDI.Asia&lt;br /&gt;
#JFE Accelerator SF&lt;br /&gt;
#JLAB&lt;br /&gt;
#Jaguar Land Rover Tech Incubator&lt;br /&gt;
#Jolt&lt;br /&gt;
#JumpSchool &lt;br /&gt;
#JumpStart Foundry&lt;br /&gt;
#Jumpstart! Boulder&lt;br /&gt;
#JusticeXL&lt;br /&gt;
#Kairos Boston Spring Program&lt;br /&gt;
#Kaplan EdTech&lt;br /&gt;
#Kick&lt;br /&gt;
#Kick Boise&lt;br /&gt;
#Kick LA&lt;br /&gt;
#Kick Victoria&lt;br /&gt;
#Kicklabs&lt;br /&gt;
#Kinetiq Labs&lt;br /&gt;
#L-SPARK Accelerator&lt;br /&gt;
#LAUNCH incubator&lt;br /&gt;
#LAUNCHub&lt;br /&gt;
#LI TechCOMETS&lt;br /&gt;
#LabFunding Project Accelerator 2014&lt;br /&gt;
#Labs Venture Accelerator&lt;br /&gt;
#Launch Chapel Hill&lt;br /&gt;
#Launch Memphis&lt;br /&gt;
#LaunchBox Digital&lt;br /&gt;
#LaunchHouse&lt;br /&gt;
#LaunchPad PEI&lt;br /&gt;
#LaunchSpot&lt;br /&gt;
#Launch_Academy&lt;br /&gt;
#Launchpad Digital Health, LLC&lt;br /&gt;
#Launchpad LA&lt;br /&gt;
#Launchpad Long Island&lt;br /&gt;
#Le Camping&lt;br /&gt;
#Leading Entrepreneurial Accelerator Program&lt;br /&gt;
#Lean Launch Ventures&lt;br /&gt;
#LearnLaunchX&lt;br /&gt;
#Lemnos Labs&lt;br /&gt;
#Life Changing Labs&lt;br /&gt;
#LiftOff Health Incubator&lt;br /&gt;
#Lightbank Start&lt;br /&gt;
#LightningLab&lt;br /&gt;
#Lowe's Accelerator&lt;br /&gt;
#MACH37&lt;br /&gt;
#MACH37 Spring&lt;br /&gt;
#MIT SA+P venture accelerator&lt;br /&gt;
#MITA Institute Accelerator&lt;br /&gt;
#MTGx MediaFactory&lt;br /&gt;
#Mac6&lt;br /&gt;
#Madworks Governance Accelerator&lt;br /&gt;
#Maine Center for Entrepreneurial Development - Top Gun Program&lt;br /&gt;
#Matter&lt;br /&gt;
#Maven Ventures Fund &amp;amp; Incubator&lt;br /&gt;
#Media Camp&lt;br /&gt;
#Melbourne Accelerator Program&lt;br /&gt;
#Memphis BioWorks&lt;br /&gt;
#Merck Accelerator&lt;br /&gt;
#MergeLane 2017 Accelerator&lt;br /&gt;
#Mergelane&lt;br /&gt;
#Metavallon&lt;br /&gt;
#Microsoft Accelerator&lt;br /&gt;
#MindTheBridge&lt;br /&gt;
#Momentum&lt;br /&gt;
#MuckerLab&lt;br /&gt;
#Muru-D&lt;br /&gt;
#My5ive Accelerator 2016&lt;br /&gt;
#N-Motion (DUPLICATE)&lt;br /&gt;
#NDRC (LaunchPad / VentureLab)&lt;br /&gt;
#NEXT Dashboard&lt;br /&gt;
#NMotion&lt;br /&gt;
#NY Digital Health Accelerator&lt;br /&gt;
#NY Fashion Tech Lab 2017&lt;br /&gt;
#NYC ACRE&lt;br /&gt;
#NYC SeedStart&lt;br /&gt;
#Nashville Entrepreneur Center&lt;br /&gt;
#Nebula Shift&lt;br /&gt;
#Nephoscale IaaS&lt;br /&gt;
#Nest New York &lt;br /&gt;
#New Ventures Group&lt;br /&gt;
#New York Digital Health Accelerator (DUPLICATE)&lt;br /&gt;
#NewME Accelerator PopUps &lt;br /&gt;
#NewMe&lt;br /&gt;
#Next media accelerator&lt;br /&gt;
#NextHIT&lt;br /&gt;
#NextStart&lt;br /&gt;
#Nike+ Accelerator&lt;br /&gt;
#Northern Arizona Center for Entrepreneurship and Technology (NACET)&lt;br /&gt;
#Northern England&lt;br /&gt;
#Nxtp.labs&lt;br /&gt;
#OCTANe&lt;br /&gt;
#Oasis 500&lt;br /&gt;
#OpenFund&lt;br /&gt;
#Orange Fab&lt;br /&gt;
#Orange Works&lt;br /&gt;
#Orion Startups&lt;br /&gt;
#Oxygen Accelerator&lt;br /&gt;
#PIE&lt;br /&gt;
#Patriot Boot Camp&lt;br /&gt;
#Pearson Catalyst for Education&lt;br /&gt;
#Pipeline H2O&lt;br /&gt;
#Pitney Bowes Inc&lt;br /&gt;
#Plarium Labs&lt;br /&gt;
#Plug In South LA &lt;br /&gt;
#Plug and Play&lt;br /&gt;
#Plum Alley Investments 2016&lt;br /&gt;
#Points of Light Accelerator&lt;br /&gt;
#PowerHaus&lt;br /&gt;
#Preccelerator® Program 2016&lt;br /&gt;
#ProSiebenSat.1 Accelerator&lt;br /&gt;
#Project Entrepreneur 2016/17&lt;br /&gt;
#Project Healtchare&lt;br /&gt;
#Project Lift&lt;br /&gt;
#Project Music&lt;br /&gt;
#Project Skyway&lt;br /&gt;
#Propeller Venture Accelerator&lt;br /&gt;
#Prosper Capital Accelerator&lt;br /&gt;
#Proton Enterprises&lt;br /&gt;
#Pushstart Accelerator&lt;br /&gt;
#Qualcomm Robotics Accelerator&lt;br /&gt;
#Queen Creek Business Incubator&lt;br /&gt;
#R/GA Accelerator&lt;br /&gt;
#RAIN Incubator/Accelerator&lt;br /&gt;
#RJI Investment Group&lt;br /&gt;
#Reach&lt;br /&gt;
#RetailXelerator&lt;br /&gt;
#Rock Health&lt;br /&gt;
#Rocket Fuel Labs&lt;br /&gt;
#Rockstart Accelerator&lt;br /&gt;
#RunUp Labs&lt;br /&gt;
#Runway IoT Accelerator 2015&lt;br /&gt;
#SAP Startup Focus Program&lt;br /&gt;
#SKTA Innopartners Innovation Accelerator&lt;br /&gt;
#SPACELAB Tech Accelerator&lt;br /&gt;
#SPARK&lt;br /&gt;
#SPH Plug and Play&lt;br /&gt;
#SURF Incubator&lt;br /&gt;
#SaltMines Group Start-Up Studio&lt;br /&gt;
#ScaleTown&lt;br /&gt;
#Seamless IoT 2016&lt;br /&gt;
#Searchcamp&lt;br /&gt;
#Seed Hatchery&lt;br /&gt;
#SeedSpot&lt;br /&gt;
#SeedStartup&lt;br /&gt;
#SeedSumo&lt;br /&gt;
#Seedcamp&lt;br /&gt;
#Seedrocket&lt;br /&gt;
#Seeqnce&lt;br /&gt;
#Sequoia Apps&lt;br /&gt;
#Serval Ventures&lt;br /&gt;
#Shenzhen Valley Ventures Incubator&lt;br /&gt;
#Shoals Entrepreneurial Center&lt;br /&gt;
#Shopper Futures Accelerator&lt;br /&gt;
#Shotput Ventures&lt;br /&gt;
#Sid Martin Biotechnology Institute&lt;br /&gt;
#SigmaLabs Accelerator&lt;br /&gt;
#Silicon Valley Incubator &amp;amp; Accelerator&lt;br /&gt;
#SixThirty&lt;br /&gt;
#Sixers Innovation Lab&lt;br /&gt;
#Skywalker Accelerator&lt;br /&gt;
#SmartHealth Activator&lt;br /&gt;
#Smashd Labs&lt;br /&gt;
#SoCo Nexus Accelerator Spring 2017&lt;br /&gt;
#Social Enterprise Challenge&lt;br /&gt;
#Socratic Labs&lt;br /&gt;
#SparkLabs&lt;br /&gt;
#Sparkgap&lt;br /&gt;
#Sports Tank&lt;br /&gt;
#Springboard&lt;br /&gt;
#Sprint Accelerator&lt;br /&gt;
#Sprint Mobile Health Accelerator&lt;br /&gt;
#SproutBox&lt;br /&gt;
#SproutCamp&lt;br /&gt;
#Starburst Aerospace Accelerator&lt;br /&gt;
#Start Path Europe&lt;br /&gt;
#Start'inPost&lt;br /&gt;
#StartEngine&lt;br /&gt;
#StartFast Venture Accelerator&lt;br /&gt;
#Starta Accelerator Winter 2017&lt;br /&gt;
#Startl&lt;br /&gt;
#Startmate&lt;br /&gt;
#Startup Accelerator (DUPLICATE)&lt;br /&gt;
#Startup Front&lt;br /&gt;
#Startup Next &amp;amp; GAN&lt;br /&gt;
#Startup Orange County Accelerator&lt;br /&gt;
#Startup Runway&lt;br /&gt;
#Startup Wise Guys&lt;br /&gt;
#Startup Zone PEI&lt;br /&gt;
#Startup52X Accelerator&lt;br /&gt;
#StartupCity&lt;br /&gt;
#StartupHighway&lt;br /&gt;
#StartupHouse Foundry program&lt;br /&gt;
#StartupMinds Accelerator &lt;br /&gt;
#StartupYard&lt;br /&gt;
#Startupbootcamp&lt;br /&gt;
#Straight Shot&lt;br /&gt;
#Summer@Highland&lt;br /&gt;
#Surge&lt;br /&gt;
#SynBio axlr8r&lt;br /&gt;
#TEB Incubation &amp;amp; Acceleration Center&lt;br /&gt;
#THRIVE Accelerator III&lt;br /&gt;
#THRIVE Open Innovation (DUPLICATE)&lt;br /&gt;
#TIM#WCAP Accelerator&lt;br /&gt;
#TLabs&lt;br /&gt;
#TMCx Accelerator Digital Health 2017&lt;br /&gt;
#Tallwave&lt;br /&gt;
#Tampa Bay Innovation Center&lt;br /&gt;
#Tampa Bay Wave&lt;br /&gt;
#Tandem Mobile Accelerator&lt;br /&gt;
#Tech Nexus&lt;br /&gt;
#Tech Wildcatters&lt;br /&gt;
#Tech2020&lt;br /&gt;
#TechLaunch&lt;br /&gt;
#TechRanch&lt;br /&gt;
#TechSquareLabs&lt;br /&gt;
#Techstars&lt;br /&gt;
#Techstars Music&lt;br /&gt;
#Telenet Idealabs&lt;br /&gt;
#Telluride Venture Accelerator&lt;br /&gt;
#TenX&lt;br /&gt;
#The Alchemist Accelerator (DUPLICATE)&lt;br /&gt;
#The Ark&lt;br /&gt;
#The Bakery&lt;br /&gt;
#The Batchery&lt;br /&gt;
#The Brandery&lt;br /&gt;
#The Bridge&lt;br /&gt;
#The Center For Technology Enterprise &amp;amp; Development&lt;br /&gt;
#The Chaser&lt;br /&gt;
#The Company Lab (CO.LAB)&lt;br /&gt;
#The Draper FinTech Connection&lt;br /&gt;
#The Factory&lt;br /&gt;
#The Greatest Pitch&lt;br /&gt;
#The Harbor Accelerator&lt;br /&gt;
#The Incubator&lt;br /&gt;
#The Iron Yard&lt;br /&gt;
#The Mediapreneur Incubator&lt;br /&gt;
#The Morpheus&lt;br /&gt;
#The New York Venture Summit&lt;br /&gt;
#The Next Step: from idea to startup&lt;br /&gt;
#The Refinery&lt;br /&gt;
#The Unilever Foundry&lt;br /&gt;
#The Venture Center's Pre-Accelerator I&lt;br /&gt;
#The Vine OC&lt;br /&gt;
#The Vogt Awards&lt;br /&gt;
#The Yield Lab&lt;br /&gt;
#The eFactory Accelerator&lt;br /&gt;
#Think Big Partners Accelerator&lt;br /&gt;
#TiE Angels&lt;br /&gt;
#Tigerlabs Digital Health Accelerator&lt;br /&gt;
#Tolstoy Summer Camp&lt;br /&gt;
#TopSeedsLab&lt;br /&gt;
#Travel Startups Incubator&lt;br /&gt;
#Travelport Labs Accelerator&lt;br /&gt;
#Travelport Labs Incubator&lt;br /&gt;
#Triangle Startup Factory&lt;br /&gt;
#Tumml&lt;br /&gt;
#Tune Labs&lt;br /&gt;
#Twin Cities Accelerator 2016&lt;br /&gt;
#UW-Whitewater Launch Pad Accelerator&lt;br /&gt;
#Unbank.ventures FinTech Incubator&lt;br /&gt;
#University Technology Park&lt;br /&gt;
#Unreasonable Institute&lt;br /&gt;
#UpTech&lt;br /&gt;
#Upstart Accelerator&lt;br /&gt;
#Upstart Labs&lt;br /&gt;
#Upstart Memphis&lt;br /&gt;
#Uptima Business Bootcamp&lt;br /&gt;
#Upwest Labs&lt;br /&gt;
#VANTEC&lt;br /&gt;
#VC FinTech Accelerator&lt;br /&gt;
#Velocity Indiana Accelerator&lt;br /&gt;
#Venture Catalyst Partners&lt;br /&gt;
#Venture Hive&lt;br /&gt;
#Venture I&lt;br /&gt;
#VentureOut's  Enterprise Tech Expedition&lt;br /&gt;
#Venturegeeks&lt;br /&gt;
#Vet-Tech Accelerator&lt;br /&gt;
#VictorySpark&lt;br /&gt;
#Village88 Techlab&lt;br /&gt;
#Volkswagen ERL Technology Accelerator&lt;br /&gt;
#WHLabs&lt;br /&gt;
#Wasabi Ventures Academy&lt;br /&gt;
#Wayra&lt;br /&gt;
#Wellness Accelerator&lt;br /&gt;
#Wells Fargo Startup Accelerator&lt;br /&gt;
#Wireless IoT&lt;br /&gt;
#Women Innovate Mobile&lt;br /&gt;
#XLerateHealth&lt;br /&gt;
#XTRATOS&lt;br /&gt;
#Xlerate Health&lt;br /&gt;
#Y Combinator&lt;br /&gt;
#Y&amp;amp;R SparkPlug 2017&lt;br /&gt;
#YEurope&lt;br /&gt;
#YLE Media Startup Accelerator Program&lt;br /&gt;
#Yahoo Ad Tech Program&lt;br /&gt;
#Yangler (online accelerator)&lt;br /&gt;
#Year of the Startup&lt;br /&gt;
#Yetizen Accelerator&lt;br /&gt;
#You Is Now&lt;br /&gt;
#Z80 Labs&lt;br /&gt;
#ZIP Launchpad Admission&lt;br /&gt;
#ZeroTo510&lt;br /&gt;
#Zone Startups Calgary&lt;br /&gt;
#designX 2017&lt;br /&gt;
#eMerging Ventures&lt;br /&gt;
#ezone&lt;br /&gt;
#iStart Jax (DUPLICATE)&lt;br /&gt;
#iStart Valley&lt;br /&gt;
#iVentures10&lt;br /&gt;
#ignite100&lt;br /&gt;
#innovyz start&lt;br /&gt;
#tekMountain Accelerator&lt;br /&gt;
&lt;br /&gt;
=Project Summary=&lt;br /&gt;
This project will be used to determine which accelerators are the most effective at churning out successful startups, as well as what characteristics are exhibited by these accelerators. First, we need to gather as much data as we can about as many accelerators as we can in order to look at factors that differentiate successful vs. unsuccessful ventures. Next, we need to create a web crawling program which will gather information about accelerators across the world by accessing their websites and extracting information. I believe that our overall goal with this research project is to gain insight into the methods of successful accelerators, as well as to find out what exactly differentiates very successful accelerators from dead accelerators.&lt;br /&gt;
&lt;br /&gt;
Helpful Links: http://seedrankings.com/&lt;br /&gt;
&lt;br /&gt;
=Sources=&lt;br /&gt;
&lt;br /&gt;
Summary: These are sources obtained from [[List of Accelerators]], Crunchbase, and other Google searches. We will evaluate these sources by looking at the number of accelerators they supply (as most of them are lists) and then also taking a look at the type of information they provide about each accelerator. Key data points are cohort-related data, startup-related data, and logistics of the accelerator. Better sources supply more information that the URL alone.&lt;br /&gt;
&lt;br /&gt;
(Obtained from [[List of Accelerators]] and various Google searches)&lt;br /&gt;
*http://seedrankings.com/&lt;br /&gt;
*http://www.acceleratorinfo.com/see-all.html&lt;br /&gt;
*http://www.seed-db.com/accelerators&lt;br /&gt;
*http://gust.com/usa-canada-accelerator-report-2015/?utm_content=35401577&amp;amp;utm_medium=social&amp;amp;utm_source=twitter&lt;br /&gt;
*https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/&lt;br /&gt;
*http://www.builtinnyc.com/2016/06/03/accelerators-incubators-nyc&lt;br /&gt;
*http://www.represent.la/&lt;br /&gt;
*http://www.launch.co/blog/complete-list-of-incubators-and-accelerators-like-y-combinat.html&lt;br /&gt;
*https://angel.co/accelerator-4 (Does not work - seems to be replaced by https://angel.co/companies?company_types[]=Incubator )&lt;br /&gt;
&lt;br /&gt;
(Obtained from Google search: &amp;quot;Accelerator Database&amp;quot;)&lt;br /&gt;
*seed-db is the first result that pops up&lt;br /&gt;
*https://www.corporate-accelerators.net/database/&lt;br /&gt;
*https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json&lt;br /&gt;
*By the 5th or 6th search result, the utility diminished greatly&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2015/03/17/the-best-startup-accelerators-of-2015-powering-a-tech-boom/#2f52fa7e34e4&lt;br /&gt;
*http://www.inc.com/will-yakowicz/the-15-best-startup-accelerators-in-the-us.html&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2016/03/11/the-best-startup-accelerators-of-2016/#74086a7724f2&lt;br /&gt;
*https://techcrunch.com/2015/03/17/these-are-the-top-20-us-accelerators/&lt;br /&gt;
*https://www.nexpcb.com/blogs/news/the-hardware-incubators-accelerators-list&lt;br /&gt;
&lt;br /&gt;
Other ways used to find Accelerators (listed below &amp;quot;List of Sources Obtained from Various Google Searches&amp;quot;):&lt;br /&gt;
*Type in generic location + &amp;quot;accelerators&amp;quot; (e.g. Houston Accelerators)&lt;br /&gt;
:*Looked at roughly the first 20 results&lt;br /&gt;
:*Used three locations as examples of accelerators that pop up&lt;br /&gt;
*Type in a specific state + &amp;quot;accelerator&amp;quot; + &amp;quot;list&amp;quot; (e.g. Texas accelerator list) to search for more relevant lists&lt;br /&gt;
:*Once again, looked at roughly the first 20 results&lt;br /&gt;
*Crunchbase has its own webpage with instructions for how we retrieve the data&lt;br /&gt;
&lt;br /&gt;
=Source Evaluations=&lt;br /&gt;
&lt;br /&gt;
Summary: These evaluations couple with each of the sources above. The evaluations provide instructions for obtaining the information listed, as well as a general review of how useful the data seems. The review serves to determine whether a crawler would be suitable for obtaining information from the source autonomously.&lt;br /&gt;
&lt;br /&gt;
==SOURCE: Crunchbase==&lt;br /&gt;
*All of the information for the Crunchbase documentation is located in the page [[Crunchbase 2013 Snapshot]] webpage, along with the documentation for how we determined the accelerator information.&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.acceleratorinfo.com/see-all.html==&lt;br /&gt;
#Opened source website&lt;br /&gt;
#Copied Information under &amp;quot;All Accelerator Programs&amp;quot; to TextPad, already sorted. Returned 190 results&lt;br /&gt;
#Each link on parent list leads to individual '''home page url''' of accelerator&lt;br /&gt;
:*Used sample size of 20 links, determined 16 to be accelerators, 2 to be incubators, 2 to be inactive or broken links&lt;br /&gt;
:*Many accelerators do not include founding date, most recent accelerators from around 2013-2014 (as determined from home page)&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for specific URLs to older accelerators, not very helpful for more specific information.&lt;br /&gt;
*Web crawling seems improbable because information is not readily available from source. Can potentially mine staff information or contact information from associated &amp;quot;about&amp;quot; page in the home url&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators==&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 235 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes:&lt;br /&gt;
::# &amp;quot;state&amp;quot;&lt;br /&gt;
::# &amp;quot;company name&amp;quot;&lt;br /&gt;
::# &amp;quot;website and CrunchBase links&amp;quot;&lt;br /&gt;
::# &amp;quot;cohort date&amp;quot;&lt;br /&gt;
::#&amp;quot;exit value&amp;quot;&lt;br /&gt;
::#&amp;quot;funding&amp;quot;. &lt;br /&gt;
:::Many entries for &amp;quot;exit value&amp;quot; are missing, some values for &amp;quot;funding&amp;quot; are missing&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators out of 235 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the following:&lt;br /&gt;
::#Status&lt;br /&gt;
::#Program (name)&lt;br /&gt;
::#Location&lt;br /&gt;
::#Country&lt;br /&gt;
::#Number of companies&lt;br /&gt;
::#Cumulative exit values&lt;br /&gt;
::#Cumulative funding &lt;br /&gt;
::#Average funding for startups&lt;br /&gt;
::#Median funding for startups&lt;br /&gt;
:::Many entries for &amp;quot;median funding&amp;quot; are left empty, as well as entries for all types of funding on the bottom half of the table&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, but after cross-referencing from other sources shows that seed-db is lacking many newer accelerators; list is not all-inclusive.&lt;br /&gt;
*Includes regional distributions for accelerator groups as well. For example, rather than just &amp;quot;Techstars&amp;quot;, the group is broken into Austin, Berlin, Boston, Boulder, etc.&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators==&lt;br /&gt;
:Very similar to &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;, but contains large regional accelerators as groups, rather than individual accelerators. For example, Techstars appears only once.&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 239 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes same information as previous source, &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;. However, accelerators spanning across multiple regions have their startups located under one category on this webpage.&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators/groups out of 239 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the same information as the &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; source&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, includes large groups as well as individual accelerators. It seems that some accelerators missing from &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; are located here, since there are 239 returns rather than 235.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.f6s.com/programs?type==&lt;br /&gt;
#On the webpage, set &amp;quot;Type&amp;quot; to &amp;quot;Accelerator/Program&amp;quot;, set &amp;quot;Location&amp;quot; to &amp;quot;North America&amp;quot;, and set &amp;quot;Invest in Country&amp;quot; to &amp;quot;United States&amp;quot; to return results&lt;br /&gt;
#Highlighted results and scrolled down until all results found; copied results to TextPad&lt;br /&gt;
#In TextPad, sorted out lines with &amp;quot;by&amp;quot;, as well as miscellaneous categories such as dates and dollar signs through Regular Expressions&lt;br /&gt;
#Using the &amp;quot;More Info&amp;quot; line which held constant through the entire list, assigned a sequential number to the line (in order to determine the number of results)&lt;br /&gt;
::*Obtained a grand total of 1467 results from the list&lt;br /&gt;
::*Along with the name of the program/accelerator, the data included:&lt;br /&gt;
::#Dollar value per team&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Application Site&lt;br /&gt;
::#Accelerator URL&lt;br /&gt;
::*Many entries are not accelerators, from a quick glance through the results, there were various conferences, 3-5 days events, and written literature pertaining to accelerators as well&lt;br /&gt;
::*From a sample size of the first 30 entries, determined 10 to be valid accelerators, 3 incubators, 6 conferences/weekends, and the rest to be miscellaneous entries such as startup events or &amp;quot;studios&amp;quot; (perhaps useful but not relevant to search)&lt;br /&gt;
::*As we go down the list, the number of accelerators proportionately decreases. Can comfortably say that overall accelerator turnout from this website is much less than 33%, probably closer to 10-15%.&lt;br /&gt;
===Review===&lt;br /&gt;
*Potentially useful website if crawler could remove the clutter and target solely the accelerators; very useful for identifying new accelerators since data automatically sorted by date and location.&lt;br /&gt;
*Large list of sources includes many irrelevant results, such as conferences or weekends which are difficult to identify. The name of the sorting category itself, &amp;quot;Accelerator/Program&amp;quot; suggests that many of the results fall under the &amp;quot;Program&amp;quot; section rather than being valid accelerators.&lt;br /&gt;
*Potential site for identifying accelerators, but limited by in-site sorting; useful for URL and perhaps equity, but not very detailed information relating to the accelerator/program.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://gust.com/usa-canada-accelerator-report-2015/==&lt;br /&gt;
#Selected region of US and Canada&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Top 20 Active Accelerators&amp;quot; and selected &amp;quot;see the full list&amp;quot; near the bottom of the listed accelerators&lt;br /&gt;
#Copied resulting entries into TextPad and sorted out the numbers to leave only the name of the accelerator&lt;br /&gt;
::*Obtained 100 results for different accelerators&lt;br /&gt;
::*Accelerator lists included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Number of Start-ups funded (2015 only)&lt;br /&gt;
::*Accelerator list limited to 2015&lt;br /&gt;
===Review===&lt;br /&gt;
*Website provides its own evaluation of an accelerator's success based on various factors and provides data for larger trends.&lt;br /&gt;
*Usefulness is questionable because website does not provide much except the URL, and all of the entries are based on success in 2015.&lt;br /&gt;
*Other interesting data within website such as &amp;quot;Hot Markets&amp;quot;, investment breakdowns by state, etc. All of this data is also limited to 2015.&lt;br /&gt;
&lt;br /&gt;
==Source: https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/==&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Startup accelerators in Boston&amp;quot;&lt;br /&gt;
#Copied text beginning from &amp;quot;MassChallenge&amp;quot; (the first paragraph was just a general definition of startups) and continued to copy until &amp;quot;Startup Incubators in Boston&amp;quot;&lt;br /&gt;
#After pasting in TextPad, I sorted the data to delete any characters after the &amp;quot;-&amp;quot; and added a sequential number at the beginning of each line&lt;br /&gt;
::*Returned a total of 17 results for startups in Boston&lt;br /&gt;
::*Accelerator list included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Capital requirements&lt;br /&gt;
::#Application periods and requirements&lt;br /&gt;
::#Paragraph describing accelerator and its goals&lt;br /&gt;
===Review===&lt;br /&gt;
*Although the guide is dated, useful for identifying strong accelerator programs in Boston&lt;br /&gt;
*Limitation: only focuses on Boston, but the description is helpful in identifying the role of the accelerator&lt;br /&gt;
*Limited information on accelerator, not very useful by itself without information from the accelerator URL&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.corporate-accelerators.net/database/==&lt;br /&gt;
#Copied and pasted table into Microsoft Excel (Data was already sorted into categories so no need for TextPad)&lt;br /&gt;
#Table returned 72 references (but there was a link to the bottom to a larger database)&lt;br /&gt;
::*The table itself includes:&lt;br /&gt;
::#Major Company&lt;br /&gt;
::#Accelerator&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Website&lt;br /&gt;
::#Details&lt;br /&gt;
::*The &amp;quot;Details&amp;quot; link led to a variety of other information including:&lt;br /&gt;
::#Status (Active or Inactive)&lt;br /&gt;
::#Locations&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Term&lt;br /&gt;
::#Cohort Based? (Regular or Irregular)&lt;br /&gt;
::#Pitch Day&lt;br /&gt;
::#Office Space&lt;br /&gt;
::#Powered by&lt;br /&gt;
::#Support Offered?&lt;br /&gt;
::#Launch year&lt;br /&gt;
::#Focus Areas&lt;br /&gt;
::#General Description&lt;br /&gt;
::*Also Included a variety of data regarding the host company as well&lt;br /&gt;
===Review===&lt;br /&gt;
*Solid list for corporate accelerators and also includes a variety of information about the accelerator, the cohorts, etc. Some of the entries are international accelerators however so need to filter them out&lt;br /&gt;
*Only limited to 72 accelerators from major companies&lt;br /&gt;
&lt;br /&gt;
==Source: https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json==&lt;br /&gt;
#This source is a .json file from the previous database&lt;br /&gt;
#After placing into TextPad, replaced each space with a ###, replaced each new line with a tab, and replaced each ### with a new line. Ultimately returned 80 results&lt;br /&gt;
::*From the file, the .json includes:&lt;br /&gt;
::#NAICS and NAICS sector &lt;br /&gt;
::#Classification&lt;br /&gt;
::#Sector Description&lt;br /&gt;
::#Term&lt;br /&gt;
::#Goal&lt;br /&gt;
::#Partner&lt;br /&gt;
::*Also includes most of the information from the previous source, since they are undoubtedly linked&lt;br /&gt;
===Review===&lt;br /&gt;
*Another solid list for corporate accelerators with some more information, but ultimately very similar to the previous source.&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.quora.com/Where-can-I-find-a-comprehensive-list-of-startup-incubators-and-accelerators-in-the-US==&lt;br /&gt;
#Since we already looked at the first listed source (seed-db), I clicked on the second link &amp;quot;(by Robert Shedd) http://blog.shedd.us/321987608/&amp;quot; which took me to a page headed &amp;quot;Help for Startups! – A semi-complete list of startup accelerator programs&amp;quot; created by a blogger, Robert Shedd&lt;br /&gt;
#List included 102 entries by the blogger, each of which do look like an accelerator&lt;br /&gt;
::*Upon immediate overview, noticed many results from previous sources were missing. Immediately noticed lack of &amp;quot;OwlSpark&amp;quot;, the accelerator from Rice.&lt;br /&gt;
::*Shedd only offers us the accelerator name plus its URL&lt;br /&gt;
===Review===&lt;br /&gt;
*Nice list to cross-reference with other sources but does not offer much new insight compared to more powerful engines such as seed-db\&lt;br /&gt;
&lt;br /&gt;
=List of Sources Obtained from Various Google Searches=&lt;br /&gt;
&lt;br /&gt;
Summary: These accelerators are taken from a specific Google search rather than a list. The idea is to compile a list of Google searches that return relevant results of accelerators. This will aid in the creation of a future web crawler.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;Location + Accelerator&amp;quot;(Only individual results, not lists)==&lt;br /&gt;
===Houston Accelerators===&lt;br /&gt;
*Examples of single accelerators found&lt;br /&gt;
:#TMCx: http://www.tmc.edu/innovation/innovation-programs/tmcx/&lt;br /&gt;
:#RED labs: http://redlabs.uh.edu/&lt;br /&gt;
:#SURGE accelerator: https://kirkcoburn.com/&lt;br /&gt;
:#OwlSpark: http://owlspark.com/&lt;br /&gt;
:#NextHIT: http://www.houstonhealthventures.com/nexthit-accelerator-program-application/&lt;br /&gt;
&lt;br /&gt;
===Los Angeles Accelerators===&lt;br /&gt;
:#Amplify: http://amplify.la/&lt;br /&gt;
:#Y Combinator: https://www.ycombinator.com/&lt;br /&gt;
:#Chicklabs: https://www.chicklabsllc.com/&lt;br /&gt;
:#Disney Accelerator: https://disneyaccelerator.com/&lt;br /&gt;
:#Launchpad: https://launchpad.la/&lt;br /&gt;
===New York Accelerators===&lt;br /&gt;
:#DreamIT Ventures: http://www.dreamit.com/#meaningful-experience&lt;br /&gt;
:#Women Innovate Mobile: http://www.wim.co/&lt;br /&gt;
:#Techstars NYC: http://www.techstars.com/programs/nyc-program/&lt;br /&gt;
:#Entrepreneurs Roundtable: http://eranyc.com/&lt;br /&gt;
:#FirstGrowthVC: http://venturecrush.com/fg/&lt;br /&gt;
:#New York Digital Health Accelerator: http://digitalhealthaccelerator.com/&lt;br /&gt;
:#Grand Central Tech: http://www.grandcentraltech.com/&lt;br /&gt;
:#Accelerator Corp: http://www.acceleratorcorp.com/&lt;br /&gt;
:#New York Startup Lab: http://nystartuplab.com/&lt;br /&gt;
===Review===&lt;br /&gt;
*Some locations return more viable results for a similar sample size. For example, New York returned 9 valid accelerators, whereas Los Angeles and Houston both returned 5 actual accelerators out of the first 20 results: an 80% difference. Some optimization may come from identifying which locations return more accelerators upon searching.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;State+Accelerator+List&amp;quot;==&lt;br /&gt;
===New York Accelerator List===&lt;br /&gt;
*http://www.ongridventures.com/resources/new-york-silicon-alley-resources/newyorkaccelerators/ (Ranks 14 accelerators)&lt;br /&gt;
*http://under30ceo.com/11-new-york-tech-incubators-and-accelerators-for-entrepreneurs/ (Ranks 11 accelerators)&lt;br /&gt;
===California Accelerator List===&lt;br /&gt;
*http://www.socaltech.com/the_complete_guide_to_southern_california_accelerators_and_incubators_part_i/s-0040924.html (Lists accelerators in Southern Cali)&lt;br /&gt;
*http://barberacorporatelaw.com/blog/2014/4/8/28-business-incubators-in-the-los-angeles-area (List of 24 accelerators near the LA area)&lt;br /&gt;
===Texas Accelerator List===&lt;br /&gt;
*http://www.austinstartuplist.com/incubators (List of accelerators in Austin, &amp;lt;5 results)&lt;br /&gt;
*http://www.siliconhillsnews.com/2016/09/02/the-top-texas-healthcare-accelerators-and-incubators/ (Modest list of accelerators aiding in healthcare)&lt;br /&gt;
*http://realfoodmba.com/food-startup-accelerators/ (List of food-based accelerators, some of which are in Austin, others of which are international)&lt;br /&gt;
===Colorado Accelerator List===&lt;br /&gt;
*http://www.builtincolorado.com/2015/01/14/best-colorado-accelerators-your-startup (8 results)&lt;br /&gt;
*https://www.quora.com/What-accelerator-programs-are-located-in-Colorado (Quora inquiry yielding modest results)&lt;br /&gt;
===Washington Accelerator List===&lt;br /&gt;
*http://www.geekwire.com/2015/mapping-seattles-incubators-accelerators-and-co-working-spaces/ (Returns 14 results)&lt;br /&gt;
===Oregon Accelerator List===&lt;br /&gt;
*http://www.bizjournals.com/portland/subscriber-only/2016/01/15/incubators-and-accelerators.html (Returns list of 5 accelerators and details)&lt;br /&gt;
*http://www.oregon4biz.com/Innovate-&amp;amp;-Create/R&amp;amp;D-Business/Incubators/ (Returns list of 26 accelerators and incubators)&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Seed-DB appears for almost all of the search results&lt;br /&gt;
*Acceleratorinfo appears for most of the search results&lt;br /&gt;
*There are multiple cumulative reports of incubators per location, but not for accelerators&lt;br /&gt;
*Most regionalized accelerator lists deal with either an article or a ranking of a particular amount of accelerators in the area&lt;br /&gt;
*Many results returned nationally ranked lists of accelerators, such as the Forbes list of &amp;quot;Top Accelerators&amp;quot; or something along the lines of &amp;quot;Best Accelerators in the US&amp;quot;. The connection is that perhaps one accelerator mentioned on the list may be located within the searched state.&lt;br /&gt;
*There are also a few results for actual particle accelerators that must be sorted out (i.e. superconducting super collider)&lt;br /&gt;
&lt;br /&gt;
==Found through google searching accelerators found previously==&lt;br /&gt;
'''Found from googling YLE Media Startup Accelerator'''&lt;br /&gt;
*https://www.corporate-accelerators.net/database/index.html (DB of Corporate Accelerators 71-79 entries)&lt;br /&gt;
*http://startupaccelerator.vc/accelerator-corporate-innovation-sig/ (Database of Accelerators and Corporate Innovation 92 entries)&lt;br /&gt;
neither of these have had their entries added to list of accelerators&lt;br /&gt;
&lt;br /&gt;
=Individual Accelerator Evaluations=&lt;br /&gt;
Summary: The purpose of this section is to create instructions for each accelerator on how to find cohort information from their URLs. Along with specific instructions for obtaining the cohorts for each accelerator chosen, there should be a list of easy-to-obtain and relevant statistics regarding the accelerator, such as information about its team, location, etc. The variable statistics list is cumulative, whereas the cohort directions are unique per the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerators Chosen (Format = Name (source))==&lt;br /&gt;
#Blue Startups (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Launchpad LA (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Y Combinator (http://www.seed-db.com/accelerators)&lt;br /&gt;
#FlashPoint (http://www.seed-db.com/accelerators/all)&lt;br /&gt;
#Prosper Accelerator (https://www.f6s.com/programs?type)&lt;br /&gt;
#Axel Springer Plug and Play (http://www.axelspringerplugandplay.com/)&lt;br /&gt;
#Techstars (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Startmate (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Capital Factory (http://blog.shedd.us/321987608/)&lt;br /&gt;
#OwlSpark (Google search: &amp;quot;Houston + accelerators&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Blue Startups (http://bluestartups.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Track Record&amp;quot; page under the &amp;quot;Home&amp;quot; tab; found total number of graduated cohorts to be 7&lt;br /&gt;
#Navigated to &amp;quot;Portfolio&amp;quot; tab. Tab includes list of all seven graduated cohorts along with companies emerging from each one. Each cohort is listed under a separate page (ex. &amp;quot;Cohort 1&amp;quot;, &amp;quot;Cohort 2&amp;quot;, etc) and at the bottom of each cohort page, there is a link to the other 6. Each company has a short description along with its URL.&lt;br /&gt;
#An &amp;quot;Alumni News&amp;quot; page at the bottom of &amp;quot;Portfolio&amp;quot; includes articles pertinent to graduated startups.&lt;br /&gt;
#Unfortunately does not include the date and year of each cohort class, but perhaps could cross-reference with other sources.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Launchpad LA (http://launchpad.la/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Companies&amp;quot; in the top of the homepage&lt;br /&gt;
#&amp;quot;Companies&amp;quot; returns all companies backed by Launchpad LA based on their class year and number (cohort)&lt;br /&gt;
#:*Also sorted by active startups vs. inactive startups&lt;br /&gt;
#At the bottom of the &amp;quot;Companies&amp;quot; tab, there is a statistical layout returning values for the number of companies started by Launchpad during its time as an accelerator (2012-present), as well as the total funding funneled into the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Y Combinator (http://www.ycombinator.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Scrolled down on the home page and clicked on a link entitled &amp;quot;See all companies&amp;quot;.&lt;br /&gt;
#Navigated to a drop down menu named &amp;quot;All Batches&amp;quot;, and clicked on it to expand the list.&lt;br /&gt;
#List is made up of dates ranging from 2005-2016, and these dates return lists of launched companies including most but not all of their URL's, as well as their launch year.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Flashpoint (http://flashpoint.gatech.edu/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#On upper right corner after animation, there is a tab sign which lets you navigate to a page labeled &amp;quot;Teams&amp;quot;&lt;br /&gt;
#The &amp;quot;Team&amp;quot; page has each batch of companies emerging from Georgia Tech, although it does not include the dates or cohorts of these companies. For example, &amp;quot;Batch 1&amp;quot; at the top of the page just lists the companies in the batch without URLs or any additional information.&lt;br /&gt;
#On the &amp;quot;Application&amp;quot; page on the tab near the top, there is information regarding Batch 7, which begins early 2017. Suggests that batch 6 either ended spring 2016 or fall 2016.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Prosper Women Entrepreneurs (http://www.prosperstl.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Accelerator&amp;quot; tab and clicked &amp;quot;Companies&amp;quot; when prompted with the drop down menu.&lt;br /&gt;
#This tab returned all of the launched company logos which then redirected to the company's home page when clicked.&lt;br /&gt;
#No other relevant form of information such as date launched or cohort was included on this page.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Axel Springer Plug and Play(http://www.axelspringerplugandplay.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Clicked on the &amp;quot;Companies&amp;quot; tab on the home page and was directed to the middle of the page which included a short list of current companies.&lt;br /&gt;
#Clicked on the &amp;quot;All Companies&amp;quot; link which returned a page filled with startup logos and brief descriptions of those startups. When clicked, each logo serves to redirect to that startup's home page.&lt;br /&gt;
#Companies were not sorted by cohort or in any other relevant way.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Techstars (http://www.techstars.com)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the Accelerators tabs and clicked &amp;quot;Companies&amp;quot; on the drop down menu.&lt;br /&gt;
#Firstly, this returns a table comprised of a long list of different classes from different areas separated by years.&lt;br /&gt;
#Upon scrolling down further, each of these classes is broken down by the startups that graduated from them. It also includes information such as how much was invested in each startup, as well as whether or not the startup was acquired, is active, or failed.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Startmate (http://www.startmate.com.au)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startups&amp;quot; tab, which returned a page of all startups that have graduated from Startmate.&lt;br /&gt;
#Startups are separated by year of graduation, and each company is linked on this page.&lt;br /&gt;
#It appears as if each year, 1 cohort is taken through the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Capital Factory (https://capitalfactory.com/accelerate/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the startups tab, which returned a long list of companies that were accelerated by Capital Factory.&lt;br /&gt;
#Each logo for the startups served as a link to their respective websites.&lt;br /&gt;
#There was no evidence or mention of any cohorts.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: OwlSpark (http://entrepreneurship.rice.edu/accelerator/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startup Teams&amp;quot; tab, which returned a page that included links to 4 &amp;quot;Classes&amp;quot;.&lt;br /&gt;
#Each class link i.e. (Class 1, Class 2, Class 3, Class 4) returned links to each startup that graduated from the program.&lt;br /&gt;
#These classes signify cohorts.&lt;br /&gt;
&lt;br /&gt;
==List of Promising Variables==&lt;br /&gt;
*Key People (founders, lead entrepreneurs, strategists, etc.)&lt;br /&gt;
*Total number of launched companies&lt;br /&gt;
*A FAQ for application details, accelerator vision, and &lt;br /&gt;
*Funds raised per company (average)&lt;br /&gt;
*Features offered by accelerator (perks, space, tools, etc)&lt;br /&gt;
*General events hosted by the accelerator&lt;br /&gt;
*(Success) stories for graduated start-ups&lt;br /&gt;
&lt;br /&gt;
=E-R Diagram (in list form) for Identifying Attributes to Pull from Accelerators=&lt;br /&gt;
Summary: I will look at different entities within the accelerator page (e.g accelerators, cohorts, founders) and then find potential attributes that can be codified from those entities. Along with the attribute, we list a potential method for pulling that particular attribute. &lt;br /&gt;
&lt;br /&gt;
Format: &lt;br /&gt;
:&amp;lt;u&amp;gt;Entity&amp;lt;/u&amp;gt;&lt;br /&gt;
:*Attribute - Possible sources/ways to get&lt;br /&gt;
&lt;br /&gt;
Ed: &amp;quot;Be creative with finding new attributes to pull!&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==List==&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
*Accelerator Name - Website, external database&lt;br /&gt;
*Contact Form - General contact section in each website &lt;br /&gt;
*Industry focus - can be pulled from description&lt;br /&gt;
*Description - pulled from website itself&lt;br /&gt;
*Takes equity? - Database or from &amp;quot;about&amp;quot; page&lt;br /&gt;
*Non-profit? - Database&lt;br /&gt;
*URL - Already have way of obtaining&lt;br /&gt;
*DNS Registration Date - Already have way of obtaining&lt;br /&gt;
*Address - Google Maps, maybe the website&lt;br /&gt;
*Founding Date - Google Maps, website, server registration&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
*Mentorship? - Description in website&lt;br /&gt;
*Space Offered - Google Maps, Website description&lt;br /&gt;
*Partnerships - Angel list, Same section as mentorship or events&lt;br /&gt;
*Hosted Events - Calender&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
*Name - Founders or Team Page&lt;br /&gt;
*Title - Directly underneath or next to name&lt;br /&gt;
*PhD? - Biography, webpage under name&lt;br /&gt;
*Serial - Biography&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot; in &amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt; (n) has (n) &amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt; &lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt;&lt;br /&gt;
*Other Companies - Biography, webpage&lt;br /&gt;
*Previous Companies - Biography&lt;br /&gt;
*Net Worth - Forbes, Biography&lt;br /&gt;
*Link back to &amp;quot;Name&amp;quot; in &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
*Date + Accelerator = Cohort ID - Database or Website&lt;br /&gt;
*Number of Startups - Website, count from &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Cohort Number - Categorization on website, external database&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Names - Website, external database&lt;br /&gt;
*State of Inc - Angel List&lt;br /&gt;
*URL - Angel List, website&lt;br /&gt;
*Founding Date - Registration database, Angel List&lt;br /&gt;
*Industry - startup description&lt;br /&gt;
*Founding Location - Angel List&lt;br /&gt;
*Current Location - Angel List&lt;br /&gt;
*VC Raised to Date - SDC Platinum&lt;br /&gt;
*Angel Funds Raised to date - Angel List&lt;br /&gt;
&lt;br /&gt;
==Variables which Distinguish Accelerator Websites==&lt;br /&gt;
*The word &amp;quot;Accelerator&amp;quot;&lt;br /&gt;
**This word appears at least one time on the home page of the vast majority of accelerator websites. The word &amp;quot;Accelerator&amp;quot; appears either as a link to another page on the website or in a title on the homepage of the website. Not many other websites contain this word on their homepage, especially not if one Googles something generic such as &amp;quot;Accelerators in the US&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
*Fixed Term&lt;br /&gt;
**Accelerators normally work with their cohorts for 3 months. This is a major factor which differentiates between an accelerator and any other member of a startup ecosystem. If on their website they mention either &amp;quot;3 months&amp;quot; or &amp;quot;12 weeks&amp;quot;, it is extremely likely that the website belongs to an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Cohorts, Portfolio, Class, or Companies&lt;br /&gt;
**This is a potential variable that could link the websites of many different accelerators. The problem with the word &amp;quot;portfolio&amp;quot; is also used by numerous venture capital firms, which could potentially cause complications when attempting to pull only the sites of accelerators from a Google search. The word &amp;quot;cohort&amp;quot;, however, would have an extremely high probability of identifying the website as belonging to an accelerator. The words &amp;quot;class&amp;quot; and &amp;quot;companies&amp;quot; are promising but do not offer certainty.&lt;br /&gt;
&lt;br /&gt;
*Equity, Investment&lt;br /&gt;
**Although by itself, equity does not mean much, when paired with any of these other terms, it could potentially point to an accelerator. Most accelerators take equity in the form of common stock (6-8%), or they will ask for some alternate form of stake in the company.&lt;br /&gt;
&lt;br /&gt;
*Education and Mentorship&lt;br /&gt;
**Accelerators differ from incubators and angel investors in that they emphasize the education of the potential startup. They offer advice and intense mentorship from more experienced entrepreneurs within their staff, as well as many networking opportunities with the outside world. This variable is more difficult to find on the website of the accelerator, but I believe that if the website includes numerous keywords such as &amp;quot;education&amp;quot;, &amp;quot;mentorship&amp;quot;, or &amp;quot;networking opportunities&amp;quot;, it would be somewhat safe to assume that the website is owned by an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Demo Day&lt;br /&gt;
**This variable does not have tremendous potential in terms of crawling websites, but I feel that it is worth mentioning. Most accelerators &amp;quot;graduate&amp;quot; their cohorts with a demo day, which is a day when the startups present their company to potential investors. If the website contains the words &amp;quot;demo day&amp;quot;, which is fairly uncommon, it could be a good source of accelerator identification.&lt;br /&gt;
&lt;br /&gt;
A combination of any of these variables would certainly identify the current website as belonging to an accelerator.&lt;br /&gt;
&lt;br /&gt;
==Comprehensive List of Accelerators==&lt;br /&gt;
&lt;br /&gt;
All text files saved in &amp;quot;Accelerators&amp;quot; project on the McNair RPD. &lt;br /&gt;
&lt;br /&gt;
*Acc.Info: 190&lt;br /&gt;
*SeedDB: 240&lt;br /&gt;
*SARP: 59&lt;br /&gt;
*Corp: 79&lt;br /&gt;
*Total: 568 results&lt;br /&gt;
&lt;br /&gt;
After removing duplicates and locations: 363 results&lt;br /&gt;
&lt;br /&gt;
Doesn't count f6s, which returns 1170 results, roughly only 300 of which were accelerators. We created a crawler to sift through the webpages and parse HTML so we could identify the accelerators. Program and HTML saved on the Desktop.&lt;br /&gt;
&lt;br /&gt;
==Randomly Chosen Accelerators==&lt;br /&gt;
*TLabs&lt;br /&gt;
*BetaSpring&lt;br /&gt;
*The Unilever Foundry&lt;br /&gt;
*AIA Accelerator&lt;br /&gt;
*R/GA Accelerator&lt;br /&gt;
*Zeroto510&lt;br /&gt;
*Hub:raum&lt;br /&gt;
*Orange Fab&lt;br /&gt;
*Furnace&lt;br /&gt;
*Launch Chapel Hill&lt;br /&gt;
&lt;br /&gt;
===Determining whether or not these are accelerators===&lt;br /&gt;
Googled name of Accelerator and clicked on the first link&lt;br /&gt;
&lt;br /&gt;
Looked for Variables which Distinguish Accelerator Websites&lt;br /&gt;
*TLabs: Homepage states: &amp;quot;Leading Indian Tech Accelerator&amp;quot;; TLabs is an accelerator, but it is located in India.&lt;br /&gt;
*Betaspring: Under the &amp;quot;About Betaspring&amp;quot; tab,  it states that &amp;quot;Betaspring was among the first ten startup accelerators to launch worldwide&amp;quot;.&lt;br /&gt;
*The Unilever Foundry: Does not claim to be an accelerator, nor does it have information on the website about cohorts. This name was pulled from the source Corporate Accelerators.&lt;br /&gt;
*AIA Accelerator: The word &amp;quot;accelerator&amp;quot; is included in the name. Under the &amp;quot;Overview&amp;quot; tab, it states that startups have received mentorship.&lt;br /&gt;
*R/GA Accelerator: Under the &amp;quot;Overview&amp;quot; tab it states that the &amp;quot;R/GA Accelerator is designed for startups and... it is a three month, immersive, mentorship driven program&amp;quot;.&lt;br /&gt;
*Zeroto510: Website contains a &amp;quot;Portfolio Companies&amp;quot; tab which divides up the companies into cohorts. This identifies Zeroto510 as an accelerator.&lt;br /&gt;
*Hub:raum: Offers accelerator and incubator programs; however, none are located in North America.&lt;br /&gt;
*Orange Fab: States on the main page that &amp;quot;We're a 3-month accelerator program&amp;quot;.&lt;br /&gt;
*Furnace: &amp;quot;About&amp;quot; tab states that Furnace is &amp;quot;an innovative startup accelerator designed to form, incubate, and launch new companies&amp;quot;. Concludes with a Demo Day&lt;br /&gt;
*Launch Chapel Hill: Homepage states that they are &amp;quot;a startup accelerator&amp;quot;. Also included on the homepage is a line that states &amp;quot;Applications for Cohort 7 are now open&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
7/10 are accelerators located in the US.&lt;br /&gt;
&lt;br /&gt;
2/10 are accelerators not located in the US.&lt;br /&gt;
&lt;br /&gt;
1/10 is not an accelerator.&lt;br /&gt;
&lt;br /&gt;
===Steps for Extracting Cohort Information===&lt;br /&gt;
*TLabs: Clicked on the &amp;quot;Startup&amp;quot; tab and located a drop down menu entitled &amp;quot;Showing Startups from:&amp;quot;. This menu separates startups into Batches ranging from 1-9. These batches are cohorts.&lt;br /&gt;
*Betaspring: This website does not have a &amp;quot;Companies&amp;quot; or &amp;quot;Startups&amp;quot; tab. I clicked on their &amp;quot;Who&amp;quot; tab and noticed that within this section were two links called &amp;quot;Our portfolio&amp;quot; and &amp;quot;Our companies&amp;quot; which both linked to the same place. This place contained a list of the startups that Betaspring has funded, as well as links to each of the startup websites. The list was not separated into cohorts.&lt;br /&gt;
*The Unilever Foundry: Does not have a &amp;quot;Startups&amp;quot; or &amp;quot;Companies&amp;quot; link on the website.&lt;br /&gt;
*AIA Accelerator: Clicked on the &amp;quot;Startups&amp;quot; tab which returned a page with 5 companies and a bit of information on each of these companies. Also included the URL to each startup. However, the companies were not separated into cohorts, probably because there are so few of them.&lt;br /&gt;
*R/GA Accelerator: Clicked on the &amp;quot;Alumni&amp;quot; tab and navigated down the webpage. Startups are separated by class, which means cohort in this case. Startup info contains link to demo day presentation as well as the startup url.&lt;br /&gt;
*Zeroto510: Hovered over the &amp;quot;About Us&amp;quot; drop down menu and clicked on the &amp;quot;Portfolio Companies&amp;quot; link. Startups are separated by cohort, one for each year, starting from 2013. &lt;br /&gt;
*Hub:raum: Clicked on the &amp;quot;Portfolio&amp;quot; tab. Directed to a page with many names of startups, as well as a brief description of what their company is about. Also includes a link to each startup's website. Startups are not separated into cohorts, but rather by investment by location, current participants, and alumni.&lt;br /&gt;
*Orange Fab: Clicked on the &amp;quot;Startups&amp;quot; tab and was directed to a different page. Startups are not only separated into cohorts named &amp;quot;Seasons&amp;quot;, but they are also separated by industry.&lt;br /&gt;
*Furnace: Clicked on &amp;quot;Portfolio&amp;quot; tab, but unfortunately the website is broken and it returned an error in code.&lt;br /&gt;
*Launch Chapel Hill: Clicked on the &amp;quot;Ventures&amp;quot; tab and was directed to a page in which all startups were separated into cohorts, and a brief description of the startup was provided underneath their logo.&lt;br /&gt;
&lt;br /&gt;
=Code=&lt;br /&gt;
&lt;br /&gt;
The directory for all data related to this project is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
==F6S Web Crawler==&lt;br /&gt;
&lt;br /&gt;
This is a python script using the selenium library that retrieves the html content of each page on F6S's North American Accelerator search results. The script is located in:&lt;br /&gt;
&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs &lt;br /&gt;
&lt;br /&gt;
The script is titled f6s_crawler_gentle.py&lt;br /&gt;
&lt;br /&gt;
When run, the script visits the F6S search page for North American Accelerator's and begins retrieving the HTML of each page in that search list. &lt;br /&gt;
NOTE: Timing must be spaced out between all interactions with the browser. F6S has Captcha, and the program will fail if the site receives too many hit requests, or has any inkling that it is being probed by a bot.&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files are stored in: &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files stored as text files are stored in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files_text&lt;br /&gt;
&lt;br /&gt;
==F6S Parser==&lt;br /&gt;
The next step is to take the HTML files retrieved by the crawler and to parse them for necessary information. This parser should also determine whether or not the site is an accelerator site. &lt;br /&gt;
&lt;br /&gt;
The code for the parser is located in &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
It is titled f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
To run the code, open the file in Komodo and press play. &lt;br /&gt;
If running from the command line, change to the correct directory and run the following comand:&lt;br /&gt;
 python f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
The list of accelerators that passed through the parser is in the same directory:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
The tab delimited text file is named AcceleratorList.&lt;br /&gt;
The file contains the names of the accelerators that had the keywords listed in the file. Also, the file contains the run dates and location of the accelerator if it was listed on the f6s page.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==F6S API==&lt;br /&gt;
F6S has an API, but we have had no success getting a key to the API. The link to get a key to the API is on [https://www.f6s.com/developers/apis/deal-feed this page].&lt;br /&gt;
&lt;br /&gt;
I (Peter) have emailed F6S to ask for a key directly at support@f6s.com. As of the end of the Fall 2016 Semester, they have not responded.&lt;br /&gt;
&lt;br /&gt;
FUN FACT (MASS-RENAME FILES USING WINDOWS POWER SHELL):&lt;br /&gt;
&lt;br /&gt;
The following command allowed me to append &amp;quot;.txt&amp;quot; to all files in a folder once in the proper directory:&lt;br /&gt;
 Get-ChildItem * | Rename-Item -NewName { $_.name + '.txt'}&lt;br /&gt;
&lt;br /&gt;
To change file formats, Microsoft suggests:&lt;br /&gt;
 Get-ChildItem *.txt | Rename-Item -NewName { $_.name -Replace '\.txt', '.log'}&lt;br /&gt;
&lt;br /&gt;
==Final Data==&lt;br /&gt;
The Parser for parsing the text files of accelerator data is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
The Parser for parsing the cohort files of accelerator data is also located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
This folder contains the Python parsers. The Final_data folder contains the tab-delimited text files of parsed data. final_accelerator_data.txt contains the generalized data saved in .txt files and final_cohort_data.txt contains the cohort data saved in .cohort.txt files.&lt;br /&gt;
&lt;br /&gt;
All the files entitled accelerator_data are subsets of the final_accelerator_data.txt file, but each file contains only the accelerators that matched to the flag specified in the file title.&lt;br /&gt;
&lt;br /&gt;
find_headers .py finds a set of the headers for all the cohort files from the seed list project.&lt;br /&gt;
&lt;br /&gt;
==Google SiteSearch==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Google_SiteSearch&lt;br /&gt;
This folder contains code for a google search parser. The script sitesearch.py will search for a queried company and return a likely web address for that company.&lt;br /&gt;
&lt;br /&gt;
==Way Back Machine Parser==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\wayback_machine.py&lt;br /&gt;
This script takes URLs and returns a timestamp for the oldest documented webpage under that URL courtesy of the Way Back Machine Archive.&lt;br /&gt;
&lt;br /&gt;
==Process Locations==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\process_locations.py&lt;br /&gt;
This script takes a physical address and converts it into latitude and longitude coordinates. Should be used in conjunction with the Enclosing Circle program to find the concentration of accelerators.&lt;br /&gt;
 E:\McNair\Software\CodeBase\EnclosingCircle.py&lt;br /&gt;
&lt;br /&gt;
=Kauffman Foundation Incubator Proposal Information=&lt;br /&gt;
&lt;br /&gt;
==Institutions==&lt;br /&gt;
Summary: F6S, Crunchbase, seed-db&lt;br /&gt;
&lt;br /&gt;
Tools: Matcher - used to match lists of potential accelerators with our current list to identify duplicates/new matches (E:\McNair\Projects\Accelerators)&lt;br /&gt;
&lt;br /&gt;
===F6S===&lt;br /&gt;
F6S WebCrawler and F6S Parser - E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
===CrunchBase===&lt;br /&gt;
&lt;br /&gt;
CrunchBase 2013 Snapshot '''(All Organizations)'''- E:\McNair\Projects\Accelerators\organizations.xls&lt;br /&gt;
&lt;br /&gt;
CrunchBase 2013 Snapshot '''(Potential Accelerators)'''- E:\McNair\Projects\Accelerators\organizations.accdb under &amp;quot;Potential Accelerators query&amp;quot; &lt;br /&gt;
&lt;br /&gt;
*Obtained using keyword matches in the descriptions of the potential accelerators.&lt;br /&gt;
&lt;br /&gt;
CrunchBase 2013 Snapshot '''(New Verified Accelerators)''' - E:\McNair\Projects\Accelerators\New CrunchBase Accelerators.xls&lt;br /&gt;
&lt;br /&gt;
We have the Crunchbase 2013 Snapshot which provided lots of new data on accelerators and incubators but we would love to use the Crunchbase API to get a current database snapshot that we could use to cross reference companies and add newly formed accelerator and incubator companies.&lt;br /&gt;
&lt;br /&gt;
===AngelList===&lt;br /&gt;
&lt;br /&gt;
===seed-db===&lt;br /&gt;
&lt;br /&gt;
Obtained through www.seed.db/accelerators&lt;br /&gt;
&lt;br /&gt;
===Global Accelerator Network (GAN)===&lt;br /&gt;
&lt;br /&gt;
GAN Parser- E:\McNair\Projects\Accelerators\Web Scraping for Accelerators\scrapeaccel.py&lt;br /&gt;
&lt;br /&gt;
GAN Data- E:\McNair\Projects\Accelerators\Web Scraping for Accelerators\GAN Accelerator Data&lt;br /&gt;
*Contains: Company Name, # of Companies Range, % of Companies Funded, Funding Raised by Companies, Employee Range, Exit Funding, Exit Date, Total Company Funding Raised, # of Mentors Range, % Equity, Location, Minimum Seed Capital Investment&lt;br /&gt;
&lt;br /&gt;
==Cohorts==&lt;br /&gt;
&lt;br /&gt;
*Cohorts obtained manually&lt;br /&gt;
*All Cohort txt files are saved under &amp;quot;E:\McNair\Projects\Accelerators\Data  &lt;br /&gt;
*cohort file name = (accelerator name).cohort&lt;br /&gt;
*Most updated Accelerator cohort data: E:\McNair\Projects\Accelerators\Cleaned Cohort Data.xls&lt;br /&gt;
&lt;br /&gt;
Automation for obtaining cohorts??&lt;br /&gt;
&lt;br /&gt;
==Other Information==&lt;br /&gt;
Summary: Whois Parser, Geocode, Tools to determine industry, etc&lt;br /&gt;
&lt;br /&gt;
===Whois Parser===&lt;br /&gt;
&lt;br /&gt;
*Retrieves and parses Whois information. Specifically, takes a file with a column of domain names and populates the corresponding columns with information from the WhoIs API.&lt;br /&gt;
&lt;br /&gt;
*Often used to obtain locations.&lt;br /&gt;
&lt;br /&gt;
===Geocode===&lt;br /&gt;
&lt;br /&gt;
Input: Company Address&lt;br /&gt;
Output: Directional Coordinates&lt;br /&gt;
&lt;br /&gt;
*Used to obtain the locations of different Accelerators and Cohort companies.&lt;br /&gt;
&lt;br /&gt;
===SDC Platinum Pull===&lt;br /&gt;
&lt;br /&gt;
Used to obtain funding information and match companies that have gotten funding with companies that are Accelerator cohorts.&lt;br /&gt;
&lt;br /&gt;
===Desired Information/Variables===&lt;br /&gt;
&lt;br /&gt;
*Key People (founders, lead entrepreneurs, strategists, etc.)&lt;br /&gt;
*Total number of launched companies&lt;br /&gt;
*A FAQ for application details, accelerator vision, and&lt;br /&gt;
*Funds raised per company (average)&lt;br /&gt;
*Features offered by accelerator (perks, space, tools, etc)&lt;br /&gt;
&lt;br /&gt;
==Desired Tools/Information==&lt;br /&gt;
&lt;br /&gt;
===Automating the Process of Obtaining Cohorts===&lt;br /&gt;
*Automating this process would save a lot of time and really progress the project.&lt;br /&gt;
&lt;br /&gt;
===Obtaining More Details on Accelerators===&lt;br /&gt;
&lt;br /&gt;
*Having the kind of thorough information on industry, companies, funding, location, exits, mentors, leadership,  that we got for the GAN companies would be fantastic.&lt;br /&gt;
&lt;br /&gt;
===List of Alive/Dead Accelerators===&lt;br /&gt;
&lt;br /&gt;
This is a dream but would be very helpful&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=22660</id>
		<title>Shrey Agarwal (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=22660"/>
		<updated>2018-02-27T22:40:37Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Spring 2018===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Shrey Agarwal]] [[Work Logs]] [[Shrey Agarwal (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
1/23/18 15:00 - 17:00&lt;br /&gt;
*Became reacclimatized with the project, spoke with Ed about the direction for the rest of the semester&lt;br /&gt;
1/25/18 15:00 - 17:00&lt;br /&gt;
*Began examining the data on pulled webpages relating to demo days&lt;br /&gt;
1/26/18 13:00 - 17:00&lt;br /&gt;
*Began categorizing demo day pages based on: 1) relevance to accelerators, 2) relevance to the particular accelerator (got to 200)&lt;br /&gt;
1/30/18 15:00 - 17:00&lt;br /&gt;
*Continued working through the demo day pages, spoke with Ed about using the data to work a better set (got to 450)&lt;br /&gt;
2/01/18 15:00 - 17:00&lt;br /&gt;
*Finished the match and created pivot tables to count the number of repetitions (companies going through more than one accelerator)&lt;br /&gt;
2/06/18 15:00 - 17:00&lt;br /&gt;
*Discussed with Matthew the best way to collect the VC data from the repetitions. We tried different matches through our SDC data to no avail&lt;br /&gt;
2/08/18 15:00 - 18:00&lt;br /&gt;
*Continued attempting to match with SDC the different columns. Didn't work without separating the data into individual files, a very tedious process.&lt;br /&gt;
2/13/18 15:00 - 17:00&lt;br /&gt;
*Spoke with Ed about incubators project, will begin as soon as we can time the accelerator startup investments. Ed is expecting us to begin sometime in the next two months, using a similar process as we did for incubators. The process should be handled by a new worker.&lt;br /&gt;
2/15/18 15:00 - 17:00&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the CrunchBase database on SQL and brushed up on SQL code.&lt;br /&gt;
2/16/18 13:00 - 17:00&lt;br /&gt;
*Sifted through the database for Crunchbase investment information.&lt;br /&gt;
2/20/18 15:00 - 17:00&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
2/22/18 15:00 - 18:00&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
2/27/18 15:00 - 17:00&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Fall 2017===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
9/19/17 15:00 - 17:00&lt;br /&gt;
*Became reacclimatized with the project, spoke with Ed about the direction for the rest of the semester&lt;br /&gt;
9/20/17 15:00 - 17:00&lt;br /&gt;
*Worked on setting up a new pull for the updated SDC data&lt;br /&gt;
9/21/17 15:00 - 17:00&lt;br /&gt;
*Finished the pull and sorted the data from the updated accelerator list&lt;br /&gt;
9/22/17 15:00 - 17:00&lt;br /&gt;
*Tried to set up the matcher with Matthew; ran into some difficulties on Power Shell, returning a blank file in the output&lt;br /&gt;
9/26/17 15:00 - 17:00&lt;br /&gt;
*Finished the match and created pivot tables to count the number of repetitions (companies going through more than one accelerator)&lt;br /&gt;
9/27/17 15:00 - 17:00&lt;br /&gt;
*Discussed with Matthew the best way to collect the VC data from the repetitions. We tried different matches through our SDC data to no avail&lt;br /&gt;
9/28/17 16:00 - 17:00&lt;br /&gt;
*Continued attempting to match with SDC the different columns. Didn't work without separating the data into individual files, a very tedious process.&lt;br /&gt;
9/29/17 15:00 - 17:00&lt;br /&gt;
*Spoke with Ed about incubators project, will begin as soon as we can time the accelerator startup investments. Ed is expecting us to begin sometime in the next two months, using a similar process as we did for incubators. The process should be handled by a new worker.&lt;br /&gt;
10/02/17 15:00 - 17:00&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the CrunchBase database on SQL and brushed up on SQL code.&lt;br /&gt;
10/03/17 15:00 - 17:00&lt;br /&gt;
*Sifted through the database for Crunchbase investment information.&lt;br /&gt;
10/04/17 15:00 - 17:00&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
10/06/17 15:00 - 17:00&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
10/11/17 15:00 - 17:00&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
10/12/17 15:00 - 17:00&lt;br /&gt;
*Discovered that the Wayback Machine will not be a good option for identifying the time when a company went through the accelerator. Created a list of VC Companies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
10/16/17 15:00 - 17:00&lt;br /&gt;
*Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
10/17/17 15:00 - 17:00&lt;br /&gt;
*Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
10/18/17 15:00 - 17:00&lt;br /&gt;
*Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
10/19/17 15:00 - 17:00&lt;br /&gt;
*Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
10/20/17 15:00 - 17:00&lt;br /&gt;
*Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
10/23/17 15:00 - 17:00&lt;br /&gt;
*Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
10/24/17 15:00 - 17:00&lt;br /&gt;
*Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
10/25/17 15:00 - 17:00&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
10/26/17 15:00 - 17:00&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
10/30/17 15:00 - 17:00&lt;br /&gt;
*Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
10/31/17 15:00 - 17:00&lt;br /&gt;
*Began compiling data in the column for the dates that a specific company went through an Accelerator.&lt;br /&gt;
11/01/17 15:00 - 17:00&lt;br /&gt;
*Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
11/02/17 15:00 - 17:00&lt;br /&gt;
*Continued entering cohort company dates into Excel file.&lt;br /&gt;
11/06/17 15:00 - 17:00&lt;br /&gt;
*Began looking at keywords for identifying the cohort class dates for each company&lt;br /&gt;
11/07/17 15:00 - 17:00&lt;br /&gt;
*Received list from Peter with the accelerator founders matched from the Crunchbase LinkedIn URLs and proceeded to find the links for those founders without a match on Crunchbase. Data found in &amp;quot;Unfound Founders List&amp;quot; in the Fall 2017 folder&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
01/17/17 14:00 - 16:00&lt;br /&gt;
*Finished up &amp;quot;accelerating&amp;quot; from [[Accelerator Seed List (Data)]], numbers 341-351&lt;br /&gt;
1/18/17 14:00 - 16:00&lt;br /&gt;
*Finished accelerating for sure, went back and began an overview of the work done for quality control.&lt;br /&gt;
01/20/17 14:00 - 16:00&lt;br /&gt;
*Mandatory meeting, then worked through 2 of Ed's unfinished accelerators&lt;br /&gt;
1/23/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to go over about 70 items in the accelerator list and ensure that they follow a uniform structure and show correct information&lt;br /&gt;
1/24/17 14:00 - 16:00&lt;br /&gt;
*Worked with Peter to fix the problem with results not coming through on the new spreadsheet by renaming the file and including more symbols in the searches. Spreadsheet should be up to date now.&lt;br /&gt;
*Got to number 144 on the list while going through files.&lt;br /&gt;
1/25/17 14:00 - 16;00&lt;br /&gt;
*Continued looking through the list and fixing wrong entries or reporting them&lt;br /&gt;
1/26/17 14:00 - 16:00&lt;br /&gt;
*Talked with Ed about project going forward and tried to access the Crunchbase API with Peter to crawl for start-up companies.&lt;br /&gt;
*Continued working through the accelerator list, stopped at number 186.&lt;br /&gt;
1/27/17 14:00 - 16:00&lt;br /&gt;
*Continued looking through accelerator list and fixing any entries with error. Got to number 261.&lt;br /&gt;
1/30/17 14:30 - 16:30&lt;br /&gt;
*Got through about 425&lt;br /&gt;
1/31/17 14:00 - 16:00&lt;br /&gt;
*Got to number 502&lt;br /&gt;
2/01/17 14:00 - 16:00&lt;br /&gt;
*Finished looking through the initial list of accelerators and writing down which ones needed to be modified or completed (through 551)&lt;br /&gt;
2/03/17 14:00 - 17:00&lt;br /&gt;
*Finished about 30 entries for the accelerator entries that still needed to be completed. Worked out of the &amp;quot;NOT DONE&amp;quot; file in the server (which is now blank because everything is finished)&lt;br /&gt;
2/06/17 14:00 - 16:00&lt;br /&gt;
*Developed a standardized format for the text files with Matthew. Instructions are under &amp;quot;standardized format&amp;quot; in the accelerator seed list portion. I started at number 226 and standardized formats up until 370.&lt;br /&gt;
2/07/17 14:00-16:00&lt;br /&gt;
*Continued work from yesterday, completed up to number 488 from the list. Will likely need one more day to finish.&lt;br /&gt;
2/08/17 14:00 - 16:00&lt;br /&gt;
*Finished standardizing the txt files for use on the excel spreadsheet, compiled the data and examined the resultant tables. Realized we needed to fix some categories in the cohort files.&lt;br /&gt;
2/09/17 14:00 - 17:00&lt;br /&gt;
*Worked with Ed on a side project trying to gather information on climate change thanks to Baker's article on the Wall Street Journal&lt;br /&gt;
*Gathered information on climate change in relation to high-growth, high-risk innovation and organizations that deal with things such as carbon credits&lt;br /&gt;
2/10/17 14:00 - 17:00&lt;br /&gt;
*Realized that blog post was ambitious because we could not really find a clear purpose from the information we gathered, nor could we find a unique angle. Held off on the idea&lt;br /&gt;
*Went back to organizing the new columns and headers on the text file by identifying areas of error in the excel spreadsheet&lt;br /&gt;
2/15/17 14:00 - 16:00&lt;br /&gt;
*Spoke with Ed about free enterprise while he lectured all of us. It took about an hour.&lt;br /&gt;
*Looked at plans for project going forward including using linkedin to search the founders&lt;br /&gt;
2/20/17 14:00 - 16:00&lt;br /&gt;
*Found our first source for expanding the project into incubators, from angel.co. Seems similar to f6s in that we can crawl it and obtain a list of incubators and their various counterparts. &lt;br /&gt;
2/21/17 14:00 - 16:00&lt;br /&gt;
*Found more sources for incubators by reading through quora discussions and masters theses. Bookmarked these pages so that I could put them into text files after.&lt;br /&gt;
2/23/17 14:00 - 18:00&lt;br /&gt;
*Converted incubator files to text-pad and saved them (4 total), then cleaned them up through regex&lt;br /&gt;
*Took the cohort text file, put it into excel, and proceeded to clean up all of the mistakes in the excel document, particularly bad data or mistakes with organizations. Got through Y-Combinator.&lt;br /&gt;
2/24/17 14:00 - 16:00&lt;br /&gt;
*Finished up cleaning the cohort data for the names and the descriptions, but there still needs to be work done on the other stuff like dates and programs&lt;br /&gt;
2/28/17 14:00 - 16:00&lt;br /&gt;
*Created page [[Hub-Based Venture Firms]] and proceeded to research VC in Hubs listed on under E:\McNair\Projects\Hubs\summer 2016\Hubs Variables - Ariel.xls&lt;br /&gt;
*Looked at details such as whether they have in-house funds, whether they co-invest, focuses, and amounts invested.&lt;br /&gt;
3/01/17 14:00 - 16:00&lt;br /&gt;
*Worked with Ben and Matthew to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
3/02/17 14:00 - 16:00&lt;br /&gt;
*Tried to repeat the VC data pull without it crashing from pulling too many entries. Unfortunately, we were unable to finish it&lt;br /&gt;
3/06/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to put final touches on the cohort data to prep it for matching with our VC data&lt;br /&gt;
3/07/17 14:00 - 16:00&lt;br /&gt;
*Finally finished working on the cohort files, will match on the 8th&lt;br /&gt;
3/08/17 14:00 - 16:00&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
3/20/17 14:00 - 16:00&lt;br /&gt;
*Participated in a SQL training session with Ed, learned how to create a database and to pull tab delimited information from text files onto a table&lt;br /&gt;
3/21/17 14:00 - 16:00&lt;br /&gt;
*Met with Ed and arrived at the conclusion of finishing the draft for a report by the end of the semester. Put the initial report information on the accelerator page using the variables that we currently have&lt;br /&gt;
3/22/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to compile tables in our database of the matched VC-portfolio company lists and the overall accelerator cohort information. Found multiple errors in the cohort file which needed to be fixed before finishing the tables and analyzing the data&lt;br /&gt;
3/23/17 14:00 - 16:00&lt;br /&gt;
*Finished cleaning the cohort file once again.&lt;br /&gt;
3/24/17 14:00 - 16:00&lt;br /&gt;
*Continued practicing my SQL and creating the code for compiling the tables&lt;br /&gt;
3/29/17 14:00 - 16:00&lt;br /&gt;
*Worked on the matched data with Matthew. Will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC-backed company names matched to one cohort company name&lt;br /&gt;
3/30/17 14:00 - 16:00&lt;br /&gt;
*Examined the Regex code for the URLs and attempted to filter them out&lt;br /&gt;
4/03/17 14:00 - 16:00&lt;br /&gt;
*Continued learning some SQL from Ed&lt;br /&gt;
4/04/17 14:00 - 16:00&lt;br /&gt;
*Began examining the Crunchbase data; looked through the 2013 snapshot&lt;br /&gt;
*Created a new Crunchbase account with McNair center and examined the basic access, which does not give us much information&lt;br /&gt;
4/05/17 14:00 - 16:00&lt;br /&gt;
*Made the final VC percentage table from our database and previous code with Ed; realized we were missing many accelerators as well as a lot of important cohort data so need to reexamine our previous data.&lt;br /&gt;
4/06/17 14:00 - 16:00&lt;br /&gt;
*Continued looking through Crunchbase to see how we can pull accelerators up until 2013; most likely will use objects to sort the data into accelerators, perhaps keywords from &amp;quot;accelerators&amp;quot;&lt;br /&gt;
4/07/17 14:00 - 16:00&lt;br /&gt;
*Examined SARP and attempted to match their accelerators with the ones from our data, realized that a few of our cohorts were missing as well as a few of the actual accelerators so we need to fix the data in our excel file&lt;br /&gt;
*Began compiling a list of missing accelerators on textpad to later insert into our excel.&lt;br /&gt;
4/10/17 13:00 - 16:00&lt;br /&gt;
*Worked with Ben to find missing accelerators from the Crunchbase data using the keywords. Also, began recording information from some of the big accelerators we were missing&lt;br /&gt;
*Found 228 matches for accelerators, will match from our list to find the similarities&lt;br /&gt;
4/11/17 14:00 - 16:00&lt;br /&gt;
*Finished compiling the accelerator and cohort information for the few we found from SARP, will consult Ed to figure out how to approach the missing accelerators and what to do for the preliminary report&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
09/27/2016 14:00 - 17:00: &lt;br /&gt;
*Set up personal and work log pages, accessed Remote Desktop. &lt;br /&gt;
*Compiled list of accelerators from Wiki&lt;br /&gt;
09/29/2016 14:00 - 16:15; 16:45 - 17:30:&lt;br /&gt;
*Created new project: [[Accelerator Seed List (Data)]] and worked with Dr. Egan to create schematic for data entry.&lt;br /&gt;
*Evaluated 3 sources and logged data. Sources were taken from [[List of Accelerators]]. Logged each step onto project page and identified categories that would be suitable for web crawling sometime in the future.&lt;br /&gt;
10/11/2016 14:00 - 17:30;&lt;br /&gt;
*Explored how to use regular expressions in TextPad to aid with data sorting (need to review expressions with Dr. Egan in future)&lt;br /&gt;
*Continued evaluating sources from [[List of Accelerators]] and recorded steps onto project page, as before. Finished evaluating the six sources from initial list. (All work done in [[Accelerator Seed List (Data)]])&lt;br /&gt;
10/13/2016 14:00 - 17:00;&lt;br /&gt;
*All work done in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Talked to Dr. Egan about project going forward. Need to pick out 10-15 accelerators from the sources listed on my project page and identify a reliable method for obtaining cohort information, as well as other variables&lt;br /&gt;
*Used google searches to identify more sources, and evaluated three databases with the help of TextPad&lt;br /&gt;
*Began working on more generic google searches. Was able to go through &amp;quot;Location+accelerator&amp;quot;-type searches today. Will continue next time.&lt;br /&gt;
10/18/2016 14:00 - 17:30;&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Took a sample size of 10 accelerators and detailed how to extract cohort information, as well as what other information is readily available from accelerator URLs.&lt;br /&gt;
*Brought Matthew up to speed on accelerator project, added summaries to each section so they became easier to follow, and worked with him to finish up extracting cohort information&lt;br /&gt;
10/20/16 14:30 - 17:30:&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Finished up the list of instructions for finding the cohort. Continued compiling the list of variables for each of the accelerators within the sample size.&lt;br /&gt;
*Consulted Peter on prospects of creating a web crawler with the information we currently have compiled. Determined it was possible, although beyond the scope of Peter's knowledge.&lt;br /&gt;
10/25/16 14:00 - 17:00&lt;br /&gt;
*Consulted Ed with next step for project.&lt;br /&gt;
*Began listing the E-R diagram onto the accelerator database page where entities were potential categories and each entity had its associated attributes&lt;br /&gt;
10/27/16 14:00 - 17:00&lt;br /&gt;
*Continued working with Matthew to identify elements in the E-R diagram for pulling information on accelerators. &lt;br /&gt;
*Found sources to obtain/cross-reference information (ie. Angel List)&lt;br /&gt;
11/08/16 14:00 - 18:00&lt;br /&gt;
*Identified possible keywords to filter results through for accelerators&lt;br /&gt;
*Began compiling a comprehensive list of accelerators based on the data we have already sifted through.&lt;br /&gt;
*Learned how to use regular expressions from Ben to sort names individually and alphabetically.&lt;br /&gt;
11/10/16 14:00 - 18:00&lt;br /&gt;
*Began sorting through accelerator list and removing duplicates, as well as identifying more places to pull names from.&lt;br /&gt;
*Worked with Peter to create a crawl for f6s because the website does not return only accelerators.&lt;br /&gt;
11/15/16 14:00 - 18:00&lt;br /&gt;
*Took a break from f6s to locate more lists based on individual google searches such as &amp;quot;city+accelerator+list&amp;quot;&lt;br /&gt;
*Put Seed DB information into an excel file on the remote desktop&lt;br /&gt;
11/17/16 14:00 - 16:00&lt;br /&gt;
*Continued filling out information for the random Google Searches&lt;br /&gt;
*Organized TextPad files on the RDP into coherent excel spreadsheets with proper headers on the table&lt;br /&gt;
*Noticed problem with f6s: it seems although all of the html coding was protected by a captcha so the crawler did not actually extract any information; it was all blocked.&lt;br /&gt;
11/22/16 14:00 - 17:00&lt;br /&gt;
*Worked to fix f6s crawler with Peter&lt;br /&gt;
*Finished and compiled master list of accelerators&lt;br /&gt;
12/01/16 14:00 - 18:00&lt;br /&gt;
*Caught up on project with Ed and Carlin&lt;br /&gt;
*Took 20 accelerators (241-260) from the list and filled out text.html files for them; finished the 20&lt;br /&gt;
12/05/16 13:00 - 16:00&lt;br /&gt;
*After finishing first 20 accelerators, continued working down the list, beginning at 321&lt;br /&gt;
*Work noted in [[Accelerator Seed List (Data)]], but mostly stored on McNair RDP&lt;br /&gt;
12/06/16 14:00 - 18:00&lt;br /&gt;
*Continued &amp;quot;Accelerating&amp;quot; down the list in [[Accelerator Seed List (Data)]], finished up until 340&lt;br /&gt;
12/08/16 14:00 - 17:00&lt;br /&gt;
*Continued working on accelerator list on the same page.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Talk:Accelerator_Data&amp;diff=21920</id>
		<title>Talk:Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Talk:Accelerator_Data&amp;diff=21920"/>
		<updated>2017-11-15T22:27:03Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;LinkedIn Founders:&lt;br /&gt;
&lt;br /&gt;
Most founders can be found with a combination of 1 of these 3 searches within the first 5 hits:&lt;br /&gt;
*&amp;quot;accelerator name&amp;quot; + founder + LinkedIn&lt;br /&gt;
*&amp;quot;accelerator name&amp;quot; + executive director + LinkedIn&lt;br /&gt;
*&amp;quot;accelerator name&amp;quot; + CEO + LinkedIn&lt;br /&gt;
&lt;br /&gt;
However, some companies do not appear regardless of these three searches. For these, it is sometimes valuable to search the company name on LinkedIn and then if their list of employees is public, find the director from the list and search their description on Google to find the actual LinkedIn url.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Demo Day hits:&lt;br /&gt;
&lt;br /&gt;
It appears that counts are inflated by the presence of startup names such as &amp;quot;tio&amp;quot; appearing in other words on the webpage, such as &amp;quot;innova-tio-n&amp;quot;. To remove these, we suggest:&lt;br /&gt;
*removing all punctuation&lt;br /&gt;
*condensing all of the text in one line&lt;br /&gt;
*running the match for &amp;quot; startup_name &amp;quot; with spaces before and after the name, as well as for the startup name at the beginning or the end of the line&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Talk:Accelerator_Data&amp;diff=21919</id>
		<title>Talk:Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Talk:Accelerator_Data&amp;diff=21919"/>
		<updated>2017-11-15T22:17:55Z</updated>

		<summary type="html">&lt;p&gt;Shrey: Created page with &amp;quot;LinkedIn Founders:  Most founders can be found with a combination of 1 of these 3 searches within the first 5 hits: &amp;quot;accelerator name&amp;quot; + founder + LinkedIn &amp;quot;accelerator name&amp;quot;...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;LinkedIn Founders:&lt;br /&gt;
&lt;br /&gt;
Most founders can be found with a combination of 1 of these 3 searches within the first 5 hits:&lt;br /&gt;
&amp;quot;accelerator name&amp;quot; + founder + LinkedIn&lt;br /&gt;
&amp;quot;accelerator name&amp;quot; + executive director + LinkedIn&lt;br /&gt;
&amp;quot;accelerator name&amp;quot; + CEO + LinkedIn&lt;br /&gt;
&lt;br /&gt;
However, some companies do not appear regardless of these three searches. For these, it is sometimes valuable to search the company name on LinkedIn and then if their list of employees is public, find the director from the list and search their description on Google to find the actual linkedin url.&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21918</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21918"/>
		<updated>2017-11-15T22:12:57Z</updated>

		<summary type="html">&lt;p&gt;Shrey: /* List of All Relevant Files */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
&lt;br /&gt;
'''Original Search'''&lt;br /&gt;
&lt;br /&gt;
*'''List of Preliminary Accelerators'''&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
**Variables: Names of potential accelerators&lt;br /&gt;
&lt;br /&gt;
*'''accelerator_data_noflag'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
&lt;br /&gt;
'''Cohort Directory &amp;quot;Big Push&amp;quot;'''&lt;br /&gt;
&lt;br /&gt;
*'''Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\&lt;br /&gt;
**Description: This folder contains files for each of the accelerators that we searched through from the &amp;quot;List of Preliminary Accelerators&amp;quot;. There are three files per accelerator: 1) The &amp;quot;accelerator name.txt&amp;quot; file which contains each of the variables recorded by all of the McNair Center workers during our big push on the project winter 2016, 2) The .html file for the cohort page if the entry was indeed an accelerator and if the worker could find the cohort page on that accelerator, and 3) a &amp;quot;accelerator name.cohort.txt&amp;quot; file which contains a list of the cohort companies as well as all variables which were easily found alongside the cohort.&lt;br /&gt;
&lt;br /&gt;
*'''List of Python files'''&lt;br /&gt;
**'''parse_accelerator_data'''&lt;br /&gt;
**'''parse_cohort_data'''&lt;br /&gt;
**'''process_locations'''&lt;br /&gt;
**'''wayback_machine'''&lt;br /&gt;
**Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: These files contain the code which Peter used to categorize the data from the &amp;quot;Data Copy&amp;quot; folder in Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\, which is just a copy of our cleaned data file. From this code, Peter returned for us a list of accelerators categorized by their flag and a compiled list of all the cohort companies as well as the variables recorded by McNair workers.&lt;br /&gt;
**'''Note''': We manually altered the cohort data which came out of Peter's code so that we could homogenize the formatting. This resulted in a unique cohort file which will not be replicated when running the code again. On the other hand, we manually altered the individual txt files for the accelerators to fix format so running Peter's code again should result in a similar file.&lt;br /&gt;
&lt;br /&gt;
*'''Cleaned Cohort Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.&lt;br /&gt;
**Variables: Accelerator Name, Company Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, Program, Cohort, Year&lt;br /&gt;
&lt;br /&gt;
'''Refining the List'''&lt;br /&gt;
&lt;br /&gt;
*'''New Crunchbase Accelerators'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017&lt;br /&gt;
**Description: After conducting some SDC matches with cohort data from the cohort companies of the accelerators in the '''accelerator_data_noflag''' text file, we realized many potential accelerators were missing. We then got an Excel file from Crunchbase containing all of its organizations, which we then sorted to identify potential missing accelerators. The accelerators we were actually missing are in this Excel file.&lt;br /&gt;
**Variables: Names of Missing Accelerators&lt;br /&gt;
**Potential Crunchbase Variables&lt;br /&gt;
&lt;br /&gt;
*'''Accelerator_Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Summer 2017\Veeral&lt;br /&gt;
**Description: This text file contains cleaned data on all of our current accelerators. This file was compiled by Veeral over Summer 2017. Some of these accelerators are not based in the United States.&lt;br /&gt;
**Variables: Accelerator, homepage_url, city, region, country_code, Creation date&lt;br /&gt;
&lt;br /&gt;
*'''ListofAccs'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all current accelerators we have been working with.&lt;br /&gt;
**Variables: Accelerator name, Whois parser code&lt;br /&gt;
&lt;br /&gt;
'''Additional Variables'''&lt;br /&gt;
&lt;br /&gt;
*'''Accelerator_Cohort_Companies'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all cohort companies of all accelerators.&lt;br /&gt;
**Variables: Cohort Companies, Accelerator name&lt;br /&gt;
&lt;br /&gt;
*'''Current Matched Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: Sheet 1 contains our matched data from matching our SDC pull with our cohort companies list found in '''Accelerator_Cohort_Companies'''. Sheet 2 removes the duplicates from the previous match. Sheet 3 contains the list of VCCompanies, which accelerator they went through, the date of their first investment. Sheet 4 contains our cohort list matched with the crunchbase organizations, but it contains too many duplicates to use.&lt;br /&gt;
**Variables: VCCompanies, Accelerator, Earliest Round Date&lt;br /&gt;
&lt;br /&gt;
*'''founders_linkedin'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains founder data for each accelerator found by Peter when crawling LinkedIn.&lt;br /&gt;
**Variables: Accelerator name, Founder name, LinkedIn URL&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21873</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21873"/>
		<updated>2017-11-14T22:01:54Z</updated>

		<summary type="html">&lt;p&gt;Shrey: /* List of All Relevant Files */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
&lt;br /&gt;
'''Original Search'''&lt;br /&gt;
&lt;br /&gt;
*'''List of Preliminary Accelerators'''&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
**Variables: Names of potential accelerators&lt;br /&gt;
&lt;br /&gt;
*'''accelerator_data_noflag'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
&lt;br /&gt;
'''Cohort Directory &amp;quot;Big Push&amp;quot;'''&lt;br /&gt;
&lt;br /&gt;
*'''Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\&lt;br /&gt;
**Description: This folder contains files for each of the accelerators that we searched through from the &amp;quot;List of Preliminary Accelerators&amp;quot;. There are three files per accelerator: 1) The &amp;quot;accelerator name.txt&amp;quot; file which contains each of the variables recorded by all of the McNair Center workers during our big push on the project winter 2016, 2) The .html file for the cohort page if the entry was indeed an accelerator and if the worker could find the cohort page on that accelerator, and 3) a &amp;quot;accelerator name.cohort.txt&amp;quot; file which contains a list of the cohort companies as well as all variables which were easily found alongside the cohort.&lt;br /&gt;
&lt;br /&gt;
*'''List of Python files'''&lt;br /&gt;
**'''parse_accelerator_data'''&lt;br /&gt;
**'''parse_cohort_data'''&lt;br /&gt;
**'''process_locations'''&lt;br /&gt;
**'''wayback_machine'''&lt;br /&gt;
**Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: These files contain the code which Peter used to categorize the data from the &amp;quot;Data Copy&amp;quot; folder in Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\, which is just a copy of our cleaned data file. From this code, Peter returned for us a list of accelerators categorized by their flag and a compiled list of all the cohort companies as well as the variables recorded by McNair workers.&lt;br /&gt;
&lt;br /&gt;
*'''Cleaned Cohort Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.&lt;br /&gt;
**Variables: Accelerator Name, Company Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, Program, Cohort, Year&lt;br /&gt;
&lt;br /&gt;
'''Refining the List'''&lt;br /&gt;
&lt;br /&gt;
*'''New Crunchbase Accelerators'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017&lt;br /&gt;
**Description: After conducting some SDC matches with cohort data from the cohort companies of the accelerators in the '''accelerator_data_noflag''' text file, we realized many potential accelerators were missing. We then got an Excel file from Crunchbase containing all of its organizations, which we then sorted to identify potential missing accelerators. The accelerators we were actually missing are in this Excel file.&lt;br /&gt;
**Variables: Names of Missing Accelerators&lt;br /&gt;
**Potential Crunchbase Variables&lt;br /&gt;
&lt;br /&gt;
*'''Accelerator_Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Summer 2017\Veeral&lt;br /&gt;
**Description: This text file contains cleaned data on all of our current accelerators. This file was compiled by Veeral over Summer 2017. Some of these accelerators are not based in the United States.&lt;br /&gt;
**Variables: Accelerator, homepage_url, city, region, country_code, Creation date&lt;br /&gt;
&lt;br /&gt;
*'''ListofAccs'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all current accelerators we have been working with.&lt;br /&gt;
**Variables: Accelerator name, Whois parser code&lt;br /&gt;
&lt;br /&gt;
'''Additional Variables'''&lt;br /&gt;
&lt;br /&gt;
*'''Accelerator_Cohort_Companies'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all cohort companies of all accelerators.&lt;br /&gt;
**Variables: Cohort Companies, Accelerator name&lt;br /&gt;
&lt;br /&gt;
*'''Current Matched Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: Sheet 1 contains our matched data from matching our SDC pull with our cohort companies list found in '''Accelerator_Cohort_Companies'''. Sheet 2 removes the duplicates from the previous match. Sheet 3 contains the list of VCCompanies, which accelerator they went through, the date of their first investment. Sheet 4 contains our cohort list matched with the crunchbase organizations, but it contains too many duplicates to use.&lt;br /&gt;
**Variables: VCCompanies, Accelerator, Earliest Round Date&lt;br /&gt;
&lt;br /&gt;
*'''founders_linkedin'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains founder data for each accelerator found by Peter when crawling LinkedIn.&lt;br /&gt;
**Variables: Accelerator name, Founder name, LinkedIn URL&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21872</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21872"/>
		<updated>2017-11-14T22:00:40Z</updated>

		<summary type="html">&lt;p&gt;Shrey: /* List of All Relevant Files */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
&lt;br /&gt;
'''Original Search'''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''List of Preliminary Accelerators'''&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
**Variables: Names of potential accelerators&lt;br /&gt;
&lt;br /&gt;
*'''accelerator_data_noflag'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
&lt;br /&gt;
'''Cohort Directory &amp;quot;Big Push&amp;quot;'''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\&lt;br /&gt;
**Description: This folder contains files for each of the accelerators that we searched through from the &amp;quot;List of Preliminary Accelerators&amp;quot;. There are three files per accelerator: 1) The &amp;quot;accelerator name.txt&amp;quot; file which contains each of the variables recorded by all of the McNair Center workers during our big push on the project winter 2016, 2) The .html file for the cohort page if the entry was indeed an accelerator and if the worker could find the cohort page on that accelerator, and 3) a &amp;quot;accelerator name.cohort.txt&amp;quot; file which contains a list of the cohort companies as well as all variables which were easily found alongside the cohort.&lt;br /&gt;
&lt;br /&gt;
*'''List of Python files'''&lt;br /&gt;
**'''parse_accelerator_data'''&lt;br /&gt;
**'''parse_cohort_data'''&lt;br /&gt;
**'''process_locations'''&lt;br /&gt;
**'''wayback_machine'''&lt;br /&gt;
**Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: These files contain the code which Peter used to categorize the data from the &amp;quot;Data Copy&amp;quot; folder in Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\, which is just a copy of our cleaned data file. From this code, Peter returned for us a list of accelerators categorized by their flag and a compiled list of all the cohort companies as well as the variables recorded by McNair workers.&lt;br /&gt;
&lt;br /&gt;
*'''Cleaned Cohort Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.&lt;br /&gt;
**Variables: Accelerator Name, Company Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, Program, Cohort, Year&lt;br /&gt;
&lt;br /&gt;
'''Refining the List'''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''New Crunchbase Accelerators'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017&lt;br /&gt;
**Description: After conducting some SDC matches with cohort data from the cohort companies of the accelerators in the '''accelerator_data_noflag''' text file, we realized many potential accelerators were missing. We then got an Excel file from Crunchbase containing all of its organizations, which we then sorted to identify potential missing accelerators. The accelerators we were actually missing are in this Excel file.&lt;br /&gt;
**Variables: Names of Missing Accelerators&lt;br /&gt;
**Potential Crunchbase Variables&lt;br /&gt;
&lt;br /&gt;
*'''Accelerator_Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Summer 2017\Veeral&lt;br /&gt;
**Description: This text file contains cleaned data on all of our current accelerators. This file was compiled by Veeral over Summer 2017. Some of these accelerators are not based in the United States.&lt;br /&gt;
**Variables: Accelerator, homepage_url, city, region, country_code, Creation date&lt;br /&gt;
&lt;br /&gt;
*'''ListofAccs'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all current accelerators we have been working with.&lt;br /&gt;
**Variables: Accelerator name, Whois parser code&lt;br /&gt;
&lt;br /&gt;
'''Additional Variables'''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Accelerator_Cohort_Companies'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all cohort companies of all accelerators.&lt;br /&gt;
**Variables: Cohort Companies, Accelerator name&lt;br /&gt;
&lt;br /&gt;
*'''Current Matched Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: Sheet 1 contains our matched data from matching our SDC pull with our cohort companies list found in '''Accelerator_Cohort_Companies'''. Sheet 2 removes the duplicates from the previous match. Sheet 3 contains the list of VCCompanies, which accelerator they went through, the date of their first investment. Sheet 4 contains our cohort list matched with the crunchbase organizations, but it contains too many duplicates to use.&lt;br /&gt;
**Variables: VCCompanies, Accelerator, Earliest Round Date&lt;br /&gt;
&lt;br /&gt;
*'''founders_linkedin'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains founder data for each accelerator found by Peter when crawling LinkedIn.&lt;br /&gt;
**Variables: Accelerator name, Founder name, LinkedIn URL&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21870</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21870"/>
		<updated>2017-11-14T21:54:16Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
&lt;br /&gt;
*'''Original Search'''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
**'''List of Preliminary Accelerators'''&lt;br /&gt;
***Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
***Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
***Variables: Names of potential accelerators&lt;br /&gt;
&lt;br /&gt;
**'''accelerator_data_noflag'''&lt;br /&gt;
***Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
***Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
***Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
&lt;br /&gt;
*'''Cohort Directory &amp;quot;Big Push&amp;quot;'''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
**'''Data'''&lt;br /&gt;
***Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\&lt;br /&gt;
***Description: This folder contains files for each of the accelerators that we searched through from the &amp;quot;List of Preliminary Accelerators&amp;quot;. There are three files per accelerator: 1) The &amp;quot;accelerator name.txt&amp;quot; file which contains each of the variables recorded by all of the McNair Center workers during our big push on the project winter 2016, 2) The .html file for the cohort page if the entry was indeed an accelerator and if the worker could find the cohort page on that accelerator, and 3) a &amp;quot;accelerator name.cohort.txt&amp;quot; file which contains a list of the cohort companies as well as all variables which were easily found alongside the cohort.&lt;br /&gt;
&lt;br /&gt;
**'''List of Python files'''&lt;br /&gt;
***'''parse_accelerator_data'''&lt;br /&gt;
***'''parse_cohort_data'''&lt;br /&gt;
***'''process_locations'''&lt;br /&gt;
***'''wayback_machine'''&lt;br /&gt;
***Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
***Description: These files contain the code which Peter used to categorize the data from the &amp;quot;Data Copy&amp;quot; folder in Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\, which is just a copy of our cleaned data file. From this code, Peter returned for us a list of accelerators categorized by their flag and a compiled list of all the cohort companies as well as the variables recorded by McNair workers.&lt;br /&gt;
&lt;br /&gt;
**'''Cleaned Cohort Data'''&lt;br /&gt;
***Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
***Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.&lt;br /&gt;
***Variables: Accelerator Name, Company Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, Program, Cohort, Year&lt;br /&gt;
&lt;br /&gt;
*'''Refining the List'''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
**'''New Crunchbase Accelerators'''&lt;br /&gt;
***Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017&lt;br /&gt;
***Description: After conducting some SDC matches with cohort data from the cohort companies of the accelerators in the '''accelerator_data_noflag''' text file, we realized many potential accelerators were missing. We then got an Excel file from Crunchbase containing all of its organizations, which we then sorted to identify potential missing accelerators. The accelerators we were actually missing are in this Excel file.&lt;br /&gt;
***Variables: Names of Missing Accelerators&lt;br /&gt;
***Potential Crunchbase Variables&lt;br /&gt;
&lt;br /&gt;
**'''Accelerator_Data'''&lt;br /&gt;
***Original Location: Bulk(E:)\McNair\Projects\Accelerators\Summer 2017\Veeral&lt;br /&gt;
***Description: This text file contains cleaned data on all of our current accelerators. This file was compiled by Veeral over Summer 2017. Some of these accelerators are not based in the United States.&lt;br /&gt;
***Variables: Accelerator, homepage_url, city, region, country_code, Creation date&lt;br /&gt;
&lt;br /&gt;
**'''ListofAccs'''&lt;br /&gt;
***Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
***Description: This text file contains a master list of all current accelerators we have been working with.&lt;br /&gt;
***Variables: Accelerator name, Whois parser code&lt;br /&gt;
&lt;br /&gt;
*'''Additional Variables'''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
**'''Accelerator_Cohort_Companies'''&lt;br /&gt;
***Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
***Description: This text file contains a master list of all cohort companies of all accelerators.&lt;br /&gt;
***Variables: Cohort Companies, Accelerator name&lt;br /&gt;
&lt;br /&gt;
**'''Current Matched Data'''&lt;br /&gt;
***Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
***Description: Sheet 1 contains our matched data from matching our SDC pull with our cohort companies list found in '''Accelerator_Cohort_Companies'''. Sheet 2 removes the duplicates from the previous match. Sheet 3 contains the list of VCCompanies, which accelerator they went through, the date of their first investment. Sheet 4 contains our cohort list matched with the crunchbase organizations, but it contains too many duplicates to use.&lt;br /&gt;
***Variables: VCCompanies, Accelerator, Earliest Round Date&lt;br /&gt;
&lt;br /&gt;
**'''founders_linkedin'''&lt;br /&gt;
***Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
***Description: This text file contains founder data for each accelerator found by Peter when crawling LinkedIn.&lt;br /&gt;
***Variables: Accelerator name, Founder name, LinkedIn URL&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21868</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21868"/>
		<updated>2017-11-14T21:50:44Z</updated>

		<summary type="html">&lt;p&gt;Shrey: /* List of All Relevant Files */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
&lt;br /&gt;
*'''Original Search'''&lt;br /&gt;
&lt;br /&gt;
**'''List of Preliminary Accelerators'''&lt;br /&gt;
***Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
***Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
***Variables: Names of potential accelerators&lt;br /&gt;
&lt;br /&gt;
**'''accelerator_data_noflag'''&lt;br /&gt;
***Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
***Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
***Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
&lt;br /&gt;
*'''Cohort Directory &amp;quot;Big Push&amp;quot;'''&lt;br /&gt;
&lt;br /&gt;
**'''Data'''&lt;br /&gt;
***Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\&lt;br /&gt;
***Description: This folder contains files for each of the accelerators that we searched through from the &amp;quot;List of Preliminary Accelerators&amp;quot;. There are three files per accelerator: 1) The &amp;quot;accelerator name.txt&amp;quot; file which contains each of the variables recorded by all of the McNair Center workers during our big push on the project winter 2016, 2) The .html file for the cohort page if the entry was indeed an accelerator and if the worker could find the cohort page on that accelerator, and 3) a &amp;quot;accelerator name.cohort.txt&amp;quot; file which contains a list of the cohort companies as well as all variables which were easily found alongside the cohort.&lt;br /&gt;
&lt;br /&gt;
**'''List of Python files'''&lt;br /&gt;
***'''parse_accelerator_data'''&lt;br /&gt;
***'''parse_cohort_data'''&lt;br /&gt;
***'''process_locations'''&lt;br /&gt;
***'''wayback_machine'''&lt;br /&gt;
***Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
***Description: These files contain the code which Peter used to categorize the data from the &amp;quot;Data Copy&amp;quot; folder in Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\, which is just a copy of our cleaned data file. From this code, Peter returned for us a list of accelerators categorized by their flag and a compiled list of all the cohort companies as well as the variables recorded by McNair workers.&lt;br /&gt;
&lt;br /&gt;
**'''Cleaned Cohort Data'''&lt;br /&gt;
***Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
***Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.&lt;br /&gt;
***Variables: Accelerator Name, Company Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, Program, Cohort, Year&lt;br /&gt;
&lt;br /&gt;
*'''Refining the List'''&lt;br /&gt;
&lt;br /&gt;
*'''New Crunchbase Accelerators'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017&lt;br /&gt;
**Description: After conducting some SDC matches with cohort data from the cohort companies of the accelerators in the '''accelerator_data_noflag''' text file, we realized many potential accelerators were missing. We then got an Excel file from Crunchbase containing all of its organizations, which we then sorted to identify potential missing accelerators. The accelerators we were actually missing are in this Excel file.&lt;br /&gt;
**Variables: Names of Missing Accelerators&lt;br /&gt;
**Potential Crunchbase Variables&lt;br /&gt;
&lt;br /&gt;
*'''Accelerator_Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Summer 2017\Veeral&lt;br /&gt;
**Description: This text file contains cleaned data on all of our current accelerators. This file was compiled by Veeral over Summer 2017. Some of these accelerators are not based in the United States.&lt;br /&gt;
**Variables: Accelerator, homepage_url, city, region, country_code, Creation date&lt;br /&gt;
&lt;br /&gt;
*'''ListofAccs'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all current accelerators we have been working with.&lt;br /&gt;
**Variables: Accelerator name, Whois parser code&lt;br /&gt;
&lt;br /&gt;
*'''Additional &lt;br /&gt;
&lt;br /&gt;
*'''Accelerator_Cohort_Companies'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all cohort companies of all accelerators.&lt;br /&gt;
**Variables: Cohort Companies, Accelerator name&lt;br /&gt;
&lt;br /&gt;
*'''Current Matched Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: Sheet 1 contains our matched data from matching our SDC pull with our cohort companies list found in '''Accelerator_Cohort_Companies'''. Sheet 2 removes the duplicates from the previous match. Sheet 3 contains the list of VCCompanies, which accelerator they went through, the date of their first investment. Sheet 4 contains our cohort list matched with the crunchbase organizations, but it contains too many duplicates to use.&lt;br /&gt;
**Variables: VCCompanies, Accelerator, Earliest Round Date&lt;br /&gt;
&lt;br /&gt;
*'''founders_linkedin'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains founder data for each accelerator found by Peter when crawling LinkedIn.&lt;br /&gt;
**Variables: Accelerator name, Founder name, LinkedIn URL&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=21862</id>
		<title>Accelerator Seed List (Data)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=21862"/>
		<updated>2017-11-14T21:06:18Z</updated>

		<summary type="html">&lt;p&gt;Shrey: /* Houston Accelerators */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Accelerator Seed List (Data)&lt;br /&gt;
|Has owner=Shrey Agarwal, Matthew Ringheanu, Veeral Shah,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has keywords=Accelerators,Data&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Industry Classifier&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Current Work=&lt;br /&gt;
&lt;br /&gt;
TODO:&lt;br /&gt;
 McNair/Projects/Accelerators/Fall 2017/unfound_founders.txt&lt;br /&gt;
A 0 means we don't have founder data for that accelerator.&lt;br /&gt;
Specs: A tab delimited text file with the following fields:&lt;br /&gt;
 Accelerator   First Name   Last Name   LinkedInURL(if possible)&lt;br /&gt;
Getting the LinkedInURL will ensure accuracy, but will work without it.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Shrey: Find &amp;quot;demo day&amp;quot; keywords, so that we can search AcceleratorName Year Keyword and get back potential demo day pages&lt;br /&gt;
*Joe: Go through Accelerator list (approx 273 accelerators) and mark each by type (see below), building out type list as you go&lt;br /&gt;
&lt;br /&gt;
Type list:&lt;br /&gt;
*Private&lt;br /&gt;
*Corporate&lt;br /&gt;
*Academic&lt;br /&gt;
 Note: if DEAD, noted here.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Other info:&lt;br /&gt;
*nonprofit? (y/n)&lt;br /&gt;
&lt;br /&gt;
*Subtype abbreviations:&lt;br /&gt;
**S: for if a social entrepreneurship initiative&lt;br /&gt;
**I: for if an incubator&lt;br /&gt;
**A: for an angel group&lt;br /&gt;
**F: for foreign&lt;br /&gt;
**C: for in coworking space/hub/etc&lt;br /&gt;
**V: for if part of venture fund&lt;br /&gt;
**G: for if government funded/partnered&lt;br /&gt;
**T: for international&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
 Note: subtypes (from individual text files in E:\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data) were only found for 23 of the 270 accelerators.  These accelerators were initially intended to be removed from the master list.  Remaining subtypes are currently being added.&lt;br /&gt;
&lt;br /&gt;
other info: &lt;br /&gt;
&lt;br /&gt;
international offices, founders, industries, org type, program duration, or other interesting, easily accessed variables.  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Type list file saved as &lt;br /&gt;
 &amp;quot;Accelerator type list&amp;quot; in E:\McNair\Projects\Accelerators\Fall 2017\Grouping project of ListOfAccs.&lt;br /&gt;
The list of ListofAccs, from which we drew Accelerator type list, should have no matches with any of the flagged accelerators in E:\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data.  There are 23 matches though.  So all subtypes must be searched and entered manually.  Whether it is a nonprofit is listed in E:\McNair\Projects\Accelerators\Fall 2017\Grouping project of ListOfAccs, called &amp;quot;whether nonprofit...&amp;quot;&lt;br /&gt;
&lt;br /&gt;
=End of Semester Report=&lt;br /&gt;
The end of semester report will focus on ranking accelerators and environments based on the variables we have gathered. Our primary form of categorization will be ranking individual accelerators based on their venture capital raise rate. We can probably generate information over time for accelerators and the amount of VC they raised to get a sense of what locations have developed in the past five years from the dates of transactions recorded by SDC. To obtain these rankings, we will identify which cohorts companies were trained in, as well as complete details of the accelerator and the details of cohort companies. We will focus only on accelerators because there are many other entities in each ecosystem. We will also utilize information on IPO or acquisition by companies, obtained through Crunchbase, to gain some sense of how successful startups emerging from a particular accelerator are. To obtain the data over time, we will need to fill out the cohort date information column in our cohort data, which will require the help of either Crunchbase or the Wayback machine for older accelerators. In ranking the accelerators across regions, we can also track industry-specific hotspots for accelerators such as medicine in Memphis or technology in San Francisco.&lt;br /&gt;
&lt;br /&gt;
To complete the report, we need to fill information in:&lt;br /&gt;
*Industry and focus&lt;br /&gt;
*Location&lt;br /&gt;
*Name, description&lt;br /&gt;
*Matched VC data&lt;br /&gt;
*Founder information (maybe)&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
This project is developing broad and near-population data on accelerators and their cohort companies. The objective is to identify which cohorts of which accelerators a cohort company was trained in, obtain details of the accelerators, and obtain details of the cohort companies, including information about any venture capital investment that the cohort company might have received and any IPO or acquisition the company may have experienced.&lt;br /&gt;
&lt;br /&gt;
The primary use of this data is for an academic paper detailed on the [[Matching Entrepreneurs to Accelerators and VCs (Academic Paper)]] page. &lt;br /&gt;
&lt;br /&gt;
However, this project can also provide useful data to other academic papers ([[Urban Start-up Agglomeration]], [[Hubs (Academic Paper)]], and [[Hubs Scorecard (Academic Paper)]]), projects ([[Houston Entrepreneurship]]) and blog posts (under the [[Emerging Ecosystems]] umbrella project).&lt;br /&gt;
&lt;br /&gt;
This project needs the results of the [[Industry Classifier]], [[Whois Parser]], and other tools.&lt;br /&gt;
&lt;br /&gt;
=Current Project Write-Up=&lt;br /&gt;
&lt;br /&gt;
==Things To Do==&lt;br /&gt;
*Obtain all URLs for accelerators in order to run through the Wayback Machine to find out when they started.&lt;br /&gt;
*Match Crunchbase Data with our Accelerator List to see if they have any accelerators that we do not.&lt;br /&gt;
*Obtain an example of accelerator that started early and has multiple companies but does not separate them into cohorts and figure out a way to determine which companies went through each cohort.&lt;br /&gt;
&lt;br /&gt;
==What Each File in the &amp;quot;Accelerator&amp;quot; Folder on the RDP Contains==&lt;br /&gt;
*&amp;quot;Accelerator List Sources&amp;quot; (Folder) - This folder contains most of the sources that we pulled accelerator names from at the very beginning of the project.&lt;br /&gt;
*&amp;quot;Code+Final_Data&amp;quot; (Folder) - This folder contains Peter's code for pulling the data from the text files in the &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Crunchbase Snapshot&amp;quot; (Folder) - This folder contains the data we obtained from Crunchbase. There is a massive amount of data which we will need to sort through to find useful information and hopefully match that data with our current cohort data.&lt;br /&gt;
*&amp;quot;Data&amp;quot; (Folder) - This folder contains all of our data on accelerators including cohort information and the html files of each cohort page. I would estimate that it is about 95% clean currently.&lt;br /&gt;
*&amp;quot;Data - Copy&amp;quot; (Folder) - This is just a copy of our current &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Data_Copy&amp;quot; (Folder) - This is a copy of our original &amp;quot;Data&amp;quot; folder before we did any manual cleaning.&lt;br /&gt;
*&amp;quot;Enclosing_Circle&amp;quot; (Folder) - This folder seems to contain some data on VC but I'm not sure how it pertains to the Accelerator project.&lt;br /&gt;
*&amp;quot;F6S Accelerator HTMLs&amp;quot; (Folder) - This folder contains the HTML pages of all the pages on the F6S website. We used it to add more potential accelerators to our list.&lt;br /&gt;
*&amp;quot;Google_SiteSearch&amp;quot; (Folder) - This folder contains Python code for Google searches.&lt;br /&gt;
*&amp;quot;Industry_Classifier&amp;quot; (Folder) - This folder seems to contain Python code but I'm not sure what for.&lt;br /&gt;
*&amp;quot;Matcher&amp;quot; (Folder) - This folder contains the Matcher.&lt;br /&gt;
*&amp;quot;Python WebCrawler&amp;quot; (Folder) - This folder contains code that is a work in progress for pulling descriptions from accelerator websites. It is Jeemin's project.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data Copy&amp;quot; (Excel File) - This file contains a copy of our cleaned cohort data.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data&amp;quot; (Excel File) - This file contains the most current, completely cleaned data on cohort company information.&lt;br /&gt;
*&amp;quot;NormalizeFixedWidth&amp;quot; (PL File) - This is the normalizer.&lt;br /&gt;
*&amp;quot;PortCoNames&amp;quot; (TXT File) - This file contains all of the names of the cohort companies as well as the accelerator they went through.&lt;br /&gt;
*&amp;quot;VC Data&amp;quot; (Excel File) - This file contains all of the names of the companies that have ever received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data&amp;quot; (TXT File) - This file contains that non-normalized data of all of the VC information.&lt;br /&gt;
*&amp;quot;VC_Data_Names&amp;quot; (TXT File) - This file contains all of the names of companies that have received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data_Names_Matched_PortCoNames&amp;quot; (Excel File) - This file contains all of the cohort companies that have also received VC funding. Still needs to be sorted through.&lt;br /&gt;
&lt;br /&gt;
==Process==&lt;br /&gt;
After accumulating the massive amount of data on accelerators, their cohorts, and their html files, we began cleaning those text files, which are located in the &amp;quot;Data&amp;quot; folder within &amp;quot;Accelerators&amp;quot;. After going through the first round of cleaning, we ran a code through the cohort data which put all of that information into an Excel document called &amp;quot;Cleaned Cohort Data&amp;quot;. There were still some mistakes in the cohort information unfortunately, which we fixed within the Excel file itself. Therefore, there are some text files within the &amp;quot;Data&amp;quot; folder that do not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file. If we were to run the cohort code through the &amp;quot;Data&amp;quot; folder, we would get something that does not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file, which is problematic. The solution to this (other than manually cleaning the text files again) would be to write a code from the &amp;quot;Cleaned Cohort Data&amp;quot; file which would allow us to clean the data in the &amp;quot;Data&amp;quot; folder through the format of the Excel file. We have also matched all of the cohort companies with our list of all companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
=Current To Do=&lt;br /&gt;
&lt;br /&gt;
#Work on the [[Crunchbase 2013 Snapshot]]&lt;br /&gt;
#Match cohort companies to VC-backed portfolio companies&lt;br /&gt;
#Refine our data to work out which cohort each cohort company was a member of, cohort start dates and locations, etc.&lt;br /&gt;
#Make a list of top accelerator lists (e.g., http://tech.co/top-startup-accelerators-ranked-2012-08) and check that we have those accelerators&lt;br /&gt;
&lt;br /&gt;
=End of Semester Notes=&lt;br /&gt;
&lt;br /&gt;
*We have compiled a very long list of accelerators from many different databases. For the past couple of weeks, everyone in the center has been going through this list, 20 at a time, classifying each one as an accelerator or not an accelerator, and then proceeding to gather data on the accelerator using the process outlined below. This process went very smoothly. We have successfully gone through about 80% of the list. We are still missing information on the last hundred or so names. All of the collected data is located on the RDP, within the &amp;quot;Accelerators&amp;quot; folder under &amp;quot;Data&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
=Data Collection Notes=&lt;br /&gt;
&lt;br /&gt;
==MATCHING==&lt;br /&gt;
&lt;br /&gt;
The files we used to match are located in the E drive. We used the matcher to match our portfolio company names from the cohort file located in E:\McNair\Projects\Accelerators. &lt;br /&gt;
*The files used to matching are located E:\McNair\Projects\Accelerators\Matcher&lt;br /&gt;
*Portco is the name of the companies pulled from the cohort file&lt;br /&gt;
*AccCo includes both the cohort company name, along with the name of the accelerator itself&lt;br /&gt;
*In the matcher, the inputs are the PortCo names, as well as the VC data from our pull in SDC&lt;br /&gt;
*The outputs include the AccCo_VC data located in E:\McNair\Projects\Accelerators which give a lot of information on the matches, including:&lt;br /&gt;
:*name of the match itself&lt;br /&gt;
:*number of investments&lt;br /&gt;
:*dates that the company received its investments&lt;br /&gt;
&lt;br /&gt;
==SDC Pull==&lt;br /&gt;
&lt;br /&gt;
We accessed SDC platinum and pulled information on round-based funding that all registered companies received from between the years 1999 to 2017.&lt;br /&gt;
&lt;br /&gt;
The receipt is as follows:&lt;br /&gt;
&lt;br /&gt;
Session Details&lt;br /&gt;
---------------&lt;br /&gt;
Request   Hits    Request Description&lt;br /&gt;
   0        -     DATABASE: Portfolio Companies (VIPC)&lt;br /&gt;
   1     96155    Venture Related Deals: Select All Venture Related Deals&lt;br /&gt;
   2     79572    Round Date: 1/1/1999 to 3/1/2017 (Custom) (Calendar)&lt;br /&gt;
   3              Custom Report: VC Data (Columnar) - Save As:&lt;br /&gt;
                  E:\McNair\Projects\Accelerators\VC Data.txt&lt;br /&gt;
�&lt;br /&gt;
Billing Ref # : 2054025&lt;br /&gt;
Capture File  : riceuniv.2054025&lt;br /&gt;
Session Name  : &lt;br /&gt;
&lt;br /&gt;
The VC data pull includes the following variables: &lt;br /&gt;
&lt;br /&gt;
Company Name                                                           Date Company      Date Company      Company        Company City                           Company Street Address, Line 1               Company Street Address, Line 2            Total Known     Company Industry Sub-Group 3                              Company Industry Major Group     Round          Company Stage Level 3     Round Amt,       Round Amt,&lt;br /&gt;
&lt;br /&gt;
==3 files==&lt;br /&gt;
&lt;br /&gt;
For each accelerator in the list, put files in E:\Projects\Accelerators\Data&lt;br /&gt;
*AcceleratorName.txt - copy and paste the variables below into a (tab-delimited) txt file and complete&lt;br /&gt;
*AcceleratorName.cohort - your cohort text file (see below)&lt;br /&gt;
*AcceleratorName.html (possibly automatically with a folder too) - save a copy of the html of the cohort page&lt;br /&gt;
&lt;br /&gt;
==.txt Variables==&lt;br /&gt;
&lt;br /&gt;
 Name	&lt;br /&gt;
 Score	&lt;br /&gt;
 Flag	&lt;br /&gt;
 CohortURL	&lt;br /&gt;
 Address	&lt;br /&gt;
 Duration	&lt;br /&gt;
 Vintage		&lt;br /&gt;
 Industry	&lt;br /&gt;
 Description	&lt;br /&gt;
 Equity	&lt;br /&gt;
 NonProfit	 &lt;br /&gt;
 Notes	&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Try to get '''Name, Score, Flag, Cohort URL and Address''' for all. ONLY GRAB OTHER VARIABLES IF EASY. Just leave things blank if you can't find them quickly.&lt;br /&gt;
&lt;br /&gt;
'''If the score is 0, or the flag is S, I, A, or F just stop''' - don't bother downloading a cohort list, saving an HTML file, etc. If possible, do stick a very brief description of the problem in the notes field.&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Score: is 0-1 where 0 is definitely not an accelerator, 1 is definitely an accelerator&lt;br /&gt;
*Flag: (leave blank if not needed), if multiple then separate by comma&lt;br /&gt;
**S for social entrep&lt;br /&gt;
**I for incubator&lt;br /&gt;
**A for an angel group&lt;br /&gt;
**F is for foreign&lt;br /&gt;
**C for in coworking space/hub/etc&lt;br /&gt;
**V for if part of venture fund&lt;br /&gt;
**D is for Dead&lt;br /&gt;
*Put just the root URL in Cohort URL if there isn't a Cohort page&lt;br /&gt;
*Duration: in wks (months x 4.33 and round)&lt;br /&gt;
*Vintage is year of first cohort if possible&lt;br /&gt;
*Industry is industry focus but only if clear focus&lt;br /&gt;
*Equity is a number (don't put %) or Y/N&lt;br /&gt;
*Notes is only there if need it. Particularly try to use this field to note discards.&lt;br /&gt;
&lt;br /&gt;
==.cohort files==&lt;br /&gt;
&lt;br /&gt;
Your .cohort files must:&lt;br /&gt;
*Be tab delimited txt&lt;br /&gt;
*Have a header&lt;br /&gt;
*The first column must be the portfolio company name&lt;br /&gt;
*Grab as many columns as you can easily (and name them)&lt;br /&gt;
&lt;br /&gt;
==Standardized format for text files==&lt;br /&gt;
&lt;br /&gt;
Information Text file&lt;br /&gt;
*1 tab only after each category&lt;br /&gt;
*No spaces after commas for flags or industry&lt;br /&gt;
*For duration put only a number in weeks but do not write &amp;quot;weeks&amp;quot;&lt;br /&gt;
*Equity is either only a number (no percent sign) or a Y/N&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Cohort Text file&lt;br /&gt;
*1 tab between each column&lt;br /&gt;
*Titles of each column on top&lt;br /&gt;
*Make a new category for &amp;quot;Cohort Number&amp;quot; and write either &amp;quot;1 2 3 4 etc.&amp;quot;&lt;br /&gt;
*Matthew: 1-225 (done) Shrey: 226-550 (done)&lt;br /&gt;
&lt;br /&gt;
==Link to Crunchbase API application==&lt;br /&gt;
&lt;br /&gt;
https://about.crunchbase.com/forms/research-access-apply/ (Does not work anymore)&lt;br /&gt;
&lt;br /&gt;
https://data.crunchbase.com/v3/docs/using-the-api (Has new instructions for application)&lt;br /&gt;
&lt;br /&gt;
==Sign-Ups==&lt;br /&gt;
&lt;br /&gt;
 Ed - 1-10 (done)&lt;br /&gt;
 Carlin -  11-20 (done)&lt;br /&gt;
 Carlin - 21-40 (done)&lt;br /&gt;
 Christy - 41-60 (done)&lt;br /&gt;
 Avesh - 61-80 (done)&lt;br /&gt;
 Eliza - 81-100 (done)&lt;br /&gt;
 Meghana - 101-120 (done)&lt;br /&gt;
 Peter - 121-140 (done)&lt;br /&gt;
 Ramee - 141-160 (done)&lt;br /&gt;
 Will - 161-180 (done)&lt;br /&gt;
 Matthew - 181-200 (done)&lt;br /&gt;
 Julia - 201-220 (done)&lt;br /&gt;
 Peter - 221-240 (done)&lt;br /&gt;
 Shrey - 241-260 (done)&lt;br /&gt;
 Matthew - 261-280 (done)&lt;br /&gt;
 Eliza - 281-300 (done)&lt;br /&gt;
 Julia - 301-320 (done)&lt;br /&gt;
 Shrey - 321-340 (done)&lt;br /&gt;
 Carlin - 341-361 (done)&lt;br /&gt;
 Julia - 362-380 (done)&lt;br /&gt;
 Dylan - 381-393 (done)&lt;br /&gt;
 Jake - 394-404 (done)&lt;br /&gt;
 Dylan - 405-410 (done)&lt;br /&gt;
 Avesh - 411-415 (done)&lt;br /&gt;
 Dylan - 416-423 (done)&lt;br /&gt;
 Peter - 424-460(done)&lt;br /&gt;
 Carlin - 461-480 (done)&lt;br /&gt;
 Peter - 481-490(done)&lt;br /&gt;
 Julia - 491-510 (done)&lt;br /&gt;
 Peter - 511-515 (done)&lt;br /&gt;
 Julia - 516-529 (done)&lt;br /&gt;
 Ben - 530-540 (done)&lt;br /&gt;
 Shrey - 541-551 (done)&lt;br /&gt;
&lt;br /&gt;
=List of Accelerators=&lt;br /&gt;
#10Xelerator&lt;br /&gt;
#1440&lt;br /&gt;
#33entrepreneurs&lt;br /&gt;
#500 Startups&lt;br /&gt;
#9Mile Labs&lt;br /&gt;
#AIA Accelerator&lt;br /&gt;
#ARK Challenge&lt;br /&gt;
#AT&amp;amp;T Aspire Accelerator&lt;br /&gt;
#ATDC Community&lt;br /&gt;
#AZ TechCelerator&lt;br /&gt;
#AccelFoods&lt;br /&gt;
#Acceleprise&lt;br /&gt;
#Accelerate Baltimore&lt;br /&gt;
#Accelerate Genius&lt;br /&gt;
#Accelerate Tectoria Accelerator&lt;br /&gt;
#Accelerator Centre&lt;br /&gt;
#Advanced Technology Development Center (ATDC)&lt;br /&gt;
#Airbus BizLab&lt;br /&gt;
#Alchemist Accelerator&lt;br /&gt;
#AlphaLab&lt;br /&gt;
#Amplify.LA&lt;br /&gt;
#Angel Capital&lt;br /&gt;
#Angelcube&lt;br /&gt;
#Angelpad&lt;br /&gt;
#Annual Business BootCamp&lt;br /&gt;
#Arizona Center for Innovation&lt;br /&gt;
#Arizona Furnace&lt;br /&gt;
#Arrowhead Tech Incubator 2016&lt;br /&gt;
#Aspire 3 Accelerator 2017&lt;br /&gt;
#Atlanta Ventures Accelerator &lt;br /&gt;
#AutoXLR8R&lt;br /&gt;
#Awesome Inc.&lt;br /&gt;
#Axel Springer Plug and Play&lt;br /&gt;
#B 4 Change Impact Accelerator&lt;br /&gt;
#B2B Acceleration Program&lt;br /&gt;
#B4C Social Venture Accelerator&lt;br /&gt;
#BBC Worldwide Labs&lt;br /&gt;
#BMW Startup Garage&lt;br /&gt;
#Brandcelerate&lt;br /&gt;
#Bunker Labs&lt;br /&gt;
#Bank of Ireland Accelerator Programme&lt;br /&gt;
#Bantunium Labs Accelerator&lt;br /&gt;
#Barclays Accelerator&lt;br /&gt;
#Barclays New York Summer 2015&lt;br /&gt;
#Berkley Ventures&lt;br /&gt;
#Bessemer Business Incubation System&lt;br /&gt;
#Beta-i&lt;br /&gt;
#Beta.MN&lt;br /&gt;
#BetaFactory&lt;br /&gt;
#BetaSpring&lt;br /&gt;
#Betablox&lt;br /&gt;
#Betaspring RevUp  (DUPLICATE)&lt;br /&gt;
#Bethnal Green Ventures&lt;br /&gt;
#BioAccel&lt;br /&gt;
#BioInspire&lt;br /&gt;
#Bir 2015&lt;br /&gt;
#BitAngel Engagement Level&lt;br /&gt;
#BitAngels Startup Summer Program of 2013&lt;br /&gt;
#Bizdom&lt;br /&gt;
#Black Forest Accelerator&lt;br /&gt;
#Blue Startups&lt;br /&gt;
#Blueprint Health&lt;br /&gt;
#Bolt Boston&lt;br /&gt;
#Bonnier Accelerator&lt;br /&gt;
#BoomStartup&lt;br /&gt;
#BoomStartup Winter 2017 (DUPLICATE)&lt;br /&gt;
#Boomtown Accelerator&lt;br /&gt;
#Boomtown Health Tech (DUPLICATE)&lt;br /&gt;
#Boost VC&lt;br /&gt;
#BootupLabs&lt;br /&gt;
#Brandery&lt;br /&gt;
#Brooklyn Beta Summer Camp&lt;br /&gt;
#Budweiser Dream Brewery&lt;br /&gt;
#Buildit&lt;br /&gt;
#BuiltinPGH Companies&lt;br /&gt;
#Business Innovation Center&lt;br /&gt;
#Business Opportunity Academy 2017&lt;br /&gt;
#Business Technology Development Center (BizTech)&lt;br /&gt;
#CLT Joules Energy Accelerator 2014&lt;br /&gt;
#CWI Ventures&lt;br /&gt;
#CWI Ventures Application (DUPLICATE)&lt;br /&gt;
#CableLabs Technology Tours 2016&lt;br /&gt;
#Capital Factory&lt;br /&gt;
#Capital Innovators&lt;br /&gt;
#Capital Investment Network (Startups)&lt;br /&gt;
#Caroline Plouff&lt;br /&gt;
#Catalyst Partners&lt;br /&gt;
#Cause Collective : Social Innovation Lab&lt;br /&gt;
#Center for Entrepreneurial Innovation&lt;br /&gt;
#Chain Reaction Innovations 2017&lt;br /&gt;
#Chemical Angel Network&lt;br /&gt;
#Chinaccelerator&lt;br /&gt;
#Cisco Entrepreneurs in Residence&lt;br /&gt;
#Citi Accelerator&lt;br /&gt;
#Citrix Startup Accelerator&lt;br /&gt;
#Claremont/Upland Makerspace Fablab&lt;br /&gt;
#Climate Ventures 2.0 Accelerator&lt;br /&gt;
#Co.Lab accelerator&lt;br /&gt;
#Code for America Accelerator&lt;br /&gt;
#Cohab's Traxtion Point&lt;br /&gt;
#Collision Conference Investors&lt;br /&gt;
#Common Bond&lt;br /&gt;
#Communitech Hyperdrive&lt;br /&gt;
#Conquer Accelerator&lt;br /&gt;
#Coolhouse Labs&lt;br /&gt;
#CuriousMinds Incubator / Accelerator&lt;br /&gt;
#CyberTECH San Diego&lt;br /&gt;
#DBS Accelerator&lt;br /&gt;
#DPD Last Mile labs&lt;br /&gt;
#DV X Labs&lt;br /&gt;
#Dat Ventures&lt;br /&gt;
#Decatur-Morgan County Entrepreneurial Center&lt;br /&gt;
#Deep Space Ventures&lt;br /&gt;
#Demo Accelerator 2016- 2017&lt;br /&gt;
#DeveloperTown&lt;br /&gt;
#Difference Engine&lt;br /&gt;
#Digital Malaysia Corporate Accelerator Program&lt;br /&gt;
#Digital Media Zone Incubator/Accelerator&lt;br /&gt;
#Disney Accelerator&lt;br /&gt;
#DogFish Accelerator&lt;br /&gt;
#Domi Station&lt;br /&gt;
#Dotforge accelerator&lt;br /&gt;
#Dream Funded&lt;br /&gt;
#DreamIT Health&lt;br /&gt;
#DreamStart - Free Mentoring Program&lt;br /&gt;
#Dreamit Ventures (DUPLICATE)&lt;br /&gt;
#Ducky Diggy Lloyd &lt;br /&gt;
#E-Capital Summit&lt;br /&gt;
#EC Mentor Skills Inventory&lt;br /&gt;
#EIGERlab&lt;br /&gt;
#ETRAC&lt;br /&gt;
#EY Startup Challenge&lt;br /&gt;
#Eco Holding&lt;br /&gt;
#Eleven Startup Accelerator&lt;br /&gt;
#Emerge Xcelerate&lt;br /&gt;
#EnterpriseWorks Incubation Program&lt;br /&gt;
#Entrepreneur Development Center&lt;br /&gt;
#Entrepreneurs Roundtable Accelerator&lt;br /&gt;
#Environmental Business Cluster&lt;br /&gt;
#Equity Legal&lt;br /&gt;
#Excelerate Labs&lt;br /&gt;
#Execution Labs&lt;br /&gt;
#Exhilarator&lt;br /&gt;
#Extreme Startups&lt;br /&gt;
#Extreme University&lt;br /&gt;
#FOOD-X&lt;br /&gt;
#Factory45&lt;br /&gt;
#Fargo Startup House 2014-2015&lt;br /&gt;
#FastTrack Propero Healthcare&lt;br /&gt;
#FbFund&lt;br /&gt;
#Female Propeller for High Flyers&lt;br /&gt;
#FinTech Innovation Lab&lt;br /&gt;
#FinTech Studios 2015&lt;br /&gt;
#Fintech Founders Club #2&lt;br /&gt;
#First Growth Venture Network&lt;br /&gt;
#Fishbowl Labs AOL&lt;br /&gt;
#Flagship Enterprise Center&lt;br /&gt;
#FlashStarts&lt;br /&gt;
#Flashpoint&lt;br /&gt;
#Flat6 Labs&lt;br /&gt;
#Fledge9&lt;br /&gt;
#Flextronics Lab IX&lt;br /&gt;
#Food Future Scale-up Accelerator 2017&lt;br /&gt;
#Food System 6 (FS6) Accelerator&lt;br /&gt;
#FoodForwardX&lt;br /&gt;
#Fortify Ventures&lt;br /&gt;
#Founder Institute&lt;br /&gt;
#FounderFuel&lt;br /&gt;
#FoundersPad&lt;br /&gt;
#Fownders Accelerator&lt;br /&gt;
#French Accelerator 2016&lt;br /&gt;
#Fund the Food&lt;br /&gt;
#Fuse Corps Host&lt;br /&gt;
#GAKKEN Accelerator Program&lt;br /&gt;
#Gainesville Technology Enterprise Center&lt;br /&gt;
#Game CoLab Incubator Program 2014&lt;br /&gt;
#GameFounders&lt;br /&gt;
#GammaRebels&lt;br /&gt;
#Gazelle Lab&lt;br /&gt;
#Gener8tor&lt;br /&gt;
#German Accelerator Life Sciences&lt;br /&gt;
#German Accelerator Tech&lt;br /&gt;
#Global Accelerator Network 2015&lt;br /&gt;
#Good Works Houston Lab&lt;br /&gt;
#GoodCompany Ventures&lt;br /&gt;
#Google Launchpad Accelerator&lt;br /&gt;
#Grants4Apps Accelerator&lt;br /&gt;
#GreenStart&lt;br /&gt;
#Greenlite Labs&lt;br /&gt;
#GrowLab&lt;br /&gt;
#Growth Hacking Accelerator 2015&lt;br /&gt;
#Gulf Coast Center for Innovation and Entrepreneurship&lt;br /&gt;
#H-Farm Ventures&lt;br /&gt;
#HACKT Mission for International Founders&lt;br /&gt;
#HAXLR8R&lt;br /&gt;
#HCC Entrepreneurship Launchpad&lt;br /&gt;
#HIGHLINE Academy&lt;br /&gt;
#HUB&lt;br /&gt;
#HUBB Accelerator&lt;br /&gt;
#HUBB GTLA 2016&lt;br /&gt;
#HackFWD&lt;br /&gt;
#Hatch&lt;br /&gt;
#Health Wildcatters&lt;br /&gt;
#Health accelerator&lt;br /&gt;
#Healthbox&lt;br /&gt;
#Hero City Co-Working Space&lt;br /&gt;
#High Street Startups Accelerator&lt;br /&gt;
#Highway1&lt;br /&gt;
#Honda Xcelerator &lt;br /&gt;
#Houston Technology Center&lt;br /&gt;
#Hub Ventures&lt;br /&gt;
#HugeThing&lt;br /&gt;
#I/O ventures&lt;br /&gt;
#ICONYC labs&lt;br /&gt;
#IDC Elevator&lt;br /&gt;
#INcubes Funnel and Accelerator 2014/2015&lt;br /&gt;
#INcubes Online Form&lt;br /&gt;
#INcubes Startup Visa&lt;br /&gt;
#Illumina Accelerator&lt;br /&gt;
#Illuminator,  New York Accelerator 2015&lt;br /&gt;
#Imagine K12&lt;br /&gt;
#Immokalee Business Development Center&lt;br /&gt;
#Impact Engine&lt;br /&gt;
#Impact USA - 2017&lt;br /&gt;
#Incubate Miami&lt;br /&gt;
#Infuse Accelerator&lt;br /&gt;
#Ingenuity Partner Program&lt;br /&gt;
#InnoSpring&lt;br /&gt;
#Innov&amp;amp;Connect&lt;br /&gt;
#Innov8 for Health&lt;br /&gt;
#Innova Memphis&lt;br /&gt;
#InnovateOC&lt;br /&gt;
#Innovation Depot&lt;br /&gt;
#Innovation Pavilion&lt;br /&gt;
#Innovation Showcase Winter 2017&lt;br /&gt;
#Insight Accelerator Labs&lt;br /&gt;
#Intel Education Accelerator&lt;br /&gt;
#Investment Preparedness Lab&lt;br /&gt;
#Invoke Collective&lt;br /&gt;
#Iowa Startup Accelerator&lt;br /&gt;
#JFDI.Asia&lt;br /&gt;
#JFE Accelerator SF&lt;br /&gt;
#JLAB&lt;br /&gt;
#Jaguar Land Rover Tech Incubator&lt;br /&gt;
#Jolt&lt;br /&gt;
#JumpSchool &lt;br /&gt;
#JumpStart Foundry&lt;br /&gt;
#Jumpstart! Boulder&lt;br /&gt;
#JusticeXL&lt;br /&gt;
#Kairos Boston Spring Program&lt;br /&gt;
#Kaplan EdTech&lt;br /&gt;
#Kick&lt;br /&gt;
#Kick Boise&lt;br /&gt;
#Kick LA&lt;br /&gt;
#Kick Victoria&lt;br /&gt;
#Kicklabs&lt;br /&gt;
#Kinetiq Labs&lt;br /&gt;
#L-SPARK Accelerator&lt;br /&gt;
#LAUNCH incubator&lt;br /&gt;
#LAUNCHub&lt;br /&gt;
#LI TechCOMETS&lt;br /&gt;
#LabFunding Project Accelerator 2014&lt;br /&gt;
#Labs Venture Accelerator&lt;br /&gt;
#Launch Chapel Hill&lt;br /&gt;
#Launch Memphis&lt;br /&gt;
#LaunchBox Digital&lt;br /&gt;
#LaunchHouse&lt;br /&gt;
#LaunchPad PEI&lt;br /&gt;
#LaunchSpot&lt;br /&gt;
#Launch_Academy&lt;br /&gt;
#Launchpad Digital Health, LLC&lt;br /&gt;
#Launchpad LA&lt;br /&gt;
#Launchpad Long Island&lt;br /&gt;
#Le Camping&lt;br /&gt;
#Leading Entrepreneurial Accelerator Program&lt;br /&gt;
#Lean Launch Ventures&lt;br /&gt;
#LearnLaunchX&lt;br /&gt;
#Lemnos Labs&lt;br /&gt;
#Life Changing Labs&lt;br /&gt;
#LiftOff Health Incubator&lt;br /&gt;
#Lightbank Start&lt;br /&gt;
#LightningLab&lt;br /&gt;
#Lowe's Accelerator&lt;br /&gt;
#MACH37&lt;br /&gt;
#MACH37 Spring&lt;br /&gt;
#MIT SA+P venture accelerator&lt;br /&gt;
#MITA Institute Accelerator&lt;br /&gt;
#MTGx MediaFactory&lt;br /&gt;
#Mac6&lt;br /&gt;
#Madworks Governance Accelerator&lt;br /&gt;
#Maine Center for Entrepreneurial Development - Top Gun Program&lt;br /&gt;
#Matter&lt;br /&gt;
#Maven Ventures Fund &amp;amp; Incubator&lt;br /&gt;
#Media Camp&lt;br /&gt;
#Melbourne Accelerator Program&lt;br /&gt;
#Memphis BioWorks&lt;br /&gt;
#Merck Accelerator&lt;br /&gt;
#MergeLane 2017 Accelerator&lt;br /&gt;
#Mergelane&lt;br /&gt;
#Metavallon&lt;br /&gt;
#Microsoft Accelerator&lt;br /&gt;
#MindTheBridge&lt;br /&gt;
#Momentum&lt;br /&gt;
#MuckerLab&lt;br /&gt;
#Muru-D&lt;br /&gt;
#My5ive Accelerator 2016&lt;br /&gt;
#N-Motion (DUPLICATE)&lt;br /&gt;
#NDRC (LaunchPad / VentureLab)&lt;br /&gt;
#NEXT Dashboard&lt;br /&gt;
#NMotion&lt;br /&gt;
#NY Digital Health Accelerator&lt;br /&gt;
#NY Fashion Tech Lab 2017&lt;br /&gt;
#NYC ACRE&lt;br /&gt;
#NYC SeedStart&lt;br /&gt;
#Nashville Entrepreneur Center&lt;br /&gt;
#Nebula Shift&lt;br /&gt;
#Nephoscale IaaS&lt;br /&gt;
#Nest New York &lt;br /&gt;
#New Ventures Group&lt;br /&gt;
#New York Digital Health Accelerator (DUPLICATE)&lt;br /&gt;
#NewME Accelerator PopUps &lt;br /&gt;
#NewMe&lt;br /&gt;
#Next media accelerator&lt;br /&gt;
#NextHIT&lt;br /&gt;
#NextStart&lt;br /&gt;
#Nike+ Accelerator&lt;br /&gt;
#Northern Arizona Center for Entrepreneurship and Technology (NACET)&lt;br /&gt;
#Northern England&lt;br /&gt;
#Nxtp.labs&lt;br /&gt;
#OCTANe&lt;br /&gt;
#Oasis 500&lt;br /&gt;
#OpenFund&lt;br /&gt;
#Orange Fab&lt;br /&gt;
#Orange Works&lt;br /&gt;
#Orion Startups&lt;br /&gt;
#Oxygen Accelerator&lt;br /&gt;
#PIE&lt;br /&gt;
#Patriot Boot Camp&lt;br /&gt;
#Pearson Catalyst for Education&lt;br /&gt;
#Pipeline H2O&lt;br /&gt;
#Pitney Bowes Inc&lt;br /&gt;
#Plarium Labs&lt;br /&gt;
#Plug In South LA &lt;br /&gt;
#Plug and Play&lt;br /&gt;
#Plum Alley Investments 2016&lt;br /&gt;
#Points of Light Accelerator&lt;br /&gt;
#PowerHaus&lt;br /&gt;
#Preccelerator® Program 2016&lt;br /&gt;
#ProSiebenSat.1 Accelerator&lt;br /&gt;
#Project Entrepreneur 2016/17&lt;br /&gt;
#Project Healtchare&lt;br /&gt;
#Project Lift&lt;br /&gt;
#Project Music&lt;br /&gt;
#Project Skyway&lt;br /&gt;
#Propeller Venture Accelerator&lt;br /&gt;
#Prosper Capital Accelerator&lt;br /&gt;
#Proton Enterprises&lt;br /&gt;
#Pushstart Accelerator&lt;br /&gt;
#Qualcomm Robotics Accelerator&lt;br /&gt;
#Queen Creek Business Incubator&lt;br /&gt;
#R/GA Accelerator&lt;br /&gt;
#RAIN Incubator/Accelerator&lt;br /&gt;
#RJI Investment Group&lt;br /&gt;
#Reach&lt;br /&gt;
#RetailXelerator&lt;br /&gt;
#Rock Health&lt;br /&gt;
#Rocket Fuel Labs&lt;br /&gt;
#Rockstart Accelerator&lt;br /&gt;
#RunUp Labs&lt;br /&gt;
#Runway IoT Accelerator 2015&lt;br /&gt;
#SAP Startup Focus Program&lt;br /&gt;
#SKTA Innopartners Innovation Accelerator&lt;br /&gt;
#SPACELAB Tech Accelerator&lt;br /&gt;
#SPARK&lt;br /&gt;
#SPH Plug and Play&lt;br /&gt;
#SURF Incubator&lt;br /&gt;
#SaltMines Group Start-Up Studio&lt;br /&gt;
#ScaleTown&lt;br /&gt;
#Seamless IoT 2016&lt;br /&gt;
#Searchcamp&lt;br /&gt;
#Seed Hatchery&lt;br /&gt;
#SeedSpot&lt;br /&gt;
#SeedStartup&lt;br /&gt;
#SeedSumo&lt;br /&gt;
#Seedcamp&lt;br /&gt;
#Seedrocket&lt;br /&gt;
#Seeqnce&lt;br /&gt;
#Sequoia Apps&lt;br /&gt;
#Serval Ventures&lt;br /&gt;
#Shenzhen Valley Ventures Incubator&lt;br /&gt;
#Shoals Entrepreneurial Center&lt;br /&gt;
#Shopper Futures Accelerator&lt;br /&gt;
#Shotput Ventures&lt;br /&gt;
#Sid Martin Biotechnology Institute&lt;br /&gt;
#SigmaLabs Accelerator&lt;br /&gt;
#Silicon Valley Incubator &amp;amp; Accelerator&lt;br /&gt;
#SixThirty&lt;br /&gt;
#Sixers Innovation Lab&lt;br /&gt;
#Skywalker Accelerator&lt;br /&gt;
#SmartHealth Activator&lt;br /&gt;
#Smashd Labs&lt;br /&gt;
#SoCo Nexus Accelerator Spring 2017&lt;br /&gt;
#Social Enterprise Challenge&lt;br /&gt;
#Socratic Labs&lt;br /&gt;
#SparkLabs&lt;br /&gt;
#Sparkgap&lt;br /&gt;
#Sports Tank&lt;br /&gt;
#Springboard&lt;br /&gt;
#Sprint Accelerator&lt;br /&gt;
#Sprint Mobile Health Accelerator&lt;br /&gt;
#SproutBox&lt;br /&gt;
#SproutCamp&lt;br /&gt;
#Starburst Aerospace Accelerator&lt;br /&gt;
#Start Path Europe&lt;br /&gt;
#Start'inPost&lt;br /&gt;
#StartEngine&lt;br /&gt;
#StartFast Venture Accelerator&lt;br /&gt;
#Starta Accelerator Winter 2017&lt;br /&gt;
#Startl&lt;br /&gt;
#Startmate&lt;br /&gt;
#Startup Accelerator (DUPLICATE)&lt;br /&gt;
#Startup Front&lt;br /&gt;
#Startup Next &amp;amp; GAN&lt;br /&gt;
#Startup Orange County Accelerator&lt;br /&gt;
#Startup Runway&lt;br /&gt;
#Startup Wise Guys&lt;br /&gt;
#Startup Zone PEI&lt;br /&gt;
#Startup52X Accelerator&lt;br /&gt;
#StartupCity&lt;br /&gt;
#StartupHighway&lt;br /&gt;
#StartupHouse Foundry program&lt;br /&gt;
#StartupMinds Accelerator &lt;br /&gt;
#StartupYard&lt;br /&gt;
#Startupbootcamp&lt;br /&gt;
#Straight Shot&lt;br /&gt;
#Summer@Highland&lt;br /&gt;
#Surge&lt;br /&gt;
#SynBio axlr8r&lt;br /&gt;
#TEB Incubation &amp;amp; Acceleration Center&lt;br /&gt;
#THRIVE Accelerator III&lt;br /&gt;
#THRIVE Open Innovation (DUPLICATE)&lt;br /&gt;
#TIM#WCAP Accelerator&lt;br /&gt;
#TLabs&lt;br /&gt;
#TMCx Accelerator Digital Health 2017&lt;br /&gt;
#Tallwave&lt;br /&gt;
#Tampa Bay Innovation Center&lt;br /&gt;
#Tampa Bay Wave&lt;br /&gt;
#Tandem Mobile Accelerator&lt;br /&gt;
#Tech Nexus&lt;br /&gt;
#Tech Wildcatters&lt;br /&gt;
#Tech2020&lt;br /&gt;
#TechLaunch&lt;br /&gt;
#TechRanch&lt;br /&gt;
#TechSquareLabs&lt;br /&gt;
#Techstars&lt;br /&gt;
#Techstars Music&lt;br /&gt;
#Telenet Idealabs&lt;br /&gt;
#Telluride Venture Accelerator&lt;br /&gt;
#TenX&lt;br /&gt;
#The Alchemist Accelerator (DUPLICATE)&lt;br /&gt;
#The Ark&lt;br /&gt;
#The Bakery&lt;br /&gt;
#The Batchery&lt;br /&gt;
#The Brandery&lt;br /&gt;
#The Bridge&lt;br /&gt;
#The Center For Technology Enterprise &amp;amp; Development&lt;br /&gt;
#The Chaser&lt;br /&gt;
#The Company Lab (CO.LAB)&lt;br /&gt;
#The Draper FinTech Connection&lt;br /&gt;
#The Factory&lt;br /&gt;
#The Greatest Pitch&lt;br /&gt;
#The Harbor Accelerator&lt;br /&gt;
#The Incubator&lt;br /&gt;
#The Iron Yard&lt;br /&gt;
#The Mediapreneur Incubator&lt;br /&gt;
#The Morpheus&lt;br /&gt;
#The New York Venture Summit&lt;br /&gt;
#The Next Step: from idea to startup&lt;br /&gt;
#The Refinery&lt;br /&gt;
#The Unilever Foundry&lt;br /&gt;
#The Venture Center's Pre-Accelerator I&lt;br /&gt;
#The Vine OC&lt;br /&gt;
#The Vogt Awards&lt;br /&gt;
#The Yield Lab&lt;br /&gt;
#The eFactory Accelerator&lt;br /&gt;
#Think Big Partners Accelerator&lt;br /&gt;
#TiE Angels&lt;br /&gt;
#Tigerlabs Digital Health Accelerator&lt;br /&gt;
#Tolstoy Summer Camp&lt;br /&gt;
#TopSeedsLab&lt;br /&gt;
#Travel Startups Incubator&lt;br /&gt;
#Travelport Labs Accelerator&lt;br /&gt;
#Travelport Labs Incubator&lt;br /&gt;
#Triangle Startup Factory&lt;br /&gt;
#Tumml&lt;br /&gt;
#Tune Labs&lt;br /&gt;
#Twin Cities Accelerator 2016&lt;br /&gt;
#UW-Whitewater Launch Pad Accelerator&lt;br /&gt;
#Unbank.ventures FinTech Incubator&lt;br /&gt;
#University Technology Park&lt;br /&gt;
#Unreasonable Institute&lt;br /&gt;
#UpTech&lt;br /&gt;
#Upstart Accelerator&lt;br /&gt;
#Upstart Labs&lt;br /&gt;
#Upstart Memphis&lt;br /&gt;
#Uptima Business Bootcamp&lt;br /&gt;
#Upwest Labs&lt;br /&gt;
#VANTEC&lt;br /&gt;
#VC FinTech Accelerator&lt;br /&gt;
#Velocity Indiana Accelerator&lt;br /&gt;
#Venture Catalyst Partners&lt;br /&gt;
#Venture Hive&lt;br /&gt;
#Venture I&lt;br /&gt;
#VentureOut's  Enterprise Tech Expedition&lt;br /&gt;
#Venturegeeks&lt;br /&gt;
#Vet-Tech Accelerator&lt;br /&gt;
#VictorySpark&lt;br /&gt;
#Village88 Techlab&lt;br /&gt;
#Volkswagen ERL Technology Accelerator&lt;br /&gt;
#WHLabs&lt;br /&gt;
#Wasabi Ventures Academy&lt;br /&gt;
#Wayra&lt;br /&gt;
#Wellness Accelerator&lt;br /&gt;
#Wells Fargo Startup Accelerator&lt;br /&gt;
#Wireless IoT&lt;br /&gt;
#Women Innovate Mobile&lt;br /&gt;
#XLerateHealth&lt;br /&gt;
#XTRATOS&lt;br /&gt;
#Xlerate Health&lt;br /&gt;
#Y Combinator&lt;br /&gt;
#Y&amp;amp;R SparkPlug 2017&lt;br /&gt;
#YEurope&lt;br /&gt;
#YLE Media Startup Accelerator Program&lt;br /&gt;
#Yahoo Ad Tech Program&lt;br /&gt;
#Yangler (online accelerator)&lt;br /&gt;
#Year of the Startup&lt;br /&gt;
#Yetizen Accelerator&lt;br /&gt;
#You Is Now&lt;br /&gt;
#Z80 Labs&lt;br /&gt;
#ZIP Launchpad Admission&lt;br /&gt;
#ZeroTo510&lt;br /&gt;
#Zone Startups Calgary&lt;br /&gt;
#designX 2017&lt;br /&gt;
#eMerging Ventures&lt;br /&gt;
#ezone&lt;br /&gt;
#iStart Jax (DUPLICATE)&lt;br /&gt;
#iStart Valley&lt;br /&gt;
#iVentures10&lt;br /&gt;
#ignite100&lt;br /&gt;
#innovyz start&lt;br /&gt;
#tekMountain Accelerator&lt;br /&gt;
&lt;br /&gt;
=Project Summary=&lt;br /&gt;
This project will be used to determine which accelerators are the most effective at churning out successful startups, as well as what characteristics are exhibited by these accelerators. First, we need to gather as much data as we can about as many accelerators as we can in order to look at factors that differentiate successful vs. unsuccessful ventures. Next, we need to create a web crawling program which will gather information about accelerators across the world by accessing their websites and extracting information. I believe that our overall goal with this research project is to gain insight into the methods of successful accelerators, as well as to find out what exactly differentiates very successful accelerators from dead accelerators.&lt;br /&gt;
&lt;br /&gt;
Helpful Links: http://seedrankings.com/&lt;br /&gt;
&lt;br /&gt;
=Sources=&lt;br /&gt;
&lt;br /&gt;
Summary: These are sources obtained from [[List of Accelerators]], Crunchbase, and other Google searches. We will evaluate these sources by looking at the number of accelerators they supply (as most of them are lists) and then also taking a look at the type of information they provide about each accelerator. Key data points are cohort-related data, startup-related data, and logistics of the accelerator. Better sources supply more information that the URL alone.&lt;br /&gt;
&lt;br /&gt;
(Obtained from [[List of Accelerators]] and various Google searches)&lt;br /&gt;
*http://seedrankings.com/&lt;br /&gt;
*http://www.acceleratorinfo.com/see-all.html&lt;br /&gt;
*http://www.seed-db.com/accelerators&lt;br /&gt;
*http://gust.com/usa-canada-accelerator-report-2015/?utm_content=35401577&amp;amp;utm_medium=social&amp;amp;utm_source=twitter&lt;br /&gt;
*https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/&lt;br /&gt;
*http://www.builtinnyc.com/2016/06/03/accelerators-incubators-nyc&lt;br /&gt;
*http://www.represent.la/&lt;br /&gt;
*http://www.launch.co/blog/complete-list-of-incubators-and-accelerators-like-y-combinat.html&lt;br /&gt;
*https://angel.co/accelerator-4 (Does not work - seems to be replaced by https://angel.co/companies?company_types[]=Incubator )&lt;br /&gt;
&lt;br /&gt;
(Obtained from Google search: &amp;quot;Accelerator Database&amp;quot;)&lt;br /&gt;
*seed-db is the first result that pops up&lt;br /&gt;
*https://www.corporate-accelerators.net/database/&lt;br /&gt;
*https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json&lt;br /&gt;
*By the 5th or 6th search result, the utility diminished greatly&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2015/03/17/the-best-startup-accelerators-of-2015-powering-a-tech-boom/#2f52fa7e34e4&lt;br /&gt;
*http://www.inc.com/will-yakowicz/the-15-best-startup-accelerators-in-the-us.html&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2016/03/11/the-best-startup-accelerators-of-2016/#74086a7724f2&lt;br /&gt;
*https://techcrunch.com/2015/03/17/these-are-the-top-20-us-accelerators/&lt;br /&gt;
*https://www.nexpcb.com/blogs/news/the-hardware-incubators-accelerators-list&lt;br /&gt;
&lt;br /&gt;
Other ways used to find Accelerators (listed below &amp;quot;List of Sources Obtained from Various Google Searches&amp;quot;):&lt;br /&gt;
*Type in generic location + &amp;quot;accelerators&amp;quot; (e.g. Houston Accelerators)&lt;br /&gt;
:*Looked at roughly the first 20 results&lt;br /&gt;
:*Used three locations as examples of accelerators that pop up&lt;br /&gt;
*Type in a specific state + &amp;quot;accelerator&amp;quot; + &amp;quot;list&amp;quot; (e.g. Texas accelerator list) to search for more relevant lists&lt;br /&gt;
:*Once again, looked at roughly the first 20 results&lt;br /&gt;
*Crunchbase has its own webpage with instructions for how we retrieve the data&lt;br /&gt;
&lt;br /&gt;
=Source Evaluations=&lt;br /&gt;
&lt;br /&gt;
Summary: These evaluations couple with each of the sources above. The evaluations provide instructions for obtaining the information listed, as well as a general review of how useful the data seems. The review serves to determine whether a crawler would be suitable for obtaining information from the source autonomously.&lt;br /&gt;
&lt;br /&gt;
==SOURCE: Crunchbase==&lt;br /&gt;
*All of the information for the Crunchbase documentation is located in the page [[Crunchbase 2013 Snapshot]] webpage, along with the documentation for how we determined the accelerator information.&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.acceleratorinfo.com/see-all.html==&lt;br /&gt;
#Opened source website&lt;br /&gt;
#Copied Information under &amp;quot;All Accelerator Programs&amp;quot; to TextPad, already sorted. Returned 190 results&lt;br /&gt;
#Each link on parent list leads to individual '''home page url''' of accelerator&lt;br /&gt;
:*Used sample size of 20 links, determined 16 to be accelerators, 2 to be incubators, 2 to be inactive or broken links&lt;br /&gt;
:*Many accelerators do not include founding date, most recent accelerators from around 2013-2014 (as determined from home page)&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for specific URLs to older accelerators, not very helpful for more specific information.&lt;br /&gt;
*Web crawling seems improbable because information is not readily available from source. Can potentially mine staff information or contact information from associated &amp;quot;about&amp;quot; page in the home url&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators==&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 235 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes:&lt;br /&gt;
::# &amp;quot;state&amp;quot;&lt;br /&gt;
::# &amp;quot;company name&amp;quot;&lt;br /&gt;
::# &amp;quot;website and CrunchBase links&amp;quot;&lt;br /&gt;
::# &amp;quot;cohort date&amp;quot;&lt;br /&gt;
::#&amp;quot;exit value&amp;quot;&lt;br /&gt;
::#&amp;quot;funding&amp;quot;. &lt;br /&gt;
:::Many entries for &amp;quot;exit value&amp;quot; are missing, some values for &amp;quot;funding&amp;quot; are missing&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators out of 235 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the following:&lt;br /&gt;
::#Status&lt;br /&gt;
::#Program (name)&lt;br /&gt;
::#Location&lt;br /&gt;
::#Country&lt;br /&gt;
::#Number of companies&lt;br /&gt;
::#Cumulative exit values&lt;br /&gt;
::#Cumulative funding &lt;br /&gt;
::#Average funding for startups&lt;br /&gt;
::#Median funding for startups&lt;br /&gt;
:::Many entries for &amp;quot;median funding&amp;quot; are left empty, as well as entries for all types of funding on the bottom half of the table&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, but after cross-referencing from other sources shows that seed-db is lacking many newer accelerators; list is not all-inclusive.&lt;br /&gt;
*Includes regional distributions for accelerator groups as well. For example, rather than just &amp;quot;Techstars&amp;quot;, the group is broken into Austin, Berlin, Boston, Boulder, etc.&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators==&lt;br /&gt;
:Very similar to &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;, but contains large regional accelerators as groups, rather than individual accelerators. For example, Techstars appears only once.&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 239 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes same information as previous source, &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;. However, accelerators spanning across multiple regions have their startups located under one category on this webpage.&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators/groups out of 239 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the same information as the &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; source&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, includes large groups as well as individual accelerators. It seems that some accelerators missing from &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; are located here, since there are 239 returns rather than 235.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.f6s.com/programs?type==&lt;br /&gt;
#On the webpage, set &amp;quot;Type&amp;quot; to &amp;quot;Accelerator/Program&amp;quot;, set &amp;quot;Location&amp;quot; to &amp;quot;North America&amp;quot;, and set &amp;quot;Invest in Country&amp;quot; to &amp;quot;United States&amp;quot; to return results&lt;br /&gt;
#Highlighted results and scrolled down until all results found; copied results to TextPad&lt;br /&gt;
#In TextPad, sorted out lines with &amp;quot;by&amp;quot;, as well as miscellaneous categories such as dates and dollar signs through Regular Expressions&lt;br /&gt;
#Using the &amp;quot;More Info&amp;quot; line which held constant through the entire list, assigned a sequential number to the line (in order to determine the number of results)&lt;br /&gt;
::*Obtained a grand total of 1467 results from the list&lt;br /&gt;
::*Along with the name of the program/accelerator, the data included:&lt;br /&gt;
::#Dollar value per team&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Application Site&lt;br /&gt;
::#Accelerator URL&lt;br /&gt;
::*Many entries are not accelerators, from a quick glance through the results, there were various conferences, 3-5 days events, and written literature pertaining to accelerators as well&lt;br /&gt;
::*From a sample size of the first 30 entries, determined 10 to be valid accelerators, 3 incubators, 6 conferences/weekends, and the rest to be miscellaneous entries such as startup events or &amp;quot;studios&amp;quot; (perhaps useful but not relevant to search)&lt;br /&gt;
::*As we go down the list, the number of accelerators proportionately decreases. Can comfortably say that overall accelerator turnout from this website is much less than 33%, probably closer to 10-15%.&lt;br /&gt;
===Review===&lt;br /&gt;
*Potentially useful website if crawler could remove the clutter and target solely the accelerators; very useful for identifying new accelerators since data automatically sorted by date and location.&lt;br /&gt;
*Large list of sources includes many irrelevant results, such as conferences or weekends which are difficult to identify. The name of the sorting category itself, &amp;quot;Accelerator/Program&amp;quot; suggests that many of the results fall under the &amp;quot;Program&amp;quot; section rather than being valid accelerators.&lt;br /&gt;
*Potential site for identifying accelerators, but limited by in-site sorting; useful for URL and perhaps equity, but not very detailed information relating to the accelerator/program.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://gust.com/usa-canada-accelerator-report-2015/==&lt;br /&gt;
#Selected region of US and Canada&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Top 20 Active Accelerators&amp;quot; and selected &amp;quot;see the full list&amp;quot; near the bottom of the listed accelerators&lt;br /&gt;
#Copied resulting entries into TextPad and sorted out the numbers to leave only the name of the accelerator&lt;br /&gt;
::*Obtained 100 results for different accelerators&lt;br /&gt;
::*Accelerator lists included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Number of Start-ups funded (2015 only)&lt;br /&gt;
::*Accelerator list limited to 2015&lt;br /&gt;
===Review===&lt;br /&gt;
*Website provides its own evaluation of an accelerator's success based on various factors and provides data for larger trends.&lt;br /&gt;
*Usefulness is questionable because website does not provide much except the URL, and all of the entries are based on success in 2015.&lt;br /&gt;
*Other interesting data within website such as &amp;quot;Hot Markets&amp;quot;, investment breakdowns by state, etc. All of this data is also limited to 2015.&lt;br /&gt;
&lt;br /&gt;
==Source: https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/==&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Startup accelerators in Boston&amp;quot;&lt;br /&gt;
#Copied text beginning from &amp;quot;MassChallenge&amp;quot; (the first paragraph was just a general definition of startups) and continued to copy until &amp;quot;Startup Incubators in Boston&amp;quot;&lt;br /&gt;
#After pasting in TextPad, I sorted the data to delete any characters after the &amp;quot;-&amp;quot; and added a sequential number at the beginning of each line&lt;br /&gt;
::*Returned a total of 17 results for startups in Boston&lt;br /&gt;
::*Accelerator list included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Capital requirements&lt;br /&gt;
::#Application periods and requirements&lt;br /&gt;
::#Paragraph describing accelerator and its goals&lt;br /&gt;
===Review===&lt;br /&gt;
*Although the guide is dated, useful for identifying strong accelerator programs in Boston&lt;br /&gt;
*Limitation: only focuses on Boston, but the description is helpful in identifying the role of the accelerator&lt;br /&gt;
*Limited information on accelerator, not very useful by itself without information from the accelerator URL&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.corporate-accelerators.net/database/==&lt;br /&gt;
#Copied and pasted table into Microsoft Excel (Data was already sorted into categories so no need for TextPad)&lt;br /&gt;
#Table returned 72 references (but there was a link to the bottom to a larger database)&lt;br /&gt;
::*The table itself includes:&lt;br /&gt;
::#Major Company&lt;br /&gt;
::#Accelerator&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Website&lt;br /&gt;
::#Details&lt;br /&gt;
::*The &amp;quot;Details&amp;quot; link led to a variety of other information including:&lt;br /&gt;
::#Status (Active or Inactive)&lt;br /&gt;
::#Locations&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Term&lt;br /&gt;
::#Cohort Based? (Regular or Irregular)&lt;br /&gt;
::#Pitch Day&lt;br /&gt;
::#Office Space&lt;br /&gt;
::#Powered by&lt;br /&gt;
::#Support Offered?&lt;br /&gt;
::#Launch year&lt;br /&gt;
::#Focus Areas&lt;br /&gt;
::#General Description&lt;br /&gt;
::*Also Included a variety of data regarding the host company as well&lt;br /&gt;
===Review===&lt;br /&gt;
*Solid list for corporate accelerators and also includes a variety of information about the accelerator, the cohorts, etc. Some of the entries are international accelerators however so need to filter them out&lt;br /&gt;
*Only limited to 72 accelerators from major companies&lt;br /&gt;
&lt;br /&gt;
==Source: https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json==&lt;br /&gt;
#This source is a .json file from the previous database&lt;br /&gt;
#After placing into TextPad, replaced each space with a ###, replaced each new line with a tab, and replaced each ### with a new line. Ultimately returned 80 results&lt;br /&gt;
::*From the file, the .json includes:&lt;br /&gt;
::#NAICS and NAICS sector &lt;br /&gt;
::#Classification&lt;br /&gt;
::#Sector Description&lt;br /&gt;
::#Term&lt;br /&gt;
::#Goal&lt;br /&gt;
::#Partner&lt;br /&gt;
::*Also includes most of the information from the previous source, since they are undoubtedly linked&lt;br /&gt;
===Review===&lt;br /&gt;
*Another solid list for corporate accelerators with some more information, but ultimately very similar to the previous source.&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.quora.com/Where-can-I-find-a-comprehensive-list-of-startup-incubators-and-accelerators-in-the-US==&lt;br /&gt;
#Since we already looked at the first listed source (seed-db), I clicked on the second link &amp;quot;(by Robert Shedd) http://blog.shedd.us/321987608/&amp;quot; which took me to a page headed &amp;quot;Help for Startups! – A semi-complete list of startup accelerator programs&amp;quot; created by a blogger, Robert Shedd&lt;br /&gt;
#List included 102 entries by the blogger, each of which do look like an accelerator&lt;br /&gt;
::*Upon immediate overview, noticed many results from previous sources were missing. Immediately noticed lack of &amp;quot;OwlSpark&amp;quot;, the accelerator from Rice.&lt;br /&gt;
::*Shedd only offers us the accelerator name plus its URL&lt;br /&gt;
===Review===&lt;br /&gt;
*Nice list to cross-reference with other sources but does not offer much new insight compared to more powerful engines such as seed-db\&lt;br /&gt;
&lt;br /&gt;
=List of Sources Obtained from Various Google Searches=&lt;br /&gt;
&lt;br /&gt;
Summary: These accelerators are taken from a specific Google search rather than a list. The idea is to compile a list of Google searches that return relevant results of accelerators. This will aid in the creation of a future web crawler.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;Location + Accelerator&amp;quot;(Only individual results, not lists)==&lt;br /&gt;
===Houston Accelerators===&lt;br /&gt;
*Examples of single accelerators found&lt;br /&gt;
:#TMCx: http://www.tmc.edu/innovation/innovation-programs/tmcx/&lt;br /&gt;
:#RED labs: http://redlabs.uh.edu/&lt;br /&gt;
:#SURGE accelerator: https://kirkcoburn.com/&lt;br /&gt;
:#OwlSpark: http://owlspark.com/&lt;br /&gt;
:#NextHIT: http://www.houstonhealthventures.com/nexthit-accelerator-program-application/&lt;br /&gt;
&lt;br /&gt;
===Los Angeles Accelerators===&lt;br /&gt;
:#Amplify: http://amplify.la/&lt;br /&gt;
:#Y Combinator: https://www.ycombinator.com/&lt;br /&gt;
:#Chicklabs: https://www.chicklabsllc.com/&lt;br /&gt;
:#Disney Accelerator: https://disneyaccelerator.com/&lt;br /&gt;
:#Launchpad: https://launchpad.la/&lt;br /&gt;
===New York Accelerators===&lt;br /&gt;
:#DreamIT Ventures: http://www.dreamit.com/#meaningful-experience&lt;br /&gt;
:#Women Innovate Mobile: http://www.wim.co/&lt;br /&gt;
:#Techstars NYC: http://www.techstars.com/programs/nyc-program/&lt;br /&gt;
:#Entrepreneurs Roundtable: http://eranyc.com/&lt;br /&gt;
:#FirstGrowthVC: http://venturecrush.com/fg/&lt;br /&gt;
:#New York Digital Health Accelerator: http://digitalhealthaccelerator.com/&lt;br /&gt;
:#Grand Central Tech: http://www.grandcentraltech.com/&lt;br /&gt;
:#Accelerator Corp: http://www.acceleratorcorp.com/&lt;br /&gt;
:#New York Startup Lab: http://nystartuplab.com/&lt;br /&gt;
===Review===&lt;br /&gt;
*Some locations return more viable results for a similar sample size. For example, New York returned 9 valid accelerators, whereas Los Angeles and Houston both returned 5 actual accelerators out of the first 20 results: an 80% difference. Some optimization may come from identifying which locations return more accelerators upon searching.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;State+Accelerator+List&amp;quot;==&lt;br /&gt;
===New York Accelerator List===&lt;br /&gt;
*http://www.ongridventures.com/resources/new-york-silicon-alley-resources/newyorkaccelerators/ (Ranks 14 accelerators)&lt;br /&gt;
*http://under30ceo.com/11-new-york-tech-incubators-and-accelerators-for-entrepreneurs/ (Ranks 11 accelerators)&lt;br /&gt;
===California Accelerator List===&lt;br /&gt;
*http://www.socaltech.com/the_complete_guide_to_southern_california_accelerators_and_incubators_part_i/s-0040924.html (Lists accelerators in Southern Cali)&lt;br /&gt;
*http://barberacorporatelaw.com/blog/2014/4/8/28-business-incubators-in-the-los-angeles-area (List of 24 accelerators near the LA area)&lt;br /&gt;
===Texas Accelerator List===&lt;br /&gt;
*http://www.austinstartuplist.com/incubators (List of accelerators in Austin, &amp;lt;5 results)&lt;br /&gt;
*http://www.siliconhillsnews.com/2016/09/02/the-top-texas-healthcare-accelerators-and-incubators/ (Modest list of accelerators aiding in healthcare)&lt;br /&gt;
*http://realfoodmba.com/food-startup-accelerators/ (List of food-based accelerators, some of which are in Austin, others of which are international)&lt;br /&gt;
===Colorado Accelerator List===&lt;br /&gt;
*http://www.builtincolorado.com/2015/01/14/best-colorado-accelerators-your-startup (8 results)&lt;br /&gt;
*https://www.quora.com/What-accelerator-programs-are-located-in-Colorado (Quora inquiry yielding modest results)&lt;br /&gt;
===Washington Accelerator List===&lt;br /&gt;
*http://www.geekwire.com/2015/mapping-seattles-incubators-accelerators-and-co-working-spaces/ (Returns 14 results)&lt;br /&gt;
===Oregon Accelerator List===&lt;br /&gt;
*http://www.bizjournals.com/portland/subscriber-only/2016/01/15/incubators-and-accelerators.html (Returns list of 5 accelerators and details)&lt;br /&gt;
*http://www.oregon4biz.com/Innovate-&amp;amp;-Create/R&amp;amp;D-Business/Incubators/ (Returns list of 26 accelerators and incubators)&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Seed-DB appears for almost all of the search results&lt;br /&gt;
*Acceleratorinfo appears for most of the search results&lt;br /&gt;
*There are multiple cumulative reports of incubators per location, but not for accelerators&lt;br /&gt;
*Most regionalized accelerator lists deal with either an article or a ranking of a particular amount of accelerators in the area&lt;br /&gt;
*Many results returned nationally ranked lists of accelerators, such as the Forbes list of &amp;quot;Top Accelerators&amp;quot; or something along the lines of &amp;quot;Best Accelerators in the US&amp;quot;. The connection is that perhaps one accelerator mentioned on the list may be located within the searched state.&lt;br /&gt;
*There are also a few results for actual particle accelerators that must be sorted out (i.e. superconducting super collider)&lt;br /&gt;
&lt;br /&gt;
==Found through google searching accelerators found previously==&lt;br /&gt;
'''Found from googling YLE Media Startup Accelerator'''&lt;br /&gt;
*https://www.corporate-accelerators.net/database/index.html (DB of Corporate Accelerators 71-79 entries)&lt;br /&gt;
*http://startupaccelerator.vc/accelerator-corporate-innovation-sig/ (Database of Accelerators and Corporate Innovation 92 entries)&lt;br /&gt;
neither of these have had their entries added to list of accelerators&lt;br /&gt;
&lt;br /&gt;
=Individual Accelerator Evaluations=&lt;br /&gt;
Summary: The purpose of this section is to create instructions for each accelerator on how to find cohort information from their URLs. Along with specific instructions for obtaining the cohorts for each accelerator chosen, there should be a list of easy-to-obtain and relevant statistics regarding the accelerator, such as information about its team, location, etc. The variable statistics list is cumulative, whereas the cohort directions are unique per the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerators Chosen (Format = Name (source))==&lt;br /&gt;
#Blue Startups (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Launchpad LA (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Y Combinator (http://www.seed-db.com/accelerators)&lt;br /&gt;
#FlashPoint (http://www.seed-db.com/accelerators/all)&lt;br /&gt;
#Prosper Accelerator (https://www.f6s.com/programs?type)&lt;br /&gt;
#Axel Springer Plug and Play (http://www.axelspringerplugandplay.com/)&lt;br /&gt;
#Techstars (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Startmate (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Capital Factory (http://blog.shedd.us/321987608/)&lt;br /&gt;
#OwlSpark (Google search: &amp;quot;Houston + accelerators&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Blue Startups (http://bluestartups.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Track Record&amp;quot; page under the &amp;quot;Home&amp;quot; tab; found total number of graduated cohorts to be 7&lt;br /&gt;
#Navigated to &amp;quot;Portfolio&amp;quot; tab. Tab includes list of all seven graduated cohorts along with companies emerging from each one. Each cohort is listed under a separate page (ex. &amp;quot;Cohort 1&amp;quot;, &amp;quot;Cohort 2&amp;quot;, etc) and at the bottom of each cohort page, there is a link to the other 6. Each company has a short description along with its URL.&lt;br /&gt;
#An &amp;quot;Alumni News&amp;quot; page at the bottom of &amp;quot;Portfolio&amp;quot; includes articles pertinent to graduated startups.&lt;br /&gt;
#Unfortunately does not include the date and year of each cohort class, but perhaps could cross-reference with other sources.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Launchpad LA (http://launchpad.la/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Companies&amp;quot; in the top of the homepage&lt;br /&gt;
#&amp;quot;Companies&amp;quot; returns all companies backed by Launchpad LA based on their class year and number (cohort)&lt;br /&gt;
#:*Also sorted by active startups vs. inactive startups&lt;br /&gt;
#At the bottom of the &amp;quot;Companies&amp;quot; tab, there is a statistical layout returning values for the number of companies started by Launchpad during its time as an accelerator (2012-present), as well as the total funding funneled into the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Y Combinator (http://www.ycombinator.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Scrolled down on the home page and clicked on a link entitled &amp;quot;See all companies&amp;quot;.&lt;br /&gt;
#Navigated to a drop down menu named &amp;quot;All Batches&amp;quot;, and clicked on it to expand the list.&lt;br /&gt;
#List is made up of dates ranging from 2005-2016, and these dates return lists of launched companies including most but not all of their URL's, as well as their launch year.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Flashpoint (http://flashpoint.gatech.edu/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#On upper right corner after animation, there is a tab sign which lets you navigate to a page labeled &amp;quot;Teams&amp;quot;&lt;br /&gt;
#The &amp;quot;Team&amp;quot; page has each batch of companies emerging from Georgia Tech, although it does not include the dates or cohorts of these companies. For example, &amp;quot;Batch 1&amp;quot; at the top of the page just lists the companies in the batch without URLs or any additional information.&lt;br /&gt;
#On the &amp;quot;Application&amp;quot; page on the tab near the top, there is information regarding Batch 7, which begins early 2017. Suggests that batch 6 either ended spring 2016 or fall 2016.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Prosper Women Entrepreneurs (http://www.prosperstl.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Accelerator&amp;quot; tab and clicked &amp;quot;Companies&amp;quot; when prompted with the drop down menu.&lt;br /&gt;
#This tab returned all of the launched company logos which then redirected to the company's home page when clicked.&lt;br /&gt;
#No other relevant form of information such as date launched or cohort was included on this page.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Axel Springer Plug and Play(http://www.axelspringerplugandplay.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Clicked on the &amp;quot;Companies&amp;quot; tab on the home page and was directed to the middle of the page which included a short list of current companies.&lt;br /&gt;
#Clicked on the &amp;quot;All Companies&amp;quot; link which returned a page filled with startup logos and brief descriptions of those startups. When clicked, each logo serves to redirect to that startup's home page.&lt;br /&gt;
#Companies were not sorted by cohort or in any other relevant way.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Techstars (http://www.techstars.com)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the Accelerators tabs and clicked &amp;quot;Companies&amp;quot; on the drop down menu.&lt;br /&gt;
#Firstly, this returns a table comprised of a long list of different classes from different areas separated by years.&lt;br /&gt;
#Upon scrolling down further, each of these classes is broken down by the startups that graduated from them. It also includes information such as how much was invested in each startup, as well as whether or not the startup was acquired, is active, or failed.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Startmate (http://www.startmate.com.au)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startups&amp;quot; tab, which returned a page of all startups that have graduated from Startmate.&lt;br /&gt;
#Startups are separated by year of graduation, and each company is linked on this page.&lt;br /&gt;
#It appears as if each year, 1 cohort is taken through the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Capital Factory (https://capitalfactory.com/accelerate/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the startups tab, which returned a long list of companies that were accelerated by Capital Factory.&lt;br /&gt;
#Each logo for the startups served as a link to their respective websites.&lt;br /&gt;
#There was no evidence or mention of any cohorts.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: OwlSpark (http://entrepreneurship.rice.edu/accelerator/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startup Teams&amp;quot; tab, which returned a page that included links to 4 &amp;quot;Classes&amp;quot;.&lt;br /&gt;
#Each class link i.e. (Class 1, Class 2, Class 3, Class 4) returned links to each startup that graduated from the program.&lt;br /&gt;
#These classes signify cohorts.&lt;br /&gt;
&lt;br /&gt;
==List of Promising Variables==&lt;br /&gt;
*Key People (founders, lead entrepreneurs, strategists, etc.)&lt;br /&gt;
*Total number of launched companies&lt;br /&gt;
*A FAQ for application details, accelerator vision, and &lt;br /&gt;
*Funds raised per company (average)&lt;br /&gt;
*Features offered by accelerator (perks, space, tools, etc)&lt;br /&gt;
*General events hosted by the accelerator&lt;br /&gt;
*(Success) stories for graduated start-ups&lt;br /&gt;
&lt;br /&gt;
=E-R Diagram (in list form) for Identifying Attributes to Pull from Accelerators=&lt;br /&gt;
Summary: I will look at different entities within the accelerator page (e.g accelerators, cohorts, founders) and then find potential attributes that can be codified from those entities. Along with the attribute, we list a potential method for pulling that particular attribute. &lt;br /&gt;
&lt;br /&gt;
Format: &lt;br /&gt;
:&amp;lt;u&amp;gt;Entity&amp;lt;/u&amp;gt;&lt;br /&gt;
:*Attribute - Possible sources/ways to get&lt;br /&gt;
&lt;br /&gt;
Ed: &amp;quot;Be creative with finding new attributes to pull!&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==List==&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
*Accelerator Name - Website, external database&lt;br /&gt;
*Contact Form - General contact section in each website &lt;br /&gt;
*Industry focus - can be pulled from description&lt;br /&gt;
*Description - pulled from website itself&lt;br /&gt;
*Takes equity? - Database or from &amp;quot;about&amp;quot; page&lt;br /&gt;
*Non-profit? - Database&lt;br /&gt;
*URL - Already have way of obtaining&lt;br /&gt;
*DNS Registration Date - Already have way of obtaining&lt;br /&gt;
*Address - Google Maps, maybe the website&lt;br /&gt;
*Founding Date - Google Maps, website, server registration&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
*Mentorship? - Description in website&lt;br /&gt;
*Space Offered - Google Maps, Website description&lt;br /&gt;
*Partnerships - Angel list, Same section as mentorship or events&lt;br /&gt;
*Hosted Events - Calender&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
*Name - Founders or Team Page&lt;br /&gt;
*Title - Directly underneath or next to name&lt;br /&gt;
*PhD? - Biography, webpage under name&lt;br /&gt;
*Serial - Biography&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot; in &amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt; (n) has (n) &amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt; &lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt;&lt;br /&gt;
*Other Companies - Biography, webpage&lt;br /&gt;
*Previous Companies - Biography&lt;br /&gt;
*Net Worth - Forbes, Biography&lt;br /&gt;
*Link back to &amp;quot;Name&amp;quot; in &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
*Date + Accelerator = Cohort ID - Database or Website&lt;br /&gt;
*Number of Startups - Website, count from &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Cohort Number - Categorization on website, external database&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Names - Website, external database&lt;br /&gt;
*State of Inc - Angel List&lt;br /&gt;
*URL - Angel List, website&lt;br /&gt;
*Founding Date - Registration database, Angel List&lt;br /&gt;
*Industry - startup description&lt;br /&gt;
*Founding Location - Angel List&lt;br /&gt;
*Current Location - Angel List&lt;br /&gt;
*VC Raised to Date - SDC Platinum&lt;br /&gt;
*Angel Funds Raised to date - Angel List&lt;br /&gt;
&lt;br /&gt;
==Variables which Distinguish Accelerator Websites==&lt;br /&gt;
*The word &amp;quot;Accelerator&amp;quot;&lt;br /&gt;
**This word appears at least one time on the home page of the vast majority of accelerator websites. The word &amp;quot;Accelerator&amp;quot; appears either as a link to another page on the website or in a title on the homepage of the website. Not many other websites contain this word on their homepage, especially not if one Googles something generic such as &amp;quot;Accelerators in the US&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
*Fixed Term&lt;br /&gt;
**Accelerators normally work with their cohorts for 3 months. This is a major factor which differentiates between an accelerator and any other member of a startup ecosystem. If on their website they mention either &amp;quot;3 months&amp;quot; or &amp;quot;12 weeks&amp;quot;, it is extremely likely that the website belongs to an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Cohorts, Portfolio, Class, or Companies&lt;br /&gt;
**This is a potential variable that could link the websites of many different accelerators. The problem with the word &amp;quot;portfolio&amp;quot; is also used by numerous venture capital firms, which could potentially cause complications when attempting to pull only the sites of accelerators from a Google search. The word &amp;quot;cohort&amp;quot;, however, would have an extremely high probability of identifying the website as belonging to an accelerator. The words &amp;quot;class&amp;quot; and &amp;quot;companies&amp;quot; are promising but do not offer certainty.&lt;br /&gt;
&lt;br /&gt;
*Equity, Investment&lt;br /&gt;
**Although by itself, equity does not mean much, when paired with any of these other terms, it could potentially point to an accelerator. Most accelerators take equity in the form of common stock (6-8%), or they will ask for some alternate form of stake in the company.&lt;br /&gt;
&lt;br /&gt;
*Education and Mentorship&lt;br /&gt;
**Accelerators differ from incubators and angel investors in that they emphasize the education of the potential startup. They offer advice and intense mentorship from more experienced entrepreneurs within their staff, as well as many networking opportunities with the outside world. This variable is more difficult to find on the website of the accelerator, but I believe that if the website includes numerous keywords such as &amp;quot;education&amp;quot;, &amp;quot;mentorship&amp;quot;, or &amp;quot;networking opportunities&amp;quot;, it would be somewhat safe to assume that the website is owned by an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Demo Day&lt;br /&gt;
**This variable does not have tremendous potential in terms of crawling websites, but I feel that it is worth mentioning. Most accelerators &amp;quot;graduate&amp;quot; their cohorts with a demo day, which is a day when the startups present their company to potential investors. If the website contains the words &amp;quot;demo day&amp;quot;, which is fairly uncommon, it could be a good source of accelerator identification.&lt;br /&gt;
&lt;br /&gt;
A combination of any of these variables would certainly identify the current website as belonging to an accelerator.&lt;br /&gt;
&lt;br /&gt;
==Comprehensive List of Accelerators==&lt;br /&gt;
&lt;br /&gt;
All text files saved in &amp;quot;Accelerators&amp;quot; project on the McNair RPD. &lt;br /&gt;
&lt;br /&gt;
*Acc.Info: 190&lt;br /&gt;
*SeedDB: 240&lt;br /&gt;
*SARP: 59&lt;br /&gt;
*Corp: 79&lt;br /&gt;
*Total: 568 results&lt;br /&gt;
&lt;br /&gt;
After removing duplicates and locations: 363 results&lt;br /&gt;
&lt;br /&gt;
Doesn't count f6s, which returns 1170 results, roughly only 300 of which were accelerators. We created a crawler to sift through the webpages and parse HTML so we could identify the accelerators. Program and HTML saved on the Desktop.&lt;br /&gt;
&lt;br /&gt;
==Randomly Chosen Accelerators==&lt;br /&gt;
*TLabs&lt;br /&gt;
*BetaSpring&lt;br /&gt;
*The Unilever Foundry&lt;br /&gt;
*AIA Accelerator&lt;br /&gt;
*R/GA Accelerator&lt;br /&gt;
*Zeroto510&lt;br /&gt;
*Hub:raum&lt;br /&gt;
*Orange Fab&lt;br /&gt;
*Furnace&lt;br /&gt;
*Launch Chapel Hill&lt;br /&gt;
&lt;br /&gt;
===Determining whether or not these are accelerators===&lt;br /&gt;
Googled name of Accelerator and clicked on the first link&lt;br /&gt;
&lt;br /&gt;
Looked for Variables which Distinguish Accelerator Websites&lt;br /&gt;
*TLabs: Homepage states: &amp;quot;Leading Indian Tech Accelerator&amp;quot;; TLabs is an accelerator, but it is located in India.&lt;br /&gt;
*Betaspring: Under the &amp;quot;About Betaspring&amp;quot; tab,  it states that &amp;quot;Betaspring was among the first ten startup accelerators to launch worldwide&amp;quot;.&lt;br /&gt;
*The Unilever Foundry: Does not claim to be an accelerator, nor does it have information on the website about cohorts. This name was pulled from the source Corporate Accelerators.&lt;br /&gt;
*AIA Accelerator: The word &amp;quot;accelerator&amp;quot; is included in the name. Under the &amp;quot;Overview&amp;quot; tab, it states that startups have received mentorship.&lt;br /&gt;
*R/GA Accelerator: Under the &amp;quot;Overview&amp;quot; tab it states that the &amp;quot;R/GA Accelerator is designed for startups and... it is a three month, immersive, mentorship driven program&amp;quot;.&lt;br /&gt;
*Zeroto510: Website contains a &amp;quot;Portfolio Companies&amp;quot; tab which divides up the companies into cohorts. This identifies Zeroto510 as an accelerator.&lt;br /&gt;
*Hub:raum: Offers accelerator and incubator programs; however, none are located in North America.&lt;br /&gt;
*Orange Fab: States on the main page that &amp;quot;We're a 3-month accelerator program&amp;quot;.&lt;br /&gt;
*Furnace: &amp;quot;About&amp;quot; tab states that Furnace is &amp;quot;an innovative startup accelerator designed to form, incubate, and launch new companies&amp;quot;. Concludes with a Demo Day&lt;br /&gt;
*Launch Chapel Hill: Homepage states that they are &amp;quot;a startup accelerator&amp;quot;. Also included on the homepage is a line that states &amp;quot;Applications for Cohort 7 are now open&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
7/10 are accelerators located in the US.&lt;br /&gt;
&lt;br /&gt;
2/10 are accelerators not located in the US.&lt;br /&gt;
&lt;br /&gt;
1/10 is not an accelerator.&lt;br /&gt;
&lt;br /&gt;
===Steps for Extracting Cohort Information===&lt;br /&gt;
*TLabs: Clicked on the &amp;quot;Startup&amp;quot; tab and located a drop down menu entitled &amp;quot;Showing Startups from:&amp;quot;. This menu separates startups into Batches ranging from 1-9. These batches are cohorts.&lt;br /&gt;
*Betaspring: This website does not have a &amp;quot;Companies&amp;quot; or &amp;quot;Startups&amp;quot; tab. I clicked on their &amp;quot;Who&amp;quot; tab and noticed that within this section were two links called &amp;quot;Our portfolio&amp;quot; and &amp;quot;Our companies&amp;quot; which both linked to the same place. This place contained a list of the startups that Betaspring has funded, as well as links to each of the startup websites. The list was not separated into cohorts.&lt;br /&gt;
*The Unilever Foundry: Does not have a &amp;quot;Startups&amp;quot; or &amp;quot;Companies&amp;quot; link on the website.&lt;br /&gt;
*AIA Accelerator: Clicked on the &amp;quot;Startups&amp;quot; tab which returned a page with 5 companies and a bit of information on each of these companies. Also included the URL to each startup. However, the companies were not separated into cohorts, probably because there are so few of them.&lt;br /&gt;
*R/GA Accelerator: Clicked on the &amp;quot;Alumni&amp;quot; tab and navigated down the webpage. Startups are separated by class, which means cohort in this case. Startup info contains link to demo day presentation as well as the startup url.&lt;br /&gt;
*Zeroto510: Hovered over the &amp;quot;About Us&amp;quot; drop down menu and clicked on the &amp;quot;Portfolio Companies&amp;quot; link. Startups are separated by cohort, one for each year, starting from 2013. &lt;br /&gt;
*Hub:raum: Clicked on the &amp;quot;Portfolio&amp;quot; tab. Directed to a page with many names of startups, as well as a brief description of what their company is about. Also includes a link to each startup's website. Startups are not separated into cohorts, but rather by investment by location, current participants, and alumni.&lt;br /&gt;
*Orange Fab: Clicked on the &amp;quot;Startups&amp;quot; tab and was directed to a different page. Startups are not only separated into cohorts named &amp;quot;Seasons&amp;quot;, but they are also separated by industry.&lt;br /&gt;
*Furnace: Clicked on &amp;quot;Portfolio&amp;quot; tab, but unfortunately the website is broken and it returned an error in code.&lt;br /&gt;
*Launch Chapel Hill: Clicked on the &amp;quot;Ventures&amp;quot; tab and was directed to a page in which all startups were separated into cohorts, and a brief description of the startup was provided underneath their logo.&lt;br /&gt;
&lt;br /&gt;
=Code=&lt;br /&gt;
&lt;br /&gt;
The directory for all data related to this project is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
==F6S Web Crawler==&lt;br /&gt;
&lt;br /&gt;
This is a python script using the selenium library that retrieves the html content of each page on F6S's North American Accelerator search results. The script is located in:&lt;br /&gt;
&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs &lt;br /&gt;
&lt;br /&gt;
The script is titled f6s_crawler_gentle.py&lt;br /&gt;
&lt;br /&gt;
When run, the script visits the F6S search page for North American Accelerator's and begins retrieving the HTML of each page in that search list. &lt;br /&gt;
NOTE: Timing must be spaced out between all interactions with the browser. F6S has Captcha, and the program will fail if the site receives too many hit requests, or has any inkling that it is being probed by a bot.&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files are stored in: &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files stored as text files are stored in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files_text&lt;br /&gt;
&lt;br /&gt;
==F6S Parser==&lt;br /&gt;
The next step is to take the HTML files retrieved by the crawler and to parse them for necessary information. This parser should also determine whether or not the site is an accelerator site. &lt;br /&gt;
&lt;br /&gt;
The code for the parser is located in &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
It is titled f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
To run the code, open the file in Komodo and press play. &lt;br /&gt;
If running from the command line, change to the correct directory and run the following comand:&lt;br /&gt;
 python f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
The list of accelerators that passed through the parser is in the same directory:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
The tab delimited text file is named AcceleratorList.&lt;br /&gt;
The file contains the names of the accelerators that had the keywords listed in the file. Also, the file contains the run dates and location of the accelerator if it was listed on the f6s page.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==F6S API==&lt;br /&gt;
F6S has an API, but we have had no success getting a key to the API. The link to get a key to the API is on [https://www.f6s.com/developers/apis/deal-feed this page].&lt;br /&gt;
&lt;br /&gt;
I (Peter) have emailed F6S to ask for a key directly at support@f6s.com. As of the end of the Fall 2016 Semester, they have not responded.&lt;br /&gt;
&lt;br /&gt;
FUN FACT (MASS-RENAME FILES USING WINDOWS POWER SHELL):&lt;br /&gt;
&lt;br /&gt;
The following command allowed me to append &amp;quot;.txt&amp;quot; to all files in a folder once in the proper directory:&lt;br /&gt;
 Get-ChildItem * | Rename-Item -NewName { $_.name + '.txt'}&lt;br /&gt;
&lt;br /&gt;
To change file formats, Microsoft suggests:&lt;br /&gt;
 Get-ChildItem *.txt | Rename-Item -NewName { $_.name -Replace '\.txt', '.log'}&lt;br /&gt;
&lt;br /&gt;
==Final Data==&lt;br /&gt;
The Parser for parsing the text files of accelerator data is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
The Parser for parsing the cohort files of accelerator data is also located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
This folder contains the Python parsers. The Final_data folder contains the tab-delimited text files of parsed data. final_accelerator_data.txt contains the generalized data saved in .txt files and final_cohort_data.txt contains the cohort data saved in .cohort.txt files.&lt;br /&gt;
&lt;br /&gt;
All the files entitled accelerator_data are subsets of the final_accelerator_data.txt file, but each file contains only the accelerators that matched to the flag specified in the file title.&lt;br /&gt;
&lt;br /&gt;
find_headers .py finds a set of the headers for all the cohort files from the seed list project.&lt;br /&gt;
&lt;br /&gt;
==Google SiteSearch==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Google_SiteSearch&lt;br /&gt;
This folder contains code for a google search parser. The script sitesearch.py will search for a queried company and return a likely web address for that company.&lt;br /&gt;
&lt;br /&gt;
==Way Back Machine Parser==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\wayback_machine.py&lt;br /&gt;
This script takes URLs and returns a timestamp for the oldest documented webpage under that URL courtesy of the Way Back Machine Archive.&lt;br /&gt;
&lt;br /&gt;
==Process Locations==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\process_locations.py&lt;br /&gt;
This script takes a physical address and converts it into latitude and longitude coordinates. Should be used in conjunction with the Enclosing Circle program to find the concentration of accelerators.&lt;br /&gt;
 E:\McNair\Software\CodeBase\EnclosingCircle.py&lt;br /&gt;
&lt;br /&gt;
=Kauffman Foundation Incubator Proposal Information=&lt;br /&gt;
&lt;br /&gt;
==Institutions==&lt;br /&gt;
Summary: F6S, Crunchbase, seed-db&lt;br /&gt;
&lt;br /&gt;
Tools: Matcher - used to match lists of potential accelerators with our current list to identify duplicates/new matches (E:\McNair\Projects\Accelerators)&lt;br /&gt;
&lt;br /&gt;
===F6S===&lt;br /&gt;
F6S WebCrawler and F6S Parser - E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
===CrunchBase===&lt;br /&gt;
&lt;br /&gt;
CrunchBase 2013 Snapshot '''(All Organizations)'''- E:\McNair\Projects\Accelerators\organizations.xls&lt;br /&gt;
&lt;br /&gt;
CrunchBase 2013 Snapshot '''(Potential Accelerators)'''- E:\McNair\Projects\Accelerators\organizations.accdb under &amp;quot;Potential Accelerators query&amp;quot; &lt;br /&gt;
&lt;br /&gt;
*Obtained using keyword matches in the descriptions of the potential accelerators.&lt;br /&gt;
&lt;br /&gt;
CrunchBase 2013 Snapshot '''(New Verified Accelerators)''' - E:\McNair\Projects\Accelerators\New CrunchBase Accelerators.xls&lt;br /&gt;
&lt;br /&gt;
We have the Crunchbase 2013 Snapshot which provided lots of new data on accelerators and incubators but we would love to use the Crunchbase API to get a current database snapshot that we could use to cross reference companies and add newly formed accelerator and incubator companies.&lt;br /&gt;
&lt;br /&gt;
===AngelList===&lt;br /&gt;
&lt;br /&gt;
===seed-db===&lt;br /&gt;
&lt;br /&gt;
Obtained through www.seed.db/accelerators&lt;br /&gt;
&lt;br /&gt;
===Global Accelerator Network (GAN)===&lt;br /&gt;
&lt;br /&gt;
GAN Parser- E:\McNair\Projects\Accelerators\Web Scraping for Accelerators\scrapeaccel.py&lt;br /&gt;
&lt;br /&gt;
GAN Data- E:\McNair\Projects\Accelerators\Web Scraping for Accelerators\GAN Accelerator Data&lt;br /&gt;
*Contains: Company Name, # of Companies Range, % of Companies Funded, Funding Raised by Companies, Employee Range, Exit Funding, Exit Date, Total Company Funding Raised, # of Mentors Range, % Equity, Location, Minimum Seed Capital Investment&lt;br /&gt;
&lt;br /&gt;
==Cohorts==&lt;br /&gt;
&lt;br /&gt;
*Cohorts obtained manually&lt;br /&gt;
*All Cohort txt files are saved under &amp;quot;E:\McNair\Projects\Accelerators\Data  &lt;br /&gt;
*cohort file name = (accelerator name).cohort&lt;br /&gt;
*Most updated Accelerator cohort data: E:\McNair\Projects\Accelerators\Cleaned Cohort Data.xls&lt;br /&gt;
&lt;br /&gt;
Automation for obtaining cohorts??&lt;br /&gt;
&lt;br /&gt;
==Other Information==&lt;br /&gt;
Summary: Whois Parser, Geocode, Tools to determine industry, etc&lt;br /&gt;
&lt;br /&gt;
===Whois Parser===&lt;br /&gt;
&lt;br /&gt;
*Retrieves and parses Whois information. Specifically, takes a file with a column of domain names and populates the corresponding columns with information from the WhoIs API.&lt;br /&gt;
&lt;br /&gt;
*Often used to obtain locations.&lt;br /&gt;
&lt;br /&gt;
===Geocode===&lt;br /&gt;
&lt;br /&gt;
Input: Company Address&lt;br /&gt;
Output: Directional Coordinates&lt;br /&gt;
&lt;br /&gt;
*Used to obtain the locations of different Accelerators and Cohort companies.&lt;br /&gt;
&lt;br /&gt;
===SDC Platinum Pull===&lt;br /&gt;
&lt;br /&gt;
Used to obtain funding information and match companies that have gotten funding with companies that are Accelerator cohorts.&lt;br /&gt;
&lt;br /&gt;
===Desired Information/Variables===&lt;br /&gt;
&lt;br /&gt;
*Key People (founders, lead entrepreneurs, strategists, etc.)&lt;br /&gt;
*Total number of launched companies&lt;br /&gt;
*A FAQ for application details, accelerator vision, and&lt;br /&gt;
*Funds raised per company (average)&lt;br /&gt;
*Features offered by accelerator (perks, space, tools, etc)&lt;br /&gt;
&lt;br /&gt;
==Desired Tools/Information==&lt;br /&gt;
&lt;br /&gt;
===Automating the Process of Obtaining Cohorts===&lt;br /&gt;
*Automating this process would save a lot of time and really progress the project.&lt;br /&gt;
&lt;br /&gt;
===Obtaining More Details on Accelerators===&lt;br /&gt;
&lt;br /&gt;
*Having the kind of thorough information on industry, companies, funding, location, exits, mentors, leadership,  that we got for the GAN companies would be fantastic.&lt;br /&gt;
&lt;br /&gt;
===List of Alive/Dead Accelerators===&lt;br /&gt;
&lt;br /&gt;
This is a dream but would be very helpful&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=21861</id>
		<title>Accelerator Seed List (Data)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=21861"/>
		<updated>2017-11-14T21:05:55Z</updated>

		<summary type="html">&lt;p&gt;Shrey: /* Source: http://www.seed-db.com/accelerators/all */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Accelerator Seed List (Data)&lt;br /&gt;
|Has owner=Shrey Agarwal, Matthew Ringheanu, Veeral Shah,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has keywords=Accelerators,Data&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Industry Classifier&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Current Work=&lt;br /&gt;
&lt;br /&gt;
TODO:&lt;br /&gt;
 McNair/Projects/Accelerators/Fall 2017/unfound_founders.txt&lt;br /&gt;
A 0 means we don't have founder data for that accelerator.&lt;br /&gt;
Specs: A tab delimited text file with the following fields:&lt;br /&gt;
 Accelerator   First Name   Last Name   LinkedInURL(if possible)&lt;br /&gt;
Getting the LinkedInURL will ensure accuracy, but will work without it.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Shrey: Find &amp;quot;demo day&amp;quot; keywords, so that we can search AcceleratorName Year Keyword and get back potential demo day pages&lt;br /&gt;
*Joe: Go through Accelerator list (approx 273 accelerators) and mark each by type (see below), building out type list as you go&lt;br /&gt;
&lt;br /&gt;
Type list:&lt;br /&gt;
*Private&lt;br /&gt;
*Corporate&lt;br /&gt;
*Academic&lt;br /&gt;
 Note: if DEAD, noted here.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Other info:&lt;br /&gt;
*nonprofit? (y/n)&lt;br /&gt;
&lt;br /&gt;
*Subtype abbreviations:&lt;br /&gt;
**S: for if a social entrepreneurship initiative&lt;br /&gt;
**I: for if an incubator&lt;br /&gt;
**A: for an angel group&lt;br /&gt;
**F: for foreign&lt;br /&gt;
**C: for in coworking space/hub/etc&lt;br /&gt;
**V: for if part of venture fund&lt;br /&gt;
**G: for if government funded/partnered&lt;br /&gt;
**T: for international&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
 Note: subtypes (from individual text files in E:\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data) were only found for 23 of the 270 accelerators.  These accelerators were initially intended to be removed from the master list.  Remaining subtypes are currently being added.&lt;br /&gt;
&lt;br /&gt;
other info: &lt;br /&gt;
&lt;br /&gt;
international offices, founders, industries, org type, program duration, or other interesting, easily accessed variables.  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Type list file saved as &lt;br /&gt;
 &amp;quot;Accelerator type list&amp;quot; in E:\McNair\Projects\Accelerators\Fall 2017\Grouping project of ListOfAccs.&lt;br /&gt;
The list of ListofAccs, from which we drew Accelerator type list, should have no matches with any of the flagged accelerators in E:\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data.  There are 23 matches though.  So all subtypes must be searched and entered manually.  Whether it is a nonprofit is listed in E:\McNair\Projects\Accelerators\Fall 2017\Grouping project of ListOfAccs, called &amp;quot;whether nonprofit...&amp;quot;&lt;br /&gt;
&lt;br /&gt;
=End of Semester Report=&lt;br /&gt;
The end of semester report will focus on ranking accelerators and environments based on the variables we have gathered. Our primary form of categorization will be ranking individual accelerators based on their venture capital raise rate. We can probably generate information over time for accelerators and the amount of VC they raised to get a sense of what locations have developed in the past five years from the dates of transactions recorded by SDC. To obtain these rankings, we will identify which cohorts companies were trained in, as well as complete details of the accelerator and the details of cohort companies. We will focus only on accelerators because there are many other entities in each ecosystem. We will also utilize information on IPO or acquisition by companies, obtained through Crunchbase, to gain some sense of how successful startups emerging from a particular accelerator are. To obtain the data over time, we will need to fill out the cohort date information column in our cohort data, which will require the help of either Crunchbase or the Wayback machine for older accelerators. In ranking the accelerators across regions, we can also track industry-specific hotspots for accelerators such as medicine in Memphis or technology in San Francisco.&lt;br /&gt;
&lt;br /&gt;
To complete the report, we need to fill information in:&lt;br /&gt;
*Industry and focus&lt;br /&gt;
*Location&lt;br /&gt;
*Name, description&lt;br /&gt;
*Matched VC data&lt;br /&gt;
*Founder information (maybe)&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
This project is developing broad and near-population data on accelerators and their cohort companies. The objective is to identify which cohorts of which accelerators a cohort company was trained in, obtain details of the accelerators, and obtain details of the cohort companies, including information about any venture capital investment that the cohort company might have received and any IPO or acquisition the company may have experienced.&lt;br /&gt;
&lt;br /&gt;
The primary use of this data is for an academic paper detailed on the [[Matching Entrepreneurs to Accelerators and VCs (Academic Paper)]] page. &lt;br /&gt;
&lt;br /&gt;
However, this project can also provide useful data to other academic papers ([[Urban Start-up Agglomeration]], [[Hubs (Academic Paper)]], and [[Hubs Scorecard (Academic Paper)]]), projects ([[Houston Entrepreneurship]]) and blog posts (under the [[Emerging Ecosystems]] umbrella project).&lt;br /&gt;
&lt;br /&gt;
This project needs the results of the [[Industry Classifier]], [[Whois Parser]], and other tools.&lt;br /&gt;
&lt;br /&gt;
=Current Project Write-Up=&lt;br /&gt;
&lt;br /&gt;
==Things To Do==&lt;br /&gt;
*Obtain all URLs for accelerators in order to run through the Wayback Machine to find out when they started.&lt;br /&gt;
*Match Crunchbase Data with our Accelerator List to see if they have any accelerators that we do not.&lt;br /&gt;
*Obtain an example of accelerator that started early and has multiple companies but does not separate them into cohorts and figure out a way to determine which companies went through each cohort.&lt;br /&gt;
&lt;br /&gt;
==What Each File in the &amp;quot;Accelerator&amp;quot; Folder on the RDP Contains==&lt;br /&gt;
*&amp;quot;Accelerator List Sources&amp;quot; (Folder) - This folder contains most of the sources that we pulled accelerator names from at the very beginning of the project.&lt;br /&gt;
*&amp;quot;Code+Final_Data&amp;quot; (Folder) - This folder contains Peter's code for pulling the data from the text files in the &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Crunchbase Snapshot&amp;quot; (Folder) - This folder contains the data we obtained from Crunchbase. There is a massive amount of data which we will need to sort through to find useful information and hopefully match that data with our current cohort data.&lt;br /&gt;
*&amp;quot;Data&amp;quot; (Folder) - This folder contains all of our data on accelerators including cohort information and the html files of each cohort page. I would estimate that it is about 95% clean currently.&lt;br /&gt;
*&amp;quot;Data - Copy&amp;quot; (Folder) - This is just a copy of our current &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Data_Copy&amp;quot; (Folder) - This is a copy of our original &amp;quot;Data&amp;quot; folder before we did any manual cleaning.&lt;br /&gt;
*&amp;quot;Enclosing_Circle&amp;quot; (Folder) - This folder seems to contain some data on VC but I'm not sure how it pertains to the Accelerator project.&lt;br /&gt;
*&amp;quot;F6S Accelerator HTMLs&amp;quot; (Folder) - This folder contains the HTML pages of all the pages on the F6S website. We used it to add more potential accelerators to our list.&lt;br /&gt;
*&amp;quot;Google_SiteSearch&amp;quot; (Folder) - This folder contains Python code for Google searches.&lt;br /&gt;
*&amp;quot;Industry_Classifier&amp;quot; (Folder) - This folder seems to contain Python code but I'm not sure what for.&lt;br /&gt;
*&amp;quot;Matcher&amp;quot; (Folder) - This folder contains the Matcher.&lt;br /&gt;
*&amp;quot;Python WebCrawler&amp;quot; (Folder) - This folder contains code that is a work in progress for pulling descriptions from accelerator websites. It is Jeemin's project.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data Copy&amp;quot; (Excel File) - This file contains a copy of our cleaned cohort data.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data&amp;quot; (Excel File) - This file contains the most current, completely cleaned data on cohort company information.&lt;br /&gt;
*&amp;quot;NormalizeFixedWidth&amp;quot; (PL File) - This is the normalizer.&lt;br /&gt;
*&amp;quot;PortCoNames&amp;quot; (TXT File) - This file contains all of the names of the cohort companies as well as the accelerator they went through.&lt;br /&gt;
*&amp;quot;VC Data&amp;quot; (Excel File) - This file contains all of the names of the companies that have ever received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data&amp;quot; (TXT File) - This file contains that non-normalized data of all of the VC information.&lt;br /&gt;
*&amp;quot;VC_Data_Names&amp;quot; (TXT File) - This file contains all of the names of companies that have received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data_Names_Matched_PortCoNames&amp;quot; (Excel File) - This file contains all of the cohort companies that have also received VC funding. Still needs to be sorted through.&lt;br /&gt;
&lt;br /&gt;
==Process==&lt;br /&gt;
After accumulating the massive amount of data on accelerators, their cohorts, and their html files, we began cleaning those text files, which are located in the &amp;quot;Data&amp;quot; folder within &amp;quot;Accelerators&amp;quot;. After going through the first round of cleaning, we ran a code through the cohort data which put all of that information into an Excel document called &amp;quot;Cleaned Cohort Data&amp;quot;. There were still some mistakes in the cohort information unfortunately, which we fixed within the Excel file itself. Therefore, there are some text files within the &amp;quot;Data&amp;quot; folder that do not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file. If we were to run the cohort code through the &amp;quot;Data&amp;quot; folder, we would get something that does not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file, which is problematic. The solution to this (other than manually cleaning the text files again) would be to write a code from the &amp;quot;Cleaned Cohort Data&amp;quot; file which would allow us to clean the data in the &amp;quot;Data&amp;quot; folder through the format of the Excel file. We have also matched all of the cohort companies with our list of all companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
=Current To Do=&lt;br /&gt;
&lt;br /&gt;
#Work on the [[Crunchbase 2013 Snapshot]]&lt;br /&gt;
#Match cohort companies to VC-backed portfolio companies&lt;br /&gt;
#Refine our data to work out which cohort each cohort company was a member of, cohort start dates and locations, etc.&lt;br /&gt;
#Make a list of top accelerator lists (e.g., http://tech.co/top-startup-accelerators-ranked-2012-08) and check that we have those accelerators&lt;br /&gt;
&lt;br /&gt;
=End of Semester Notes=&lt;br /&gt;
&lt;br /&gt;
*We have compiled a very long list of accelerators from many different databases. For the past couple of weeks, everyone in the center has been going through this list, 20 at a time, classifying each one as an accelerator or not an accelerator, and then proceeding to gather data on the accelerator using the process outlined below. This process went very smoothly. We have successfully gone through about 80% of the list. We are still missing information on the last hundred or so names. All of the collected data is located on the RDP, within the &amp;quot;Accelerators&amp;quot; folder under &amp;quot;Data&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
=Data Collection Notes=&lt;br /&gt;
&lt;br /&gt;
==MATCHING==&lt;br /&gt;
&lt;br /&gt;
The files we used to match are located in the E drive. We used the matcher to match our portfolio company names from the cohort file located in E:\McNair\Projects\Accelerators. &lt;br /&gt;
*The files used to matching are located E:\McNair\Projects\Accelerators\Matcher&lt;br /&gt;
*Portco is the name of the companies pulled from the cohort file&lt;br /&gt;
*AccCo includes both the cohort company name, along with the name of the accelerator itself&lt;br /&gt;
*In the matcher, the inputs are the PortCo names, as well as the VC data from our pull in SDC&lt;br /&gt;
*The outputs include the AccCo_VC data located in E:\McNair\Projects\Accelerators which give a lot of information on the matches, including:&lt;br /&gt;
:*name of the match itself&lt;br /&gt;
:*number of investments&lt;br /&gt;
:*dates that the company received its investments&lt;br /&gt;
&lt;br /&gt;
==SDC Pull==&lt;br /&gt;
&lt;br /&gt;
We accessed SDC platinum and pulled information on round-based funding that all registered companies received from between the years 1999 to 2017.&lt;br /&gt;
&lt;br /&gt;
The receipt is as follows:&lt;br /&gt;
&lt;br /&gt;
Session Details&lt;br /&gt;
---------------&lt;br /&gt;
Request   Hits    Request Description&lt;br /&gt;
   0        -     DATABASE: Portfolio Companies (VIPC)&lt;br /&gt;
   1     96155    Venture Related Deals: Select All Venture Related Deals&lt;br /&gt;
   2     79572    Round Date: 1/1/1999 to 3/1/2017 (Custom) (Calendar)&lt;br /&gt;
   3              Custom Report: VC Data (Columnar) - Save As:&lt;br /&gt;
                  E:\McNair\Projects\Accelerators\VC Data.txt&lt;br /&gt;
�&lt;br /&gt;
Billing Ref # : 2054025&lt;br /&gt;
Capture File  : riceuniv.2054025&lt;br /&gt;
Session Name  : &lt;br /&gt;
&lt;br /&gt;
The VC data pull includes the following variables: &lt;br /&gt;
&lt;br /&gt;
Company Name                                                           Date Company      Date Company      Company        Company City                           Company Street Address, Line 1               Company Street Address, Line 2            Total Known     Company Industry Sub-Group 3                              Company Industry Major Group     Round          Company Stage Level 3     Round Amt,       Round Amt,&lt;br /&gt;
&lt;br /&gt;
==3 files==&lt;br /&gt;
&lt;br /&gt;
For each accelerator in the list, put files in E:\Projects\Accelerators\Data&lt;br /&gt;
*AcceleratorName.txt - copy and paste the variables below into a (tab-delimited) txt file and complete&lt;br /&gt;
*AcceleratorName.cohort - your cohort text file (see below)&lt;br /&gt;
*AcceleratorName.html (possibly automatically with a folder too) - save a copy of the html of the cohort page&lt;br /&gt;
&lt;br /&gt;
==.txt Variables==&lt;br /&gt;
&lt;br /&gt;
 Name	&lt;br /&gt;
 Score	&lt;br /&gt;
 Flag	&lt;br /&gt;
 CohortURL	&lt;br /&gt;
 Address	&lt;br /&gt;
 Duration	&lt;br /&gt;
 Vintage		&lt;br /&gt;
 Industry	&lt;br /&gt;
 Description	&lt;br /&gt;
 Equity	&lt;br /&gt;
 NonProfit	 &lt;br /&gt;
 Notes	&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Try to get '''Name, Score, Flag, Cohort URL and Address''' for all. ONLY GRAB OTHER VARIABLES IF EASY. Just leave things blank if you can't find them quickly.&lt;br /&gt;
&lt;br /&gt;
'''If the score is 0, or the flag is S, I, A, or F just stop''' - don't bother downloading a cohort list, saving an HTML file, etc. If possible, do stick a very brief description of the problem in the notes field.&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Score: is 0-1 where 0 is definitely not an accelerator, 1 is definitely an accelerator&lt;br /&gt;
*Flag: (leave blank if not needed), if multiple then separate by comma&lt;br /&gt;
**S for social entrep&lt;br /&gt;
**I for incubator&lt;br /&gt;
**A for an angel group&lt;br /&gt;
**F is for foreign&lt;br /&gt;
**C for in coworking space/hub/etc&lt;br /&gt;
**V for if part of venture fund&lt;br /&gt;
**D is for Dead&lt;br /&gt;
*Put just the root URL in Cohort URL if there isn't a Cohort page&lt;br /&gt;
*Duration: in wks (months x 4.33 and round)&lt;br /&gt;
*Vintage is year of first cohort if possible&lt;br /&gt;
*Industry is industry focus but only if clear focus&lt;br /&gt;
*Equity is a number (don't put %) or Y/N&lt;br /&gt;
*Notes is only there if need it. Particularly try to use this field to note discards.&lt;br /&gt;
&lt;br /&gt;
==.cohort files==&lt;br /&gt;
&lt;br /&gt;
Your .cohort files must:&lt;br /&gt;
*Be tab delimited txt&lt;br /&gt;
*Have a header&lt;br /&gt;
*The first column must be the portfolio company name&lt;br /&gt;
*Grab as many columns as you can easily (and name them)&lt;br /&gt;
&lt;br /&gt;
==Standardized format for text files==&lt;br /&gt;
&lt;br /&gt;
Information Text file&lt;br /&gt;
*1 tab only after each category&lt;br /&gt;
*No spaces after commas for flags or industry&lt;br /&gt;
*For duration put only a number in weeks but do not write &amp;quot;weeks&amp;quot;&lt;br /&gt;
*Equity is either only a number (no percent sign) or a Y/N&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Cohort Text file&lt;br /&gt;
*1 tab between each column&lt;br /&gt;
*Titles of each column on top&lt;br /&gt;
*Make a new category for &amp;quot;Cohort Number&amp;quot; and write either &amp;quot;1 2 3 4 etc.&amp;quot;&lt;br /&gt;
*Matthew: 1-225 (done) Shrey: 226-550 (done)&lt;br /&gt;
&lt;br /&gt;
==Link to Crunchbase API application==&lt;br /&gt;
&lt;br /&gt;
https://about.crunchbase.com/forms/research-access-apply/ (Does not work anymore)&lt;br /&gt;
&lt;br /&gt;
https://data.crunchbase.com/v3/docs/using-the-api (Has new instructions for application)&lt;br /&gt;
&lt;br /&gt;
==Sign-Ups==&lt;br /&gt;
&lt;br /&gt;
 Ed - 1-10 (done)&lt;br /&gt;
 Carlin -  11-20 (done)&lt;br /&gt;
 Carlin - 21-40 (done)&lt;br /&gt;
 Christy - 41-60 (done)&lt;br /&gt;
 Avesh - 61-80 (done)&lt;br /&gt;
 Eliza - 81-100 (done)&lt;br /&gt;
 Meghana - 101-120 (done)&lt;br /&gt;
 Peter - 121-140 (done)&lt;br /&gt;
 Ramee - 141-160 (done)&lt;br /&gt;
 Will - 161-180 (done)&lt;br /&gt;
 Matthew - 181-200 (done)&lt;br /&gt;
 Julia - 201-220 (done)&lt;br /&gt;
 Peter - 221-240 (done)&lt;br /&gt;
 Shrey - 241-260 (done)&lt;br /&gt;
 Matthew - 261-280 (done)&lt;br /&gt;
 Eliza - 281-300 (done)&lt;br /&gt;
 Julia - 301-320 (done)&lt;br /&gt;
 Shrey - 321-340 (done)&lt;br /&gt;
 Carlin - 341-361 (done)&lt;br /&gt;
 Julia - 362-380 (done)&lt;br /&gt;
 Dylan - 381-393 (done)&lt;br /&gt;
 Jake - 394-404 (done)&lt;br /&gt;
 Dylan - 405-410 (done)&lt;br /&gt;
 Avesh - 411-415 (done)&lt;br /&gt;
 Dylan - 416-423 (done)&lt;br /&gt;
 Peter - 424-460(done)&lt;br /&gt;
 Carlin - 461-480 (done)&lt;br /&gt;
 Peter - 481-490(done)&lt;br /&gt;
 Julia - 491-510 (done)&lt;br /&gt;
 Peter - 511-515 (done)&lt;br /&gt;
 Julia - 516-529 (done)&lt;br /&gt;
 Ben - 530-540 (done)&lt;br /&gt;
 Shrey - 541-551 (done)&lt;br /&gt;
&lt;br /&gt;
=List of Accelerators=&lt;br /&gt;
#10Xelerator&lt;br /&gt;
#1440&lt;br /&gt;
#33entrepreneurs&lt;br /&gt;
#500 Startups&lt;br /&gt;
#9Mile Labs&lt;br /&gt;
#AIA Accelerator&lt;br /&gt;
#ARK Challenge&lt;br /&gt;
#AT&amp;amp;T Aspire Accelerator&lt;br /&gt;
#ATDC Community&lt;br /&gt;
#AZ TechCelerator&lt;br /&gt;
#AccelFoods&lt;br /&gt;
#Acceleprise&lt;br /&gt;
#Accelerate Baltimore&lt;br /&gt;
#Accelerate Genius&lt;br /&gt;
#Accelerate Tectoria Accelerator&lt;br /&gt;
#Accelerator Centre&lt;br /&gt;
#Advanced Technology Development Center (ATDC)&lt;br /&gt;
#Airbus BizLab&lt;br /&gt;
#Alchemist Accelerator&lt;br /&gt;
#AlphaLab&lt;br /&gt;
#Amplify.LA&lt;br /&gt;
#Angel Capital&lt;br /&gt;
#Angelcube&lt;br /&gt;
#Angelpad&lt;br /&gt;
#Annual Business BootCamp&lt;br /&gt;
#Arizona Center for Innovation&lt;br /&gt;
#Arizona Furnace&lt;br /&gt;
#Arrowhead Tech Incubator 2016&lt;br /&gt;
#Aspire 3 Accelerator 2017&lt;br /&gt;
#Atlanta Ventures Accelerator &lt;br /&gt;
#AutoXLR8R&lt;br /&gt;
#Awesome Inc.&lt;br /&gt;
#Axel Springer Plug and Play&lt;br /&gt;
#B 4 Change Impact Accelerator&lt;br /&gt;
#B2B Acceleration Program&lt;br /&gt;
#B4C Social Venture Accelerator&lt;br /&gt;
#BBC Worldwide Labs&lt;br /&gt;
#BMW Startup Garage&lt;br /&gt;
#Brandcelerate&lt;br /&gt;
#Bunker Labs&lt;br /&gt;
#Bank of Ireland Accelerator Programme&lt;br /&gt;
#Bantunium Labs Accelerator&lt;br /&gt;
#Barclays Accelerator&lt;br /&gt;
#Barclays New York Summer 2015&lt;br /&gt;
#Berkley Ventures&lt;br /&gt;
#Bessemer Business Incubation System&lt;br /&gt;
#Beta-i&lt;br /&gt;
#Beta.MN&lt;br /&gt;
#BetaFactory&lt;br /&gt;
#BetaSpring&lt;br /&gt;
#Betablox&lt;br /&gt;
#Betaspring RevUp  (DUPLICATE)&lt;br /&gt;
#Bethnal Green Ventures&lt;br /&gt;
#BioAccel&lt;br /&gt;
#BioInspire&lt;br /&gt;
#Bir 2015&lt;br /&gt;
#BitAngel Engagement Level&lt;br /&gt;
#BitAngels Startup Summer Program of 2013&lt;br /&gt;
#Bizdom&lt;br /&gt;
#Black Forest Accelerator&lt;br /&gt;
#Blue Startups&lt;br /&gt;
#Blueprint Health&lt;br /&gt;
#Bolt Boston&lt;br /&gt;
#Bonnier Accelerator&lt;br /&gt;
#BoomStartup&lt;br /&gt;
#BoomStartup Winter 2017 (DUPLICATE)&lt;br /&gt;
#Boomtown Accelerator&lt;br /&gt;
#Boomtown Health Tech (DUPLICATE)&lt;br /&gt;
#Boost VC&lt;br /&gt;
#BootupLabs&lt;br /&gt;
#Brandery&lt;br /&gt;
#Brooklyn Beta Summer Camp&lt;br /&gt;
#Budweiser Dream Brewery&lt;br /&gt;
#Buildit&lt;br /&gt;
#BuiltinPGH Companies&lt;br /&gt;
#Business Innovation Center&lt;br /&gt;
#Business Opportunity Academy 2017&lt;br /&gt;
#Business Technology Development Center (BizTech)&lt;br /&gt;
#CLT Joules Energy Accelerator 2014&lt;br /&gt;
#CWI Ventures&lt;br /&gt;
#CWI Ventures Application (DUPLICATE)&lt;br /&gt;
#CableLabs Technology Tours 2016&lt;br /&gt;
#Capital Factory&lt;br /&gt;
#Capital Innovators&lt;br /&gt;
#Capital Investment Network (Startups)&lt;br /&gt;
#Caroline Plouff&lt;br /&gt;
#Catalyst Partners&lt;br /&gt;
#Cause Collective : Social Innovation Lab&lt;br /&gt;
#Center for Entrepreneurial Innovation&lt;br /&gt;
#Chain Reaction Innovations 2017&lt;br /&gt;
#Chemical Angel Network&lt;br /&gt;
#Chinaccelerator&lt;br /&gt;
#Cisco Entrepreneurs in Residence&lt;br /&gt;
#Citi Accelerator&lt;br /&gt;
#Citrix Startup Accelerator&lt;br /&gt;
#Claremont/Upland Makerspace Fablab&lt;br /&gt;
#Climate Ventures 2.0 Accelerator&lt;br /&gt;
#Co.Lab accelerator&lt;br /&gt;
#Code for America Accelerator&lt;br /&gt;
#Cohab's Traxtion Point&lt;br /&gt;
#Collision Conference Investors&lt;br /&gt;
#Common Bond&lt;br /&gt;
#Communitech Hyperdrive&lt;br /&gt;
#Conquer Accelerator&lt;br /&gt;
#Coolhouse Labs&lt;br /&gt;
#CuriousMinds Incubator / Accelerator&lt;br /&gt;
#CyberTECH San Diego&lt;br /&gt;
#DBS Accelerator&lt;br /&gt;
#DPD Last Mile labs&lt;br /&gt;
#DV X Labs&lt;br /&gt;
#Dat Ventures&lt;br /&gt;
#Decatur-Morgan County Entrepreneurial Center&lt;br /&gt;
#Deep Space Ventures&lt;br /&gt;
#Demo Accelerator 2016- 2017&lt;br /&gt;
#DeveloperTown&lt;br /&gt;
#Difference Engine&lt;br /&gt;
#Digital Malaysia Corporate Accelerator Program&lt;br /&gt;
#Digital Media Zone Incubator/Accelerator&lt;br /&gt;
#Disney Accelerator&lt;br /&gt;
#DogFish Accelerator&lt;br /&gt;
#Domi Station&lt;br /&gt;
#Dotforge accelerator&lt;br /&gt;
#Dream Funded&lt;br /&gt;
#DreamIT Health&lt;br /&gt;
#DreamStart - Free Mentoring Program&lt;br /&gt;
#Dreamit Ventures (DUPLICATE)&lt;br /&gt;
#Ducky Diggy Lloyd &lt;br /&gt;
#E-Capital Summit&lt;br /&gt;
#EC Mentor Skills Inventory&lt;br /&gt;
#EIGERlab&lt;br /&gt;
#ETRAC&lt;br /&gt;
#EY Startup Challenge&lt;br /&gt;
#Eco Holding&lt;br /&gt;
#Eleven Startup Accelerator&lt;br /&gt;
#Emerge Xcelerate&lt;br /&gt;
#EnterpriseWorks Incubation Program&lt;br /&gt;
#Entrepreneur Development Center&lt;br /&gt;
#Entrepreneurs Roundtable Accelerator&lt;br /&gt;
#Environmental Business Cluster&lt;br /&gt;
#Equity Legal&lt;br /&gt;
#Excelerate Labs&lt;br /&gt;
#Execution Labs&lt;br /&gt;
#Exhilarator&lt;br /&gt;
#Extreme Startups&lt;br /&gt;
#Extreme University&lt;br /&gt;
#FOOD-X&lt;br /&gt;
#Factory45&lt;br /&gt;
#Fargo Startup House 2014-2015&lt;br /&gt;
#FastTrack Propero Healthcare&lt;br /&gt;
#FbFund&lt;br /&gt;
#Female Propeller for High Flyers&lt;br /&gt;
#FinTech Innovation Lab&lt;br /&gt;
#FinTech Studios 2015&lt;br /&gt;
#Fintech Founders Club #2&lt;br /&gt;
#First Growth Venture Network&lt;br /&gt;
#Fishbowl Labs AOL&lt;br /&gt;
#Flagship Enterprise Center&lt;br /&gt;
#FlashStarts&lt;br /&gt;
#Flashpoint&lt;br /&gt;
#Flat6 Labs&lt;br /&gt;
#Fledge9&lt;br /&gt;
#Flextronics Lab IX&lt;br /&gt;
#Food Future Scale-up Accelerator 2017&lt;br /&gt;
#Food System 6 (FS6) Accelerator&lt;br /&gt;
#FoodForwardX&lt;br /&gt;
#Fortify Ventures&lt;br /&gt;
#Founder Institute&lt;br /&gt;
#FounderFuel&lt;br /&gt;
#FoundersPad&lt;br /&gt;
#Fownders Accelerator&lt;br /&gt;
#French Accelerator 2016&lt;br /&gt;
#Fund the Food&lt;br /&gt;
#Fuse Corps Host&lt;br /&gt;
#GAKKEN Accelerator Program&lt;br /&gt;
#Gainesville Technology Enterprise Center&lt;br /&gt;
#Game CoLab Incubator Program 2014&lt;br /&gt;
#GameFounders&lt;br /&gt;
#GammaRebels&lt;br /&gt;
#Gazelle Lab&lt;br /&gt;
#Gener8tor&lt;br /&gt;
#German Accelerator Life Sciences&lt;br /&gt;
#German Accelerator Tech&lt;br /&gt;
#Global Accelerator Network 2015&lt;br /&gt;
#Good Works Houston Lab&lt;br /&gt;
#GoodCompany Ventures&lt;br /&gt;
#Google Launchpad Accelerator&lt;br /&gt;
#Grants4Apps Accelerator&lt;br /&gt;
#GreenStart&lt;br /&gt;
#Greenlite Labs&lt;br /&gt;
#GrowLab&lt;br /&gt;
#Growth Hacking Accelerator 2015&lt;br /&gt;
#Gulf Coast Center for Innovation and Entrepreneurship&lt;br /&gt;
#H-Farm Ventures&lt;br /&gt;
#HACKT Mission for International Founders&lt;br /&gt;
#HAXLR8R&lt;br /&gt;
#HCC Entrepreneurship Launchpad&lt;br /&gt;
#HIGHLINE Academy&lt;br /&gt;
#HUB&lt;br /&gt;
#HUBB Accelerator&lt;br /&gt;
#HUBB GTLA 2016&lt;br /&gt;
#HackFWD&lt;br /&gt;
#Hatch&lt;br /&gt;
#Health Wildcatters&lt;br /&gt;
#Health accelerator&lt;br /&gt;
#Healthbox&lt;br /&gt;
#Hero City Co-Working Space&lt;br /&gt;
#High Street Startups Accelerator&lt;br /&gt;
#Highway1&lt;br /&gt;
#Honda Xcelerator &lt;br /&gt;
#Houston Technology Center&lt;br /&gt;
#Hub Ventures&lt;br /&gt;
#HugeThing&lt;br /&gt;
#I/O ventures&lt;br /&gt;
#ICONYC labs&lt;br /&gt;
#IDC Elevator&lt;br /&gt;
#INcubes Funnel and Accelerator 2014/2015&lt;br /&gt;
#INcubes Online Form&lt;br /&gt;
#INcubes Startup Visa&lt;br /&gt;
#Illumina Accelerator&lt;br /&gt;
#Illuminator,  New York Accelerator 2015&lt;br /&gt;
#Imagine K12&lt;br /&gt;
#Immokalee Business Development Center&lt;br /&gt;
#Impact Engine&lt;br /&gt;
#Impact USA - 2017&lt;br /&gt;
#Incubate Miami&lt;br /&gt;
#Infuse Accelerator&lt;br /&gt;
#Ingenuity Partner Program&lt;br /&gt;
#InnoSpring&lt;br /&gt;
#Innov&amp;amp;Connect&lt;br /&gt;
#Innov8 for Health&lt;br /&gt;
#Innova Memphis&lt;br /&gt;
#InnovateOC&lt;br /&gt;
#Innovation Depot&lt;br /&gt;
#Innovation Pavilion&lt;br /&gt;
#Innovation Showcase Winter 2017&lt;br /&gt;
#Insight Accelerator Labs&lt;br /&gt;
#Intel Education Accelerator&lt;br /&gt;
#Investment Preparedness Lab&lt;br /&gt;
#Invoke Collective&lt;br /&gt;
#Iowa Startup Accelerator&lt;br /&gt;
#JFDI.Asia&lt;br /&gt;
#JFE Accelerator SF&lt;br /&gt;
#JLAB&lt;br /&gt;
#Jaguar Land Rover Tech Incubator&lt;br /&gt;
#Jolt&lt;br /&gt;
#JumpSchool &lt;br /&gt;
#JumpStart Foundry&lt;br /&gt;
#Jumpstart! Boulder&lt;br /&gt;
#JusticeXL&lt;br /&gt;
#Kairos Boston Spring Program&lt;br /&gt;
#Kaplan EdTech&lt;br /&gt;
#Kick&lt;br /&gt;
#Kick Boise&lt;br /&gt;
#Kick LA&lt;br /&gt;
#Kick Victoria&lt;br /&gt;
#Kicklabs&lt;br /&gt;
#Kinetiq Labs&lt;br /&gt;
#L-SPARK Accelerator&lt;br /&gt;
#LAUNCH incubator&lt;br /&gt;
#LAUNCHub&lt;br /&gt;
#LI TechCOMETS&lt;br /&gt;
#LabFunding Project Accelerator 2014&lt;br /&gt;
#Labs Venture Accelerator&lt;br /&gt;
#Launch Chapel Hill&lt;br /&gt;
#Launch Memphis&lt;br /&gt;
#LaunchBox Digital&lt;br /&gt;
#LaunchHouse&lt;br /&gt;
#LaunchPad PEI&lt;br /&gt;
#LaunchSpot&lt;br /&gt;
#Launch_Academy&lt;br /&gt;
#Launchpad Digital Health, LLC&lt;br /&gt;
#Launchpad LA&lt;br /&gt;
#Launchpad Long Island&lt;br /&gt;
#Le Camping&lt;br /&gt;
#Leading Entrepreneurial Accelerator Program&lt;br /&gt;
#Lean Launch Ventures&lt;br /&gt;
#LearnLaunchX&lt;br /&gt;
#Lemnos Labs&lt;br /&gt;
#Life Changing Labs&lt;br /&gt;
#LiftOff Health Incubator&lt;br /&gt;
#Lightbank Start&lt;br /&gt;
#LightningLab&lt;br /&gt;
#Lowe's Accelerator&lt;br /&gt;
#MACH37&lt;br /&gt;
#MACH37 Spring&lt;br /&gt;
#MIT SA+P venture accelerator&lt;br /&gt;
#MITA Institute Accelerator&lt;br /&gt;
#MTGx MediaFactory&lt;br /&gt;
#Mac6&lt;br /&gt;
#Madworks Governance Accelerator&lt;br /&gt;
#Maine Center for Entrepreneurial Development - Top Gun Program&lt;br /&gt;
#Matter&lt;br /&gt;
#Maven Ventures Fund &amp;amp; Incubator&lt;br /&gt;
#Media Camp&lt;br /&gt;
#Melbourne Accelerator Program&lt;br /&gt;
#Memphis BioWorks&lt;br /&gt;
#Merck Accelerator&lt;br /&gt;
#MergeLane 2017 Accelerator&lt;br /&gt;
#Mergelane&lt;br /&gt;
#Metavallon&lt;br /&gt;
#Microsoft Accelerator&lt;br /&gt;
#MindTheBridge&lt;br /&gt;
#Momentum&lt;br /&gt;
#MuckerLab&lt;br /&gt;
#Muru-D&lt;br /&gt;
#My5ive Accelerator 2016&lt;br /&gt;
#N-Motion (DUPLICATE)&lt;br /&gt;
#NDRC (LaunchPad / VentureLab)&lt;br /&gt;
#NEXT Dashboard&lt;br /&gt;
#NMotion&lt;br /&gt;
#NY Digital Health Accelerator&lt;br /&gt;
#NY Fashion Tech Lab 2017&lt;br /&gt;
#NYC ACRE&lt;br /&gt;
#NYC SeedStart&lt;br /&gt;
#Nashville Entrepreneur Center&lt;br /&gt;
#Nebula Shift&lt;br /&gt;
#Nephoscale IaaS&lt;br /&gt;
#Nest New York &lt;br /&gt;
#New Ventures Group&lt;br /&gt;
#New York Digital Health Accelerator (DUPLICATE)&lt;br /&gt;
#NewME Accelerator PopUps &lt;br /&gt;
#NewMe&lt;br /&gt;
#Next media accelerator&lt;br /&gt;
#NextHIT&lt;br /&gt;
#NextStart&lt;br /&gt;
#Nike+ Accelerator&lt;br /&gt;
#Northern Arizona Center for Entrepreneurship and Technology (NACET)&lt;br /&gt;
#Northern England&lt;br /&gt;
#Nxtp.labs&lt;br /&gt;
#OCTANe&lt;br /&gt;
#Oasis 500&lt;br /&gt;
#OpenFund&lt;br /&gt;
#Orange Fab&lt;br /&gt;
#Orange Works&lt;br /&gt;
#Orion Startups&lt;br /&gt;
#Oxygen Accelerator&lt;br /&gt;
#PIE&lt;br /&gt;
#Patriot Boot Camp&lt;br /&gt;
#Pearson Catalyst for Education&lt;br /&gt;
#Pipeline H2O&lt;br /&gt;
#Pitney Bowes Inc&lt;br /&gt;
#Plarium Labs&lt;br /&gt;
#Plug In South LA &lt;br /&gt;
#Plug and Play&lt;br /&gt;
#Plum Alley Investments 2016&lt;br /&gt;
#Points of Light Accelerator&lt;br /&gt;
#PowerHaus&lt;br /&gt;
#Preccelerator® Program 2016&lt;br /&gt;
#ProSiebenSat.1 Accelerator&lt;br /&gt;
#Project Entrepreneur 2016/17&lt;br /&gt;
#Project Healtchare&lt;br /&gt;
#Project Lift&lt;br /&gt;
#Project Music&lt;br /&gt;
#Project Skyway&lt;br /&gt;
#Propeller Venture Accelerator&lt;br /&gt;
#Prosper Capital Accelerator&lt;br /&gt;
#Proton Enterprises&lt;br /&gt;
#Pushstart Accelerator&lt;br /&gt;
#Qualcomm Robotics Accelerator&lt;br /&gt;
#Queen Creek Business Incubator&lt;br /&gt;
#R/GA Accelerator&lt;br /&gt;
#RAIN Incubator/Accelerator&lt;br /&gt;
#RJI Investment Group&lt;br /&gt;
#Reach&lt;br /&gt;
#RetailXelerator&lt;br /&gt;
#Rock Health&lt;br /&gt;
#Rocket Fuel Labs&lt;br /&gt;
#Rockstart Accelerator&lt;br /&gt;
#RunUp Labs&lt;br /&gt;
#Runway IoT Accelerator 2015&lt;br /&gt;
#SAP Startup Focus Program&lt;br /&gt;
#SKTA Innopartners Innovation Accelerator&lt;br /&gt;
#SPACELAB Tech Accelerator&lt;br /&gt;
#SPARK&lt;br /&gt;
#SPH Plug and Play&lt;br /&gt;
#SURF Incubator&lt;br /&gt;
#SaltMines Group Start-Up Studio&lt;br /&gt;
#ScaleTown&lt;br /&gt;
#Seamless IoT 2016&lt;br /&gt;
#Searchcamp&lt;br /&gt;
#Seed Hatchery&lt;br /&gt;
#SeedSpot&lt;br /&gt;
#SeedStartup&lt;br /&gt;
#SeedSumo&lt;br /&gt;
#Seedcamp&lt;br /&gt;
#Seedrocket&lt;br /&gt;
#Seeqnce&lt;br /&gt;
#Sequoia Apps&lt;br /&gt;
#Serval Ventures&lt;br /&gt;
#Shenzhen Valley Ventures Incubator&lt;br /&gt;
#Shoals Entrepreneurial Center&lt;br /&gt;
#Shopper Futures Accelerator&lt;br /&gt;
#Shotput Ventures&lt;br /&gt;
#Sid Martin Biotechnology Institute&lt;br /&gt;
#SigmaLabs Accelerator&lt;br /&gt;
#Silicon Valley Incubator &amp;amp; Accelerator&lt;br /&gt;
#SixThirty&lt;br /&gt;
#Sixers Innovation Lab&lt;br /&gt;
#Skywalker Accelerator&lt;br /&gt;
#SmartHealth Activator&lt;br /&gt;
#Smashd Labs&lt;br /&gt;
#SoCo Nexus Accelerator Spring 2017&lt;br /&gt;
#Social Enterprise Challenge&lt;br /&gt;
#Socratic Labs&lt;br /&gt;
#SparkLabs&lt;br /&gt;
#Sparkgap&lt;br /&gt;
#Sports Tank&lt;br /&gt;
#Springboard&lt;br /&gt;
#Sprint Accelerator&lt;br /&gt;
#Sprint Mobile Health Accelerator&lt;br /&gt;
#SproutBox&lt;br /&gt;
#SproutCamp&lt;br /&gt;
#Starburst Aerospace Accelerator&lt;br /&gt;
#Start Path Europe&lt;br /&gt;
#Start'inPost&lt;br /&gt;
#StartEngine&lt;br /&gt;
#StartFast Venture Accelerator&lt;br /&gt;
#Starta Accelerator Winter 2017&lt;br /&gt;
#Startl&lt;br /&gt;
#Startmate&lt;br /&gt;
#Startup Accelerator (DUPLICATE)&lt;br /&gt;
#Startup Front&lt;br /&gt;
#Startup Next &amp;amp; GAN&lt;br /&gt;
#Startup Orange County Accelerator&lt;br /&gt;
#Startup Runway&lt;br /&gt;
#Startup Wise Guys&lt;br /&gt;
#Startup Zone PEI&lt;br /&gt;
#Startup52X Accelerator&lt;br /&gt;
#StartupCity&lt;br /&gt;
#StartupHighway&lt;br /&gt;
#StartupHouse Foundry program&lt;br /&gt;
#StartupMinds Accelerator &lt;br /&gt;
#StartupYard&lt;br /&gt;
#Startupbootcamp&lt;br /&gt;
#Straight Shot&lt;br /&gt;
#Summer@Highland&lt;br /&gt;
#Surge&lt;br /&gt;
#SynBio axlr8r&lt;br /&gt;
#TEB Incubation &amp;amp; Acceleration Center&lt;br /&gt;
#THRIVE Accelerator III&lt;br /&gt;
#THRIVE Open Innovation (DUPLICATE)&lt;br /&gt;
#TIM#WCAP Accelerator&lt;br /&gt;
#TLabs&lt;br /&gt;
#TMCx Accelerator Digital Health 2017&lt;br /&gt;
#Tallwave&lt;br /&gt;
#Tampa Bay Innovation Center&lt;br /&gt;
#Tampa Bay Wave&lt;br /&gt;
#Tandem Mobile Accelerator&lt;br /&gt;
#Tech Nexus&lt;br /&gt;
#Tech Wildcatters&lt;br /&gt;
#Tech2020&lt;br /&gt;
#TechLaunch&lt;br /&gt;
#TechRanch&lt;br /&gt;
#TechSquareLabs&lt;br /&gt;
#Techstars&lt;br /&gt;
#Techstars Music&lt;br /&gt;
#Telenet Idealabs&lt;br /&gt;
#Telluride Venture Accelerator&lt;br /&gt;
#TenX&lt;br /&gt;
#The Alchemist Accelerator (DUPLICATE)&lt;br /&gt;
#The Ark&lt;br /&gt;
#The Bakery&lt;br /&gt;
#The Batchery&lt;br /&gt;
#The Brandery&lt;br /&gt;
#The Bridge&lt;br /&gt;
#The Center For Technology Enterprise &amp;amp; Development&lt;br /&gt;
#The Chaser&lt;br /&gt;
#The Company Lab (CO.LAB)&lt;br /&gt;
#The Draper FinTech Connection&lt;br /&gt;
#The Factory&lt;br /&gt;
#The Greatest Pitch&lt;br /&gt;
#The Harbor Accelerator&lt;br /&gt;
#The Incubator&lt;br /&gt;
#The Iron Yard&lt;br /&gt;
#The Mediapreneur Incubator&lt;br /&gt;
#The Morpheus&lt;br /&gt;
#The New York Venture Summit&lt;br /&gt;
#The Next Step: from idea to startup&lt;br /&gt;
#The Refinery&lt;br /&gt;
#The Unilever Foundry&lt;br /&gt;
#The Venture Center's Pre-Accelerator I&lt;br /&gt;
#The Vine OC&lt;br /&gt;
#The Vogt Awards&lt;br /&gt;
#The Yield Lab&lt;br /&gt;
#The eFactory Accelerator&lt;br /&gt;
#Think Big Partners Accelerator&lt;br /&gt;
#TiE Angels&lt;br /&gt;
#Tigerlabs Digital Health Accelerator&lt;br /&gt;
#Tolstoy Summer Camp&lt;br /&gt;
#TopSeedsLab&lt;br /&gt;
#Travel Startups Incubator&lt;br /&gt;
#Travelport Labs Accelerator&lt;br /&gt;
#Travelport Labs Incubator&lt;br /&gt;
#Triangle Startup Factory&lt;br /&gt;
#Tumml&lt;br /&gt;
#Tune Labs&lt;br /&gt;
#Twin Cities Accelerator 2016&lt;br /&gt;
#UW-Whitewater Launch Pad Accelerator&lt;br /&gt;
#Unbank.ventures FinTech Incubator&lt;br /&gt;
#University Technology Park&lt;br /&gt;
#Unreasonable Institute&lt;br /&gt;
#UpTech&lt;br /&gt;
#Upstart Accelerator&lt;br /&gt;
#Upstart Labs&lt;br /&gt;
#Upstart Memphis&lt;br /&gt;
#Uptima Business Bootcamp&lt;br /&gt;
#Upwest Labs&lt;br /&gt;
#VANTEC&lt;br /&gt;
#VC FinTech Accelerator&lt;br /&gt;
#Velocity Indiana Accelerator&lt;br /&gt;
#Venture Catalyst Partners&lt;br /&gt;
#Venture Hive&lt;br /&gt;
#Venture I&lt;br /&gt;
#VentureOut's  Enterprise Tech Expedition&lt;br /&gt;
#Venturegeeks&lt;br /&gt;
#Vet-Tech Accelerator&lt;br /&gt;
#VictorySpark&lt;br /&gt;
#Village88 Techlab&lt;br /&gt;
#Volkswagen ERL Technology Accelerator&lt;br /&gt;
#WHLabs&lt;br /&gt;
#Wasabi Ventures Academy&lt;br /&gt;
#Wayra&lt;br /&gt;
#Wellness Accelerator&lt;br /&gt;
#Wells Fargo Startup Accelerator&lt;br /&gt;
#Wireless IoT&lt;br /&gt;
#Women Innovate Mobile&lt;br /&gt;
#XLerateHealth&lt;br /&gt;
#XTRATOS&lt;br /&gt;
#Xlerate Health&lt;br /&gt;
#Y Combinator&lt;br /&gt;
#Y&amp;amp;R SparkPlug 2017&lt;br /&gt;
#YEurope&lt;br /&gt;
#YLE Media Startup Accelerator Program&lt;br /&gt;
#Yahoo Ad Tech Program&lt;br /&gt;
#Yangler (online accelerator)&lt;br /&gt;
#Year of the Startup&lt;br /&gt;
#Yetizen Accelerator&lt;br /&gt;
#You Is Now&lt;br /&gt;
#Z80 Labs&lt;br /&gt;
#ZIP Launchpad Admission&lt;br /&gt;
#ZeroTo510&lt;br /&gt;
#Zone Startups Calgary&lt;br /&gt;
#designX 2017&lt;br /&gt;
#eMerging Ventures&lt;br /&gt;
#ezone&lt;br /&gt;
#iStart Jax (DUPLICATE)&lt;br /&gt;
#iStart Valley&lt;br /&gt;
#iVentures10&lt;br /&gt;
#ignite100&lt;br /&gt;
#innovyz start&lt;br /&gt;
#tekMountain Accelerator&lt;br /&gt;
&lt;br /&gt;
=Project Summary=&lt;br /&gt;
This project will be used to determine which accelerators are the most effective at churning out successful startups, as well as what characteristics are exhibited by these accelerators. First, we need to gather as much data as we can about as many accelerators as we can in order to look at factors that differentiate successful vs. unsuccessful ventures. Next, we need to create a web crawling program which will gather information about accelerators across the world by accessing their websites and extracting information. I believe that our overall goal with this research project is to gain insight into the methods of successful accelerators, as well as to find out what exactly differentiates very successful accelerators from dead accelerators.&lt;br /&gt;
&lt;br /&gt;
Helpful Links: http://seedrankings.com/&lt;br /&gt;
&lt;br /&gt;
=Sources=&lt;br /&gt;
&lt;br /&gt;
Summary: These are sources obtained from [[List of Accelerators]], Crunchbase, and other Google searches. We will evaluate these sources by looking at the number of accelerators they supply (as most of them are lists) and then also taking a look at the type of information they provide about each accelerator. Key data points are cohort-related data, startup-related data, and logistics of the accelerator. Better sources supply more information that the URL alone.&lt;br /&gt;
&lt;br /&gt;
(Obtained from [[List of Accelerators]] and various Google searches)&lt;br /&gt;
*http://seedrankings.com/&lt;br /&gt;
*http://www.acceleratorinfo.com/see-all.html&lt;br /&gt;
*http://www.seed-db.com/accelerators&lt;br /&gt;
*http://gust.com/usa-canada-accelerator-report-2015/?utm_content=35401577&amp;amp;utm_medium=social&amp;amp;utm_source=twitter&lt;br /&gt;
*https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/&lt;br /&gt;
*http://www.builtinnyc.com/2016/06/03/accelerators-incubators-nyc&lt;br /&gt;
*http://www.represent.la/&lt;br /&gt;
*http://www.launch.co/blog/complete-list-of-incubators-and-accelerators-like-y-combinat.html&lt;br /&gt;
*https://angel.co/accelerator-4 (Does not work - seems to be replaced by https://angel.co/companies?company_types[]=Incubator )&lt;br /&gt;
&lt;br /&gt;
(Obtained from Google search: &amp;quot;Accelerator Database&amp;quot;)&lt;br /&gt;
*seed-db is the first result that pops up&lt;br /&gt;
*https://www.corporate-accelerators.net/database/&lt;br /&gt;
*https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json&lt;br /&gt;
*By the 5th or 6th search result, the utility diminished greatly&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2015/03/17/the-best-startup-accelerators-of-2015-powering-a-tech-boom/#2f52fa7e34e4&lt;br /&gt;
*http://www.inc.com/will-yakowicz/the-15-best-startup-accelerators-in-the-us.html&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2016/03/11/the-best-startup-accelerators-of-2016/#74086a7724f2&lt;br /&gt;
*https://techcrunch.com/2015/03/17/these-are-the-top-20-us-accelerators/&lt;br /&gt;
*https://www.nexpcb.com/blogs/news/the-hardware-incubators-accelerators-list&lt;br /&gt;
&lt;br /&gt;
Other ways used to find Accelerators (listed below &amp;quot;List of Sources Obtained from Various Google Searches&amp;quot;):&lt;br /&gt;
*Type in generic location + &amp;quot;accelerators&amp;quot; (e.g. Houston Accelerators)&lt;br /&gt;
:*Looked at roughly the first 20 results&lt;br /&gt;
:*Used three locations as examples of accelerators that pop up&lt;br /&gt;
*Type in a specific state + &amp;quot;accelerator&amp;quot; + &amp;quot;list&amp;quot; (e.g. Texas accelerator list) to search for more relevant lists&lt;br /&gt;
:*Once again, looked at roughly the first 20 results&lt;br /&gt;
*Crunchbase has its own webpage with instructions for how we retrieve the data&lt;br /&gt;
&lt;br /&gt;
=Source Evaluations=&lt;br /&gt;
&lt;br /&gt;
Summary: These evaluations couple with each of the sources above. The evaluations provide instructions for obtaining the information listed, as well as a general review of how useful the data seems. The review serves to determine whether a crawler would be suitable for obtaining information from the source autonomously.&lt;br /&gt;
&lt;br /&gt;
==SOURCE: Crunchbase==&lt;br /&gt;
*All of the information for the Crunchbase documentation is located in the page [[Crunchbase 2013 Snapshot]] webpage, along with the documentation for how we determined the accelerator information.&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.acceleratorinfo.com/see-all.html==&lt;br /&gt;
#Opened source website&lt;br /&gt;
#Copied Information under &amp;quot;All Accelerator Programs&amp;quot; to TextPad, already sorted. Returned 190 results&lt;br /&gt;
#Each link on parent list leads to individual '''home page url''' of accelerator&lt;br /&gt;
:*Used sample size of 20 links, determined 16 to be accelerators, 2 to be incubators, 2 to be inactive or broken links&lt;br /&gt;
:*Many accelerators do not include founding date, most recent accelerators from around 2013-2014 (as determined from home page)&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for specific URLs to older accelerators, not very helpful for more specific information.&lt;br /&gt;
*Web crawling seems improbable because information is not readily available from source. Can potentially mine staff information or contact information from associated &amp;quot;about&amp;quot; page in the home url&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators==&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 235 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes:&lt;br /&gt;
::# &amp;quot;state&amp;quot;&lt;br /&gt;
::# &amp;quot;company name&amp;quot;&lt;br /&gt;
::# &amp;quot;website and CrunchBase links&amp;quot;&lt;br /&gt;
::# &amp;quot;cohort date&amp;quot;&lt;br /&gt;
::#&amp;quot;exit value&amp;quot;&lt;br /&gt;
::#&amp;quot;funding&amp;quot;. &lt;br /&gt;
:::Many entries for &amp;quot;exit value&amp;quot; are missing, some values for &amp;quot;funding&amp;quot; are missing&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators out of 235 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the following:&lt;br /&gt;
::#Status&lt;br /&gt;
::#Program (name)&lt;br /&gt;
::#Location&lt;br /&gt;
::#Country&lt;br /&gt;
::#Number of companies&lt;br /&gt;
::#Cumulative exit values&lt;br /&gt;
::#Cumulative funding &lt;br /&gt;
::#Average funding for startups&lt;br /&gt;
::#Median funding for startups&lt;br /&gt;
:::Many entries for &amp;quot;median funding&amp;quot; are left empty, as well as entries for all types of funding on the bottom half of the table&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, but after cross-referencing from other sources shows that seed-db is lacking many newer accelerators; list is not all-inclusive.&lt;br /&gt;
*Includes regional distributions for accelerator groups as well. For example, rather than just &amp;quot;Techstars&amp;quot;, the group is broken into Austin, Berlin, Boston, Boulder, etc.&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators==&lt;br /&gt;
:Very similar to &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;, but contains large regional accelerators as groups, rather than individual accelerators. For example, Techstars appears only once.&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 239 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes same information as previous source, &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;. However, accelerators spanning across multiple regions have their startups located under one category on this webpage.&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators/groups out of 239 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the same information as the &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; source&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, includes large groups as well as individual accelerators. It seems that some accelerators missing from &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; are located here, since there are 239 returns rather than 235.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.f6s.com/programs?type==&lt;br /&gt;
#On the webpage, set &amp;quot;Type&amp;quot; to &amp;quot;Accelerator/Program&amp;quot;, set &amp;quot;Location&amp;quot; to &amp;quot;North America&amp;quot;, and set &amp;quot;Invest in Country&amp;quot; to &amp;quot;United States&amp;quot; to return results&lt;br /&gt;
#Highlighted results and scrolled down until all results found; copied results to TextPad&lt;br /&gt;
#In TextPad, sorted out lines with &amp;quot;by&amp;quot;, as well as miscellaneous categories such as dates and dollar signs through Regular Expressions&lt;br /&gt;
#Using the &amp;quot;More Info&amp;quot; line which held constant through the entire list, assigned a sequential number to the line (in order to determine the number of results)&lt;br /&gt;
::*Obtained a grand total of 1467 results from the list&lt;br /&gt;
::*Along with the name of the program/accelerator, the data included:&lt;br /&gt;
::#Dollar value per team&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Application Site&lt;br /&gt;
::#Accelerator URL&lt;br /&gt;
::*Many entries are not accelerators, from a quick glance through the results, there were various conferences, 3-5 days events, and written literature pertaining to accelerators as well&lt;br /&gt;
::*From a sample size of the first 30 entries, determined 10 to be valid accelerators, 3 incubators, 6 conferences/weekends, and the rest to be miscellaneous entries such as startup events or &amp;quot;studios&amp;quot; (perhaps useful but not relevant to search)&lt;br /&gt;
::*As we go down the list, the number of accelerators proportionately decreases. Can comfortably say that overall accelerator turnout from this website is much less than 33%, probably closer to 10-15%.&lt;br /&gt;
===Review===&lt;br /&gt;
*Potentially useful website if crawler could remove the clutter and target solely the accelerators; very useful for identifying new accelerators since data automatically sorted by date and location.&lt;br /&gt;
*Large list of sources includes many irrelevant results, such as conferences or weekends which are difficult to identify. The name of the sorting category itself, &amp;quot;Accelerator/Program&amp;quot; suggests that many of the results fall under the &amp;quot;Program&amp;quot; section rather than being valid accelerators.&lt;br /&gt;
*Potential site for identifying accelerators, but limited by in-site sorting; useful for URL and perhaps equity, but not very detailed information relating to the accelerator/program.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://gust.com/usa-canada-accelerator-report-2015/==&lt;br /&gt;
#Selected region of US and Canada&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Top 20 Active Accelerators&amp;quot; and selected &amp;quot;see the full list&amp;quot; near the bottom of the listed accelerators&lt;br /&gt;
#Copied resulting entries into TextPad and sorted out the numbers to leave only the name of the accelerator&lt;br /&gt;
::*Obtained 100 results for different accelerators&lt;br /&gt;
::*Accelerator lists included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Number of Start-ups funded (2015 only)&lt;br /&gt;
::*Accelerator list limited to 2015&lt;br /&gt;
===Review===&lt;br /&gt;
*Website provides its own evaluation of an accelerator's success based on various factors and provides data for larger trends.&lt;br /&gt;
*Usefulness is questionable because website does not provide much except the URL, and all of the entries are based on success in 2015.&lt;br /&gt;
*Other interesting data within website such as &amp;quot;Hot Markets&amp;quot;, investment breakdowns by state, etc. All of this data is also limited to 2015.&lt;br /&gt;
&lt;br /&gt;
==Source: https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/==&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Startup accelerators in Boston&amp;quot;&lt;br /&gt;
#Copied text beginning from &amp;quot;MassChallenge&amp;quot; (the first paragraph was just a general definition of startups) and continued to copy until &amp;quot;Startup Incubators in Boston&amp;quot;&lt;br /&gt;
#After pasting in TextPad, I sorted the data to delete any characters after the &amp;quot;-&amp;quot; and added a sequential number at the beginning of each line&lt;br /&gt;
::*Returned a total of 17 results for startups in Boston&lt;br /&gt;
::*Accelerator list included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Capital requirements&lt;br /&gt;
::#Application periods and requirements&lt;br /&gt;
::#Paragraph describing accelerator and its goals&lt;br /&gt;
===Review===&lt;br /&gt;
*Although the guide is dated, useful for identifying strong accelerator programs in Boston&lt;br /&gt;
*Limitation: only focuses on Boston, but the description is helpful in identifying the role of the accelerator&lt;br /&gt;
*Limited information on accelerator, not very useful by itself without information from the accelerator URL&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.corporate-accelerators.net/database/==&lt;br /&gt;
#Copied and pasted table into Microsoft Excel (Data was already sorted into categories so no need for TextPad)&lt;br /&gt;
#Table returned 72 references (but there was a link to the bottom to a larger database)&lt;br /&gt;
::*The table itself includes:&lt;br /&gt;
::#Major Company&lt;br /&gt;
::#Accelerator&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Website&lt;br /&gt;
::#Details&lt;br /&gt;
::*The &amp;quot;Details&amp;quot; link led to a variety of other information including:&lt;br /&gt;
::#Status (Active or Inactive)&lt;br /&gt;
::#Locations&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Term&lt;br /&gt;
::#Cohort Based? (Regular or Irregular)&lt;br /&gt;
::#Pitch Day&lt;br /&gt;
::#Office Space&lt;br /&gt;
::#Powered by&lt;br /&gt;
::#Support Offered?&lt;br /&gt;
::#Launch year&lt;br /&gt;
::#Focus Areas&lt;br /&gt;
::#General Description&lt;br /&gt;
::*Also Included a variety of data regarding the host company as well&lt;br /&gt;
===Review===&lt;br /&gt;
*Solid list for corporate accelerators and also includes a variety of information about the accelerator, the cohorts, etc. Some of the entries are international accelerators however so need to filter them out&lt;br /&gt;
*Only limited to 72 accelerators from major companies&lt;br /&gt;
&lt;br /&gt;
==Source: https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json==&lt;br /&gt;
#This source is a .json file from the previous database&lt;br /&gt;
#After placing into TextPad, replaced each space with a ###, replaced each new line with a tab, and replaced each ### with a new line. Ultimately returned 80 results&lt;br /&gt;
::*From the file, the .json includes:&lt;br /&gt;
::#NAICS and NAICS sector &lt;br /&gt;
::#Classification&lt;br /&gt;
::#Sector Description&lt;br /&gt;
::#Term&lt;br /&gt;
::#Goal&lt;br /&gt;
::#Partner&lt;br /&gt;
::*Also includes most of the information from the previous source, since they are undoubtedly linked&lt;br /&gt;
===Review===&lt;br /&gt;
*Another solid list for corporate accelerators with some more information, but ultimately very similar to the previous source.&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.quora.com/Where-can-I-find-a-comprehensive-list-of-startup-incubators-and-accelerators-in-the-US==&lt;br /&gt;
#Since we already looked at the first listed source (seed-db), I clicked on the second link &amp;quot;(by Robert Shedd) http://blog.shedd.us/321987608/&amp;quot; which took me to a page headed &amp;quot;Help for Startups! – A semi-complete list of startup accelerator programs&amp;quot; created by a blogger, Robert Shedd&lt;br /&gt;
#List included 102 entries by the blogger, each of which do look like an accelerator&lt;br /&gt;
::*Upon immediate overview, noticed many results from previous sources were missing. Immediately noticed lack of &amp;quot;OwlSpark&amp;quot;, the accelerator from Rice.&lt;br /&gt;
::*Shedd only offers us the accelerator name plus its URL&lt;br /&gt;
===Review===&lt;br /&gt;
*Nice list to cross-reference with other sources but does not offer much new insight compared to more powerful engines such as seed-db\&lt;br /&gt;
&lt;br /&gt;
=List of Sources Obtained from Various Google Searches=&lt;br /&gt;
&lt;br /&gt;
Summary: These accelerators are taken from a specific Google search rather than a list. The idea is to compile a list of Google searches that return relevant results of accelerators. This will aid in the creation of a future web crawler.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;Location + Accelerator&amp;quot;(Only individual results, not lists)==&lt;br /&gt;
===Houston Accelerators===&lt;br /&gt;
*Examples of single accelerators found&lt;br /&gt;
:#TMCx: http://www.tmc.edu/innovation/innovation-programs/tmcx/&lt;br /&gt;
:#RED labs: http://redlabs.uh.edu/8&lt;br /&gt;
:#SURGE accelerator: https://kirkcoburn.com/&lt;br /&gt;
:#OwlSpark: http://owlspark.com/&lt;br /&gt;
:#NextHIT: http://www.houstonhealthventures.com/nexthit-accelerator-program-application/&lt;br /&gt;
===Los Angeles Accelerators===&lt;br /&gt;
:#Amplify: http://amplify.la/&lt;br /&gt;
:#Y Combinator: https://www.ycombinator.com/&lt;br /&gt;
:#Chicklabs: https://www.chicklabsllc.com/&lt;br /&gt;
:#Disney Accelerator: https://disneyaccelerator.com/&lt;br /&gt;
:#Launchpad: https://launchpad.la/&lt;br /&gt;
===New York Accelerators===&lt;br /&gt;
:#DreamIT Ventures: http://www.dreamit.com/#meaningful-experience&lt;br /&gt;
:#Women Innovate Mobile: http://www.wim.co/&lt;br /&gt;
:#Techstars NYC: http://www.techstars.com/programs/nyc-program/&lt;br /&gt;
:#Entrepreneurs Roundtable: http://eranyc.com/&lt;br /&gt;
:#FirstGrowthVC: http://venturecrush.com/fg/&lt;br /&gt;
:#New York Digital Health Accelerator: http://digitalhealthaccelerator.com/&lt;br /&gt;
:#Grand Central Tech: http://www.grandcentraltech.com/&lt;br /&gt;
:#Accelerator Corp: http://www.acceleratorcorp.com/&lt;br /&gt;
:#New York Startup Lab: http://nystartuplab.com/&lt;br /&gt;
===Review===&lt;br /&gt;
*Some locations return more viable results for a similar sample size. For example, New York returned 9 valid accelerators, whereas Los Angeles and Houston both returned 5 actual accelerators out of the first 20 results: an 80% difference. Some optimization may come from identifying which locations return more accelerators upon searching.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;State+Accelerator+List&amp;quot;==&lt;br /&gt;
===New York Accelerator List===&lt;br /&gt;
*http://www.ongridventures.com/resources/new-york-silicon-alley-resources/newyorkaccelerators/ (Ranks 14 accelerators)&lt;br /&gt;
*http://under30ceo.com/11-new-york-tech-incubators-and-accelerators-for-entrepreneurs/ (Ranks 11 accelerators)&lt;br /&gt;
===California Accelerator List===&lt;br /&gt;
*http://www.socaltech.com/the_complete_guide_to_southern_california_accelerators_and_incubators_part_i/s-0040924.html (Lists accelerators in Southern Cali)&lt;br /&gt;
*http://barberacorporatelaw.com/blog/2014/4/8/28-business-incubators-in-the-los-angeles-area (List of 24 accelerators near the LA area)&lt;br /&gt;
===Texas Accelerator List===&lt;br /&gt;
*http://www.austinstartuplist.com/incubators (List of accelerators in Austin, &amp;lt;5 results)&lt;br /&gt;
*http://www.siliconhillsnews.com/2016/09/02/the-top-texas-healthcare-accelerators-and-incubators/ (Modest list of accelerators aiding in healthcare)&lt;br /&gt;
*http://realfoodmba.com/food-startup-accelerators/ (List of food-based accelerators, some of which are in Austin, others of which are international)&lt;br /&gt;
===Colorado Accelerator List===&lt;br /&gt;
*http://www.builtincolorado.com/2015/01/14/best-colorado-accelerators-your-startup (8 results)&lt;br /&gt;
*https://www.quora.com/What-accelerator-programs-are-located-in-Colorado (Quora inquiry yielding modest results)&lt;br /&gt;
===Washington Accelerator List===&lt;br /&gt;
*http://www.geekwire.com/2015/mapping-seattles-incubators-accelerators-and-co-working-spaces/ (Returns 14 results)&lt;br /&gt;
===Oregon Accelerator List===&lt;br /&gt;
*http://www.bizjournals.com/portland/subscriber-only/2016/01/15/incubators-and-accelerators.html (Returns list of 5 accelerators and details)&lt;br /&gt;
*http://www.oregon4biz.com/Innovate-&amp;amp;-Create/R&amp;amp;D-Business/Incubators/ (Returns list of 26 accelerators and incubators)&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Seed-DB appears for almost all of the search results&lt;br /&gt;
*Acceleratorinfo appears for most of the search results&lt;br /&gt;
*There are multiple cumulative reports of incubators per location, but not for accelerators&lt;br /&gt;
*Most regionalized accelerator lists deal with either an article or a ranking of a particular amount of accelerators in the area&lt;br /&gt;
*Many results returned nationally ranked lists of accelerators, such as the Forbes list of &amp;quot;Top Accelerators&amp;quot; or something along the lines of &amp;quot;Best Accelerators in the US&amp;quot;. The connection is that perhaps one accelerator mentioned on the list may be located within the searched state.&lt;br /&gt;
*There are also a few results for actual particle accelerators that must be sorted out (i.e. superconducting super collider)&lt;br /&gt;
&lt;br /&gt;
==Found through google searching accelerators found previously==&lt;br /&gt;
'''Found from googling YLE Media Startup Accelerator'''&lt;br /&gt;
*https://www.corporate-accelerators.net/database/index.html (DB of Corporate Accelerators 71-79 entries)&lt;br /&gt;
*http://startupaccelerator.vc/accelerator-corporate-innovation-sig/ (Database of Accelerators and Corporate Innovation 92 entries)&lt;br /&gt;
neither of these have had their entries added to list of accelerators&lt;br /&gt;
&lt;br /&gt;
=Individual Accelerator Evaluations=&lt;br /&gt;
Summary: The purpose of this section is to create instructions for each accelerator on how to find cohort information from their URLs. Along with specific instructions for obtaining the cohorts for each accelerator chosen, there should be a list of easy-to-obtain and relevant statistics regarding the accelerator, such as information about its team, location, etc. The variable statistics list is cumulative, whereas the cohort directions are unique per the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerators Chosen (Format = Name (source))==&lt;br /&gt;
#Blue Startups (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Launchpad LA (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Y Combinator (http://www.seed-db.com/accelerators)&lt;br /&gt;
#FlashPoint (http://www.seed-db.com/accelerators/all)&lt;br /&gt;
#Prosper Accelerator (https://www.f6s.com/programs?type)&lt;br /&gt;
#Axel Springer Plug and Play (http://www.axelspringerplugandplay.com/)&lt;br /&gt;
#Techstars (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Startmate (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Capital Factory (http://blog.shedd.us/321987608/)&lt;br /&gt;
#OwlSpark (Google search: &amp;quot;Houston + accelerators&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Blue Startups (http://bluestartups.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Track Record&amp;quot; page under the &amp;quot;Home&amp;quot; tab; found total number of graduated cohorts to be 7&lt;br /&gt;
#Navigated to &amp;quot;Portfolio&amp;quot; tab. Tab includes list of all seven graduated cohorts along with companies emerging from each one. Each cohort is listed under a separate page (ex. &amp;quot;Cohort 1&amp;quot;, &amp;quot;Cohort 2&amp;quot;, etc) and at the bottom of each cohort page, there is a link to the other 6. Each company has a short description along with its URL.&lt;br /&gt;
#An &amp;quot;Alumni News&amp;quot; page at the bottom of &amp;quot;Portfolio&amp;quot; includes articles pertinent to graduated startups.&lt;br /&gt;
#Unfortunately does not include the date and year of each cohort class, but perhaps could cross-reference with other sources.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Launchpad LA (http://launchpad.la/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Companies&amp;quot; in the top of the homepage&lt;br /&gt;
#&amp;quot;Companies&amp;quot; returns all companies backed by Launchpad LA based on their class year and number (cohort)&lt;br /&gt;
#:*Also sorted by active startups vs. inactive startups&lt;br /&gt;
#At the bottom of the &amp;quot;Companies&amp;quot; tab, there is a statistical layout returning values for the number of companies started by Launchpad during its time as an accelerator (2012-present), as well as the total funding funneled into the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Y Combinator (http://www.ycombinator.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Scrolled down on the home page and clicked on a link entitled &amp;quot;See all companies&amp;quot;.&lt;br /&gt;
#Navigated to a drop down menu named &amp;quot;All Batches&amp;quot;, and clicked on it to expand the list.&lt;br /&gt;
#List is made up of dates ranging from 2005-2016, and these dates return lists of launched companies including most but not all of their URL's, as well as their launch year.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Flashpoint (http://flashpoint.gatech.edu/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#On upper right corner after animation, there is a tab sign which lets you navigate to a page labeled &amp;quot;Teams&amp;quot;&lt;br /&gt;
#The &amp;quot;Team&amp;quot; page has each batch of companies emerging from Georgia Tech, although it does not include the dates or cohorts of these companies. For example, &amp;quot;Batch 1&amp;quot; at the top of the page just lists the companies in the batch without URLs or any additional information.&lt;br /&gt;
#On the &amp;quot;Application&amp;quot; page on the tab near the top, there is information regarding Batch 7, which begins early 2017. Suggests that batch 6 either ended spring 2016 or fall 2016.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Prosper Women Entrepreneurs (http://www.prosperstl.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Accelerator&amp;quot; tab and clicked &amp;quot;Companies&amp;quot; when prompted with the drop down menu.&lt;br /&gt;
#This tab returned all of the launched company logos which then redirected to the company's home page when clicked.&lt;br /&gt;
#No other relevant form of information such as date launched or cohort was included on this page.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Axel Springer Plug and Play(http://www.axelspringerplugandplay.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Clicked on the &amp;quot;Companies&amp;quot; tab on the home page and was directed to the middle of the page which included a short list of current companies.&lt;br /&gt;
#Clicked on the &amp;quot;All Companies&amp;quot; link which returned a page filled with startup logos and brief descriptions of those startups. When clicked, each logo serves to redirect to that startup's home page.&lt;br /&gt;
#Companies were not sorted by cohort or in any other relevant way.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Techstars (http://www.techstars.com)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the Accelerators tabs and clicked &amp;quot;Companies&amp;quot; on the drop down menu.&lt;br /&gt;
#Firstly, this returns a table comprised of a long list of different classes from different areas separated by years.&lt;br /&gt;
#Upon scrolling down further, each of these classes is broken down by the startups that graduated from them. It also includes information such as how much was invested in each startup, as well as whether or not the startup was acquired, is active, or failed.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Startmate (http://www.startmate.com.au)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startups&amp;quot; tab, which returned a page of all startups that have graduated from Startmate.&lt;br /&gt;
#Startups are separated by year of graduation, and each company is linked on this page.&lt;br /&gt;
#It appears as if each year, 1 cohort is taken through the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Capital Factory (https://capitalfactory.com/accelerate/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the startups tab, which returned a long list of companies that were accelerated by Capital Factory.&lt;br /&gt;
#Each logo for the startups served as a link to their respective websites.&lt;br /&gt;
#There was no evidence or mention of any cohorts.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: OwlSpark (http://entrepreneurship.rice.edu/accelerator/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startup Teams&amp;quot; tab, which returned a page that included links to 4 &amp;quot;Classes&amp;quot;.&lt;br /&gt;
#Each class link i.e. (Class 1, Class 2, Class 3, Class 4) returned links to each startup that graduated from the program.&lt;br /&gt;
#These classes signify cohorts.&lt;br /&gt;
&lt;br /&gt;
==List of Promising Variables==&lt;br /&gt;
*Key People (founders, lead entrepreneurs, strategists, etc.)&lt;br /&gt;
*Total number of launched companies&lt;br /&gt;
*A FAQ for application details, accelerator vision, and &lt;br /&gt;
*Funds raised per company (average)&lt;br /&gt;
*Features offered by accelerator (perks, space, tools, etc)&lt;br /&gt;
*General events hosted by the accelerator&lt;br /&gt;
*(Success) stories for graduated start-ups&lt;br /&gt;
&lt;br /&gt;
=E-R Diagram (in list form) for Identifying Attributes to Pull from Accelerators=&lt;br /&gt;
Summary: I will look at different entities within the accelerator page (e.g accelerators, cohorts, founders) and then find potential attributes that can be codified from those entities. Along with the attribute, we list a potential method for pulling that particular attribute. &lt;br /&gt;
&lt;br /&gt;
Format: &lt;br /&gt;
:&amp;lt;u&amp;gt;Entity&amp;lt;/u&amp;gt;&lt;br /&gt;
:*Attribute - Possible sources/ways to get&lt;br /&gt;
&lt;br /&gt;
Ed: &amp;quot;Be creative with finding new attributes to pull!&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==List==&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
*Accelerator Name - Website, external database&lt;br /&gt;
*Contact Form - General contact section in each website &lt;br /&gt;
*Industry focus - can be pulled from description&lt;br /&gt;
*Description - pulled from website itself&lt;br /&gt;
*Takes equity? - Database or from &amp;quot;about&amp;quot; page&lt;br /&gt;
*Non-profit? - Database&lt;br /&gt;
*URL - Already have way of obtaining&lt;br /&gt;
*DNS Registration Date - Already have way of obtaining&lt;br /&gt;
*Address - Google Maps, maybe the website&lt;br /&gt;
*Founding Date - Google Maps, website, server registration&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
*Mentorship? - Description in website&lt;br /&gt;
*Space Offered - Google Maps, Website description&lt;br /&gt;
*Partnerships - Angel list, Same section as mentorship or events&lt;br /&gt;
*Hosted Events - Calender&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
*Name - Founders or Team Page&lt;br /&gt;
*Title - Directly underneath or next to name&lt;br /&gt;
*PhD? - Biography, webpage under name&lt;br /&gt;
*Serial - Biography&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot; in &amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt; (n) has (n) &amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt; &lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt;&lt;br /&gt;
*Other Companies - Biography, webpage&lt;br /&gt;
*Previous Companies - Biography&lt;br /&gt;
*Net Worth - Forbes, Biography&lt;br /&gt;
*Link back to &amp;quot;Name&amp;quot; in &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
*Date + Accelerator = Cohort ID - Database or Website&lt;br /&gt;
*Number of Startups - Website, count from &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Cohort Number - Categorization on website, external database&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Names - Website, external database&lt;br /&gt;
*State of Inc - Angel List&lt;br /&gt;
*URL - Angel List, website&lt;br /&gt;
*Founding Date - Registration database, Angel List&lt;br /&gt;
*Industry - startup description&lt;br /&gt;
*Founding Location - Angel List&lt;br /&gt;
*Current Location - Angel List&lt;br /&gt;
*VC Raised to Date - SDC Platinum&lt;br /&gt;
*Angel Funds Raised to date - Angel List&lt;br /&gt;
&lt;br /&gt;
==Variables which Distinguish Accelerator Websites==&lt;br /&gt;
*The word &amp;quot;Accelerator&amp;quot;&lt;br /&gt;
**This word appears at least one time on the home page of the vast majority of accelerator websites. The word &amp;quot;Accelerator&amp;quot; appears either as a link to another page on the website or in a title on the homepage of the website. Not many other websites contain this word on their homepage, especially not if one Googles something generic such as &amp;quot;Accelerators in the US&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
*Fixed Term&lt;br /&gt;
**Accelerators normally work with their cohorts for 3 months. This is a major factor which differentiates between an accelerator and any other member of a startup ecosystem. If on their website they mention either &amp;quot;3 months&amp;quot; or &amp;quot;12 weeks&amp;quot;, it is extremely likely that the website belongs to an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Cohorts, Portfolio, Class, or Companies&lt;br /&gt;
**This is a potential variable that could link the websites of many different accelerators. The problem with the word &amp;quot;portfolio&amp;quot; is also used by numerous venture capital firms, which could potentially cause complications when attempting to pull only the sites of accelerators from a Google search. The word &amp;quot;cohort&amp;quot;, however, would have an extremely high probability of identifying the website as belonging to an accelerator. The words &amp;quot;class&amp;quot; and &amp;quot;companies&amp;quot; are promising but do not offer certainty.&lt;br /&gt;
&lt;br /&gt;
*Equity, Investment&lt;br /&gt;
**Although by itself, equity does not mean much, when paired with any of these other terms, it could potentially point to an accelerator. Most accelerators take equity in the form of common stock (6-8%), or they will ask for some alternate form of stake in the company.&lt;br /&gt;
&lt;br /&gt;
*Education and Mentorship&lt;br /&gt;
**Accelerators differ from incubators and angel investors in that they emphasize the education of the potential startup. They offer advice and intense mentorship from more experienced entrepreneurs within their staff, as well as many networking opportunities with the outside world. This variable is more difficult to find on the website of the accelerator, but I believe that if the website includes numerous keywords such as &amp;quot;education&amp;quot;, &amp;quot;mentorship&amp;quot;, or &amp;quot;networking opportunities&amp;quot;, it would be somewhat safe to assume that the website is owned by an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Demo Day&lt;br /&gt;
**This variable does not have tremendous potential in terms of crawling websites, but I feel that it is worth mentioning. Most accelerators &amp;quot;graduate&amp;quot; their cohorts with a demo day, which is a day when the startups present their company to potential investors. If the website contains the words &amp;quot;demo day&amp;quot;, which is fairly uncommon, it could be a good source of accelerator identification.&lt;br /&gt;
&lt;br /&gt;
A combination of any of these variables would certainly identify the current website as belonging to an accelerator.&lt;br /&gt;
&lt;br /&gt;
==Comprehensive List of Accelerators==&lt;br /&gt;
&lt;br /&gt;
All text files saved in &amp;quot;Accelerators&amp;quot; project on the McNair RPD. &lt;br /&gt;
&lt;br /&gt;
*Acc.Info: 190&lt;br /&gt;
*SeedDB: 240&lt;br /&gt;
*SARP: 59&lt;br /&gt;
*Corp: 79&lt;br /&gt;
*Total: 568 results&lt;br /&gt;
&lt;br /&gt;
After removing duplicates and locations: 363 results&lt;br /&gt;
&lt;br /&gt;
Doesn't count f6s, which returns 1170 results, roughly only 300 of which were accelerators. We created a crawler to sift through the webpages and parse HTML so we could identify the accelerators. Program and HTML saved on the Desktop.&lt;br /&gt;
&lt;br /&gt;
==Randomly Chosen Accelerators==&lt;br /&gt;
*TLabs&lt;br /&gt;
*BetaSpring&lt;br /&gt;
*The Unilever Foundry&lt;br /&gt;
*AIA Accelerator&lt;br /&gt;
*R/GA Accelerator&lt;br /&gt;
*Zeroto510&lt;br /&gt;
*Hub:raum&lt;br /&gt;
*Orange Fab&lt;br /&gt;
*Furnace&lt;br /&gt;
*Launch Chapel Hill&lt;br /&gt;
&lt;br /&gt;
===Determining whether or not these are accelerators===&lt;br /&gt;
Googled name of Accelerator and clicked on the first link&lt;br /&gt;
&lt;br /&gt;
Looked for Variables which Distinguish Accelerator Websites&lt;br /&gt;
*TLabs: Homepage states: &amp;quot;Leading Indian Tech Accelerator&amp;quot;; TLabs is an accelerator, but it is located in India.&lt;br /&gt;
*Betaspring: Under the &amp;quot;About Betaspring&amp;quot; tab,  it states that &amp;quot;Betaspring was among the first ten startup accelerators to launch worldwide&amp;quot;.&lt;br /&gt;
*The Unilever Foundry: Does not claim to be an accelerator, nor does it have information on the website about cohorts. This name was pulled from the source Corporate Accelerators.&lt;br /&gt;
*AIA Accelerator: The word &amp;quot;accelerator&amp;quot; is included in the name. Under the &amp;quot;Overview&amp;quot; tab, it states that startups have received mentorship.&lt;br /&gt;
*R/GA Accelerator: Under the &amp;quot;Overview&amp;quot; tab it states that the &amp;quot;R/GA Accelerator is designed for startups and... it is a three month, immersive, mentorship driven program&amp;quot;.&lt;br /&gt;
*Zeroto510: Website contains a &amp;quot;Portfolio Companies&amp;quot; tab which divides up the companies into cohorts. This identifies Zeroto510 as an accelerator.&lt;br /&gt;
*Hub:raum: Offers accelerator and incubator programs; however, none are located in North America.&lt;br /&gt;
*Orange Fab: States on the main page that &amp;quot;We're a 3-month accelerator program&amp;quot;.&lt;br /&gt;
*Furnace: &amp;quot;About&amp;quot; tab states that Furnace is &amp;quot;an innovative startup accelerator designed to form, incubate, and launch new companies&amp;quot;. Concludes with a Demo Day&lt;br /&gt;
*Launch Chapel Hill: Homepage states that they are &amp;quot;a startup accelerator&amp;quot;. Also included on the homepage is a line that states &amp;quot;Applications for Cohort 7 are now open&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
7/10 are accelerators located in the US.&lt;br /&gt;
&lt;br /&gt;
2/10 are accelerators not located in the US.&lt;br /&gt;
&lt;br /&gt;
1/10 is not an accelerator.&lt;br /&gt;
&lt;br /&gt;
===Steps for Extracting Cohort Information===&lt;br /&gt;
*TLabs: Clicked on the &amp;quot;Startup&amp;quot; tab and located a drop down menu entitled &amp;quot;Showing Startups from:&amp;quot;. This menu separates startups into Batches ranging from 1-9. These batches are cohorts.&lt;br /&gt;
*Betaspring: This website does not have a &amp;quot;Companies&amp;quot; or &amp;quot;Startups&amp;quot; tab. I clicked on their &amp;quot;Who&amp;quot; tab and noticed that within this section were two links called &amp;quot;Our portfolio&amp;quot; and &amp;quot;Our companies&amp;quot; which both linked to the same place. This place contained a list of the startups that Betaspring has funded, as well as links to each of the startup websites. The list was not separated into cohorts.&lt;br /&gt;
*The Unilever Foundry: Does not have a &amp;quot;Startups&amp;quot; or &amp;quot;Companies&amp;quot; link on the website.&lt;br /&gt;
*AIA Accelerator: Clicked on the &amp;quot;Startups&amp;quot; tab which returned a page with 5 companies and a bit of information on each of these companies. Also included the URL to each startup. However, the companies were not separated into cohorts, probably because there are so few of them.&lt;br /&gt;
*R/GA Accelerator: Clicked on the &amp;quot;Alumni&amp;quot; tab and navigated down the webpage. Startups are separated by class, which means cohort in this case. Startup info contains link to demo day presentation as well as the startup url.&lt;br /&gt;
*Zeroto510: Hovered over the &amp;quot;About Us&amp;quot; drop down menu and clicked on the &amp;quot;Portfolio Companies&amp;quot; link. Startups are separated by cohort, one for each year, starting from 2013. &lt;br /&gt;
*Hub:raum: Clicked on the &amp;quot;Portfolio&amp;quot; tab. Directed to a page with many names of startups, as well as a brief description of what their company is about. Also includes a link to each startup's website. Startups are not separated into cohorts, but rather by investment by location, current participants, and alumni.&lt;br /&gt;
*Orange Fab: Clicked on the &amp;quot;Startups&amp;quot; tab and was directed to a different page. Startups are not only separated into cohorts named &amp;quot;Seasons&amp;quot;, but they are also separated by industry.&lt;br /&gt;
*Furnace: Clicked on &amp;quot;Portfolio&amp;quot; tab, but unfortunately the website is broken and it returned an error in code.&lt;br /&gt;
*Launch Chapel Hill: Clicked on the &amp;quot;Ventures&amp;quot; tab and was directed to a page in which all startups were separated into cohorts, and a brief description of the startup was provided underneath their logo.&lt;br /&gt;
&lt;br /&gt;
=Code=&lt;br /&gt;
&lt;br /&gt;
The directory for all data related to this project is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
==F6S Web Crawler==&lt;br /&gt;
&lt;br /&gt;
This is a python script using the selenium library that retrieves the html content of each page on F6S's North American Accelerator search results. The script is located in:&lt;br /&gt;
&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs &lt;br /&gt;
&lt;br /&gt;
The script is titled f6s_crawler_gentle.py&lt;br /&gt;
&lt;br /&gt;
When run, the script visits the F6S search page for North American Accelerator's and begins retrieving the HTML of each page in that search list. &lt;br /&gt;
NOTE: Timing must be spaced out between all interactions with the browser. F6S has Captcha, and the program will fail if the site receives too many hit requests, or has any inkling that it is being probed by a bot.&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files are stored in: &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files stored as text files are stored in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files_text&lt;br /&gt;
&lt;br /&gt;
==F6S Parser==&lt;br /&gt;
The next step is to take the HTML files retrieved by the crawler and to parse them for necessary information. This parser should also determine whether or not the site is an accelerator site. &lt;br /&gt;
&lt;br /&gt;
The code for the parser is located in &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
It is titled f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
To run the code, open the file in Komodo and press play. &lt;br /&gt;
If running from the command line, change to the correct directory and run the following comand:&lt;br /&gt;
 python f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
The list of accelerators that passed through the parser is in the same directory:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
The tab delimited text file is named AcceleratorList.&lt;br /&gt;
The file contains the names of the accelerators that had the keywords listed in the file. Also, the file contains the run dates and location of the accelerator if it was listed on the f6s page.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==F6S API==&lt;br /&gt;
F6S has an API, but we have had no success getting a key to the API. The link to get a key to the API is on [https://www.f6s.com/developers/apis/deal-feed this page].&lt;br /&gt;
&lt;br /&gt;
I (Peter) have emailed F6S to ask for a key directly at support@f6s.com. As of the end of the Fall 2016 Semester, they have not responded.&lt;br /&gt;
&lt;br /&gt;
FUN FACT (MASS-RENAME FILES USING WINDOWS POWER SHELL):&lt;br /&gt;
&lt;br /&gt;
The following command allowed me to append &amp;quot;.txt&amp;quot; to all files in a folder once in the proper directory:&lt;br /&gt;
 Get-ChildItem * | Rename-Item -NewName { $_.name + '.txt'}&lt;br /&gt;
&lt;br /&gt;
To change file formats, Microsoft suggests:&lt;br /&gt;
 Get-ChildItem *.txt | Rename-Item -NewName { $_.name -Replace '\.txt', '.log'}&lt;br /&gt;
&lt;br /&gt;
==Final Data==&lt;br /&gt;
The Parser for parsing the text files of accelerator data is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
The Parser for parsing the cohort files of accelerator data is also located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
This folder contains the Python parsers. The Final_data folder contains the tab-delimited text files of parsed data. final_accelerator_data.txt contains the generalized data saved in .txt files and final_cohort_data.txt contains the cohort data saved in .cohort.txt files.&lt;br /&gt;
&lt;br /&gt;
All the files entitled accelerator_data are subsets of the final_accelerator_data.txt file, but each file contains only the accelerators that matched to the flag specified in the file title.&lt;br /&gt;
&lt;br /&gt;
find_headers .py finds a set of the headers for all the cohort files from the seed list project.&lt;br /&gt;
&lt;br /&gt;
==Google SiteSearch==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Google_SiteSearch&lt;br /&gt;
This folder contains code for a google search parser. The script sitesearch.py will search for a queried company and return a likely web address for that company.&lt;br /&gt;
&lt;br /&gt;
==Way Back Machine Parser==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\wayback_machine.py&lt;br /&gt;
This script takes URLs and returns a timestamp for the oldest documented webpage under that URL courtesy of the Way Back Machine Archive.&lt;br /&gt;
&lt;br /&gt;
==Process Locations==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\process_locations.py&lt;br /&gt;
This script takes a physical address and converts it into latitude and longitude coordinates. Should be used in conjunction with the Enclosing Circle program to find the concentration of accelerators.&lt;br /&gt;
 E:\McNair\Software\CodeBase\EnclosingCircle.py&lt;br /&gt;
&lt;br /&gt;
=Kauffman Foundation Incubator Proposal Information=&lt;br /&gt;
&lt;br /&gt;
==Institutions==&lt;br /&gt;
Summary: F6S, Crunchbase, seed-db&lt;br /&gt;
&lt;br /&gt;
Tools: Matcher - used to match lists of potential accelerators with our current list to identify duplicates/new matches (E:\McNair\Projects\Accelerators)&lt;br /&gt;
&lt;br /&gt;
===F6S===&lt;br /&gt;
F6S WebCrawler and F6S Parser - E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
===CrunchBase===&lt;br /&gt;
&lt;br /&gt;
CrunchBase 2013 Snapshot '''(All Organizations)'''- E:\McNair\Projects\Accelerators\organizations.xls&lt;br /&gt;
&lt;br /&gt;
CrunchBase 2013 Snapshot '''(Potential Accelerators)'''- E:\McNair\Projects\Accelerators\organizations.accdb under &amp;quot;Potential Accelerators query&amp;quot; &lt;br /&gt;
&lt;br /&gt;
*Obtained using keyword matches in the descriptions of the potential accelerators.&lt;br /&gt;
&lt;br /&gt;
CrunchBase 2013 Snapshot '''(New Verified Accelerators)''' - E:\McNair\Projects\Accelerators\New CrunchBase Accelerators.xls&lt;br /&gt;
&lt;br /&gt;
We have the Crunchbase 2013 Snapshot which provided lots of new data on accelerators and incubators but we would love to use the Crunchbase API to get a current database snapshot that we could use to cross reference companies and add newly formed accelerator and incubator companies.&lt;br /&gt;
&lt;br /&gt;
===AngelList===&lt;br /&gt;
&lt;br /&gt;
===seed-db===&lt;br /&gt;
&lt;br /&gt;
Obtained through www.seed.db/accelerators&lt;br /&gt;
&lt;br /&gt;
===Global Accelerator Network (GAN)===&lt;br /&gt;
&lt;br /&gt;
GAN Parser- E:\McNair\Projects\Accelerators\Web Scraping for Accelerators\scrapeaccel.py&lt;br /&gt;
&lt;br /&gt;
GAN Data- E:\McNair\Projects\Accelerators\Web Scraping for Accelerators\GAN Accelerator Data&lt;br /&gt;
*Contains: Company Name, # of Companies Range, % of Companies Funded, Funding Raised by Companies, Employee Range, Exit Funding, Exit Date, Total Company Funding Raised, # of Mentors Range, % Equity, Location, Minimum Seed Capital Investment&lt;br /&gt;
&lt;br /&gt;
==Cohorts==&lt;br /&gt;
&lt;br /&gt;
*Cohorts obtained manually&lt;br /&gt;
*All Cohort txt files are saved under &amp;quot;E:\McNair\Projects\Accelerators\Data  &lt;br /&gt;
*cohort file name = (accelerator name).cohort&lt;br /&gt;
*Most updated Accelerator cohort data: E:\McNair\Projects\Accelerators\Cleaned Cohort Data.xls&lt;br /&gt;
&lt;br /&gt;
Automation for obtaining cohorts??&lt;br /&gt;
&lt;br /&gt;
==Other Information==&lt;br /&gt;
Summary: Whois Parser, Geocode, Tools to determine industry, etc&lt;br /&gt;
&lt;br /&gt;
===Whois Parser===&lt;br /&gt;
&lt;br /&gt;
*Retrieves and parses Whois information. Specifically, takes a file with a column of domain names and populates the corresponding columns with information from the WhoIs API.&lt;br /&gt;
&lt;br /&gt;
*Often used to obtain locations.&lt;br /&gt;
&lt;br /&gt;
===Geocode===&lt;br /&gt;
&lt;br /&gt;
Input: Company Address&lt;br /&gt;
Output: Directional Coordinates&lt;br /&gt;
&lt;br /&gt;
*Used to obtain the locations of different Accelerators and Cohort companies.&lt;br /&gt;
&lt;br /&gt;
===SDC Platinum Pull===&lt;br /&gt;
&lt;br /&gt;
Used to obtain funding information and match companies that have gotten funding with companies that are Accelerator cohorts.&lt;br /&gt;
&lt;br /&gt;
===Desired Information/Variables===&lt;br /&gt;
&lt;br /&gt;
*Key People (founders, lead entrepreneurs, strategists, etc.)&lt;br /&gt;
*Total number of launched companies&lt;br /&gt;
*A FAQ for application details, accelerator vision, and&lt;br /&gt;
*Funds raised per company (average)&lt;br /&gt;
*Features offered by accelerator (perks, space, tools, etc)&lt;br /&gt;
&lt;br /&gt;
==Desired Tools/Information==&lt;br /&gt;
&lt;br /&gt;
===Automating the Process of Obtaining Cohorts===&lt;br /&gt;
*Automating this process would save a lot of time and really progress the project.&lt;br /&gt;
&lt;br /&gt;
===Obtaining More Details on Accelerators===&lt;br /&gt;
&lt;br /&gt;
*Having the kind of thorough information on industry, companies, funding, location, exits, mentors, leadership,  that we got for the GAN companies would be fantastic.&lt;br /&gt;
&lt;br /&gt;
===List of Alive/Dead Accelerators===&lt;br /&gt;
&lt;br /&gt;
This is a dream but would be very helpful&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=21860</id>
		<title>Accelerator Seed List (Data)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=21860"/>
		<updated>2017-11-14T21:05:28Z</updated>

		<summary type="html">&lt;p&gt;Shrey: /* Sources */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Accelerator Seed List (Data)&lt;br /&gt;
|Has owner=Shrey Agarwal, Matthew Ringheanu, Veeral Shah,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has keywords=Accelerators,Data&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Industry Classifier&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Current Work=&lt;br /&gt;
&lt;br /&gt;
TODO:&lt;br /&gt;
 McNair/Projects/Accelerators/Fall 2017/unfound_founders.txt&lt;br /&gt;
A 0 means we don't have founder data for that accelerator.&lt;br /&gt;
Specs: A tab delimited text file with the following fields:&lt;br /&gt;
 Accelerator   First Name   Last Name   LinkedInURL(if possible)&lt;br /&gt;
Getting the LinkedInURL will ensure accuracy, but will work without it.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Shrey: Find &amp;quot;demo day&amp;quot; keywords, so that we can search AcceleratorName Year Keyword and get back potential demo day pages&lt;br /&gt;
*Joe: Go through Accelerator list (approx 273 accelerators) and mark each by type (see below), building out type list as you go&lt;br /&gt;
&lt;br /&gt;
Type list:&lt;br /&gt;
*Private&lt;br /&gt;
*Corporate&lt;br /&gt;
*Academic&lt;br /&gt;
 Note: if DEAD, noted here.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Other info:&lt;br /&gt;
*nonprofit? (y/n)&lt;br /&gt;
&lt;br /&gt;
*Subtype abbreviations:&lt;br /&gt;
**S: for if a social entrepreneurship initiative&lt;br /&gt;
**I: for if an incubator&lt;br /&gt;
**A: for an angel group&lt;br /&gt;
**F: for foreign&lt;br /&gt;
**C: for in coworking space/hub/etc&lt;br /&gt;
**V: for if part of venture fund&lt;br /&gt;
**G: for if government funded/partnered&lt;br /&gt;
**T: for international&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
 Note: subtypes (from individual text files in E:\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data) were only found for 23 of the 270 accelerators.  These accelerators were initially intended to be removed from the master list.  Remaining subtypes are currently being added.&lt;br /&gt;
&lt;br /&gt;
other info: &lt;br /&gt;
&lt;br /&gt;
international offices, founders, industries, org type, program duration, or other interesting, easily accessed variables.  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Type list file saved as &lt;br /&gt;
 &amp;quot;Accelerator type list&amp;quot; in E:\McNair\Projects\Accelerators\Fall 2017\Grouping project of ListOfAccs.&lt;br /&gt;
The list of ListofAccs, from which we drew Accelerator type list, should have no matches with any of the flagged accelerators in E:\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data.  There are 23 matches though.  So all subtypes must be searched and entered manually.  Whether it is a nonprofit is listed in E:\McNair\Projects\Accelerators\Fall 2017\Grouping project of ListOfAccs, called &amp;quot;whether nonprofit...&amp;quot;&lt;br /&gt;
&lt;br /&gt;
=End of Semester Report=&lt;br /&gt;
The end of semester report will focus on ranking accelerators and environments based on the variables we have gathered. Our primary form of categorization will be ranking individual accelerators based on their venture capital raise rate. We can probably generate information over time for accelerators and the amount of VC they raised to get a sense of what locations have developed in the past five years from the dates of transactions recorded by SDC. To obtain these rankings, we will identify which cohorts companies were trained in, as well as complete details of the accelerator and the details of cohort companies. We will focus only on accelerators because there are many other entities in each ecosystem. We will also utilize information on IPO or acquisition by companies, obtained through Crunchbase, to gain some sense of how successful startups emerging from a particular accelerator are. To obtain the data over time, we will need to fill out the cohort date information column in our cohort data, which will require the help of either Crunchbase or the Wayback machine for older accelerators. In ranking the accelerators across regions, we can also track industry-specific hotspots for accelerators such as medicine in Memphis or technology in San Francisco.&lt;br /&gt;
&lt;br /&gt;
To complete the report, we need to fill information in:&lt;br /&gt;
*Industry and focus&lt;br /&gt;
*Location&lt;br /&gt;
*Name, description&lt;br /&gt;
*Matched VC data&lt;br /&gt;
*Founder information (maybe)&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
This project is developing broad and near-population data on accelerators and their cohort companies. The objective is to identify which cohorts of which accelerators a cohort company was trained in, obtain details of the accelerators, and obtain details of the cohort companies, including information about any venture capital investment that the cohort company might have received and any IPO or acquisition the company may have experienced.&lt;br /&gt;
&lt;br /&gt;
The primary use of this data is for an academic paper detailed on the [[Matching Entrepreneurs to Accelerators and VCs (Academic Paper)]] page. &lt;br /&gt;
&lt;br /&gt;
However, this project can also provide useful data to other academic papers ([[Urban Start-up Agglomeration]], [[Hubs (Academic Paper)]], and [[Hubs Scorecard (Academic Paper)]]), projects ([[Houston Entrepreneurship]]) and blog posts (under the [[Emerging Ecosystems]] umbrella project).&lt;br /&gt;
&lt;br /&gt;
This project needs the results of the [[Industry Classifier]], [[Whois Parser]], and other tools.&lt;br /&gt;
&lt;br /&gt;
=Current Project Write-Up=&lt;br /&gt;
&lt;br /&gt;
==Things To Do==&lt;br /&gt;
*Obtain all URLs for accelerators in order to run through the Wayback Machine to find out when they started.&lt;br /&gt;
*Match Crunchbase Data with our Accelerator List to see if they have any accelerators that we do not.&lt;br /&gt;
*Obtain an example of accelerator that started early and has multiple companies but does not separate them into cohorts and figure out a way to determine which companies went through each cohort.&lt;br /&gt;
&lt;br /&gt;
==What Each File in the &amp;quot;Accelerator&amp;quot; Folder on the RDP Contains==&lt;br /&gt;
*&amp;quot;Accelerator List Sources&amp;quot; (Folder) - This folder contains most of the sources that we pulled accelerator names from at the very beginning of the project.&lt;br /&gt;
*&amp;quot;Code+Final_Data&amp;quot; (Folder) - This folder contains Peter's code for pulling the data from the text files in the &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Crunchbase Snapshot&amp;quot; (Folder) - This folder contains the data we obtained from Crunchbase. There is a massive amount of data which we will need to sort through to find useful information and hopefully match that data with our current cohort data.&lt;br /&gt;
*&amp;quot;Data&amp;quot; (Folder) - This folder contains all of our data on accelerators including cohort information and the html files of each cohort page. I would estimate that it is about 95% clean currently.&lt;br /&gt;
*&amp;quot;Data - Copy&amp;quot; (Folder) - This is just a copy of our current &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Data_Copy&amp;quot; (Folder) - This is a copy of our original &amp;quot;Data&amp;quot; folder before we did any manual cleaning.&lt;br /&gt;
*&amp;quot;Enclosing_Circle&amp;quot; (Folder) - This folder seems to contain some data on VC but I'm not sure how it pertains to the Accelerator project.&lt;br /&gt;
*&amp;quot;F6S Accelerator HTMLs&amp;quot; (Folder) - This folder contains the HTML pages of all the pages on the F6S website. We used it to add more potential accelerators to our list.&lt;br /&gt;
*&amp;quot;Google_SiteSearch&amp;quot; (Folder) - This folder contains Python code for Google searches.&lt;br /&gt;
*&amp;quot;Industry_Classifier&amp;quot; (Folder) - This folder seems to contain Python code but I'm not sure what for.&lt;br /&gt;
*&amp;quot;Matcher&amp;quot; (Folder) - This folder contains the Matcher.&lt;br /&gt;
*&amp;quot;Python WebCrawler&amp;quot; (Folder) - This folder contains code that is a work in progress for pulling descriptions from accelerator websites. It is Jeemin's project.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data Copy&amp;quot; (Excel File) - This file contains a copy of our cleaned cohort data.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data&amp;quot; (Excel File) - This file contains the most current, completely cleaned data on cohort company information.&lt;br /&gt;
*&amp;quot;NormalizeFixedWidth&amp;quot; (PL File) - This is the normalizer.&lt;br /&gt;
*&amp;quot;PortCoNames&amp;quot; (TXT File) - This file contains all of the names of the cohort companies as well as the accelerator they went through.&lt;br /&gt;
*&amp;quot;VC Data&amp;quot; (Excel File) - This file contains all of the names of the companies that have ever received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data&amp;quot; (TXT File) - This file contains that non-normalized data of all of the VC information.&lt;br /&gt;
*&amp;quot;VC_Data_Names&amp;quot; (TXT File) - This file contains all of the names of companies that have received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data_Names_Matched_PortCoNames&amp;quot; (Excel File) - This file contains all of the cohort companies that have also received VC funding. Still needs to be sorted through.&lt;br /&gt;
&lt;br /&gt;
==Process==&lt;br /&gt;
After accumulating the massive amount of data on accelerators, their cohorts, and their html files, we began cleaning those text files, which are located in the &amp;quot;Data&amp;quot; folder within &amp;quot;Accelerators&amp;quot;. After going through the first round of cleaning, we ran a code through the cohort data which put all of that information into an Excel document called &amp;quot;Cleaned Cohort Data&amp;quot;. There were still some mistakes in the cohort information unfortunately, which we fixed within the Excel file itself. Therefore, there are some text files within the &amp;quot;Data&amp;quot; folder that do not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file. If we were to run the cohort code through the &amp;quot;Data&amp;quot; folder, we would get something that does not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file, which is problematic. The solution to this (other than manually cleaning the text files again) would be to write a code from the &amp;quot;Cleaned Cohort Data&amp;quot; file which would allow us to clean the data in the &amp;quot;Data&amp;quot; folder through the format of the Excel file. We have also matched all of the cohort companies with our list of all companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
=Current To Do=&lt;br /&gt;
&lt;br /&gt;
#Work on the [[Crunchbase 2013 Snapshot]]&lt;br /&gt;
#Match cohort companies to VC-backed portfolio companies&lt;br /&gt;
#Refine our data to work out which cohort each cohort company was a member of, cohort start dates and locations, etc.&lt;br /&gt;
#Make a list of top accelerator lists (e.g., http://tech.co/top-startup-accelerators-ranked-2012-08) and check that we have those accelerators&lt;br /&gt;
&lt;br /&gt;
=End of Semester Notes=&lt;br /&gt;
&lt;br /&gt;
*We have compiled a very long list of accelerators from many different databases. For the past couple of weeks, everyone in the center has been going through this list, 20 at a time, classifying each one as an accelerator or not an accelerator, and then proceeding to gather data on the accelerator using the process outlined below. This process went very smoothly. We have successfully gone through about 80% of the list. We are still missing information on the last hundred or so names. All of the collected data is located on the RDP, within the &amp;quot;Accelerators&amp;quot; folder under &amp;quot;Data&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
=Data Collection Notes=&lt;br /&gt;
&lt;br /&gt;
==MATCHING==&lt;br /&gt;
&lt;br /&gt;
The files we used to match are located in the E drive. We used the matcher to match our portfolio company names from the cohort file located in E:\McNair\Projects\Accelerators. &lt;br /&gt;
*The files used to matching are located E:\McNair\Projects\Accelerators\Matcher&lt;br /&gt;
*Portco is the name of the companies pulled from the cohort file&lt;br /&gt;
*AccCo includes both the cohort company name, along with the name of the accelerator itself&lt;br /&gt;
*In the matcher, the inputs are the PortCo names, as well as the VC data from our pull in SDC&lt;br /&gt;
*The outputs include the AccCo_VC data located in E:\McNair\Projects\Accelerators which give a lot of information on the matches, including:&lt;br /&gt;
:*name of the match itself&lt;br /&gt;
:*number of investments&lt;br /&gt;
:*dates that the company received its investments&lt;br /&gt;
&lt;br /&gt;
==SDC Pull==&lt;br /&gt;
&lt;br /&gt;
We accessed SDC platinum and pulled information on round-based funding that all registered companies received from between the years 1999 to 2017.&lt;br /&gt;
&lt;br /&gt;
The receipt is as follows:&lt;br /&gt;
&lt;br /&gt;
Session Details&lt;br /&gt;
---------------&lt;br /&gt;
Request   Hits    Request Description&lt;br /&gt;
   0        -     DATABASE: Portfolio Companies (VIPC)&lt;br /&gt;
   1     96155    Venture Related Deals: Select All Venture Related Deals&lt;br /&gt;
   2     79572    Round Date: 1/1/1999 to 3/1/2017 (Custom) (Calendar)&lt;br /&gt;
   3              Custom Report: VC Data (Columnar) - Save As:&lt;br /&gt;
                  E:\McNair\Projects\Accelerators\VC Data.txt&lt;br /&gt;
�&lt;br /&gt;
Billing Ref # : 2054025&lt;br /&gt;
Capture File  : riceuniv.2054025&lt;br /&gt;
Session Name  : &lt;br /&gt;
&lt;br /&gt;
The VC data pull includes the following variables: &lt;br /&gt;
&lt;br /&gt;
Company Name                                                           Date Company      Date Company      Company        Company City                           Company Street Address, Line 1               Company Street Address, Line 2            Total Known     Company Industry Sub-Group 3                              Company Industry Major Group     Round          Company Stage Level 3     Round Amt,       Round Amt,&lt;br /&gt;
&lt;br /&gt;
==3 files==&lt;br /&gt;
&lt;br /&gt;
For each accelerator in the list, put files in E:\Projects\Accelerators\Data&lt;br /&gt;
*AcceleratorName.txt - copy and paste the variables below into a (tab-delimited) txt file and complete&lt;br /&gt;
*AcceleratorName.cohort - your cohort text file (see below)&lt;br /&gt;
*AcceleratorName.html (possibly automatically with a folder too) - save a copy of the html of the cohort page&lt;br /&gt;
&lt;br /&gt;
==.txt Variables==&lt;br /&gt;
&lt;br /&gt;
 Name	&lt;br /&gt;
 Score	&lt;br /&gt;
 Flag	&lt;br /&gt;
 CohortURL	&lt;br /&gt;
 Address	&lt;br /&gt;
 Duration	&lt;br /&gt;
 Vintage		&lt;br /&gt;
 Industry	&lt;br /&gt;
 Description	&lt;br /&gt;
 Equity	&lt;br /&gt;
 NonProfit	 &lt;br /&gt;
 Notes	&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Try to get '''Name, Score, Flag, Cohort URL and Address''' for all. ONLY GRAB OTHER VARIABLES IF EASY. Just leave things blank if you can't find them quickly.&lt;br /&gt;
&lt;br /&gt;
'''If the score is 0, or the flag is S, I, A, or F just stop''' - don't bother downloading a cohort list, saving an HTML file, etc. If possible, do stick a very brief description of the problem in the notes field.&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Score: is 0-1 where 0 is definitely not an accelerator, 1 is definitely an accelerator&lt;br /&gt;
*Flag: (leave blank if not needed), if multiple then separate by comma&lt;br /&gt;
**S for social entrep&lt;br /&gt;
**I for incubator&lt;br /&gt;
**A for an angel group&lt;br /&gt;
**F is for foreign&lt;br /&gt;
**C for in coworking space/hub/etc&lt;br /&gt;
**V for if part of venture fund&lt;br /&gt;
**D is for Dead&lt;br /&gt;
*Put just the root URL in Cohort URL if there isn't a Cohort page&lt;br /&gt;
*Duration: in wks (months x 4.33 and round)&lt;br /&gt;
*Vintage is year of first cohort if possible&lt;br /&gt;
*Industry is industry focus but only if clear focus&lt;br /&gt;
*Equity is a number (don't put %) or Y/N&lt;br /&gt;
*Notes is only there if need it. Particularly try to use this field to note discards.&lt;br /&gt;
&lt;br /&gt;
==.cohort files==&lt;br /&gt;
&lt;br /&gt;
Your .cohort files must:&lt;br /&gt;
*Be tab delimited txt&lt;br /&gt;
*Have a header&lt;br /&gt;
*The first column must be the portfolio company name&lt;br /&gt;
*Grab as many columns as you can easily (and name them)&lt;br /&gt;
&lt;br /&gt;
==Standardized format for text files==&lt;br /&gt;
&lt;br /&gt;
Information Text file&lt;br /&gt;
*1 tab only after each category&lt;br /&gt;
*No spaces after commas for flags or industry&lt;br /&gt;
*For duration put only a number in weeks but do not write &amp;quot;weeks&amp;quot;&lt;br /&gt;
*Equity is either only a number (no percent sign) or a Y/N&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Cohort Text file&lt;br /&gt;
*1 tab between each column&lt;br /&gt;
*Titles of each column on top&lt;br /&gt;
*Make a new category for &amp;quot;Cohort Number&amp;quot; and write either &amp;quot;1 2 3 4 etc.&amp;quot;&lt;br /&gt;
*Matthew: 1-225 (done) Shrey: 226-550 (done)&lt;br /&gt;
&lt;br /&gt;
==Link to Crunchbase API application==&lt;br /&gt;
&lt;br /&gt;
https://about.crunchbase.com/forms/research-access-apply/ (Does not work anymore)&lt;br /&gt;
&lt;br /&gt;
https://data.crunchbase.com/v3/docs/using-the-api (Has new instructions for application)&lt;br /&gt;
&lt;br /&gt;
==Sign-Ups==&lt;br /&gt;
&lt;br /&gt;
 Ed - 1-10 (done)&lt;br /&gt;
 Carlin -  11-20 (done)&lt;br /&gt;
 Carlin - 21-40 (done)&lt;br /&gt;
 Christy - 41-60 (done)&lt;br /&gt;
 Avesh - 61-80 (done)&lt;br /&gt;
 Eliza - 81-100 (done)&lt;br /&gt;
 Meghana - 101-120 (done)&lt;br /&gt;
 Peter - 121-140 (done)&lt;br /&gt;
 Ramee - 141-160 (done)&lt;br /&gt;
 Will - 161-180 (done)&lt;br /&gt;
 Matthew - 181-200 (done)&lt;br /&gt;
 Julia - 201-220 (done)&lt;br /&gt;
 Peter - 221-240 (done)&lt;br /&gt;
 Shrey - 241-260 (done)&lt;br /&gt;
 Matthew - 261-280 (done)&lt;br /&gt;
 Eliza - 281-300 (done)&lt;br /&gt;
 Julia - 301-320 (done)&lt;br /&gt;
 Shrey - 321-340 (done)&lt;br /&gt;
 Carlin - 341-361 (done)&lt;br /&gt;
 Julia - 362-380 (done)&lt;br /&gt;
 Dylan - 381-393 (done)&lt;br /&gt;
 Jake - 394-404 (done)&lt;br /&gt;
 Dylan - 405-410 (done)&lt;br /&gt;
 Avesh - 411-415 (done)&lt;br /&gt;
 Dylan - 416-423 (done)&lt;br /&gt;
 Peter - 424-460(done)&lt;br /&gt;
 Carlin - 461-480 (done)&lt;br /&gt;
 Peter - 481-490(done)&lt;br /&gt;
 Julia - 491-510 (done)&lt;br /&gt;
 Peter - 511-515 (done)&lt;br /&gt;
 Julia - 516-529 (done)&lt;br /&gt;
 Ben - 530-540 (done)&lt;br /&gt;
 Shrey - 541-551 (done)&lt;br /&gt;
&lt;br /&gt;
=List of Accelerators=&lt;br /&gt;
#10Xelerator&lt;br /&gt;
#1440&lt;br /&gt;
#33entrepreneurs&lt;br /&gt;
#500 Startups&lt;br /&gt;
#9Mile Labs&lt;br /&gt;
#AIA Accelerator&lt;br /&gt;
#ARK Challenge&lt;br /&gt;
#AT&amp;amp;T Aspire Accelerator&lt;br /&gt;
#ATDC Community&lt;br /&gt;
#AZ TechCelerator&lt;br /&gt;
#AccelFoods&lt;br /&gt;
#Acceleprise&lt;br /&gt;
#Accelerate Baltimore&lt;br /&gt;
#Accelerate Genius&lt;br /&gt;
#Accelerate Tectoria Accelerator&lt;br /&gt;
#Accelerator Centre&lt;br /&gt;
#Advanced Technology Development Center (ATDC)&lt;br /&gt;
#Airbus BizLab&lt;br /&gt;
#Alchemist Accelerator&lt;br /&gt;
#AlphaLab&lt;br /&gt;
#Amplify.LA&lt;br /&gt;
#Angel Capital&lt;br /&gt;
#Angelcube&lt;br /&gt;
#Angelpad&lt;br /&gt;
#Annual Business BootCamp&lt;br /&gt;
#Arizona Center for Innovation&lt;br /&gt;
#Arizona Furnace&lt;br /&gt;
#Arrowhead Tech Incubator 2016&lt;br /&gt;
#Aspire 3 Accelerator 2017&lt;br /&gt;
#Atlanta Ventures Accelerator &lt;br /&gt;
#AutoXLR8R&lt;br /&gt;
#Awesome Inc.&lt;br /&gt;
#Axel Springer Plug and Play&lt;br /&gt;
#B 4 Change Impact Accelerator&lt;br /&gt;
#B2B Acceleration Program&lt;br /&gt;
#B4C Social Venture Accelerator&lt;br /&gt;
#BBC Worldwide Labs&lt;br /&gt;
#BMW Startup Garage&lt;br /&gt;
#Brandcelerate&lt;br /&gt;
#Bunker Labs&lt;br /&gt;
#Bank of Ireland Accelerator Programme&lt;br /&gt;
#Bantunium Labs Accelerator&lt;br /&gt;
#Barclays Accelerator&lt;br /&gt;
#Barclays New York Summer 2015&lt;br /&gt;
#Berkley Ventures&lt;br /&gt;
#Bessemer Business Incubation System&lt;br /&gt;
#Beta-i&lt;br /&gt;
#Beta.MN&lt;br /&gt;
#BetaFactory&lt;br /&gt;
#BetaSpring&lt;br /&gt;
#Betablox&lt;br /&gt;
#Betaspring RevUp  (DUPLICATE)&lt;br /&gt;
#Bethnal Green Ventures&lt;br /&gt;
#BioAccel&lt;br /&gt;
#BioInspire&lt;br /&gt;
#Bir 2015&lt;br /&gt;
#BitAngel Engagement Level&lt;br /&gt;
#BitAngels Startup Summer Program of 2013&lt;br /&gt;
#Bizdom&lt;br /&gt;
#Black Forest Accelerator&lt;br /&gt;
#Blue Startups&lt;br /&gt;
#Blueprint Health&lt;br /&gt;
#Bolt Boston&lt;br /&gt;
#Bonnier Accelerator&lt;br /&gt;
#BoomStartup&lt;br /&gt;
#BoomStartup Winter 2017 (DUPLICATE)&lt;br /&gt;
#Boomtown Accelerator&lt;br /&gt;
#Boomtown Health Tech (DUPLICATE)&lt;br /&gt;
#Boost VC&lt;br /&gt;
#BootupLabs&lt;br /&gt;
#Brandery&lt;br /&gt;
#Brooklyn Beta Summer Camp&lt;br /&gt;
#Budweiser Dream Brewery&lt;br /&gt;
#Buildit&lt;br /&gt;
#BuiltinPGH Companies&lt;br /&gt;
#Business Innovation Center&lt;br /&gt;
#Business Opportunity Academy 2017&lt;br /&gt;
#Business Technology Development Center (BizTech)&lt;br /&gt;
#CLT Joules Energy Accelerator 2014&lt;br /&gt;
#CWI Ventures&lt;br /&gt;
#CWI Ventures Application (DUPLICATE)&lt;br /&gt;
#CableLabs Technology Tours 2016&lt;br /&gt;
#Capital Factory&lt;br /&gt;
#Capital Innovators&lt;br /&gt;
#Capital Investment Network (Startups)&lt;br /&gt;
#Caroline Plouff&lt;br /&gt;
#Catalyst Partners&lt;br /&gt;
#Cause Collective : Social Innovation Lab&lt;br /&gt;
#Center for Entrepreneurial Innovation&lt;br /&gt;
#Chain Reaction Innovations 2017&lt;br /&gt;
#Chemical Angel Network&lt;br /&gt;
#Chinaccelerator&lt;br /&gt;
#Cisco Entrepreneurs in Residence&lt;br /&gt;
#Citi Accelerator&lt;br /&gt;
#Citrix Startup Accelerator&lt;br /&gt;
#Claremont/Upland Makerspace Fablab&lt;br /&gt;
#Climate Ventures 2.0 Accelerator&lt;br /&gt;
#Co.Lab accelerator&lt;br /&gt;
#Code for America Accelerator&lt;br /&gt;
#Cohab's Traxtion Point&lt;br /&gt;
#Collision Conference Investors&lt;br /&gt;
#Common Bond&lt;br /&gt;
#Communitech Hyperdrive&lt;br /&gt;
#Conquer Accelerator&lt;br /&gt;
#Coolhouse Labs&lt;br /&gt;
#CuriousMinds Incubator / Accelerator&lt;br /&gt;
#CyberTECH San Diego&lt;br /&gt;
#DBS Accelerator&lt;br /&gt;
#DPD Last Mile labs&lt;br /&gt;
#DV X Labs&lt;br /&gt;
#Dat Ventures&lt;br /&gt;
#Decatur-Morgan County Entrepreneurial Center&lt;br /&gt;
#Deep Space Ventures&lt;br /&gt;
#Demo Accelerator 2016- 2017&lt;br /&gt;
#DeveloperTown&lt;br /&gt;
#Difference Engine&lt;br /&gt;
#Digital Malaysia Corporate Accelerator Program&lt;br /&gt;
#Digital Media Zone Incubator/Accelerator&lt;br /&gt;
#Disney Accelerator&lt;br /&gt;
#DogFish Accelerator&lt;br /&gt;
#Domi Station&lt;br /&gt;
#Dotforge accelerator&lt;br /&gt;
#Dream Funded&lt;br /&gt;
#DreamIT Health&lt;br /&gt;
#DreamStart - Free Mentoring Program&lt;br /&gt;
#Dreamit Ventures (DUPLICATE)&lt;br /&gt;
#Ducky Diggy Lloyd &lt;br /&gt;
#E-Capital Summit&lt;br /&gt;
#EC Mentor Skills Inventory&lt;br /&gt;
#EIGERlab&lt;br /&gt;
#ETRAC&lt;br /&gt;
#EY Startup Challenge&lt;br /&gt;
#Eco Holding&lt;br /&gt;
#Eleven Startup Accelerator&lt;br /&gt;
#Emerge Xcelerate&lt;br /&gt;
#EnterpriseWorks Incubation Program&lt;br /&gt;
#Entrepreneur Development Center&lt;br /&gt;
#Entrepreneurs Roundtable Accelerator&lt;br /&gt;
#Environmental Business Cluster&lt;br /&gt;
#Equity Legal&lt;br /&gt;
#Excelerate Labs&lt;br /&gt;
#Execution Labs&lt;br /&gt;
#Exhilarator&lt;br /&gt;
#Extreme Startups&lt;br /&gt;
#Extreme University&lt;br /&gt;
#FOOD-X&lt;br /&gt;
#Factory45&lt;br /&gt;
#Fargo Startup House 2014-2015&lt;br /&gt;
#FastTrack Propero Healthcare&lt;br /&gt;
#FbFund&lt;br /&gt;
#Female Propeller for High Flyers&lt;br /&gt;
#FinTech Innovation Lab&lt;br /&gt;
#FinTech Studios 2015&lt;br /&gt;
#Fintech Founders Club #2&lt;br /&gt;
#First Growth Venture Network&lt;br /&gt;
#Fishbowl Labs AOL&lt;br /&gt;
#Flagship Enterprise Center&lt;br /&gt;
#FlashStarts&lt;br /&gt;
#Flashpoint&lt;br /&gt;
#Flat6 Labs&lt;br /&gt;
#Fledge9&lt;br /&gt;
#Flextronics Lab IX&lt;br /&gt;
#Food Future Scale-up Accelerator 2017&lt;br /&gt;
#Food System 6 (FS6) Accelerator&lt;br /&gt;
#FoodForwardX&lt;br /&gt;
#Fortify Ventures&lt;br /&gt;
#Founder Institute&lt;br /&gt;
#FounderFuel&lt;br /&gt;
#FoundersPad&lt;br /&gt;
#Fownders Accelerator&lt;br /&gt;
#French Accelerator 2016&lt;br /&gt;
#Fund the Food&lt;br /&gt;
#Fuse Corps Host&lt;br /&gt;
#GAKKEN Accelerator Program&lt;br /&gt;
#Gainesville Technology Enterprise Center&lt;br /&gt;
#Game CoLab Incubator Program 2014&lt;br /&gt;
#GameFounders&lt;br /&gt;
#GammaRebels&lt;br /&gt;
#Gazelle Lab&lt;br /&gt;
#Gener8tor&lt;br /&gt;
#German Accelerator Life Sciences&lt;br /&gt;
#German Accelerator Tech&lt;br /&gt;
#Global Accelerator Network 2015&lt;br /&gt;
#Good Works Houston Lab&lt;br /&gt;
#GoodCompany Ventures&lt;br /&gt;
#Google Launchpad Accelerator&lt;br /&gt;
#Grants4Apps Accelerator&lt;br /&gt;
#GreenStart&lt;br /&gt;
#Greenlite Labs&lt;br /&gt;
#GrowLab&lt;br /&gt;
#Growth Hacking Accelerator 2015&lt;br /&gt;
#Gulf Coast Center for Innovation and Entrepreneurship&lt;br /&gt;
#H-Farm Ventures&lt;br /&gt;
#HACKT Mission for International Founders&lt;br /&gt;
#HAXLR8R&lt;br /&gt;
#HCC Entrepreneurship Launchpad&lt;br /&gt;
#HIGHLINE Academy&lt;br /&gt;
#HUB&lt;br /&gt;
#HUBB Accelerator&lt;br /&gt;
#HUBB GTLA 2016&lt;br /&gt;
#HackFWD&lt;br /&gt;
#Hatch&lt;br /&gt;
#Health Wildcatters&lt;br /&gt;
#Health accelerator&lt;br /&gt;
#Healthbox&lt;br /&gt;
#Hero City Co-Working Space&lt;br /&gt;
#High Street Startups Accelerator&lt;br /&gt;
#Highway1&lt;br /&gt;
#Honda Xcelerator &lt;br /&gt;
#Houston Technology Center&lt;br /&gt;
#Hub Ventures&lt;br /&gt;
#HugeThing&lt;br /&gt;
#I/O ventures&lt;br /&gt;
#ICONYC labs&lt;br /&gt;
#IDC Elevator&lt;br /&gt;
#INcubes Funnel and Accelerator 2014/2015&lt;br /&gt;
#INcubes Online Form&lt;br /&gt;
#INcubes Startup Visa&lt;br /&gt;
#Illumina Accelerator&lt;br /&gt;
#Illuminator,  New York Accelerator 2015&lt;br /&gt;
#Imagine K12&lt;br /&gt;
#Immokalee Business Development Center&lt;br /&gt;
#Impact Engine&lt;br /&gt;
#Impact USA - 2017&lt;br /&gt;
#Incubate Miami&lt;br /&gt;
#Infuse Accelerator&lt;br /&gt;
#Ingenuity Partner Program&lt;br /&gt;
#InnoSpring&lt;br /&gt;
#Innov&amp;amp;Connect&lt;br /&gt;
#Innov8 for Health&lt;br /&gt;
#Innova Memphis&lt;br /&gt;
#InnovateOC&lt;br /&gt;
#Innovation Depot&lt;br /&gt;
#Innovation Pavilion&lt;br /&gt;
#Innovation Showcase Winter 2017&lt;br /&gt;
#Insight Accelerator Labs&lt;br /&gt;
#Intel Education Accelerator&lt;br /&gt;
#Investment Preparedness Lab&lt;br /&gt;
#Invoke Collective&lt;br /&gt;
#Iowa Startup Accelerator&lt;br /&gt;
#JFDI.Asia&lt;br /&gt;
#JFE Accelerator SF&lt;br /&gt;
#JLAB&lt;br /&gt;
#Jaguar Land Rover Tech Incubator&lt;br /&gt;
#Jolt&lt;br /&gt;
#JumpSchool &lt;br /&gt;
#JumpStart Foundry&lt;br /&gt;
#Jumpstart! Boulder&lt;br /&gt;
#JusticeXL&lt;br /&gt;
#Kairos Boston Spring Program&lt;br /&gt;
#Kaplan EdTech&lt;br /&gt;
#Kick&lt;br /&gt;
#Kick Boise&lt;br /&gt;
#Kick LA&lt;br /&gt;
#Kick Victoria&lt;br /&gt;
#Kicklabs&lt;br /&gt;
#Kinetiq Labs&lt;br /&gt;
#L-SPARK Accelerator&lt;br /&gt;
#LAUNCH incubator&lt;br /&gt;
#LAUNCHub&lt;br /&gt;
#LI TechCOMETS&lt;br /&gt;
#LabFunding Project Accelerator 2014&lt;br /&gt;
#Labs Venture Accelerator&lt;br /&gt;
#Launch Chapel Hill&lt;br /&gt;
#Launch Memphis&lt;br /&gt;
#LaunchBox Digital&lt;br /&gt;
#LaunchHouse&lt;br /&gt;
#LaunchPad PEI&lt;br /&gt;
#LaunchSpot&lt;br /&gt;
#Launch_Academy&lt;br /&gt;
#Launchpad Digital Health, LLC&lt;br /&gt;
#Launchpad LA&lt;br /&gt;
#Launchpad Long Island&lt;br /&gt;
#Le Camping&lt;br /&gt;
#Leading Entrepreneurial Accelerator Program&lt;br /&gt;
#Lean Launch Ventures&lt;br /&gt;
#LearnLaunchX&lt;br /&gt;
#Lemnos Labs&lt;br /&gt;
#Life Changing Labs&lt;br /&gt;
#LiftOff Health Incubator&lt;br /&gt;
#Lightbank Start&lt;br /&gt;
#LightningLab&lt;br /&gt;
#Lowe's Accelerator&lt;br /&gt;
#MACH37&lt;br /&gt;
#MACH37 Spring&lt;br /&gt;
#MIT SA+P venture accelerator&lt;br /&gt;
#MITA Institute Accelerator&lt;br /&gt;
#MTGx MediaFactory&lt;br /&gt;
#Mac6&lt;br /&gt;
#Madworks Governance Accelerator&lt;br /&gt;
#Maine Center for Entrepreneurial Development - Top Gun Program&lt;br /&gt;
#Matter&lt;br /&gt;
#Maven Ventures Fund &amp;amp; Incubator&lt;br /&gt;
#Media Camp&lt;br /&gt;
#Melbourne Accelerator Program&lt;br /&gt;
#Memphis BioWorks&lt;br /&gt;
#Merck Accelerator&lt;br /&gt;
#MergeLane 2017 Accelerator&lt;br /&gt;
#Mergelane&lt;br /&gt;
#Metavallon&lt;br /&gt;
#Microsoft Accelerator&lt;br /&gt;
#MindTheBridge&lt;br /&gt;
#Momentum&lt;br /&gt;
#MuckerLab&lt;br /&gt;
#Muru-D&lt;br /&gt;
#My5ive Accelerator 2016&lt;br /&gt;
#N-Motion (DUPLICATE)&lt;br /&gt;
#NDRC (LaunchPad / VentureLab)&lt;br /&gt;
#NEXT Dashboard&lt;br /&gt;
#NMotion&lt;br /&gt;
#NY Digital Health Accelerator&lt;br /&gt;
#NY Fashion Tech Lab 2017&lt;br /&gt;
#NYC ACRE&lt;br /&gt;
#NYC SeedStart&lt;br /&gt;
#Nashville Entrepreneur Center&lt;br /&gt;
#Nebula Shift&lt;br /&gt;
#Nephoscale IaaS&lt;br /&gt;
#Nest New York &lt;br /&gt;
#New Ventures Group&lt;br /&gt;
#New York Digital Health Accelerator (DUPLICATE)&lt;br /&gt;
#NewME Accelerator PopUps &lt;br /&gt;
#NewMe&lt;br /&gt;
#Next media accelerator&lt;br /&gt;
#NextHIT&lt;br /&gt;
#NextStart&lt;br /&gt;
#Nike+ Accelerator&lt;br /&gt;
#Northern Arizona Center for Entrepreneurship and Technology (NACET)&lt;br /&gt;
#Northern England&lt;br /&gt;
#Nxtp.labs&lt;br /&gt;
#OCTANe&lt;br /&gt;
#Oasis 500&lt;br /&gt;
#OpenFund&lt;br /&gt;
#Orange Fab&lt;br /&gt;
#Orange Works&lt;br /&gt;
#Orion Startups&lt;br /&gt;
#Oxygen Accelerator&lt;br /&gt;
#PIE&lt;br /&gt;
#Patriot Boot Camp&lt;br /&gt;
#Pearson Catalyst for Education&lt;br /&gt;
#Pipeline H2O&lt;br /&gt;
#Pitney Bowes Inc&lt;br /&gt;
#Plarium Labs&lt;br /&gt;
#Plug In South LA &lt;br /&gt;
#Plug and Play&lt;br /&gt;
#Plum Alley Investments 2016&lt;br /&gt;
#Points of Light Accelerator&lt;br /&gt;
#PowerHaus&lt;br /&gt;
#Preccelerator® Program 2016&lt;br /&gt;
#ProSiebenSat.1 Accelerator&lt;br /&gt;
#Project Entrepreneur 2016/17&lt;br /&gt;
#Project Healtchare&lt;br /&gt;
#Project Lift&lt;br /&gt;
#Project Music&lt;br /&gt;
#Project Skyway&lt;br /&gt;
#Propeller Venture Accelerator&lt;br /&gt;
#Prosper Capital Accelerator&lt;br /&gt;
#Proton Enterprises&lt;br /&gt;
#Pushstart Accelerator&lt;br /&gt;
#Qualcomm Robotics Accelerator&lt;br /&gt;
#Queen Creek Business Incubator&lt;br /&gt;
#R/GA Accelerator&lt;br /&gt;
#RAIN Incubator/Accelerator&lt;br /&gt;
#RJI Investment Group&lt;br /&gt;
#Reach&lt;br /&gt;
#RetailXelerator&lt;br /&gt;
#Rock Health&lt;br /&gt;
#Rocket Fuel Labs&lt;br /&gt;
#Rockstart Accelerator&lt;br /&gt;
#RunUp Labs&lt;br /&gt;
#Runway IoT Accelerator 2015&lt;br /&gt;
#SAP Startup Focus Program&lt;br /&gt;
#SKTA Innopartners Innovation Accelerator&lt;br /&gt;
#SPACELAB Tech Accelerator&lt;br /&gt;
#SPARK&lt;br /&gt;
#SPH Plug and Play&lt;br /&gt;
#SURF Incubator&lt;br /&gt;
#SaltMines Group Start-Up Studio&lt;br /&gt;
#ScaleTown&lt;br /&gt;
#Seamless IoT 2016&lt;br /&gt;
#Searchcamp&lt;br /&gt;
#Seed Hatchery&lt;br /&gt;
#SeedSpot&lt;br /&gt;
#SeedStartup&lt;br /&gt;
#SeedSumo&lt;br /&gt;
#Seedcamp&lt;br /&gt;
#Seedrocket&lt;br /&gt;
#Seeqnce&lt;br /&gt;
#Sequoia Apps&lt;br /&gt;
#Serval Ventures&lt;br /&gt;
#Shenzhen Valley Ventures Incubator&lt;br /&gt;
#Shoals Entrepreneurial Center&lt;br /&gt;
#Shopper Futures Accelerator&lt;br /&gt;
#Shotput Ventures&lt;br /&gt;
#Sid Martin Biotechnology Institute&lt;br /&gt;
#SigmaLabs Accelerator&lt;br /&gt;
#Silicon Valley Incubator &amp;amp; Accelerator&lt;br /&gt;
#SixThirty&lt;br /&gt;
#Sixers Innovation Lab&lt;br /&gt;
#Skywalker Accelerator&lt;br /&gt;
#SmartHealth Activator&lt;br /&gt;
#Smashd Labs&lt;br /&gt;
#SoCo Nexus Accelerator Spring 2017&lt;br /&gt;
#Social Enterprise Challenge&lt;br /&gt;
#Socratic Labs&lt;br /&gt;
#SparkLabs&lt;br /&gt;
#Sparkgap&lt;br /&gt;
#Sports Tank&lt;br /&gt;
#Springboard&lt;br /&gt;
#Sprint Accelerator&lt;br /&gt;
#Sprint Mobile Health Accelerator&lt;br /&gt;
#SproutBox&lt;br /&gt;
#SproutCamp&lt;br /&gt;
#Starburst Aerospace Accelerator&lt;br /&gt;
#Start Path Europe&lt;br /&gt;
#Start'inPost&lt;br /&gt;
#StartEngine&lt;br /&gt;
#StartFast Venture Accelerator&lt;br /&gt;
#Starta Accelerator Winter 2017&lt;br /&gt;
#Startl&lt;br /&gt;
#Startmate&lt;br /&gt;
#Startup Accelerator (DUPLICATE)&lt;br /&gt;
#Startup Front&lt;br /&gt;
#Startup Next &amp;amp; GAN&lt;br /&gt;
#Startup Orange County Accelerator&lt;br /&gt;
#Startup Runway&lt;br /&gt;
#Startup Wise Guys&lt;br /&gt;
#Startup Zone PEI&lt;br /&gt;
#Startup52X Accelerator&lt;br /&gt;
#StartupCity&lt;br /&gt;
#StartupHighway&lt;br /&gt;
#StartupHouse Foundry program&lt;br /&gt;
#StartupMinds Accelerator &lt;br /&gt;
#StartupYard&lt;br /&gt;
#Startupbootcamp&lt;br /&gt;
#Straight Shot&lt;br /&gt;
#Summer@Highland&lt;br /&gt;
#Surge&lt;br /&gt;
#SynBio axlr8r&lt;br /&gt;
#TEB Incubation &amp;amp; Acceleration Center&lt;br /&gt;
#THRIVE Accelerator III&lt;br /&gt;
#THRIVE Open Innovation (DUPLICATE)&lt;br /&gt;
#TIM#WCAP Accelerator&lt;br /&gt;
#TLabs&lt;br /&gt;
#TMCx Accelerator Digital Health 2017&lt;br /&gt;
#Tallwave&lt;br /&gt;
#Tampa Bay Innovation Center&lt;br /&gt;
#Tampa Bay Wave&lt;br /&gt;
#Tandem Mobile Accelerator&lt;br /&gt;
#Tech Nexus&lt;br /&gt;
#Tech Wildcatters&lt;br /&gt;
#Tech2020&lt;br /&gt;
#TechLaunch&lt;br /&gt;
#TechRanch&lt;br /&gt;
#TechSquareLabs&lt;br /&gt;
#Techstars&lt;br /&gt;
#Techstars Music&lt;br /&gt;
#Telenet Idealabs&lt;br /&gt;
#Telluride Venture Accelerator&lt;br /&gt;
#TenX&lt;br /&gt;
#The Alchemist Accelerator (DUPLICATE)&lt;br /&gt;
#The Ark&lt;br /&gt;
#The Bakery&lt;br /&gt;
#The Batchery&lt;br /&gt;
#The Brandery&lt;br /&gt;
#The Bridge&lt;br /&gt;
#The Center For Technology Enterprise &amp;amp; Development&lt;br /&gt;
#The Chaser&lt;br /&gt;
#The Company Lab (CO.LAB)&lt;br /&gt;
#The Draper FinTech Connection&lt;br /&gt;
#The Factory&lt;br /&gt;
#The Greatest Pitch&lt;br /&gt;
#The Harbor Accelerator&lt;br /&gt;
#The Incubator&lt;br /&gt;
#The Iron Yard&lt;br /&gt;
#The Mediapreneur Incubator&lt;br /&gt;
#The Morpheus&lt;br /&gt;
#The New York Venture Summit&lt;br /&gt;
#The Next Step: from idea to startup&lt;br /&gt;
#The Refinery&lt;br /&gt;
#The Unilever Foundry&lt;br /&gt;
#The Venture Center's Pre-Accelerator I&lt;br /&gt;
#The Vine OC&lt;br /&gt;
#The Vogt Awards&lt;br /&gt;
#The Yield Lab&lt;br /&gt;
#The eFactory Accelerator&lt;br /&gt;
#Think Big Partners Accelerator&lt;br /&gt;
#TiE Angels&lt;br /&gt;
#Tigerlabs Digital Health Accelerator&lt;br /&gt;
#Tolstoy Summer Camp&lt;br /&gt;
#TopSeedsLab&lt;br /&gt;
#Travel Startups Incubator&lt;br /&gt;
#Travelport Labs Accelerator&lt;br /&gt;
#Travelport Labs Incubator&lt;br /&gt;
#Triangle Startup Factory&lt;br /&gt;
#Tumml&lt;br /&gt;
#Tune Labs&lt;br /&gt;
#Twin Cities Accelerator 2016&lt;br /&gt;
#UW-Whitewater Launch Pad Accelerator&lt;br /&gt;
#Unbank.ventures FinTech Incubator&lt;br /&gt;
#University Technology Park&lt;br /&gt;
#Unreasonable Institute&lt;br /&gt;
#UpTech&lt;br /&gt;
#Upstart Accelerator&lt;br /&gt;
#Upstart Labs&lt;br /&gt;
#Upstart Memphis&lt;br /&gt;
#Uptima Business Bootcamp&lt;br /&gt;
#Upwest Labs&lt;br /&gt;
#VANTEC&lt;br /&gt;
#VC FinTech Accelerator&lt;br /&gt;
#Velocity Indiana Accelerator&lt;br /&gt;
#Venture Catalyst Partners&lt;br /&gt;
#Venture Hive&lt;br /&gt;
#Venture I&lt;br /&gt;
#VentureOut's  Enterprise Tech Expedition&lt;br /&gt;
#Venturegeeks&lt;br /&gt;
#Vet-Tech Accelerator&lt;br /&gt;
#VictorySpark&lt;br /&gt;
#Village88 Techlab&lt;br /&gt;
#Volkswagen ERL Technology Accelerator&lt;br /&gt;
#WHLabs&lt;br /&gt;
#Wasabi Ventures Academy&lt;br /&gt;
#Wayra&lt;br /&gt;
#Wellness Accelerator&lt;br /&gt;
#Wells Fargo Startup Accelerator&lt;br /&gt;
#Wireless IoT&lt;br /&gt;
#Women Innovate Mobile&lt;br /&gt;
#XLerateHealth&lt;br /&gt;
#XTRATOS&lt;br /&gt;
#Xlerate Health&lt;br /&gt;
#Y Combinator&lt;br /&gt;
#Y&amp;amp;R SparkPlug 2017&lt;br /&gt;
#YEurope&lt;br /&gt;
#YLE Media Startup Accelerator Program&lt;br /&gt;
#Yahoo Ad Tech Program&lt;br /&gt;
#Yangler (online accelerator)&lt;br /&gt;
#Year of the Startup&lt;br /&gt;
#Yetizen Accelerator&lt;br /&gt;
#You Is Now&lt;br /&gt;
#Z80 Labs&lt;br /&gt;
#ZIP Launchpad Admission&lt;br /&gt;
#ZeroTo510&lt;br /&gt;
#Zone Startups Calgary&lt;br /&gt;
#designX 2017&lt;br /&gt;
#eMerging Ventures&lt;br /&gt;
#ezone&lt;br /&gt;
#iStart Jax (DUPLICATE)&lt;br /&gt;
#iStart Valley&lt;br /&gt;
#iVentures10&lt;br /&gt;
#ignite100&lt;br /&gt;
#innovyz start&lt;br /&gt;
#tekMountain Accelerator&lt;br /&gt;
&lt;br /&gt;
=Project Summary=&lt;br /&gt;
This project will be used to determine which accelerators are the most effective at churning out successful startups, as well as what characteristics are exhibited by these accelerators. First, we need to gather as much data as we can about as many accelerators as we can in order to look at factors that differentiate successful vs. unsuccessful ventures. Next, we need to create a web crawling program which will gather information about accelerators across the world by accessing their websites and extracting information. I believe that our overall goal with this research project is to gain insight into the methods of successful accelerators, as well as to find out what exactly differentiates very successful accelerators from dead accelerators.&lt;br /&gt;
&lt;br /&gt;
Helpful Links: http://seedrankings.com/&lt;br /&gt;
&lt;br /&gt;
=Sources=&lt;br /&gt;
&lt;br /&gt;
Summary: These are sources obtained from [[List of Accelerators]], Crunchbase, and other Google searches. We will evaluate these sources by looking at the number of accelerators they supply (as most of them are lists) and then also taking a look at the type of information they provide about each accelerator. Key data points are cohort-related data, startup-related data, and logistics of the accelerator. Better sources supply more information that the URL alone.&lt;br /&gt;
&lt;br /&gt;
(Obtained from [[List of Accelerators]] and various Google searches)&lt;br /&gt;
*http://seedrankings.com/&lt;br /&gt;
*http://www.acceleratorinfo.com/see-all.html&lt;br /&gt;
*http://www.seed-db.com/accelerators&lt;br /&gt;
*http://gust.com/usa-canada-accelerator-report-2015/?utm_content=35401577&amp;amp;utm_medium=social&amp;amp;utm_source=twitter&lt;br /&gt;
*https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/&lt;br /&gt;
*http://www.builtinnyc.com/2016/06/03/accelerators-incubators-nyc&lt;br /&gt;
*http://www.represent.la/&lt;br /&gt;
*http://www.launch.co/blog/complete-list-of-incubators-and-accelerators-like-y-combinat.html&lt;br /&gt;
*https://angel.co/accelerator-4 (Does not work - seems to be replaced by https://angel.co/companies?company_types[]=Incubator )&lt;br /&gt;
&lt;br /&gt;
(Obtained from Google search: &amp;quot;Accelerator Database&amp;quot;)&lt;br /&gt;
*seed-db is the first result that pops up&lt;br /&gt;
*https://www.corporate-accelerators.net/database/&lt;br /&gt;
*https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json&lt;br /&gt;
*By the 5th or 6th search result, the utility diminished greatly&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2015/03/17/the-best-startup-accelerators-of-2015-powering-a-tech-boom/#2f52fa7e34e4&lt;br /&gt;
*http://www.inc.com/will-yakowicz/the-15-best-startup-accelerators-in-the-us.html&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2016/03/11/the-best-startup-accelerators-of-2016/#74086a7724f2&lt;br /&gt;
*https://techcrunch.com/2015/03/17/these-are-the-top-20-us-accelerators/&lt;br /&gt;
*https://www.nexpcb.com/blogs/news/the-hardware-incubators-accelerators-list&lt;br /&gt;
&lt;br /&gt;
Other ways used to find Accelerators (listed below &amp;quot;List of Sources Obtained from Various Google Searches&amp;quot;):&lt;br /&gt;
*Type in generic location + &amp;quot;accelerators&amp;quot; (e.g. Houston Accelerators)&lt;br /&gt;
:*Looked at roughly the first 20 results&lt;br /&gt;
:*Used three locations as examples of accelerators that pop up&lt;br /&gt;
*Type in a specific state + &amp;quot;accelerator&amp;quot; + &amp;quot;list&amp;quot; (e.g. Texas accelerator list) to search for more relevant lists&lt;br /&gt;
:*Once again, looked at roughly the first 20 results&lt;br /&gt;
*Crunchbase has its own webpage with instructions for how we retrieve the data&lt;br /&gt;
&lt;br /&gt;
=Source Evaluations=&lt;br /&gt;
&lt;br /&gt;
Summary: These evaluations couple with each of the sources above. The evaluations provide instructions for obtaining the information listed, as well as a general review of how useful the data seems. The review serves to determine whether a crawler would be suitable for obtaining information from the source autonomously.&lt;br /&gt;
&lt;br /&gt;
==SOURCE: Crunchbase==&lt;br /&gt;
*All of the information for the Crunchbase documentation is located in the page [[Crunchbase 2013 Snapshot]] webpage, along with the documentation for how we determined the accelerator information.&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.acceleratorinfo.com/see-all.html==&lt;br /&gt;
#Opened source website&lt;br /&gt;
#Copied Information under &amp;quot;All Accelerator Programs&amp;quot; to TextPad, already sorted. Returned 190 results&lt;br /&gt;
#Each link on parent list leads to individual '''home page url''' of accelerator&lt;br /&gt;
:*Used sample size of 20 links, determined 16 to be accelerators, 2 to be incubators, 2 to be inactive or broken links&lt;br /&gt;
:*Many accelerators do not include founding date, most recent accelerators from around 2013-2014 (as determined from home page)&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for specific URLs to older accelerators, not very helpful for more specific information.&lt;br /&gt;
*Web crawling seems improbable because information is not readily available from source. Can potentially mine staff information or contact information from associated &amp;quot;about&amp;quot; page in the home url&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators/all==&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 235 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes:&lt;br /&gt;
::# &amp;quot;state&amp;quot;&lt;br /&gt;
::# &amp;quot;company name&amp;quot;&lt;br /&gt;
::# &amp;quot;website and CrunchBase links&amp;quot;&lt;br /&gt;
::# &amp;quot;cohort date&amp;quot;&lt;br /&gt;
::#&amp;quot;exit value&amp;quot;&lt;br /&gt;
::#&amp;quot;funding&amp;quot;. &lt;br /&gt;
:::Many entries for &amp;quot;exit value&amp;quot; are missing, some values for &amp;quot;funding&amp;quot; are missing&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators out of 235 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the following:&lt;br /&gt;
::#Status&lt;br /&gt;
::#Program (name)&lt;br /&gt;
::#Location&lt;br /&gt;
::#Country&lt;br /&gt;
::#Number of companies&lt;br /&gt;
::#Cumulative exit values&lt;br /&gt;
::#Cumulative funding &lt;br /&gt;
::#Average funding for startups&lt;br /&gt;
::#Median funding for startups&lt;br /&gt;
:::Many entries for &amp;quot;median funding&amp;quot; are left empty, as well as entries for all types of funding on the bottom half of the table&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, but after cross-referencing from other sources shows that seed-db is lacking many newer accelerators; list is not all-inclusive.&lt;br /&gt;
*Includes regional distributions for accelerator groups as well. For example, rather than just &amp;quot;Techstars&amp;quot;, the group is broken into Austin, Berlin, Boston, Boulder, etc.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators==&lt;br /&gt;
:Very similar to &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;, but contains large regional accelerators as groups, rather than individual accelerators. For example, Techstars appears only once.&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 239 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes same information as previous source, &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;. However, accelerators spanning across multiple regions have their startups located under one category on this webpage.&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators/groups out of 239 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the same information as the &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; source&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, includes large groups as well as individual accelerators. It seems that some accelerators missing from &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; are located here, since there are 239 returns rather than 235.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.f6s.com/programs?type==&lt;br /&gt;
#On the webpage, set &amp;quot;Type&amp;quot; to &amp;quot;Accelerator/Program&amp;quot;, set &amp;quot;Location&amp;quot; to &amp;quot;North America&amp;quot;, and set &amp;quot;Invest in Country&amp;quot; to &amp;quot;United States&amp;quot; to return results&lt;br /&gt;
#Highlighted results and scrolled down until all results found; copied results to TextPad&lt;br /&gt;
#In TextPad, sorted out lines with &amp;quot;by&amp;quot;, as well as miscellaneous categories such as dates and dollar signs through Regular Expressions&lt;br /&gt;
#Using the &amp;quot;More Info&amp;quot; line which held constant through the entire list, assigned a sequential number to the line (in order to determine the number of results)&lt;br /&gt;
::*Obtained a grand total of 1467 results from the list&lt;br /&gt;
::*Along with the name of the program/accelerator, the data included:&lt;br /&gt;
::#Dollar value per team&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Application Site&lt;br /&gt;
::#Accelerator URL&lt;br /&gt;
::*Many entries are not accelerators, from a quick glance through the results, there were various conferences, 3-5 days events, and written literature pertaining to accelerators as well&lt;br /&gt;
::*From a sample size of the first 30 entries, determined 10 to be valid accelerators, 3 incubators, 6 conferences/weekends, and the rest to be miscellaneous entries such as startup events or &amp;quot;studios&amp;quot; (perhaps useful but not relevant to search)&lt;br /&gt;
::*As we go down the list, the number of accelerators proportionately decreases. Can comfortably say that overall accelerator turnout from this website is much less than 33%, probably closer to 10-15%.&lt;br /&gt;
===Review===&lt;br /&gt;
*Potentially useful website if crawler could remove the clutter and target solely the accelerators; very useful for identifying new accelerators since data automatically sorted by date and location.&lt;br /&gt;
*Large list of sources includes many irrelevant results, such as conferences or weekends which are difficult to identify. The name of the sorting category itself, &amp;quot;Accelerator/Program&amp;quot; suggests that many of the results fall under the &amp;quot;Program&amp;quot; section rather than being valid accelerators.&lt;br /&gt;
*Potential site for identifying accelerators, but limited by in-site sorting; useful for URL and perhaps equity, but not very detailed information relating to the accelerator/program.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://gust.com/usa-canada-accelerator-report-2015/==&lt;br /&gt;
#Selected region of US and Canada&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Top 20 Active Accelerators&amp;quot; and selected &amp;quot;see the full list&amp;quot; near the bottom of the listed accelerators&lt;br /&gt;
#Copied resulting entries into TextPad and sorted out the numbers to leave only the name of the accelerator&lt;br /&gt;
::*Obtained 100 results for different accelerators&lt;br /&gt;
::*Accelerator lists included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Number of Start-ups funded (2015 only)&lt;br /&gt;
::*Accelerator list limited to 2015&lt;br /&gt;
===Review===&lt;br /&gt;
*Website provides its own evaluation of an accelerator's success based on various factors and provides data for larger trends.&lt;br /&gt;
*Usefulness is questionable because website does not provide much except the URL, and all of the entries are based on success in 2015.&lt;br /&gt;
*Other interesting data within website such as &amp;quot;Hot Markets&amp;quot;, investment breakdowns by state, etc. All of this data is also limited to 2015.&lt;br /&gt;
&lt;br /&gt;
==Source: https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/==&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Startup accelerators in Boston&amp;quot;&lt;br /&gt;
#Copied text beginning from &amp;quot;MassChallenge&amp;quot; (the first paragraph was just a general definition of startups) and continued to copy until &amp;quot;Startup Incubators in Boston&amp;quot;&lt;br /&gt;
#After pasting in TextPad, I sorted the data to delete any characters after the &amp;quot;-&amp;quot; and added a sequential number at the beginning of each line&lt;br /&gt;
::*Returned a total of 17 results for startups in Boston&lt;br /&gt;
::*Accelerator list included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Capital requirements&lt;br /&gt;
::#Application periods and requirements&lt;br /&gt;
::#Paragraph describing accelerator and its goals&lt;br /&gt;
===Review===&lt;br /&gt;
*Although the guide is dated, useful for identifying strong accelerator programs in Boston&lt;br /&gt;
*Limitation: only focuses on Boston, but the description is helpful in identifying the role of the accelerator&lt;br /&gt;
*Limited information on accelerator, not very useful by itself without information from the accelerator URL&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.corporate-accelerators.net/database/==&lt;br /&gt;
#Copied and pasted table into Microsoft Excel (Data was already sorted into categories so no need for TextPad)&lt;br /&gt;
#Table returned 72 references (but there was a link to the bottom to a larger database)&lt;br /&gt;
::*The table itself includes:&lt;br /&gt;
::#Major Company&lt;br /&gt;
::#Accelerator&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Website&lt;br /&gt;
::#Details&lt;br /&gt;
::*The &amp;quot;Details&amp;quot; link led to a variety of other information including:&lt;br /&gt;
::#Status (Active or Inactive)&lt;br /&gt;
::#Locations&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Term&lt;br /&gt;
::#Cohort Based? (Regular or Irregular)&lt;br /&gt;
::#Pitch Day&lt;br /&gt;
::#Office Space&lt;br /&gt;
::#Powered by&lt;br /&gt;
::#Support Offered?&lt;br /&gt;
::#Launch year&lt;br /&gt;
::#Focus Areas&lt;br /&gt;
::#General Description&lt;br /&gt;
::*Also Included a variety of data regarding the host company as well&lt;br /&gt;
===Review===&lt;br /&gt;
*Solid list for corporate accelerators and also includes a variety of information about the accelerator, the cohorts, etc. Some of the entries are international accelerators however so need to filter them out&lt;br /&gt;
*Only limited to 72 accelerators from major companies&lt;br /&gt;
&lt;br /&gt;
==Source: https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json==&lt;br /&gt;
#This source is a .json file from the previous database&lt;br /&gt;
#After placing into TextPad, replaced each space with a ###, replaced each new line with a tab, and replaced each ### with a new line. Ultimately returned 80 results&lt;br /&gt;
::*From the file, the .json includes:&lt;br /&gt;
::#NAICS and NAICS sector &lt;br /&gt;
::#Classification&lt;br /&gt;
::#Sector Description&lt;br /&gt;
::#Term&lt;br /&gt;
::#Goal&lt;br /&gt;
::#Partner&lt;br /&gt;
::*Also includes most of the information from the previous source, since they are undoubtedly linked&lt;br /&gt;
===Review===&lt;br /&gt;
*Another solid list for corporate accelerators with some more information, but ultimately very similar to the previous source.&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.quora.com/Where-can-I-find-a-comprehensive-list-of-startup-incubators-and-accelerators-in-the-US==&lt;br /&gt;
#Since we already looked at the first listed source (seed-db), I clicked on the second link &amp;quot;(by Robert Shedd) http://blog.shedd.us/321987608/&amp;quot; which took me to a page headed &amp;quot;Help for Startups! – A semi-complete list of startup accelerator programs&amp;quot; created by a blogger, Robert Shedd&lt;br /&gt;
#List included 102 entries by the blogger, each of which do look like an accelerator&lt;br /&gt;
::*Upon immediate overview, noticed many results from previous sources were missing. Immediately noticed lack of &amp;quot;OwlSpark&amp;quot;, the accelerator from Rice.&lt;br /&gt;
::*Shedd only offers us the accelerator name plus its URL&lt;br /&gt;
===Review===&lt;br /&gt;
*Nice list to cross-reference with other sources but does not offer much new insight compared to more powerful engines such as seed-db\&lt;br /&gt;
&lt;br /&gt;
=List of Sources Obtained from Various Google Searches=&lt;br /&gt;
&lt;br /&gt;
Summary: These accelerators are taken from a specific Google search rather than a list. The idea is to compile a list of Google searches that return relevant results of accelerators. This will aid in the creation of a future web crawler.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;Location + Accelerator&amp;quot;(Only individual results, not lists)==&lt;br /&gt;
===Houston Accelerators===&lt;br /&gt;
*Examples of single accelerators found&lt;br /&gt;
:#TMCx: http://www.tmc.edu/innovation/innovation-programs/tmcx/&lt;br /&gt;
:#RED labs: http://redlabs.uh.edu/8&lt;br /&gt;
:#SURGE accelerator: https://kirkcoburn.com/&lt;br /&gt;
:#OwlSpark: http://owlspark.com/&lt;br /&gt;
:#NextHIT: http://www.houstonhealthventures.com/nexthit-accelerator-program-application/&lt;br /&gt;
===Los Angeles Accelerators===&lt;br /&gt;
:#Amplify: http://amplify.la/&lt;br /&gt;
:#Y Combinator: https://www.ycombinator.com/&lt;br /&gt;
:#Chicklabs: https://www.chicklabsllc.com/&lt;br /&gt;
:#Disney Accelerator: https://disneyaccelerator.com/&lt;br /&gt;
:#Launchpad: https://launchpad.la/&lt;br /&gt;
===New York Accelerators===&lt;br /&gt;
:#DreamIT Ventures: http://www.dreamit.com/#meaningful-experience&lt;br /&gt;
:#Women Innovate Mobile: http://www.wim.co/&lt;br /&gt;
:#Techstars NYC: http://www.techstars.com/programs/nyc-program/&lt;br /&gt;
:#Entrepreneurs Roundtable: http://eranyc.com/&lt;br /&gt;
:#FirstGrowthVC: http://venturecrush.com/fg/&lt;br /&gt;
:#New York Digital Health Accelerator: http://digitalhealthaccelerator.com/&lt;br /&gt;
:#Grand Central Tech: http://www.grandcentraltech.com/&lt;br /&gt;
:#Accelerator Corp: http://www.acceleratorcorp.com/&lt;br /&gt;
:#New York Startup Lab: http://nystartuplab.com/&lt;br /&gt;
===Review===&lt;br /&gt;
*Some locations return more viable results for a similar sample size. For example, New York returned 9 valid accelerators, whereas Los Angeles and Houston both returned 5 actual accelerators out of the first 20 results: an 80% difference. Some optimization may come from identifying which locations return more accelerators upon searching.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;State+Accelerator+List&amp;quot;==&lt;br /&gt;
===New York Accelerator List===&lt;br /&gt;
*http://www.ongridventures.com/resources/new-york-silicon-alley-resources/newyorkaccelerators/ (Ranks 14 accelerators)&lt;br /&gt;
*http://under30ceo.com/11-new-york-tech-incubators-and-accelerators-for-entrepreneurs/ (Ranks 11 accelerators)&lt;br /&gt;
===California Accelerator List===&lt;br /&gt;
*http://www.socaltech.com/the_complete_guide_to_southern_california_accelerators_and_incubators_part_i/s-0040924.html (Lists accelerators in Southern Cali)&lt;br /&gt;
*http://barberacorporatelaw.com/blog/2014/4/8/28-business-incubators-in-the-los-angeles-area (List of 24 accelerators near the LA area)&lt;br /&gt;
===Texas Accelerator List===&lt;br /&gt;
*http://www.austinstartuplist.com/incubators (List of accelerators in Austin, &amp;lt;5 results)&lt;br /&gt;
*http://www.siliconhillsnews.com/2016/09/02/the-top-texas-healthcare-accelerators-and-incubators/ (Modest list of accelerators aiding in healthcare)&lt;br /&gt;
*http://realfoodmba.com/food-startup-accelerators/ (List of food-based accelerators, some of which are in Austin, others of which are international)&lt;br /&gt;
===Colorado Accelerator List===&lt;br /&gt;
*http://www.builtincolorado.com/2015/01/14/best-colorado-accelerators-your-startup (8 results)&lt;br /&gt;
*https://www.quora.com/What-accelerator-programs-are-located-in-Colorado (Quora inquiry yielding modest results)&lt;br /&gt;
===Washington Accelerator List===&lt;br /&gt;
*http://www.geekwire.com/2015/mapping-seattles-incubators-accelerators-and-co-working-spaces/ (Returns 14 results)&lt;br /&gt;
===Oregon Accelerator List===&lt;br /&gt;
*http://www.bizjournals.com/portland/subscriber-only/2016/01/15/incubators-and-accelerators.html (Returns list of 5 accelerators and details)&lt;br /&gt;
*http://www.oregon4biz.com/Innovate-&amp;amp;-Create/R&amp;amp;D-Business/Incubators/ (Returns list of 26 accelerators and incubators)&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Seed-DB appears for almost all of the search results&lt;br /&gt;
*Acceleratorinfo appears for most of the search results&lt;br /&gt;
*There are multiple cumulative reports of incubators per location, but not for accelerators&lt;br /&gt;
*Most regionalized accelerator lists deal with either an article or a ranking of a particular amount of accelerators in the area&lt;br /&gt;
*Many results returned nationally ranked lists of accelerators, such as the Forbes list of &amp;quot;Top Accelerators&amp;quot; or something along the lines of &amp;quot;Best Accelerators in the US&amp;quot;. The connection is that perhaps one accelerator mentioned on the list may be located within the searched state.&lt;br /&gt;
*There are also a few results for actual particle accelerators that must be sorted out (i.e. superconducting super collider)&lt;br /&gt;
&lt;br /&gt;
==Found through google searching accelerators found previously==&lt;br /&gt;
'''Found from googling YLE Media Startup Accelerator'''&lt;br /&gt;
*https://www.corporate-accelerators.net/database/index.html (DB of Corporate Accelerators 71-79 entries)&lt;br /&gt;
*http://startupaccelerator.vc/accelerator-corporate-innovation-sig/ (Database of Accelerators and Corporate Innovation 92 entries)&lt;br /&gt;
neither of these have had their entries added to list of accelerators&lt;br /&gt;
&lt;br /&gt;
=Individual Accelerator Evaluations=&lt;br /&gt;
Summary: The purpose of this section is to create instructions for each accelerator on how to find cohort information from their URLs. Along with specific instructions for obtaining the cohorts for each accelerator chosen, there should be a list of easy-to-obtain and relevant statistics regarding the accelerator, such as information about its team, location, etc. The variable statistics list is cumulative, whereas the cohort directions are unique per the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerators Chosen (Format = Name (source))==&lt;br /&gt;
#Blue Startups (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Launchpad LA (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Y Combinator (http://www.seed-db.com/accelerators)&lt;br /&gt;
#FlashPoint (http://www.seed-db.com/accelerators/all)&lt;br /&gt;
#Prosper Accelerator (https://www.f6s.com/programs?type)&lt;br /&gt;
#Axel Springer Plug and Play (http://www.axelspringerplugandplay.com/)&lt;br /&gt;
#Techstars (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Startmate (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Capital Factory (http://blog.shedd.us/321987608/)&lt;br /&gt;
#OwlSpark (Google search: &amp;quot;Houston + accelerators&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Blue Startups (http://bluestartups.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Track Record&amp;quot; page under the &amp;quot;Home&amp;quot; tab; found total number of graduated cohorts to be 7&lt;br /&gt;
#Navigated to &amp;quot;Portfolio&amp;quot; tab. Tab includes list of all seven graduated cohorts along with companies emerging from each one. Each cohort is listed under a separate page (ex. &amp;quot;Cohort 1&amp;quot;, &amp;quot;Cohort 2&amp;quot;, etc) and at the bottom of each cohort page, there is a link to the other 6. Each company has a short description along with its URL.&lt;br /&gt;
#An &amp;quot;Alumni News&amp;quot; page at the bottom of &amp;quot;Portfolio&amp;quot; includes articles pertinent to graduated startups.&lt;br /&gt;
#Unfortunately does not include the date and year of each cohort class, but perhaps could cross-reference with other sources.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Launchpad LA (http://launchpad.la/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Companies&amp;quot; in the top of the homepage&lt;br /&gt;
#&amp;quot;Companies&amp;quot; returns all companies backed by Launchpad LA based on their class year and number (cohort)&lt;br /&gt;
#:*Also sorted by active startups vs. inactive startups&lt;br /&gt;
#At the bottom of the &amp;quot;Companies&amp;quot; tab, there is a statistical layout returning values for the number of companies started by Launchpad during its time as an accelerator (2012-present), as well as the total funding funneled into the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Y Combinator (http://www.ycombinator.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Scrolled down on the home page and clicked on a link entitled &amp;quot;See all companies&amp;quot;.&lt;br /&gt;
#Navigated to a drop down menu named &amp;quot;All Batches&amp;quot;, and clicked on it to expand the list.&lt;br /&gt;
#List is made up of dates ranging from 2005-2016, and these dates return lists of launched companies including most but not all of their URL's, as well as their launch year.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Flashpoint (http://flashpoint.gatech.edu/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#On upper right corner after animation, there is a tab sign which lets you navigate to a page labeled &amp;quot;Teams&amp;quot;&lt;br /&gt;
#The &amp;quot;Team&amp;quot; page has each batch of companies emerging from Georgia Tech, although it does not include the dates or cohorts of these companies. For example, &amp;quot;Batch 1&amp;quot; at the top of the page just lists the companies in the batch without URLs or any additional information.&lt;br /&gt;
#On the &amp;quot;Application&amp;quot; page on the tab near the top, there is information regarding Batch 7, which begins early 2017. Suggests that batch 6 either ended spring 2016 or fall 2016.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Prosper Women Entrepreneurs (http://www.prosperstl.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Accelerator&amp;quot; tab and clicked &amp;quot;Companies&amp;quot; when prompted with the drop down menu.&lt;br /&gt;
#This tab returned all of the launched company logos which then redirected to the company's home page when clicked.&lt;br /&gt;
#No other relevant form of information such as date launched or cohort was included on this page.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Axel Springer Plug and Play(http://www.axelspringerplugandplay.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Clicked on the &amp;quot;Companies&amp;quot; tab on the home page and was directed to the middle of the page which included a short list of current companies.&lt;br /&gt;
#Clicked on the &amp;quot;All Companies&amp;quot; link which returned a page filled with startup logos and brief descriptions of those startups. When clicked, each logo serves to redirect to that startup's home page.&lt;br /&gt;
#Companies were not sorted by cohort or in any other relevant way.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Techstars (http://www.techstars.com)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the Accelerators tabs and clicked &amp;quot;Companies&amp;quot; on the drop down menu.&lt;br /&gt;
#Firstly, this returns a table comprised of a long list of different classes from different areas separated by years.&lt;br /&gt;
#Upon scrolling down further, each of these classes is broken down by the startups that graduated from them. It also includes information such as how much was invested in each startup, as well as whether or not the startup was acquired, is active, or failed.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Startmate (http://www.startmate.com.au)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startups&amp;quot; tab, which returned a page of all startups that have graduated from Startmate.&lt;br /&gt;
#Startups are separated by year of graduation, and each company is linked on this page.&lt;br /&gt;
#It appears as if each year, 1 cohort is taken through the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Capital Factory (https://capitalfactory.com/accelerate/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the startups tab, which returned a long list of companies that were accelerated by Capital Factory.&lt;br /&gt;
#Each logo for the startups served as a link to their respective websites.&lt;br /&gt;
#There was no evidence or mention of any cohorts.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: OwlSpark (http://entrepreneurship.rice.edu/accelerator/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startup Teams&amp;quot; tab, which returned a page that included links to 4 &amp;quot;Classes&amp;quot;.&lt;br /&gt;
#Each class link i.e. (Class 1, Class 2, Class 3, Class 4) returned links to each startup that graduated from the program.&lt;br /&gt;
#These classes signify cohorts.&lt;br /&gt;
&lt;br /&gt;
==List of Promising Variables==&lt;br /&gt;
*Key People (founders, lead entrepreneurs, strategists, etc.)&lt;br /&gt;
*Total number of launched companies&lt;br /&gt;
*A FAQ for application details, accelerator vision, and &lt;br /&gt;
*Funds raised per company (average)&lt;br /&gt;
*Features offered by accelerator (perks, space, tools, etc)&lt;br /&gt;
*General events hosted by the accelerator&lt;br /&gt;
*(Success) stories for graduated start-ups&lt;br /&gt;
&lt;br /&gt;
=E-R Diagram (in list form) for Identifying Attributes to Pull from Accelerators=&lt;br /&gt;
Summary: I will look at different entities within the accelerator page (e.g accelerators, cohorts, founders) and then find potential attributes that can be codified from those entities. Along with the attribute, we list a potential method for pulling that particular attribute. &lt;br /&gt;
&lt;br /&gt;
Format: &lt;br /&gt;
:&amp;lt;u&amp;gt;Entity&amp;lt;/u&amp;gt;&lt;br /&gt;
:*Attribute - Possible sources/ways to get&lt;br /&gt;
&lt;br /&gt;
Ed: &amp;quot;Be creative with finding new attributes to pull!&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==List==&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
*Accelerator Name - Website, external database&lt;br /&gt;
*Contact Form - General contact section in each website &lt;br /&gt;
*Industry focus - can be pulled from description&lt;br /&gt;
*Description - pulled from website itself&lt;br /&gt;
*Takes equity? - Database or from &amp;quot;about&amp;quot; page&lt;br /&gt;
*Non-profit? - Database&lt;br /&gt;
*URL - Already have way of obtaining&lt;br /&gt;
*DNS Registration Date - Already have way of obtaining&lt;br /&gt;
*Address - Google Maps, maybe the website&lt;br /&gt;
*Founding Date - Google Maps, website, server registration&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
*Mentorship? - Description in website&lt;br /&gt;
*Space Offered - Google Maps, Website description&lt;br /&gt;
*Partnerships - Angel list, Same section as mentorship or events&lt;br /&gt;
*Hosted Events - Calender&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
*Name - Founders or Team Page&lt;br /&gt;
*Title - Directly underneath or next to name&lt;br /&gt;
*PhD? - Biography, webpage under name&lt;br /&gt;
*Serial - Biography&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot; in &amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt; (n) has (n) &amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt; &lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt;&lt;br /&gt;
*Other Companies - Biography, webpage&lt;br /&gt;
*Previous Companies - Biography&lt;br /&gt;
*Net Worth - Forbes, Biography&lt;br /&gt;
*Link back to &amp;quot;Name&amp;quot; in &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
*Date + Accelerator = Cohort ID - Database or Website&lt;br /&gt;
*Number of Startups - Website, count from &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Cohort Number - Categorization on website, external database&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Names - Website, external database&lt;br /&gt;
*State of Inc - Angel List&lt;br /&gt;
*URL - Angel List, website&lt;br /&gt;
*Founding Date - Registration database, Angel List&lt;br /&gt;
*Industry - startup description&lt;br /&gt;
*Founding Location - Angel List&lt;br /&gt;
*Current Location - Angel List&lt;br /&gt;
*VC Raised to Date - SDC Platinum&lt;br /&gt;
*Angel Funds Raised to date - Angel List&lt;br /&gt;
&lt;br /&gt;
==Variables which Distinguish Accelerator Websites==&lt;br /&gt;
*The word &amp;quot;Accelerator&amp;quot;&lt;br /&gt;
**This word appears at least one time on the home page of the vast majority of accelerator websites. The word &amp;quot;Accelerator&amp;quot; appears either as a link to another page on the website or in a title on the homepage of the website. Not many other websites contain this word on their homepage, especially not if one Googles something generic such as &amp;quot;Accelerators in the US&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
*Fixed Term&lt;br /&gt;
**Accelerators normally work with their cohorts for 3 months. This is a major factor which differentiates between an accelerator and any other member of a startup ecosystem. If on their website they mention either &amp;quot;3 months&amp;quot; or &amp;quot;12 weeks&amp;quot;, it is extremely likely that the website belongs to an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Cohorts, Portfolio, Class, or Companies&lt;br /&gt;
**This is a potential variable that could link the websites of many different accelerators. The problem with the word &amp;quot;portfolio&amp;quot; is also used by numerous venture capital firms, which could potentially cause complications when attempting to pull only the sites of accelerators from a Google search. The word &amp;quot;cohort&amp;quot;, however, would have an extremely high probability of identifying the website as belonging to an accelerator. The words &amp;quot;class&amp;quot; and &amp;quot;companies&amp;quot; are promising but do not offer certainty.&lt;br /&gt;
&lt;br /&gt;
*Equity, Investment&lt;br /&gt;
**Although by itself, equity does not mean much, when paired with any of these other terms, it could potentially point to an accelerator. Most accelerators take equity in the form of common stock (6-8%), or they will ask for some alternate form of stake in the company.&lt;br /&gt;
&lt;br /&gt;
*Education and Mentorship&lt;br /&gt;
**Accelerators differ from incubators and angel investors in that they emphasize the education of the potential startup. They offer advice and intense mentorship from more experienced entrepreneurs within their staff, as well as many networking opportunities with the outside world. This variable is more difficult to find on the website of the accelerator, but I believe that if the website includes numerous keywords such as &amp;quot;education&amp;quot;, &amp;quot;mentorship&amp;quot;, or &amp;quot;networking opportunities&amp;quot;, it would be somewhat safe to assume that the website is owned by an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Demo Day&lt;br /&gt;
**This variable does not have tremendous potential in terms of crawling websites, but I feel that it is worth mentioning. Most accelerators &amp;quot;graduate&amp;quot; their cohorts with a demo day, which is a day when the startups present their company to potential investors. If the website contains the words &amp;quot;demo day&amp;quot;, which is fairly uncommon, it could be a good source of accelerator identification.&lt;br /&gt;
&lt;br /&gt;
A combination of any of these variables would certainly identify the current website as belonging to an accelerator.&lt;br /&gt;
&lt;br /&gt;
==Comprehensive List of Accelerators==&lt;br /&gt;
&lt;br /&gt;
All text files saved in &amp;quot;Accelerators&amp;quot; project on the McNair RPD. &lt;br /&gt;
&lt;br /&gt;
*Acc.Info: 190&lt;br /&gt;
*SeedDB: 240&lt;br /&gt;
*SARP: 59&lt;br /&gt;
*Corp: 79&lt;br /&gt;
*Total: 568 results&lt;br /&gt;
&lt;br /&gt;
After removing duplicates and locations: 363 results&lt;br /&gt;
&lt;br /&gt;
Doesn't count f6s, which returns 1170 results, roughly only 300 of which were accelerators. We created a crawler to sift through the webpages and parse HTML so we could identify the accelerators. Program and HTML saved on the Desktop.&lt;br /&gt;
&lt;br /&gt;
==Randomly Chosen Accelerators==&lt;br /&gt;
*TLabs&lt;br /&gt;
*BetaSpring&lt;br /&gt;
*The Unilever Foundry&lt;br /&gt;
*AIA Accelerator&lt;br /&gt;
*R/GA Accelerator&lt;br /&gt;
*Zeroto510&lt;br /&gt;
*Hub:raum&lt;br /&gt;
*Orange Fab&lt;br /&gt;
*Furnace&lt;br /&gt;
*Launch Chapel Hill&lt;br /&gt;
&lt;br /&gt;
===Determining whether or not these are accelerators===&lt;br /&gt;
Googled name of Accelerator and clicked on the first link&lt;br /&gt;
&lt;br /&gt;
Looked for Variables which Distinguish Accelerator Websites&lt;br /&gt;
*TLabs: Homepage states: &amp;quot;Leading Indian Tech Accelerator&amp;quot;; TLabs is an accelerator, but it is located in India.&lt;br /&gt;
*Betaspring: Under the &amp;quot;About Betaspring&amp;quot; tab,  it states that &amp;quot;Betaspring was among the first ten startup accelerators to launch worldwide&amp;quot;.&lt;br /&gt;
*The Unilever Foundry: Does not claim to be an accelerator, nor does it have information on the website about cohorts. This name was pulled from the source Corporate Accelerators.&lt;br /&gt;
*AIA Accelerator: The word &amp;quot;accelerator&amp;quot; is included in the name. Under the &amp;quot;Overview&amp;quot; tab, it states that startups have received mentorship.&lt;br /&gt;
*R/GA Accelerator: Under the &amp;quot;Overview&amp;quot; tab it states that the &amp;quot;R/GA Accelerator is designed for startups and... it is a three month, immersive, mentorship driven program&amp;quot;.&lt;br /&gt;
*Zeroto510: Website contains a &amp;quot;Portfolio Companies&amp;quot; tab which divides up the companies into cohorts. This identifies Zeroto510 as an accelerator.&lt;br /&gt;
*Hub:raum: Offers accelerator and incubator programs; however, none are located in North America.&lt;br /&gt;
*Orange Fab: States on the main page that &amp;quot;We're a 3-month accelerator program&amp;quot;.&lt;br /&gt;
*Furnace: &amp;quot;About&amp;quot; tab states that Furnace is &amp;quot;an innovative startup accelerator designed to form, incubate, and launch new companies&amp;quot;. Concludes with a Demo Day&lt;br /&gt;
*Launch Chapel Hill: Homepage states that they are &amp;quot;a startup accelerator&amp;quot;. Also included on the homepage is a line that states &amp;quot;Applications for Cohort 7 are now open&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
7/10 are accelerators located in the US.&lt;br /&gt;
&lt;br /&gt;
2/10 are accelerators not located in the US.&lt;br /&gt;
&lt;br /&gt;
1/10 is not an accelerator.&lt;br /&gt;
&lt;br /&gt;
===Steps for Extracting Cohort Information===&lt;br /&gt;
*TLabs: Clicked on the &amp;quot;Startup&amp;quot; tab and located a drop down menu entitled &amp;quot;Showing Startups from:&amp;quot;. This menu separates startups into Batches ranging from 1-9. These batches are cohorts.&lt;br /&gt;
*Betaspring: This website does not have a &amp;quot;Companies&amp;quot; or &amp;quot;Startups&amp;quot; tab. I clicked on their &amp;quot;Who&amp;quot; tab and noticed that within this section were two links called &amp;quot;Our portfolio&amp;quot; and &amp;quot;Our companies&amp;quot; which both linked to the same place. This place contained a list of the startups that Betaspring has funded, as well as links to each of the startup websites. The list was not separated into cohorts.&lt;br /&gt;
*The Unilever Foundry: Does not have a &amp;quot;Startups&amp;quot; or &amp;quot;Companies&amp;quot; link on the website.&lt;br /&gt;
*AIA Accelerator: Clicked on the &amp;quot;Startups&amp;quot; tab which returned a page with 5 companies and a bit of information on each of these companies. Also included the URL to each startup. However, the companies were not separated into cohorts, probably because there are so few of them.&lt;br /&gt;
*R/GA Accelerator: Clicked on the &amp;quot;Alumni&amp;quot; tab and navigated down the webpage. Startups are separated by class, which means cohort in this case. Startup info contains link to demo day presentation as well as the startup url.&lt;br /&gt;
*Zeroto510: Hovered over the &amp;quot;About Us&amp;quot; drop down menu and clicked on the &amp;quot;Portfolio Companies&amp;quot; link. Startups are separated by cohort, one for each year, starting from 2013. &lt;br /&gt;
*Hub:raum: Clicked on the &amp;quot;Portfolio&amp;quot; tab. Directed to a page with many names of startups, as well as a brief description of what their company is about. Also includes a link to each startup's website. Startups are not separated into cohorts, but rather by investment by location, current participants, and alumni.&lt;br /&gt;
*Orange Fab: Clicked on the &amp;quot;Startups&amp;quot; tab and was directed to a different page. Startups are not only separated into cohorts named &amp;quot;Seasons&amp;quot;, but they are also separated by industry.&lt;br /&gt;
*Furnace: Clicked on &amp;quot;Portfolio&amp;quot; tab, but unfortunately the website is broken and it returned an error in code.&lt;br /&gt;
*Launch Chapel Hill: Clicked on the &amp;quot;Ventures&amp;quot; tab and was directed to a page in which all startups were separated into cohorts, and a brief description of the startup was provided underneath their logo.&lt;br /&gt;
&lt;br /&gt;
=Code=&lt;br /&gt;
&lt;br /&gt;
The directory for all data related to this project is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
==F6S Web Crawler==&lt;br /&gt;
&lt;br /&gt;
This is a python script using the selenium library that retrieves the html content of each page on F6S's North American Accelerator search results. The script is located in:&lt;br /&gt;
&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs &lt;br /&gt;
&lt;br /&gt;
The script is titled f6s_crawler_gentle.py&lt;br /&gt;
&lt;br /&gt;
When run, the script visits the F6S search page for North American Accelerator's and begins retrieving the HTML of each page in that search list. &lt;br /&gt;
NOTE: Timing must be spaced out between all interactions with the browser. F6S has Captcha, and the program will fail if the site receives too many hit requests, or has any inkling that it is being probed by a bot.&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files are stored in: &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files stored as text files are stored in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files_text&lt;br /&gt;
&lt;br /&gt;
==F6S Parser==&lt;br /&gt;
The next step is to take the HTML files retrieved by the crawler and to parse them for necessary information. This parser should also determine whether or not the site is an accelerator site. &lt;br /&gt;
&lt;br /&gt;
The code for the parser is located in &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
It is titled f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
To run the code, open the file in Komodo and press play. &lt;br /&gt;
If running from the command line, change to the correct directory and run the following comand:&lt;br /&gt;
 python f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
The list of accelerators that passed through the parser is in the same directory:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
The tab delimited text file is named AcceleratorList.&lt;br /&gt;
The file contains the names of the accelerators that had the keywords listed in the file. Also, the file contains the run dates and location of the accelerator if it was listed on the f6s page.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==F6S API==&lt;br /&gt;
F6S has an API, but we have had no success getting a key to the API. The link to get a key to the API is on [https://www.f6s.com/developers/apis/deal-feed this page].&lt;br /&gt;
&lt;br /&gt;
I (Peter) have emailed F6S to ask for a key directly at support@f6s.com. As of the end of the Fall 2016 Semester, they have not responded.&lt;br /&gt;
&lt;br /&gt;
FUN FACT (MASS-RENAME FILES USING WINDOWS POWER SHELL):&lt;br /&gt;
&lt;br /&gt;
The following command allowed me to append &amp;quot;.txt&amp;quot; to all files in a folder once in the proper directory:&lt;br /&gt;
 Get-ChildItem * | Rename-Item -NewName { $_.name + '.txt'}&lt;br /&gt;
&lt;br /&gt;
To change file formats, Microsoft suggests:&lt;br /&gt;
 Get-ChildItem *.txt | Rename-Item -NewName { $_.name -Replace '\.txt', '.log'}&lt;br /&gt;
&lt;br /&gt;
==Final Data==&lt;br /&gt;
The Parser for parsing the text files of accelerator data is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
The Parser for parsing the cohort files of accelerator data is also located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
This folder contains the Python parsers. The Final_data folder contains the tab-delimited text files of parsed data. final_accelerator_data.txt contains the generalized data saved in .txt files and final_cohort_data.txt contains the cohort data saved in .cohort.txt files.&lt;br /&gt;
&lt;br /&gt;
All the files entitled accelerator_data are subsets of the final_accelerator_data.txt file, but each file contains only the accelerators that matched to the flag specified in the file title.&lt;br /&gt;
&lt;br /&gt;
find_headers .py finds a set of the headers for all the cohort files from the seed list project.&lt;br /&gt;
&lt;br /&gt;
==Google SiteSearch==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Google_SiteSearch&lt;br /&gt;
This folder contains code for a google search parser. The script sitesearch.py will search for a queried company and return a likely web address for that company.&lt;br /&gt;
&lt;br /&gt;
==Way Back Machine Parser==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\wayback_machine.py&lt;br /&gt;
This script takes URLs and returns a timestamp for the oldest documented webpage under that URL courtesy of the Way Back Machine Archive.&lt;br /&gt;
&lt;br /&gt;
==Process Locations==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\process_locations.py&lt;br /&gt;
This script takes a physical address and converts it into latitude and longitude coordinates. Should be used in conjunction with the Enclosing Circle program to find the concentration of accelerators.&lt;br /&gt;
 E:\McNair\Software\CodeBase\EnclosingCircle.py&lt;br /&gt;
&lt;br /&gt;
=Kauffman Foundation Incubator Proposal Information=&lt;br /&gt;
&lt;br /&gt;
==Institutions==&lt;br /&gt;
Summary: F6S, Crunchbase, seed-db&lt;br /&gt;
&lt;br /&gt;
Tools: Matcher - used to match lists of potential accelerators with our current list to identify duplicates/new matches (E:\McNair\Projects\Accelerators)&lt;br /&gt;
&lt;br /&gt;
===F6S===&lt;br /&gt;
F6S WebCrawler and F6S Parser - E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
===CrunchBase===&lt;br /&gt;
&lt;br /&gt;
CrunchBase 2013 Snapshot '''(All Organizations)'''- E:\McNair\Projects\Accelerators\organizations.xls&lt;br /&gt;
&lt;br /&gt;
CrunchBase 2013 Snapshot '''(Potential Accelerators)'''- E:\McNair\Projects\Accelerators\organizations.accdb under &amp;quot;Potential Accelerators query&amp;quot; &lt;br /&gt;
&lt;br /&gt;
*Obtained using keyword matches in the descriptions of the potential accelerators.&lt;br /&gt;
&lt;br /&gt;
CrunchBase 2013 Snapshot '''(New Verified Accelerators)''' - E:\McNair\Projects\Accelerators\New CrunchBase Accelerators.xls&lt;br /&gt;
&lt;br /&gt;
We have the Crunchbase 2013 Snapshot which provided lots of new data on accelerators and incubators but we would love to use the Crunchbase API to get a current database snapshot that we could use to cross reference companies and add newly formed accelerator and incubator companies.&lt;br /&gt;
&lt;br /&gt;
===AngelList===&lt;br /&gt;
&lt;br /&gt;
===seed-db===&lt;br /&gt;
&lt;br /&gt;
Obtained through www.seed.db/accelerators&lt;br /&gt;
&lt;br /&gt;
===Global Accelerator Network (GAN)===&lt;br /&gt;
&lt;br /&gt;
GAN Parser- E:\McNair\Projects\Accelerators\Web Scraping for Accelerators\scrapeaccel.py&lt;br /&gt;
&lt;br /&gt;
GAN Data- E:\McNair\Projects\Accelerators\Web Scraping for Accelerators\GAN Accelerator Data&lt;br /&gt;
*Contains: Company Name, # of Companies Range, % of Companies Funded, Funding Raised by Companies, Employee Range, Exit Funding, Exit Date, Total Company Funding Raised, # of Mentors Range, % Equity, Location, Minimum Seed Capital Investment&lt;br /&gt;
&lt;br /&gt;
==Cohorts==&lt;br /&gt;
&lt;br /&gt;
*Cohorts obtained manually&lt;br /&gt;
*All Cohort txt files are saved under &amp;quot;E:\McNair\Projects\Accelerators\Data  &lt;br /&gt;
*cohort file name = (accelerator name).cohort&lt;br /&gt;
*Most updated Accelerator cohort data: E:\McNair\Projects\Accelerators\Cleaned Cohort Data.xls&lt;br /&gt;
&lt;br /&gt;
Automation for obtaining cohorts??&lt;br /&gt;
&lt;br /&gt;
==Other Information==&lt;br /&gt;
Summary: Whois Parser, Geocode, Tools to determine industry, etc&lt;br /&gt;
&lt;br /&gt;
===Whois Parser===&lt;br /&gt;
&lt;br /&gt;
*Retrieves and parses Whois information. Specifically, takes a file with a column of domain names and populates the corresponding columns with information from the WhoIs API.&lt;br /&gt;
&lt;br /&gt;
*Often used to obtain locations.&lt;br /&gt;
&lt;br /&gt;
===Geocode===&lt;br /&gt;
&lt;br /&gt;
Input: Company Address&lt;br /&gt;
Output: Directional Coordinates&lt;br /&gt;
&lt;br /&gt;
*Used to obtain the locations of different Accelerators and Cohort companies.&lt;br /&gt;
&lt;br /&gt;
===SDC Platinum Pull===&lt;br /&gt;
&lt;br /&gt;
Used to obtain funding information and match companies that have gotten funding with companies that are Accelerator cohorts.&lt;br /&gt;
&lt;br /&gt;
===Desired Information/Variables===&lt;br /&gt;
&lt;br /&gt;
*Key People (founders, lead entrepreneurs, strategists, etc.)&lt;br /&gt;
*Total number of launched companies&lt;br /&gt;
*A FAQ for application details, accelerator vision, and&lt;br /&gt;
*Funds raised per company (average)&lt;br /&gt;
*Features offered by accelerator (perks, space, tools, etc)&lt;br /&gt;
&lt;br /&gt;
==Desired Tools/Information==&lt;br /&gt;
&lt;br /&gt;
===Automating the Process of Obtaining Cohorts===&lt;br /&gt;
*Automating this process would save a lot of time and really progress the project.&lt;br /&gt;
&lt;br /&gt;
===Obtaining More Details on Accelerators===&lt;br /&gt;
&lt;br /&gt;
*Having the kind of thorough information on industry, companies, funding, location, exits, mentors, leadership,  that we got for the GAN companies would be fantastic.&lt;br /&gt;
&lt;br /&gt;
===List of Alive/Dead Accelerators===&lt;br /&gt;
&lt;br /&gt;
This is a dream but would be very helpful&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=21859</id>
		<title>Accelerator Seed List (Data)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=21859"/>
		<updated>2017-11-14T21:04:06Z</updated>

		<summary type="html">&lt;p&gt;Shrey: /* Link to Crunchbase API application */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Accelerator Seed List (Data)&lt;br /&gt;
|Has owner=Shrey Agarwal, Matthew Ringheanu, Veeral Shah,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has keywords=Accelerators,Data&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Industry Classifier&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Current Work=&lt;br /&gt;
&lt;br /&gt;
TODO:&lt;br /&gt;
 McNair/Projects/Accelerators/Fall 2017/unfound_founders.txt&lt;br /&gt;
A 0 means we don't have founder data for that accelerator.&lt;br /&gt;
Specs: A tab delimited text file with the following fields:&lt;br /&gt;
 Accelerator   First Name   Last Name   LinkedInURL(if possible)&lt;br /&gt;
Getting the LinkedInURL will ensure accuracy, but will work without it.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Shrey: Find &amp;quot;demo day&amp;quot; keywords, so that we can search AcceleratorName Year Keyword and get back potential demo day pages&lt;br /&gt;
*Joe: Go through Accelerator list (approx 273 accelerators) and mark each by type (see below), building out type list as you go&lt;br /&gt;
&lt;br /&gt;
Type list:&lt;br /&gt;
*Private&lt;br /&gt;
*Corporate&lt;br /&gt;
*Academic&lt;br /&gt;
 Note: if DEAD, noted here.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Other info:&lt;br /&gt;
*nonprofit? (y/n)&lt;br /&gt;
&lt;br /&gt;
*Subtype abbreviations:&lt;br /&gt;
**S: for if a social entrepreneurship initiative&lt;br /&gt;
**I: for if an incubator&lt;br /&gt;
**A: for an angel group&lt;br /&gt;
**F: for foreign&lt;br /&gt;
**C: for in coworking space/hub/etc&lt;br /&gt;
**V: for if part of venture fund&lt;br /&gt;
**G: for if government funded/partnered&lt;br /&gt;
**T: for international&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
 Note: subtypes (from individual text files in E:\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data) were only found for 23 of the 270 accelerators.  These accelerators were initially intended to be removed from the master list.  Remaining subtypes are currently being added.&lt;br /&gt;
&lt;br /&gt;
other info: &lt;br /&gt;
&lt;br /&gt;
international offices, founders, industries, org type, program duration, or other interesting, easily accessed variables.  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Type list file saved as &lt;br /&gt;
 &amp;quot;Accelerator type list&amp;quot; in E:\McNair\Projects\Accelerators\Fall 2017\Grouping project of ListOfAccs.&lt;br /&gt;
The list of ListofAccs, from which we drew Accelerator type list, should have no matches with any of the flagged accelerators in E:\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data.  There are 23 matches though.  So all subtypes must be searched and entered manually.  Whether it is a nonprofit is listed in E:\McNair\Projects\Accelerators\Fall 2017\Grouping project of ListOfAccs, called &amp;quot;whether nonprofit...&amp;quot;&lt;br /&gt;
&lt;br /&gt;
=End of Semester Report=&lt;br /&gt;
The end of semester report will focus on ranking accelerators and environments based on the variables we have gathered. Our primary form of categorization will be ranking individual accelerators based on their venture capital raise rate. We can probably generate information over time for accelerators and the amount of VC they raised to get a sense of what locations have developed in the past five years from the dates of transactions recorded by SDC. To obtain these rankings, we will identify which cohorts companies were trained in, as well as complete details of the accelerator and the details of cohort companies. We will focus only on accelerators because there are many other entities in each ecosystem. We will also utilize information on IPO or acquisition by companies, obtained through Crunchbase, to gain some sense of how successful startups emerging from a particular accelerator are. To obtain the data over time, we will need to fill out the cohort date information column in our cohort data, which will require the help of either Crunchbase or the Wayback machine for older accelerators. In ranking the accelerators across regions, we can also track industry-specific hotspots for accelerators such as medicine in Memphis or technology in San Francisco.&lt;br /&gt;
&lt;br /&gt;
To complete the report, we need to fill information in:&lt;br /&gt;
*Industry and focus&lt;br /&gt;
*Location&lt;br /&gt;
*Name, description&lt;br /&gt;
*Matched VC data&lt;br /&gt;
*Founder information (maybe)&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
This project is developing broad and near-population data on accelerators and their cohort companies. The objective is to identify which cohorts of which accelerators a cohort company was trained in, obtain details of the accelerators, and obtain details of the cohort companies, including information about any venture capital investment that the cohort company might have received and any IPO or acquisition the company may have experienced.&lt;br /&gt;
&lt;br /&gt;
The primary use of this data is for an academic paper detailed on the [[Matching Entrepreneurs to Accelerators and VCs (Academic Paper)]] page. &lt;br /&gt;
&lt;br /&gt;
However, this project can also provide useful data to other academic papers ([[Urban Start-up Agglomeration]], [[Hubs (Academic Paper)]], and [[Hubs Scorecard (Academic Paper)]]), projects ([[Houston Entrepreneurship]]) and blog posts (under the [[Emerging Ecosystems]] umbrella project).&lt;br /&gt;
&lt;br /&gt;
This project needs the results of the [[Industry Classifier]], [[Whois Parser]], and other tools.&lt;br /&gt;
&lt;br /&gt;
=Current Project Write-Up=&lt;br /&gt;
&lt;br /&gt;
==Things To Do==&lt;br /&gt;
*Obtain all URLs for accelerators in order to run through the Wayback Machine to find out when they started.&lt;br /&gt;
*Match Crunchbase Data with our Accelerator List to see if they have any accelerators that we do not.&lt;br /&gt;
*Obtain an example of accelerator that started early and has multiple companies but does not separate them into cohorts and figure out a way to determine which companies went through each cohort.&lt;br /&gt;
&lt;br /&gt;
==What Each File in the &amp;quot;Accelerator&amp;quot; Folder on the RDP Contains==&lt;br /&gt;
*&amp;quot;Accelerator List Sources&amp;quot; (Folder) - This folder contains most of the sources that we pulled accelerator names from at the very beginning of the project.&lt;br /&gt;
*&amp;quot;Code+Final_Data&amp;quot; (Folder) - This folder contains Peter's code for pulling the data from the text files in the &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Crunchbase Snapshot&amp;quot; (Folder) - This folder contains the data we obtained from Crunchbase. There is a massive amount of data which we will need to sort through to find useful information and hopefully match that data with our current cohort data.&lt;br /&gt;
*&amp;quot;Data&amp;quot; (Folder) - This folder contains all of our data on accelerators including cohort information and the html files of each cohort page. I would estimate that it is about 95% clean currently.&lt;br /&gt;
*&amp;quot;Data - Copy&amp;quot; (Folder) - This is just a copy of our current &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Data_Copy&amp;quot; (Folder) - This is a copy of our original &amp;quot;Data&amp;quot; folder before we did any manual cleaning.&lt;br /&gt;
*&amp;quot;Enclosing_Circle&amp;quot; (Folder) - This folder seems to contain some data on VC but I'm not sure how it pertains to the Accelerator project.&lt;br /&gt;
*&amp;quot;F6S Accelerator HTMLs&amp;quot; (Folder) - This folder contains the HTML pages of all the pages on the F6S website. We used it to add more potential accelerators to our list.&lt;br /&gt;
*&amp;quot;Google_SiteSearch&amp;quot; (Folder) - This folder contains Python code for Google searches.&lt;br /&gt;
*&amp;quot;Industry_Classifier&amp;quot; (Folder) - This folder seems to contain Python code but I'm not sure what for.&lt;br /&gt;
*&amp;quot;Matcher&amp;quot; (Folder) - This folder contains the Matcher.&lt;br /&gt;
*&amp;quot;Python WebCrawler&amp;quot; (Folder) - This folder contains code that is a work in progress for pulling descriptions from accelerator websites. It is Jeemin's project.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data Copy&amp;quot; (Excel File) - This file contains a copy of our cleaned cohort data.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data&amp;quot; (Excel File) - This file contains the most current, completely cleaned data on cohort company information.&lt;br /&gt;
*&amp;quot;NormalizeFixedWidth&amp;quot; (PL File) - This is the normalizer.&lt;br /&gt;
*&amp;quot;PortCoNames&amp;quot; (TXT File) - This file contains all of the names of the cohort companies as well as the accelerator they went through.&lt;br /&gt;
*&amp;quot;VC Data&amp;quot; (Excel File) - This file contains all of the names of the companies that have ever received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data&amp;quot; (TXT File) - This file contains that non-normalized data of all of the VC information.&lt;br /&gt;
*&amp;quot;VC_Data_Names&amp;quot; (TXT File) - This file contains all of the names of companies that have received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data_Names_Matched_PortCoNames&amp;quot; (Excel File) - This file contains all of the cohort companies that have also received VC funding. Still needs to be sorted through.&lt;br /&gt;
&lt;br /&gt;
==Process==&lt;br /&gt;
After accumulating the massive amount of data on accelerators, their cohorts, and their html files, we began cleaning those text files, which are located in the &amp;quot;Data&amp;quot; folder within &amp;quot;Accelerators&amp;quot;. After going through the first round of cleaning, we ran a code through the cohort data which put all of that information into an Excel document called &amp;quot;Cleaned Cohort Data&amp;quot;. There were still some mistakes in the cohort information unfortunately, which we fixed within the Excel file itself. Therefore, there are some text files within the &amp;quot;Data&amp;quot; folder that do not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file. If we were to run the cohort code through the &amp;quot;Data&amp;quot; folder, we would get something that does not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file, which is problematic. The solution to this (other than manually cleaning the text files again) would be to write a code from the &amp;quot;Cleaned Cohort Data&amp;quot; file which would allow us to clean the data in the &amp;quot;Data&amp;quot; folder through the format of the Excel file. We have also matched all of the cohort companies with our list of all companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
=Current To Do=&lt;br /&gt;
&lt;br /&gt;
#Work on the [[Crunchbase 2013 Snapshot]]&lt;br /&gt;
#Match cohort companies to VC-backed portfolio companies&lt;br /&gt;
#Refine our data to work out which cohort each cohort company was a member of, cohort start dates and locations, etc.&lt;br /&gt;
#Make a list of top accelerator lists (e.g., http://tech.co/top-startup-accelerators-ranked-2012-08) and check that we have those accelerators&lt;br /&gt;
&lt;br /&gt;
=End of Semester Notes=&lt;br /&gt;
&lt;br /&gt;
*We have compiled a very long list of accelerators from many different databases. For the past couple of weeks, everyone in the center has been going through this list, 20 at a time, classifying each one as an accelerator or not an accelerator, and then proceeding to gather data on the accelerator using the process outlined below. This process went very smoothly. We have successfully gone through about 80% of the list. We are still missing information on the last hundred or so names. All of the collected data is located on the RDP, within the &amp;quot;Accelerators&amp;quot; folder under &amp;quot;Data&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
=Data Collection Notes=&lt;br /&gt;
&lt;br /&gt;
==MATCHING==&lt;br /&gt;
&lt;br /&gt;
The files we used to match are located in the E drive. We used the matcher to match our portfolio company names from the cohort file located in E:\McNair\Projects\Accelerators. &lt;br /&gt;
*The files used to matching are located E:\McNair\Projects\Accelerators\Matcher&lt;br /&gt;
*Portco is the name of the companies pulled from the cohort file&lt;br /&gt;
*AccCo includes both the cohort company name, along with the name of the accelerator itself&lt;br /&gt;
*In the matcher, the inputs are the PortCo names, as well as the VC data from our pull in SDC&lt;br /&gt;
*The outputs include the AccCo_VC data located in E:\McNair\Projects\Accelerators which give a lot of information on the matches, including:&lt;br /&gt;
:*name of the match itself&lt;br /&gt;
:*number of investments&lt;br /&gt;
:*dates that the company received its investments&lt;br /&gt;
&lt;br /&gt;
==SDC Pull==&lt;br /&gt;
&lt;br /&gt;
We accessed SDC platinum and pulled information on round-based funding that all registered companies received from between the years 1999 to 2017.&lt;br /&gt;
&lt;br /&gt;
The receipt is as follows:&lt;br /&gt;
&lt;br /&gt;
Session Details&lt;br /&gt;
---------------&lt;br /&gt;
Request   Hits    Request Description&lt;br /&gt;
   0        -     DATABASE: Portfolio Companies (VIPC)&lt;br /&gt;
   1     96155    Venture Related Deals: Select All Venture Related Deals&lt;br /&gt;
   2     79572    Round Date: 1/1/1999 to 3/1/2017 (Custom) (Calendar)&lt;br /&gt;
   3              Custom Report: VC Data (Columnar) - Save As:&lt;br /&gt;
                  E:\McNair\Projects\Accelerators\VC Data.txt&lt;br /&gt;
�&lt;br /&gt;
Billing Ref # : 2054025&lt;br /&gt;
Capture File  : riceuniv.2054025&lt;br /&gt;
Session Name  : &lt;br /&gt;
&lt;br /&gt;
The VC data pull includes the following variables: &lt;br /&gt;
&lt;br /&gt;
Company Name                                                           Date Company      Date Company      Company        Company City                           Company Street Address, Line 1               Company Street Address, Line 2            Total Known     Company Industry Sub-Group 3                              Company Industry Major Group     Round          Company Stage Level 3     Round Amt,       Round Amt,&lt;br /&gt;
&lt;br /&gt;
==3 files==&lt;br /&gt;
&lt;br /&gt;
For each accelerator in the list, put files in E:\Projects\Accelerators\Data&lt;br /&gt;
*AcceleratorName.txt - copy and paste the variables below into a (tab-delimited) txt file and complete&lt;br /&gt;
*AcceleratorName.cohort - your cohort text file (see below)&lt;br /&gt;
*AcceleratorName.html (possibly automatically with a folder too) - save a copy of the html of the cohort page&lt;br /&gt;
&lt;br /&gt;
==.txt Variables==&lt;br /&gt;
&lt;br /&gt;
 Name	&lt;br /&gt;
 Score	&lt;br /&gt;
 Flag	&lt;br /&gt;
 CohortURL	&lt;br /&gt;
 Address	&lt;br /&gt;
 Duration	&lt;br /&gt;
 Vintage		&lt;br /&gt;
 Industry	&lt;br /&gt;
 Description	&lt;br /&gt;
 Equity	&lt;br /&gt;
 NonProfit	 &lt;br /&gt;
 Notes	&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Try to get '''Name, Score, Flag, Cohort URL and Address''' for all. ONLY GRAB OTHER VARIABLES IF EASY. Just leave things blank if you can't find them quickly.&lt;br /&gt;
&lt;br /&gt;
'''If the score is 0, or the flag is S, I, A, or F just stop''' - don't bother downloading a cohort list, saving an HTML file, etc. If possible, do stick a very brief description of the problem in the notes field.&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Score: is 0-1 where 0 is definitely not an accelerator, 1 is definitely an accelerator&lt;br /&gt;
*Flag: (leave blank if not needed), if multiple then separate by comma&lt;br /&gt;
**S for social entrep&lt;br /&gt;
**I for incubator&lt;br /&gt;
**A for an angel group&lt;br /&gt;
**F is for foreign&lt;br /&gt;
**C for in coworking space/hub/etc&lt;br /&gt;
**V for if part of venture fund&lt;br /&gt;
**D is for Dead&lt;br /&gt;
*Put just the root URL in Cohort URL if there isn't a Cohort page&lt;br /&gt;
*Duration: in wks (months x 4.33 and round)&lt;br /&gt;
*Vintage is year of first cohort if possible&lt;br /&gt;
*Industry is industry focus but only if clear focus&lt;br /&gt;
*Equity is a number (don't put %) or Y/N&lt;br /&gt;
*Notes is only there if need it. Particularly try to use this field to note discards.&lt;br /&gt;
&lt;br /&gt;
==.cohort files==&lt;br /&gt;
&lt;br /&gt;
Your .cohort files must:&lt;br /&gt;
*Be tab delimited txt&lt;br /&gt;
*Have a header&lt;br /&gt;
*The first column must be the portfolio company name&lt;br /&gt;
*Grab as many columns as you can easily (and name them)&lt;br /&gt;
&lt;br /&gt;
==Standardized format for text files==&lt;br /&gt;
&lt;br /&gt;
Information Text file&lt;br /&gt;
*1 tab only after each category&lt;br /&gt;
*No spaces after commas for flags or industry&lt;br /&gt;
*For duration put only a number in weeks but do not write &amp;quot;weeks&amp;quot;&lt;br /&gt;
*Equity is either only a number (no percent sign) or a Y/N&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Cohort Text file&lt;br /&gt;
*1 tab between each column&lt;br /&gt;
*Titles of each column on top&lt;br /&gt;
*Make a new category for &amp;quot;Cohort Number&amp;quot; and write either &amp;quot;1 2 3 4 etc.&amp;quot;&lt;br /&gt;
*Matthew: 1-225 (done) Shrey: 226-550 (done)&lt;br /&gt;
&lt;br /&gt;
==Link to Crunchbase API application==&lt;br /&gt;
&lt;br /&gt;
https://about.crunchbase.com/forms/research-access-apply/ (Does not work anymore)&lt;br /&gt;
&lt;br /&gt;
https://data.crunchbase.com/v3/docs/using-the-api (Has new instructions for application)&lt;br /&gt;
&lt;br /&gt;
==Sign-Ups==&lt;br /&gt;
&lt;br /&gt;
 Ed - 1-10 (done)&lt;br /&gt;
 Carlin -  11-20 (done)&lt;br /&gt;
 Carlin - 21-40 (done)&lt;br /&gt;
 Christy - 41-60 (done)&lt;br /&gt;
 Avesh - 61-80 (done)&lt;br /&gt;
 Eliza - 81-100 (done)&lt;br /&gt;
 Meghana - 101-120 (done)&lt;br /&gt;
 Peter - 121-140 (done)&lt;br /&gt;
 Ramee - 141-160 (done)&lt;br /&gt;
 Will - 161-180 (done)&lt;br /&gt;
 Matthew - 181-200 (done)&lt;br /&gt;
 Julia - 201-220 (done)&lt;br /&gt;
 Peter - 221-240 (done)&lt;br /&gt;
 Shrey - 241-260 (done)&lt;br /&gt;
 Matthew - 261-280 (done)&lt;br /&gt;
 Eliza - 281-300 (done)&lt;br /&gt;
 Julia - 301-320 (done)&lt;br /&gt;
 Shrey - 321-340 (done)&lt;br /&gt;
 Carlin - 341-361 (done)&lt;br /&gt;
 Julia - 362-380 (done)&lt;br /&gt;
 Dylan - 381-393 (done)&lt;br /&gt;
 Jake - 394-404 (done)&lt;br /&gt;
 Dylan - 405-410 (done)&lt;br /&gt;
 Avesh - 411-415 (done)&lt;br /&gt;
 Dylan - 416-423 (done)&lt;br /&gt;
 Peter - 424-460(done)&lt;br /&gt;
 Carlin - 461-480 (done)&lt;br /&gt;
 Peter - 481-490(done)&lt;br /&gt;
 Julia - 491-510 (done)&lt;br /&gt;
 Peter - 511-515 (done)&lt;br /&gt;
 Julia - 516-529 (done)&lt;br /&gt;
 Ben - 530-540 (done)&lt;br /&gt;
 Shrey - 541-551 (done)&lt;br /&gt;
&lt;br /&gt;
=List of Accelerators=&lt;br /&gt;
#10Xelerator&lt;br /&gt;
#1440&lt;br /&gt;
#33entrepreneurs&lt;br /&gt;
#500 Startups&lt;br /&gt;
#9Mile Labs&lt;br /&gt;
#AIA Accelerator&lt;br /&gt;
#ARK Challenge&lt;br /&gt;
#AT&amp;amp;T Aspire Accelerator&lt;br /&gt;
#ATDC Community&lt;br /&gt;
#AZ TechCelerator&lt;br /&gt;
#AccelFoods&lt;br /&gt;
#Acceleprise&lt;br /&gt;
#Accelerate Baltimore&lt;br /&gt;
#Accelerate Genius&lt;br /&gt;
#Accelerate Tectoria Accelerator&lt;br /&gt;
#Accelerator Centre&lt;br /&gt;
#Advanced Technology Development Center (ATDC)&lt;br /&gt;
#Airbus BizLab&lt;br /&gt;
#Alchemist Accelerator&lt;br /&gt;
#AlphaLab&lt;br /&gt;
#Amplify.LA&lt;br /&gt;
#Angel Capital&lt;br /&gt;
#Angelcube&lt;br /&gt;
#Angelpad&lt;br /&gt;
#Annual Business BootCamp&lt;br /&gt;
#Arizona Center for Innovation&lt;br /&gt;
#Arizona Furnace&lt;br /&gt;
#Arrowhead Tech Incubator 2016&lt;br /&gt;
#Aspire 3 Accelerator 2017&lt;br /&gt;
#Atlanta Ventures Accelerator &lt;br /&gt;
#AutoXLR8R&lt;br /&gt;
#Awesome Inc.&lt;br /&gt;
#Axel Springer Plug and Play&lt;br /&gt;
#B 4 Change Impact Accelerator&lt;br /&gt;
#B2B Acceleration Program&lt;br /&gt;
#B4C Social Venture Accelerator&lt;br /&gt;
#BBC Worldwide Labs&lt;br /&gt;
#BMW Startup Garage&lt;br /&gt;
#Brandcelerate&lt;br /&gt;
#Bunker Labs&lt;br /&gt;
#Bank of Ireland Accelerator Programme&lt;br /&gt;
#Bantunium Labs Accelerator&lt;br /&gt;
#Barclays Accelerator&lt;br /&gt;
#Barclays New York Summer 2015&lt;br /&gt;
#Berkley Ventures&lt;br /&gt;
#Bessemer Business Incubation System&lt;br /&gt;
#Beta-i&lt;br /&gt;
#Beta.MN&lt;br /&gt;
#BetaFactory&lt;br /&gt;
#BetaSpring&lt;br /&gt;
#Betablox&lt;br /&gt;
#Betaspring RevUp  (DUPLICATE)&lt;br /&gt;
#Bethnal Green Ventures&lt;br /&gt;
#BioAccel&lt;br /&gt;
#BioInspire&lt;br /&gt;
#Bir 2015&lt;br /&gt;
#BitAngel Engagement Level&lt;br /&gt;
#BitAngels Startup Summer Program of 2013&lt;br /&gt;
#Bizdom&lt;br /&gt;
#Black Forest Accelerator&lt;br /&gt;
#Blue Startups&lt;br /&gt;
#Blueprint Health&lt;br /&gt;
#Bolt Boston&lt;br /&gt;
#Bonnier Accelerator&lt;br /&gt;
#BoomStartup&lt;br /&gt;
#BoomStartup Winter 2017 (DUPLICATE)&lt;br /&gt;
#Boomtown Accelerator&lt;br /&gt;
#Boomtown Health Tech (DUPLICATE)&lt;br /&gt;
#Boost VC&lt;br /&gt;
#BootupLabs&lt;br /&gt;
#Brandery&lt;br /&gt;
#Brooklyn Beta Summer Camp&lt;br /&gt;
#Budweiser Dream Brewery&lt;br /&gt;
#Buildit&lt;br /&gt;
#BuiltinPGH Companies&lt;br /&gt;
#Business Innovation Center&lt;br /&gt;
#Business Opportunity Academy 2017&lt;br /&gt;
#Business Technology Development Center (BizTech)&lt;br /&gt;
#CLT Joules Energy Accelerator 2014&lt;br /&gt;
#CWI Ventures&lt;br /&gt;
#CWI Ventures Application (DUPLICATE)&lt;br /&gt;
#CableLabs Technology Tours 2016&lt;br /&gt;
#Capital Factory&lt;br /&gt;
#Capital Innovators&lt;br /&gt;
#Capital Investment Network (Startups)&lt;br /&gt;
#Caroline Plouff&lt;br /&gt;
#Catalyst Partners&lt;br /&gt;
#Cause Collective : Social Innovation Lab&lt;br /&gt;
#Center for Entrepreneurial Innovation&lt;br /&gt;
#Chain Reaction Innovations 2017&lt;br /&gt;
#Chemical Angel Network&lt;br /&gt;
#Chinaccelerator&lt;br /&gt;
#Cisco Entrepreneurs in Residence&lt;br /&gt;
#Citi Accelerator&lt;br /&gt;
#Citrix Startup Accelerator&lt;br /&gt;
#Claremont/Upland Makerspace Fablab&lt;br /&gt;
#Climate Ventures 2.0 Accelerator&lt;br /&gt;
#Co.Lab accelerator&lt;br /&gt;
#Code for America Accelerator&lt;br /&gt;
#Cohab's Traxtion Point&lt;br /&gt;
#Collision Conference Investors&lt;br /&gt;
#Common Bond&lt;br /&gt;
#Communitech Hyperdrive&lt;br /&gt;
#Conquer Accelerator&lt;br /&gt;
#Coolhouse Labs&lt;br /&gt;
#CuriousMinds Incubator / Accelerator&lt;br /&gt;
#CyberTECH San Diego&lt;br /&gt;
#DBS Accelerator&lt;br /&gt;
#DPD Last Mile labs&lt;br /&gt;
#DV X Labs&lt;br /&gt;
#Dat Ventures&lt;br /&gt;
#Decatur-Morgan County Entrepreneurial Center&lt;br /&gt;
#Deep Space Ventures&lt;br /&gt;
#Demo Accelerator 2016- 2017&lt;br /&gt;
#DeveloperTown&lt;br /&gt;
#Difference Engine&lt;br /&gt;
#Digital Malaysia Corporate Accelerator Program&lt;br /&gt;
#Digital Media Zone Incubator/Accelerator&lt;br /&gt;
#Disney Accelerator&lt;br /&gt;
#DogFish Accelerator&lt;br /&gt;
#Domi Station&lt;br /&gt;
#Dotforge accelerator&lt;br /&gt;
#Dream Funded&lt;br /&gt;
#DreamIT Health&lt;br /&gt;
#DreamStart - Free Mentoring Program&lt;br /&gt;
#Dreamit Ventures (DUPLICATE)&lt;br /&gt;
#Ducky Diggy Lloyd &lt;br /&gt;
#E-Capital Summit&lt;br /&gt;
#EC Mentor Skills Inventory&lt;br /&gt;
#EIGERlab&lt;br /&gt;
#ETRAC&lt;br /&gt;
#EY Startup Challenge&lt;br /&gt;
#Eco Holding&lt;br /&gt;
#Eleven Startup Accelerator&lt;br /&gt;
#Emerge Xcelerate&lt;br /&gt;
#EnterpriseWorks Incubation Program&lt;br /&gt;
#Entrepreneur Development Center&lt;br /&gt;
#Entrepreneurs Roundtable Accelerator&lt;br /&gt;
#Environmental Business Cluster&lt;br /&gt;
#Equity Legal&lt;br /&gt;
#Excelerate Labs&lt;br /&gt;
#Execution Labs&lt;br /&gt;
#Exhilarator&lt;br /&gt;
#Extreme Startups&lt;br /&gt;
#Extreme University&lt;br /&gt;
#FOOD-X&lt;br /&gt;
#Factory45&lt;br /&gt;
#Fargo Startup House 2014-2015&lt;br /&gt;
#FastTrack Propero Healthcare&lt;br /&gt;
#FbFund&lt;br /&gt;
#Female Propeller for High Flyers&lt;br /&gt;
#FinTech Innovation Lab&lt;br /&gt;
#FinTech Studios 2015&lt;br /&gt;
#Fintech Founders Club #2&lt;br /&gt;
#First Growth Venture Network&lt;br /&gt;
#Fishbowl Labs AOL&lt;br /&gt;
#Flagship Enterprise Center&lt;br /&gt;
#FlashStarts&lt;br /&gt;
#Flashpoint&lt;br /&gt;
#Flat6 Labs&lt;br /&gt;
#Fledge9&lt;br /&gt;
#Flextronics Lab IX&lt;br /&gt;
#Food Future Scale-up Accelerator 2017&lt;br /&gt;
#Food System 6 (FS6) Accelerator&lt;br /&gt;
#FoodForwardX&lt;br /&gt;
#Fortify Ventures&lt;br /&gt;
#Founder Institute&lt;br /&gt;
#FounderFuel&lt;br /&gt;
#FoundersPad&lt;br /&gt;
#Fownders Accelerator&lt;br /&gt;
#French Accelerator 2016&lt;br /&gt;
#Fund the Food&lt;br /&gt;
#Fuse Corps Host&lt;br /&gt;
#GAKKEN Accelerator Program&lt;br /&gt;
#Gainesville Technology Enterprise Center&lt;br /&gt;
#Game CoLab Incubator Program 2014&lt;br /&gt;
#GameFounders&lt;br /&gt;
#GammaRebels&lt;br /&gt;
#Gazelle Lab&lt;br /&gt;
#Gener8tor&lt;br /&gt;
#German Accelerator Life Sciences&lt;br /&gt;
#German Accelerator Tech&lt;br /&gt;
#Global Accelerator Network 2015&lt;br /&gt;
#Good Works Houston Lab&lt;br /&gt;
#GoodCompany Ventures&lt;br /&gt;
#Google Launchpad Accelerator&lt;br /&gt;
#Grants4Apps Accelerator&lt;br /&gt;
#GreenStart&lt;br /&gt;
#Greenlite Labs&lt;br /&gt;
#GrowLab&lt;br /&gt;
#Growth Hacking Accelerator 2015&lt;br /&gt;
#Gulf Coast Center for Innovation and Entrepreneurship&lt;br /&gt;
#H-Farm Ventures&lt;br /&gt;
#HACKT Mission for International Founders&lt;br /&gt;
#HAXLR8R&lt;br /&gt;
#HCC Entrepreneurship Launchpad&lt;br /&gt;
#HIGHLINE Academy&lt;br /&gt;
#HUB&lt;br /&gt;
#HUBB Accelerator&lt;br /&gt;
#HUBB GTLA 2016&lt;br /&gt;
#HackFWD&lt;br /&gt;
#Hatch&lt;br /&gt;
#Health Wildcatters&lt;br /&gt;
#Health accelerator&lt;br /&gt;
#Healthbox&lt;br /&gt;
#Hero City Co-Working Space&lt;br /&gt;
#High Street Startups Accelerator&lt;br /&gt;
#Highway1&lt;br /&gt;
#Honda Xcelerator &lt;br /&gt;
#Houston Technology Center&lt;br /&gt;
#Hub Ventures&lt;br /&gt;
#HugeThing&lt;br /&gt;
#I/O ventures&lt;br /&gt;
#ICONYC labs&lt;br /&gt;
#IDC Elevator&lt;br /&gt;
#INcubes Funnel and Accelerator 2014/2015&lt;br /&gt;
#INcubes Online Form&lt;br /&gt;
#INcubes Startup Visa&lt;br /&gt;
#Illumina Accelerator&lt;br /&gt;
#Illuminator,  New York Accelerator 2015&lt;br /&gt;
#Imagine K12&lt;br /&gt;
#Immokalee Business Development Center&lt;br /&gt;
#Impact Engine&lt;br /&gt;
#Impact USA - 2017&lt;br /&gt;
#Incubate Miami&lt;br /&gt;
#Infuse Accelerator&lt;br /&gt;
#Ingenuity Partner Program&lt;br /&gt;
#InnoSpring&lt;br /&gt;
#Innov&amp;amp;Connect&lt;br /&gt;
#Innov8 for Health&lt;br /&gt;
#Innova Memphis&lt;br /&gt;
#InnovateOC&lt;br /&gt;
#Innovation Depot&lt;br /&gt;
#Innovation Pavilion&lt;br /&gt;
#Innovation Showcase Winter 2017&lt;br /&gt;
#Insight Accelerator Labs&lt;br /&gt;
#Intel Education Accelerator&lt;br /&gt;
#Investment Preparedness Lab&lt;br /&gt;
#Invoke Collective&lt;br /&gt;
#Iowa Startup Accelerator&lt;br /&gt;
#JFDI.Asia&lt;br /&gt;
#JFE Accelerator SF&lt;br /&gt;
#JLAB&lt;br /&gt;
#Jaguar Land Rover Tech Incubator&lt;br /&gt;
#Jolt&lt;br /&gt;
#JumpSchool &lt;br /&gt;
#JumpStart Foundry&lt;br /&gt;
#Jumpstart! Boulder&lt;br /&gt;
#JusticeXL&lt;br /&gt;
#Kairos Boston Spring Program&lt;br /&gt;
#Kaplan EdTech&lt;br /&gt;
#Kick&lt;br /&gt;
#Kick Boise&lt;br /&gt;
#Kick LA&lt;br /&gt;
#Kick Victoria&lt;br /&gt;
#Kicklabs&lt;br /&gt;
#Kinetiq Labs&lt;br /&gt;
#L-SPARK Accelerator&lt;br /&gt;
#LAUNCH incubator&lt;br /&gt;
#LAUNCHub&lt;br /&gt;
#LI TechCOMETS&lt;br /&gt;
#LabFunding Project Accelerator 2014&lt;br /&gt;
#Labs Venture Accelerator&lt;br /&gt;
#Launch Chapel Hill&lt;br /&gt;
#Launch Memphis&lt;br /&gt;
#LaunchBox Digital&lt;br /&gt;
#LaunchHouse&lt;br /&gt;
#LaunchPad PEI&lt;br /&gt;
#LaunchSpot&lt;br /&gt;
#Launch_Academy&lt;br /&gt;
#Launchpad Digital Health, LLC&lt;br /&gt;
#Launchpad LA&lt;br /&gt;
#Launchpad Long Island&lt;br /&gt;
#Le Camping&lt;br /&gt;
#Leading Entrepreneurial Accelerator Program&lt;br /&gt;
#Lean Launch Ventures&lt;br /&gt;
#LearnLaunchX&lt;br /&gt;
#Lemnos Labs&lt;br /&gt;
#Life Changing Labs&lt;br /&gt;
#LiftOff Health Incubator&lt;br /&gt;
#Lightbank Start&lt;br /&gt;
#LightningLab&lt;br /&gt;
#Lowe's Accelerator&lt;br /&gt;
#MACH37&lt;br /&gt;
#MACH37 Spring&lt;br /&gt;
#MIT SA+P venture accelerator&lt;br /&gt;
#MITA Institute Accelerator&lt;br /&gt;
#MTGx MediaFactory&lt;br /&gt;
#Mac6&lt;br /&gt;
#Madworks Governance Accelerator&lt;br /&gt;
#Maine Center for Entrepreneurial Development - Top Gun Program&lt;br /&gt;
#Matter&lt;br /&gt;
#Maven Ventures Fund &amp;amp; Incubator&lt;br /&gt;
#Media Camp&lt;br /&gt;
#Melbourne Accelerator Program&lt;br /&gt;
#Memphis BioWorks&lt;br /&gt;
#Merck Accelerator&lt;br /&gt;
#MergeLane 2017 Accelerator&lt;br /&gt;
#Mergelane&lt;br /&gt;
#Metavallon&lt;br /&gt;
#Microsoft Accelerator&lt;br /&gt;
#MindTheBridge&lt;br /&gt;
#Momentum&lt;br /&gt;
#MuckerLab&lt;br /&gt;
#Muru-D&lt;br /&gt;
#My5ive Accelerator 2016&lt;br /&gt;
#N-Motion (DUPLICATE)&lt;br /&gt;
#NDRC (LaunchPad / VentureLab)&lt;br /&gt;
#NEXT Dashboard&lt;br /&gt;
#NMotion&lt;br /&gt;
#NY Digital Health Accelerator&lt;br /&gt;
#NY Fashion Tech Lab 2017&lt;br /&gt;
#NYC ACRE&lt;br /&gt;
#NYC SeedStart&lt;br /&gt;
#Nashville Entrepreneur Center&lt;br /&gt;
#Nebula Shift&lt;br /&gt;
#Nephoscale IaaS&lt;br /&gt;
#Nest New York &lt;br /&gt;
#New Ventures Group&lt;br /&gt;
#New York Digital Health Accelerator (DUPLICATE)&lt;br /&gt;
#NewME Accelerator PopUps &lt;br /&gt;
#NewMe&lt;br /&gt;
#Next media accelerator&lt;br /&gt;
#NextHIT&lt;br /&gt;
#NextStart&lt;br /&gt;
#Nike+ Accelerator&lt;br /&gt;
#Northern Arizona Center for Entrepreneurship and Technology (NACET)&lt;br /&gt;
#Northern England&lt;br /&gt;
#Nxtp.labs&lt;br /&gt;
#OCTANe&lt;br /&gt;
#Oasis 500&lt;br /&gt;
#OpenFund&lt;br /&gt;
#Orange Fab&lt;br /&gt;
#Orange Works&lt;br /&gt;
#Orion Startups&lt;br /&gt;
#Oxygen Accelerator&lt;br /&gt;
#PIE&lt;br /&gt;
#Patriot Boot Camp&lt;br /&gt;
#Pearson Catalyst for Education&lt;br /&gt;
#Pipeline H2O&lt;br /&gt;
#Pitney Bowes Inc&lt;br /&gt;
#Plarium Labs&lt;br /&gt;
#Plug In South LA &lt;br /&gt;
#Plug and Play&lt;br /&gt;
#Plum Alley Investments 2016&lt;br /&gt;
#Points of Light Accelerator&lt;br /&gt;
#PowerHaus&lt;br /&gt;
#Preccelerator® Program 2016&lt;br /&gt;
#ProSiebenSat.1 Accelerator&lt;br /&gt;
#Project Entrepreneur 2016/17&lt;br /&gt;
#Project Healtchare&lt;br /&gt;
#Project Lift&lt;br /&gt;
#Project Music&lt;br /&gt;
#Project Skyway&lt;br /&gt;
#Propeller Venture Accelerator&lt;br /&gt;
#Prosper Capital Accelerator&lt;br /&gt;
#Proton Enterprises&lt;br /&gt;
#Pushstart Accelerator&lt;br /&gt;
#Qualcomm Robotics Accelerator&lt;br /&gt;
#Queen Creek Business Incubator&lt;br /&gt;
#R/GA Accelerator&lt;br /&gt;
#RAIN Incubator/Accelerator&lt;br /&gt;
#RJI Investment Group&lt;br /&gt;
#Reach&lt;br /&gt;
#RetailXelerator&lt;br /&gt;
#Rock Health&lt;br /&gt;
#Rocket Fuel Labs&lt;br /&gt;
#Rockstart Accelerator&lt;br /&gt;
#RunUp Labs&lt;br /&gt;
#Runway IoT Accelerator 2015&lt;br /&gt;
#SAP Startup Focus Program&lt;br /&gt;
#SKTA Innopartners Innovation Accelerator&lt;br /&gt;
#SPACELAB Tech Accelerator&lt;br /&gt;
#SPARK&lt;br /&gt;
#SPH Plug and Play&lt;br /&gt;
#SURF Incubator&lt;br /&gt;
#SaltMines Group Start-Up Studio&lt;br /&gt;
#ScaleTown&lt;br /&gt;
#Seamless IoT 2016&lt;br /&gt;
#Searchcamp&lt;br /&gt;
#Seed Hatchery&lt;br /&gt;
#SeedSpot&lt;br /&gt;
#SeedStartup&lt;br /&gt;
#SeedSumo&lt;br /&gt;
#Seedcamp&lt;br /&gt;
#Seedrocket&lt;br /&gt;
#Seeqnce&lt;br /&gt;
#Sequoia Apps&lt;br /&gt;
#Serval Ventures&lt;br /&gt;
#Shenzhen Valley Ventures Incubator&lt;br /&gt;
#Shoals Entrepreneurial Center&lt;br /&gt;
#Shopper Futures Accelerator&lt;br /&gt;
#Shotput Ventures&lt;br /&gt;
#Sid Martin Biotechnology Institute&lt;br /&gt;
#SigmaLabs Accelerator&lt;br /&gt;
#Silicon Valley Incubator &amp;amp; Accelerator&lt;br /&gt;
#SixThirty&lt;br /&gt;
#Sixers Innovation Lab&lt;br /&gt;
#Skywalker Accelerator&lt;br /&gt;
#SmartHealth Activator&lt;br /&gt;
#Smashd Labs&lt;br /&gt;
#SoCo Nexus Accelerator Spring 2017&lt;br /&gt;
#Social Enterprise Challenge&lt;br /&gt;
#Socratic Labs&lt;br /&gt;
#SparkLabs&lt;br /&gt;
#Sparkgap&lt;br /&gt;
#Sports Tank&lt;br /&gt;
#Springboard&lt;br /&gt;
#Sprint Accelerator&lt;br /&gt;
#Sprint Mobile Health Accelerator&lt;br /&gt;
#SproutBox&lt;br /&gt;
#SproutCamp&lt;br /&gt;
#Starburst Aerospace Accelerator&lt;br /&gt;
#Start Path Europe&lt;br /&gt;
#Start'inPost&lt;br /&gt;
#StartEngine&lt;br /&gt;
#StartFast Venture Accelerator&lt;br /&gt;
#Starta Accelerator Winter 2017&lt;br /&gt;
#Startl&lt;br /&gt;
#Startmate&lt;br /&gt;
#Startup Accelerator (DUPLICATE)&lt;br /&gt;
#Startup Front&lt;br /&gt;
#Startup Next &amp;amp; GAN&lt;br /&gt;
#Startup Orange County Accelerator&lt;br /&gt;
#Startup Runway&lt;br /&gt;
#Startup Wise Guys&lt;br /&gt;
#Startup Zone PEI&lt;br /&gt;
#Startup52X Accelerator&lt;br /&gt;
#StartupCity&lt;br /&gt;
#StartupHighway&lt;br /&gt;
#StartupHouse Foundry program&lt;br /&gt;
#StartupMinds Accelerator &lt;br /&gt;
#StartupYard&lt;br /&gt;
#Startupbootcamp&lt;br /&gt;
#Straight Shot&lt;br /&gt;
#Summer@Highland&lt;br /&gt;
#Surge&lt;br /&gt;
#SynBio axlr8r&lt;br /&gt;
#TEB Incubation &amp;amp; Acceleration Center&lt;br /&gt;
#THRIVE Accelerator III&lt;br /&gt;
#THRIVE Open Innovation (DUPLICATE)&lt;br /&gt;
#TIM#WCAP Accelerator&lt;br /&gt;
#TLabs&lt;br /&gt;
#TMCx Accelerator Digital Health 2017&lt;br /&gt;
#Tallwave&lt;br /&gt;
#Tampa Bay Innovation Center&lt;br /&gt;
#Tampa Bay Wave&lt;br /&gt;
#Tandem Mobile Accelerator&lt;br /&gt;
#Tech Nexus&lt;br /&gt;
#Tech Wildcatters&lt;br /&gt;
#Tech2020&lt;br /&gt;
#TechLaunch&lt;br /&gt;
#TechRanch&lt;br /&gt;
#TechSquareLabs&lt;br /&gt;
#Techstars&lt;br /&gt;
#Techstars Music&lt;br /&gt;
#Telenet Idealabs&lt;br /&gt;
#Telluride Venture Accelerator&lt;br /&gt;
#TenX&lt;br /&gt;
#The Alchemist Accelerator (DUPLICATE)&lt;br /&gt;
#The Ark&lt;br /&gt;
#The Bakery&lt;br /&gt;
#The Batchery&lt;br /&gt;
#The Brandery&lt;br /&gt;
#The Bridge&lt;br /&gt;
#The Center For Technology Enterprise &amp;amp; Development&lt;br /&gt;
#The Chaser&lt;br /&gt;
#The Company Lab (CO.LAB)&lt;br /&gt;
#The Draper FinTech Connection&lt;br /&gt;
#The Factory&lt;br /&gt;
#The Greatest Pitch&lt;br /&gt;
#The Harbor Accelerator&lt;br /&gt;
#The Incubator&lt;br /&gt;
#The Iron Yard&lt;br /&gt;
#The Mediapreneur Incubator&lt;br /&gt;
#The Morpheus&lt;br /&gt;
#The New York Venture Summit&lt;br /&gt;
#The Next Step: from idea to startup&lt;br /&gt;
#The Refinery&lt;br /&gt;
#The Unilever Foundry&lt;br /&gt;
#The Venture Center's Pre-Accelerator I&lt;br /&gt;
#The Vine OC&lt;br /&gt;
#The Vogt Awards&lt;br /&gt;
#The Yield Lab&lt;br /&gt;
#The eFactory Accelerator&lt;br /&gt;
#Think Big Partners Accelerator&lt;br /&gt;
#TiE Angels&lt;br /&gt;
#Tigerlabs Digital Health Accelerator&lt;br /&gt;
#Tolstoy Summer Camp&lt;br /&gt;
#TopSeedsLab&lt;br /&gt;
#Travel Startups Incubator&lt;br /&gt;
#Travelport Labs Accelerator&lt;br /&gt;
#Travelport Labs Incubator&lt;br /&gt;
#Triangle Startup Factory&lt;br /&gt;
#Tumml&lt;br /&gt;
#Tune Labs&lt;br /&gt;
#Twin Cities Accelerator 2016&lt;br /&gt;
#UW-Whitewater Launch Pad Accelerator&lt;br /&gt;
#Unbank.ventures FinTech Incubator&lt;br /&gt;
#University Technology Park&lt;br /&gt;
#Unreasonable Institute&lt;br /&gt;
#UpTech&lt;br /&gt;
#Upstart Accelerator&lt;br /&gt;
#Upstart Labs&lt;br /&gt;
#Upstart Memphis&lt;br /&gt;
#Uptima Business Bootcamp&lt;br /&gt;
#Upwest Labs&lt;br /&gt;
#VANTEC&lt;br /&gt;
#VC FinTech Accelerator&lt;br /&gt;
#Velocity Indiana Accelerator&lt;br /&gt;
#Venture Catalyst Partners&lt;br /&gt;
#Venture Hive&lt;br /&gt;
#Venture I&lt;br /&gt;
#VentureOut's  Enterprise Tech Expedition&lt;br /&gt;
#Venturegeeks&lt;br /&gt;
#Vet-Tech Accelerator&lt;br /&gt;
#VictorySpark&lt;br /&gt;
#Village88 Techlab&lt;br /&gt;
#Volkswagen ERL Technology Accelerator&lt;br /&gt;
#WHLabs&lt;br /&gt;
#Wasabi Ventures Academy&lt;br /&gt;
#Wayra&lt;br /&gt;
#Wellness Accelerator&lt;br /&gt;
#Wells Fargo Startup Accelerator&lt;br /&gt;
#Wireless IoT&lt;br /&gt;
#Women Innovate Mobile&lt;br /&gt;
#XLerateHealth&lt;br /&gt;
#XTRATOS&lt;br /&gt;
#Xlerate Health&lt;br /&gt;
#Y Combinator&lt;br /&gt;
#Y&amp;amp;R SparkPlug 2017&lt;br /&gt;
#YEurope&lt;br /&gt;
#YLE Media Startup Accelerator Program&lt;br /&gt;
#Yahoo Ad Tech Program&lt;br /&gt;
#Yangler (online accelerator)&lt;br /&gt;
#Year of the Startup&lt;br /&gt;
#Yetizen Accelerator&lt;br /&gt;
#You Is Now&lt;br /&gt;
#Z80 Labs&lt;br /&gt;
#ZIP Launchpad Admission&lt;br /&gt;
#ZeroTo510&lt;br /&gt;
#Zone Startups Calgary&lt;br /&gt;
#designX 2017&lt;br /&gt;
#eMerging Ventures&lt;br /&gt;
#ezone&lt;br /&gt;
#iStart Jax (DUPLICATE)&lt;br /&gt;
#iStart Valley&lt;br /&gt;
#iVentures10&lt;br /&gt;
#ignite100&lt;br /&gt;
#innovyz start&lt;br /&gt;
#tekMountain Accelerator&lt;br /&gt;
&lt;br /&gt;
=Project Summary=&lt;br /&gt;
This project will be used to determine which accelerators are the most effective at churning out successful startups, as well as what characteristics are exhibited by these accelerators. First, we need to gather as much data as we can about as many accelerators as we can in order to look at factors that differentiate successful vs. unsuccessful ventures. Next, we need to create a web crawling program which will gather information about accelerators across the world by accessing their websites and extracting information. I believe that our overall goal with this research project is to gain insight into the methods of successful accelerators, as well as to find out what exactly differentiates very successful accelerators from dead accelerators.&lt;br /&gt;
&lt;br /&gt;
Helpful Links: http://seedrankings.com/&lt;br /&gt;
&lt;br /&gt;
=Sources=&lt;br /&gt;
&lt;br /&gt;
Summary: These are sources obtained from [[List of Accelerators]], Crunchbase, and other Google searches. We will evaluate these sources by looking at the number of accelerators they supply (as most of them are lists) and then also taking a look at the type of information they provide about each accelerator. Key data points are cohort-related data, startup-related data, and logistics of the accelerator. Better sources supply more information that the URL alone.&lt;br /&gt;
&lt;br /&gt;
(Obtained from [[List of Accelerators]] and various Google searches)&lt;br /&gt;
*http://seedrankings.com/&lt;br /&gt;
*http://www.acceleratorinfo.com/see-all.html&lt;br /&gt;
*http://www.seed-db.com/accelerators&lt;br /&gt;
*http://gust.com/usa-canada-accelerator-report-2015/?utm_content=35401577&amp;amp;utm_medium=social&amp;amp;utm_source=twitter&lt;br /&gt;
*https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/&lt;br /&gt;
*http://www.builtinnyc.com/2016/06/03/accelerators-incubators-nyc&lt;br /&gt;
*http://www.represent.la/&lt;br /&gt;
*http://www.launch.co/blog/complete-list-of-incubators-and-accelerators-like-y-combinat.html&lt;br /&gt;
*https://angel.co/accelerator-4&lt;br /&gt;
&lt;br /&gt;
(Obtained from Google search: &amp;quot;Accelerator Database&amp;quot;)&lt;br /&gt;
*seed-db is the first result that pops up&lt;br /&gt;
*https://www.corporate-accelerators.net/database/&lt;br /&gt;
*https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json&lt;br /&gt;
*By the 5th or 6th search result, the utility diminished greatly&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2015/03/17/the-best-startup-accelerators-of-2015-powering-a-tech-boom/#2f52fa7e34e4&lt;br /&gt;
*http://www.inc.com/will-yakowicz/the-15-best-startup-accelerators-in-the-us.html&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2016/03/11/the-best-startup-accelerators-of-2016/#74086a7724f2&lt;br /&gt;
*https://techcrunch.com/2015/03/17/these-are-the-top-20-us-accelerators/&lt;br /&gt;
*https://www.nexpcb.com/blogs/news/the-hardware-incubators-accelerators-list&lt;br /&gt;
&lt;br /&gt;
Other ways used to find Accelerators (listed below &amp;quot;List of Sources Obtained from Various Google Searches&amp;quot;):&lt;br /&gt;
*Type in generic location + &amp;quot;accelerators&amp;quot; (e.g. Houston Accelerators)&lt;br /&gt;
:*Looked at roughly the first 20 results&lt;br /&gt;
:*Used three locations as examples of accelerators that pop up&lt;br /&gt;
*Type in a specific state + &amp;quot;accelerator&amp;quot; + &amp;quot;list&amp;quot; (e.g. Texas accelerator list) to search for more relevant lists&lt;br /&gt;
:*Once again, looked at roughly the first 20 results&lt;br /&gt;
*Crunchbase has its own webpage with instructions for how we retrieve the data&lt;br /&gt;
&lt;br /&gt;
=Source Evaluations=&lt;br /&gt;
&lt;br /&gt;
Summary: These evaluations couple with each of the sources above. The evaluations provide instructions for obtaining the information listed, as well as a general review of how useful the data seems. The review serves to determine whether a crawler would be suitable for obtaining information from the source autonomously.&lt;br /&gt;
&lt;br /&gt;
==SOURCE: Crunchbase==&lt;br /&gt;
*All of the information for the Crunchbase documentation is located in the page [[Crunchbase 2013 Snapshot]] webpage, along with the documentation for how we determined the accelerator information.&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.acceleratorinfo.com/see-all.html==&lt;br /&gt;
#Opened source website&lt;br /&gt;
#Copied Information under &amp;quot;All Accelerator Programs&amp;quot; to TextPad, already sorted. Returned 190 results&lt;br /&gt;
#Each link on parent list leads to individual '''home page url''' of accelerator&lt;br /&gt;
:*Used sample size of 20 links, determined 16 to be accelerators, 2 to be incubators, 2 to be inactive or broken links&lt;br /&gt;
:*Many accelerators do not include founding date, most recent accelerators from around 2013-2014 (as determined from home page)&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for specific URLs to older accelerators, not very helpful for more specific information.&lt;br /&gt;
*Web crawling seems improbable because information is not readily available from source. Can potentially mine staff information or contact information from associated &amp;quot;about&amp;quot; page in the home url&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators/all==&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 235 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes:&lt;br /&gt;
::# &amp;quot;state&amp;quot;&lt;br /&gt;
::# &amp;quot;company name&amp;quot;&lt;br /&gt;
::# &amp;quot;website and CrunchBase links&amp;quot;&lt;br /&gt;
::# &amp;quot;cohort date&amp;quot;&lt;br /&gt;
::#&amp;quot;exit value&amp;quot;&lt;br /&gt;
::#&amp;quot;funding&amp;quot;. &lt;br /&gt;
:::Many entries for &amp;quot;exit value&amp;quot; are missing, some values for &amp;quot;funding&amp;quot; are missing&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators out of 235 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the following:&lt;br /&gt;
::#Status&lt;br /&gt;
::#Program (name)&lt;br /&gt;
::#Location&lt;br /&gt;
::#Country&lt;br /&gt;
::#Number of companies&lt;br /&gt;
::#Cumulative exit values&lt;br /&gt;
::#Cumulative funding &lt;br /&gt;
::#Average funding for startups&lt;br /&gt;
::#Median funding for startups&lt;br /&gt;
:::Many entries for &amp;quot;median funding&amp;quot; are left empty, as well as entries for all types of funding on the bottom half of the table&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, but after cross-referencing from other sources shows that seed-db is lacking many newer accelerators; list is not all-inclusive.&lt;br /&gt;
*Includes regional distributions for accelerator groups as well. For example, rather than just &amp;quot;Techstars&amp;quot;, the group is broken into Austin, Berlin, Boston, Boulder, etc.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators==&lt;br /&gt;
:Very similar to &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;, but contains large regional accelerators as groups, rather than individual accelerators. For example, Techstars appears only once.&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 239 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes same information as previous source, &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;. However, accelerators spanning across multiple regions have their startups located under one category on this webpage.&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators/groups out of 239 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the same information as the &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; source&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, includes large groups as well as individual accelerators. It seems that some accelerators missing from &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; are located here, since there are 239 returns rather than 235.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.f6s.com/programs?type==&lt;br /&gt;
#On the webpage, set &amp;quot;Type&amp;quot; to &amp;quot;Accelerator/Program&amp;quot;, set &amp;quot;Location&amp;quot; to &amp;quot;North America&amp;quot;, and set &amp;quot;Invest in Country&amp;quot; to &amp;quot;United States&amp;quot; to return results&lt;br /&gt;
#Highlighted results and scrolled down until all results found; copied results to TextPad&lt;br /&gt;
#In TextPad, sorted out lines with &amp;quot;by&amp;quot;, as well as miscellaneous categories such as dates and dollar signs through Regular Expressions&lt;br /&gt;
#Using the &amp;quot;More Info&amp;quot; line which held constant through the entire list, assigned a sequential number to the line (in order to determine the number of results)&lt;br /&gt;
::*Obtained a grand total of 1467 results from the list&lt;br /&gt;
::*Along with the name of the program/accelerator, the data included:&lt;br /&gt;
::#Dollar value per team&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Application Site&lt;br /&gt;
::#Accelerator URL&lt;br /&gt;
::*Many entries are not accelerators, from a quick glance through the results, there were various conferences, 3-5 days events, and written literature pertaining to accelerators as well&lt;br /&gt;
::*From a sample size of the first 30 entries, determined 10 to be valid accelerators, 3 incubators, 6 conferences/weekends, and the rest to be miscellaneous entries such as startup events or &amp;quot;studios&amp;quot; (perhaps useful but not relevant to search)&lt;br /&gt;
::*As we go down the list, the number of accelerators proportionately decreases. Can comfortably say that overall accelerator turnout from this website is much less than 33%, probably closer to 10-15%.&lt;br /&gt;
===Review===&lt;br /&gt;
*Potentially useful website if crawler could remove the clutter and target solely the accelerators; very useful for identifying new accelerators since data automatically sorted by date and location.&lt;br /&gt;
*Large list of sources includes many irrelevant results, such as conferences or weekends which are difficult to identify. The name of the sorting category itself, &amp;quot;Accelerator/Program&amp;quot; suggests that many of the results fall under the &amp;quot;Program&amp;quot; section rather than being valid accelerators.&lt;br /&gt;
*Potential site for identifying accelerators, but limited by in-site sorting; useful for URL and perhaps equity, but not very detailed information relating to the accelerator/program.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://gust.com/usa-canada-accelerator-report-2015/==&lt;br /&gt;
#Selected region of US and Canada&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Top 20 Active Accelerators&amp;quot; and selected &amp;quot;see the full list&amp;quot; near the bottom of the listed accelerators&lt;br /&gt;
#Copied resulting entries into TextPad and sorted out the numbers to leave only the name of the accelerator&lt;br /&gt;
::*Obtained 100 results for different accelerators&lt;br /&gt;
::*Accelerator lists included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Number of Start-ups funded (2015 only)&lt;br /&gt;
::*Accelerator list limited to 2015&lt;br /&gt;
===Review===&lt;br /&gt;
*Website provides its own evaluation of an accelerator's success based on various factors and provides data for larger trends.&lt;br /&gt;
*Usefulness is questionable because website does not provide much except the URL, and all of the entries are based on success in 2015.&lt;br /&gt;
*Other interesting data within website such as &amp;quot;Hot Markets&amp;quot;, investment breakdowns by state, etc. All of this data is also limited to 2015.&lt;br /&gt;
&lt;br /&gt;
==Source: https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/==&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Startup accelerators in Boston&amp;quot;&lt;br /&gt;
#Copied text beginning from &amp;quot;MassChallenge&amp;quot; (the first paragraph was just a general definition of startups) and continued to copy until &amp;quot;Startup Incubators in Boston&amp;quot;&lt;br /&gt;
#After pasting in TextPad, I sorted the data to delete any characters after the &amp;quot;-&amp;quot; and added a sequential number at the beginning of each line&lt;br /&gt;
::*Returned a total of 17 results for startups in Boston&lt;br /&gt;
::*Accelerator list included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Capital requirements&lt;br /&gt;
::#Application periods and requirements&lt;br /&gt;
::#Paragraph describing accelerator and its goals&lt;br /&gt;
===Review===&lt;br /&gt;
*Although the guide is dated, useful for identifying strong accelerator programs in Boston&lt;br /&gt;
*Limitation: only focuses on Boston, but the description is helpful in identifying the role of the accelerator&lt;br /&gt;
*Limited information on accelerator, not very useful by itself without information from the accelerator URL&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.corporate-accelerators.net/database/==&lt;br /&gt;
#Copied and pasted table into Microsoft Excel (Data was already sorted into categories so no need for TextPad)&lt;br /&gt;
#Table returned 72 references (but there was a link to the bottom to a larger database)&lt;br /&gt;
::*The table itself includes:&lt;br /&gt;
::#Major Company&lt;br /&gt;
::#Accelerator&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Website&lt;br /&gt;
::#Details&lt;br /&gt;
::*The &amp;quot;Details&amp;quot; link led to a variety of other information including:&lt;br /&gt;
::#Status (Active or Inactive)&lt;br /&gt;
::#Locations&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Term&lt;br /&gt;
::#Cohort Based? (Regular or Irregular)&lt;br /&gt;
::#Pitch Day&lt;br /&gt;
::#Office Space&lt;br /&gt;
::#Powered by&lt;br /&gt;
::#Support Offered?&lt;br /&gt;
::#Launch year&lt;br /&gt;
::#Focus Areas&lt;br /&gt;
::#General Description&lt;br /&gt;
::*Also Included a variety of data regarding the host company as well&lt;br /&gt;
===Review===&lt;br /&gt;
*Solid list for corporate accelerators and also includes a variety of information about the accelerator, the cohorts, etc. Some of the entries are international accelerators however so need to filter them out&lt;br /&gt;
*Only limited to 72 accelerators from major companies&lt;br /&gt;
&lt;br /&gt;
==Source: https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json==&lt;br /&gt;
#This source is a .json file from the previous database&lt;br /&gt;
#After placing into TextPad, replaced each space with a ###, replaced each new line with a tab, and replaced each ### with a new line. Ultimately returned 80 results&lt;br /&gt;
::*From the file, the .json includes:&lt;br /&gt;
::#NAICS and NAICS sector &lt;br /&gt;
::#Classification&lt;br /&gt;
::#Sector Description&lt;br /&gt;
::#Term&lt;br /&gt;
::#Goal&lt;br /&gt;
::#Partner&lt;br /&gt;
::*Also includes most of the information from the previous source, since they are undoubtedly linked&lt;br /&gt;
===Review===&lt;br /&gt;
*Another solid list for corporate accelerators with some more information, but ultimately very similar to the previous source.&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.quora.com/Where-can-I-find-a-comprehensive-list-of-startup-incubators-and-accelerators-in-the-US==&lt;br /&gt;
#Since we already looked at the first listed source (seed-db), I clicked on the second link &amp;quot;(by Robert Shedd) http://blog.shedd.us/321987608/&amp;quot; which took me to a page headed &amp;quot;Help for Startups! – A semi-complete list of startup accelerator programs&amp;quot; created by a blogger, Robert Shedd&lt;br /&gt;
#List included 102 entries by the blogger, each of which do look like an accelerator&lt;br /&gt;
::*Upon immediate overview, noticed many results from previous sources were missing. Immediately noticed lack of &amp;quot;OwlSpark&amp;quot;, the accelerator from Rice.&lt;br /&gt;
::*Shedd only offers us the accelerator name plus its URL&lt;br /&gt;
===Review===&lt;br /&gt;
*Nice list to cross-reference with other sources but does not offer much new insight compared to more powerful engines such as seed-db\&lt;br /&gt;
&lt;br /&gt;
=List of Sources Obtained from Various Google Searches=&lt;br /&gt;
&lt;br /&gt;
Summary: These accelerators are taken from a specific Google search rather than a list. The idea is to compile a list of Google searches that return relevant results of accelerators. This will aid in the creation of a future web crawler.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;Location + Accelerator&amp;quot;(Only individual results, not lists)==&lt;br /&gt;
===Houston Accelerators===&lt;br /&gt;
*Examples of single accelerators found&lt;br /&gt;
:#TMCx: http://www.tmc.edu/innovation/innovation-programs/tmcx/&lt;br /&gt;
:#RED labs: http://redlabs.uh.edu/8&lt;br /&gt;
:#SURGE accelerator: https://kirkcoburn.com/&lt;br /&gt;
:#OwlSpark: http://owlspark.com/&lt;br /&gt;
:#NextHIT: http://www.houstonhealthventures.com/nexthit-accelerator-program-application/&lt;br /&gt;
===Los Angeles Accelerators===&lt;br /&gt;
:#Amplify: http://amplify.la/&lt;br /&gt;
:#Y Combinator: https://www.ycombinator.com/&lt;br /&gt;
:#Chicklabs: https://www.chicklabsllc.com/&lt;br /&gt;
:#Disney Accelerator: https://disneyaccelerator.com/&lt;br /&gt;
:#Launchpad: https://launchpad.la/&lt;br /&gt;
===New York Accelerators===&lt;br /&gt;
:#DreamIT Ventures: http://www.dreamit.com/#meaningful-experience&lt;br /&gt;
:#Women Innovate Mobile: http://www.wim.co/&lt;br /&gt;
:#Techstars NYC: http://www.techstars.com/programs/nyc-program/&lt;br /&gt;
:#Entrepreneurs Roundtable: http://eranyc.com/&lt;br /&gt;
:#FirstGrowthVC: http://venturecrush.com/fg/&lt;br /&gt;
:#New York Digital Health Accelerator: http://digitalhealthaccelerator.com/&lt;br /&gt;
:#Grand Central Tech: http://www.grandcentraltech.com/&lt;br /&gt;
:#Accelerator Corp: http://www.acceleratorcorp.com/&lt;br /&gt;
:#New York Startup Lab: http://nystartuplab.com/&lt;br /&gt;
===Review===&lt;br /&gt;
*Some locations return more viable results for a similar sample size. For example, New York returned 9 valid accelerators, whereas Los Angeles and Houston both returned 5 actual accelerators out of the first 20 results: an 80% difference. Some optimization may come from identifying which locations return more accelerators upon searching.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;State+Accelerator+List&amp;quot;==&lt;br /&gt;
===New York Accelerator List===&lt;br /&gt;
*http://www.ongridventures.com/resources/new-york-silicon-alley-resources/newyorkaccelerators/ (Ranks 14 accelerators)&lt;br /&gt;
*http://under30ceo.com/11-new-york-tech-incubators-and-accelerators-for-entrepreneurs/ (Ranks 11 accelerators)&lt;br /&gt;
===California Accelerator List===&lt;br /&gt;
*http://www.socaltech.com/the_complete_guide_to_southern_california_accelerators_and_incubators_part_i/s-0040924.html (Lists accelerators in Southern Cali)&lt;br /&gt;
*http://barberacorporatelaw.com/blog/2014/4/8/28-business-incubators-in-the-los-angeles-area (List of 24 accelerators near the LA area)&lt;br /&gt;
===Texas Accelerator List===&lt;br /&gt;
*http://www.austinstartuplist.com/incubators (List of accelerators in Austin, &amp;lt;5 results)&lt;br /&gt;
*http://www.siliconhillsnews.com/2016/09/02/the-top-texas-healthcare-accelerators-and-incubators/ (Modest list of accelerators aiding in healthcare)&lt;br /&gt;
*http://realfoodmba.com/food-startup-accelerators/ (List of food-based accelerators, some of which are in Austin, others of which are international)&lt;br /&gt;
===Colorado Accelerator List===&lt;br /&gt;
*http://www.builtincolorado.com/2015/01/14/best-colorado-accelerators-your-startup (8 results)&lt;br /&gt;
*https://www.quora.com/What-accelerator-programs-are-located-in-Colorado (Quora inquiry yielding modest results)&lt;br /&gt;
===Washington Accelerator List===&lt;br /&gt;
*http://www.geekwire.com/2015/mapping-seattles-incubators-accelerators-and-co-working-spaces/ (Returns 14 results)&lt;br /&gt;
===Oregon Accelerator List===&lt;br /&gt;
*http://www.bizjournals.com/portland/subscriber-only/2016/01/15/incubators-and-accelerators.html (Returns list of 5 accelerators and details)&lt;br /&gt;
*http://www.oregon4biz.com/Innovate-&amp;amp;-Create/R&amp;amp;D-Business/Incubators/ (Returns list of 26 accelerators and incubators)&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Seed-DB appears for almost all of the search results&lt;br /&gt;
*Acceleratorinfo appears for most of the search results&lt;br /&gt;
*There are multiple cumulative reports of incubators per location, but not for accelerators&lt;br /&gt;
*Most regionalized accelerator lists deal with either an article or a ranking of a particular amount of accelerators in the area&lt;br /&gt;
*Many results returned nationally ranked lists of accelerators, such as the Forbes list of &amp;quot;Top Accelerators&amp;quot; or something along the lines of &amp;quot;Best Accelerators in the US&amp;quot;. The connection is that perhaps one accelerator mentioned on the list may be located within the searched state.&lt;br /&gt;
*There are also a few results for actual particle accelerators that must be sorted out (i.e. superconducting super collider)&lt;br /&gt;
&lt;br /&gt;
==Found through google searching accelerators found previously==&lt;br /&gt;
'''Found from googling YLE Media Startup Accelerator'''&lt;br /&gt;
*https://www.corporate-accelerators.net/database/index.html (DB of Corporate Accelerators 71-79 entries)&lt;br /&gt;
*http://startupaccelerator.vc/accelerator-corporate-innovation-sig/ (Database of Accelerators and Corporate Innovation 92 entries)&lt;br /&gt;
neither of these have had their entries added to list of accelerators&lt;br /&gt;
&lt;br /&gt;
=Individual Accelerator Evaluations=&lt;br /&gt;
Summary: The purpose of this section is to create instructions for each accelerator on how to find cohort information from their URLs. Along with specific instructions for obtaining the cohorts for each accelerator chosen, there should be a list of easy-to-obtain and relevant statistics regarding the accelerator, such as information about its team, location, etc. The variable statistics list is cumulative, whereas the cohort directions are unique per the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerators Chosen (Format = Name (source))==&lt;br /&gt;
#Blue Startups (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Launchpad LA (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Y Combinator (http://www.seed-db.com/accelerators)&lt;br /&gt;
#FlashPoint (http://www.seed-db.com/accelerators/all)&lt;br /&gt;
#Prosper Accelerator (https://www.f6s.com/programs?type)&lt;br /&gt;
#Axel Springer Plug and Play (http://www.axelspringerplugandplay.com/)&lt;br /&gt;
#Techstars (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Startmate (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Capital Factory (http://blog.shedd.us/321987608/)&lt;br /&gt;
#OwlSpark (Google search: &amp;quot;Houston + accelerators&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Blue Startups (http://bluestartups.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Track Record&amp;quot; page under the &amp;quot;Home&amp;quot; tab; found total number of graduated cohorts to be 7&lt;br /&gt;
#Navigated to &amp;quot;Portfolio&amp;quot; tab. Tab includes list of all seven graduated cohorts along with companies emerging from each one. Each cohort is listed under a separate page (ex. &amp;quot;Cohort 1&amp;quot;, &amp;quot;Cohort 2&amp;quot;, etc) and at the bottom of each cohort page, there is a link to the other 6. Each company has a short description along with its URL.&lt;br /&gt;
#An &amp;quot;Alumni News&amp;quot; page at the bottom of &amp;quot;Portfolio&amp;quot; includes articles pertinent to graduated startups.&lt;br /&gt;
#Unfortunately does not include the date and year of each cohort class, but perhaps could cross-reference with other sources.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Launchpad LA (http://launchpad.la/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Companies&amp;quot; in the top of the homepage&lt;br /&gt;
#&amp;quot;Companies&amp;quot; returns all companies backed by Launchpad LA based on their class year and number (cohort)&lt;br /&gt;
#:*Also sorted by active startups vs. inactive startups&lt;br /&gt;
#At the bottom of the &amp;quot;Companies&amp;quot; tab, there is a statistical layout returning values for the number of companies started by Launchpad during its time as an accelerator (2012-present), as well as the total funding funneled into the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Y Combinator (http://www.ycombinator.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Scrolled down on the home page and clicked on a link entitled &amp;quot;See all companies&amp;quot;.&lt;br /&gt;
#Navigated to a drop down menu named &amp;quot;All Batches&amp;quot;, and clicked on it to expand the list.&lt;br /&gt;
#List is made up of dates ranging from 2005-2016, and these dates return lists of launched companies including most but not all of their URL's, as well as their launch year.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Flashpoint (http://flashpoint.gatech.edu/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#On upper right corner after animation, there is a tab sign which lets you navigate to a page labeled &amp;quot;Teams&amp;quot;&lt;br /&gt;
#The &amp;quot;Team&amp;quot; page has each batch of companies emerging from Georgia Tech, although it does not include the dates or cohorts of these companies. For example, &amp;quot;Batch 1&amp;quot; at the top of the page just lists the companies in the batch without URLs or any additional information.&lt;br /&gt;
#On the &amp;quot;Application&amp;quot; page on the tab near the top, there is information regarding Batch 7, which begins early 2017. Suggests that batch 6 either ended spring 2016 or fall 2016.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Prosper Women Entrepreneurs (http://www.prosperstl.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Accelerator&amp;quot; tab and clicked &amp;quot;Companies&amp;quot; when prompted with the drop down menu.&lt;br /&gt;
#This tab returned all of the launched company logos which then redirected to the company's home page when clicked.&lt;br /&gt;
#No other relevant form of information such as date launched or cohort was included on this page.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Axel Springer Plug and Play(http://www.axelspringerplugandplay.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Clicked on the &amp;quot;Companies&amp;quot; tab on the home page and was directed to the middle of the page which included a short list of current companies.&lt;br /&gt;
#Clicked on the &amp;quot;All Companies&amp;quot; link which returned a page filled with startup logos and brief descriptions of those startups. When clicked, each logo serves to redirect to that startup's home page.&lt;br /&gt;
#Companies were not sorted by cohort or in any other relevant way.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Techstars (http://www.techstars.com)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the Accelerators tabs and clicked &amp;quot;Companies&amp;quot; on the drop down menu.&lt;br /&gt;
#Firstly, this returns a table comprised of a long list of different classes from different areas separated by years.&lt;br /&gt;
#Upon scrolling down further, each of these classes is broken down by the startups that graduated from them. It also includes information such as how much was invested in each startup, as well as whether or not the startup was acquired, is active, or failed.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Startmate (http://www.startmate.com.au)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startups&amp;quot; tab, which returned a page of all startups that have graduated from Startmate.&lt;br /&gt;
#Startups are separated by year of graduation, and each company is linked on this page.&lt;br /&gt;
#It appears as if each year, 1 cohort is taken through the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Capital Factory (https://capitalfactory.com/accelerate/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the startups tab, which returned a long list of companies that were accelerated by Capital Factory.&lt;br /&gt;
#Each logo for the startups served as a link to their respective websites.&lt;br /&gt;
#There was no evidence or mention of any cohorts.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: OwlSpark (http://entrepreneurship.rice.edu/accelerator/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startup Teams&amp;quot; tab, which returned a page that included links to 4 &amp;quot;Classes&amp;quot;.&lt;br /&gt;
#Each class link i.e. (Class 1, Class 2, Class 3, Class 4) returned links to each startup that graduated from the program.&lt;br /&gt;
#These classes signify cohorts.&lt;br /&gt;
&lt;br /&gt;
==List of Promising Variables==&lt;br /&gt;
*Key People (founders, lead entrepreneurs, strategists, etc.)&lt;br /&gt;
*Total number of launched companies&lt;br /&gt;
*A FAQ for application details, accelerator vision, and &lt;br /&gt;
*Funds raised per company (average)&lt;br /&gt;
*Features offered by accelerator (perks, space, tools, etc)&lt;br /&gt;
*General events hosted by the accelerator&lt;br /&gt;
*(Success) stories for graduated start-ups&lt;br /&gt;
&lt;br /&gt;
=E-R Diagram (in list form) for Identifying Attributes to Pull from Accelerators=&lt;br /&gt;
Summary: I will look at different entities within the accelerator page (e.g accelerators, cohorts, founders) and then find potential attributes that can be codified from those entities. Along with the attribute, we list a potential method for pulling that particular attribute. &lt;br /&gt;
&lt;br /&gt;
Format: &lt;br /&gt;
:&amp;lt;u&amp;gt;Entity&amp;lt;/u&amp;gt;&lt;br /&gt;
:*Attribute - Possible sources/ways to get&lt;br /&gt;
&lt;br /&gt;
Ed: &amp;quot;Be creative with finding new attributes to pull!&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==List==&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
*Accelerator Name - Website, external database&lt;br /&gt;
*Contact Form - General contact section in each website &lt;br /&gt;
*Industry focus - can be pulled from description&lt;br /&gt;
*Description - pulled from website itself&lt;br /&gt;
*Takes equity? - Database or from &amp;quot;about&amp;quot; page&lt;br /&gt;
*Non-profit? - Database&lt;br /&gt;
*URL - Already have way of obtaining&lt;br /&gt;
*DNS Registration Date - Already have way of obtaining&lt;br /&gt;
*Address - Google Maps, maybe the website&lt;br /&gt;
*Founding Date - Google Maps, website, server registration&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
*Mentorship? - Description in website&lt;br /&gt;
*Space Offered - Google Maps, Website description&lt;br /&gt;
*Partnerships - Angel list, Same section as mentorship or events&lt;br /&gt;
*Hosted Events - Calender&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
*Name - Founders or Team Page&lt;br /&gt;
*Title - Directly underneath or next to name&lt;br /&gt;
*PhD? - Biography, webpage under name&lt;br /&gt;
*Serial - Biography&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot; in &amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt; (n) has (n) &amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt; &lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt;&lt;br /&gt;
*Other Companies - Biography, webpage&lt;br /&gt;
*Previous Companies - Biography&lt;br /&gt;
*Net Worth - Forbes, Biography&lt;br /&gt;
*Link back to &amp;quot;Name&amp;quot; in &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
*Date + Accelerator = Cohort ID - Database or Website&lt;br /&gt;
*Number of Startups - Website, count from &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Cohort Number - Categorization on website, external database&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Names - Website, external database&lt;br /&gt;
*State of Inc - Angel List&lt;br /&gt;
*URL - Angel List, website&lt;br /&gt;
*Founding Date - Registration database, Angel List&lt;br /&gt;
*Industry - startup description&lt;br /&gt;
*Founding Location - Angel List&lt;br /&gt;
*Current Location - Angel List&lt;br /&gt;
*VC Raised to Date - SDC Platinum&lt;br /&gt;
*Angel Funds Raised to date - Angel List&lt;br /&gt;
&lt;br /&gt;
==Variables which Distinguish Accelerator Websites==&lt;br /&gt;
*The word &amp;quot;Accelerator&amp;quot;&lt;br /&gt;
**This word appears at least one time on the home page of the vast majority of accelerator websites. The word &amp;quot;Accelerator&amp;quot; appears either as a link to another page on the website or in a title on the homepage of the website. Not many other websites contain this word on their homepage, especially not if one Googles something generic such as &amp;quot;Accelerators in the US&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
*Fixed Term&lt;br /&gt;
**Accelerators normally work with their cohorts for 3 months. This is a major factor which differentiates between an accelerator and any other member of a startup ecosystem. If on their website they mention either &amp;quot;3 months&amp;quot; or &amp;quot;12 weeks&amp;quot;, it is extremely likely that the website belongs to an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Cohorts, Portfolio, Class, or Companies&lt;br /&gt;
**This is a potential variable that could link the websites of many different accelerators. The problem with the word &amp;quot;portfolio&amp;quot; is also used by numerous venture capital firms, which could potentially cause complications when attempting to pull only the sites of accelerators from a Google search. The word &amp;quot;cohort&amp;quot;, however, would have an extremely high probability of identifying the website as belonging to an accelerator. The words &amp;quot;class&amp;quot; and &amp;quot;companies&amp;quot; are promising but do not offer certainty.&lt;br /&gt;
&lt;br /&gt;
*Equity, Investment&lt;br /&gt;
**Although by itself, equity does not mean much, when paired with any of these other terms, it could potentially point to an accelerator. Most accelerators take equity in the form of common stock (6-8%), or they will ask for some alternate form of stake in the company.&lt;br /&gt;
&lt;br /&gt;
*Education and Mentorship&lt;br /&gt;
**Accelerators differ from incubators and angel investors in that they emphasize the education of the potential startup. They offer advice and intense mentorship from more experienced entrepreneurs within their staff, as well as many networking opportunities with the outside world. This variable is more difficult to find on the website of the accelerator, but I believe that if the website includes numerous keywords such as &amp;quot;education&amp;quot;, &amp;quot;mentorship&amp;quot;, or &amp;quot;networking opportunities&amp;quot;, it would be somewhat safe to assume that the website is owned by an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Demo Day&lt;br /&gt;
**This variable does not have tremendous potential in terms of crawling websites, but I feel that it is worth mentioning. Most accelerators &amp;quot;graduate&amp;quot; their cohorts with a demo day, which is a day when the startups present their company to potential investors. If the website contains the words &amp;quot;demo day&amp;quot;, which is fairly uncommon, it could be a good source of accelerator identification.&lt;br /&gt;
&lt;br /&gt;
A combination of any of these variables would certainly identify the current website as belonging to an accelerator.&lt;br /&gt;
&lt;br /&gt;
==Comprehensive List of Accelerators==&lt;br /&gt;
&lt;br /&gt;
All text files saved in &amp;quot;Accelerators&amp;quot; project on the McNair RPD. &lt;br /&gt;
&lt;br /&gt;
*Acc.Info: 190&lt;br /&gt;
*SeedDB: 240&lt;br /&gt;
*SARP: 59&lt;br /&gt;
*Corp: 79&lt;br /&gt;
*Total: 568 results&lt;br /&gt;
&lt;br /&gt;
After removing duplicates and locations: 363 results&lt;br /&gt;
&lt;br /&gt;
Doesn't count f6s, which returns 1170 results, roughly only 300 of which were accelerators. We created a crawler to sift through the webpages and parse HTML so we could identify the accelerators. Program and HTML saved on the Desktop.&lt;br /&gt;
&lt;br /&gt;
==Randomly Chosen Accelerators==&lt;br /&gt;
*TLabs&lt;br /&gt;
*BetaSpring&lt;br /&gt;
*The Unilever Foundry&lt;br /&gt;
*AIA Accelerator&lt;br /&gt;
*R/GA Accelerator&lt;br /&gt;
*Zeroto510&lt;br /&gt;
*Hub:raum&lt;br /&gt;
*Orange Fab&lt;br /&gt;
*Furnace&lt;br /&gt;
*Launch Chapel Hill&lt;br /&gt;
&lt;br /&gt;
===Determining whether or not these are accelerators===&lt;br /&gt;
Googled name of Accelerator and clicked on the first link&lt;br /&gt;
&lt;br /&gt;
Looked for Variables which Distinguish Accelerator Websites&lt;br /&gt;
*TLabs: Homepage states: &amp;quot;Leading Indian Tech Accelerator&amp;quot;; TLabs is an accelerator, but it is located in India.&lt;br /&gt;
*Betaspring: Under the &amp;quot;About Betaspring&amp;quot; tab,  it states that &amp;quot;Betaspring was among the first ten startup accelerators to launch worldwide&amp;quot;.&lt;br /&gt;
*The Unilever Foundry: Does not claim to be an accelerator, nor does it have information on the website about cohorts. This name was pulled from the source Corporate Accelerators.&lt;br /&gt;
*AIA Accelerator: The word &amp;quot;accelerator&amp;quot; is included in the name. Under the &amp;quot;Overview&amp;quot; tab, it states that startups have received mentorship.&lt;br /&gt;
*R/GA Accelerator: Under the &amp;quot;Overview&amp;quot; tab it states that the &amp;quot;R/GA Accelerator is designed for startups and... it is a three month, immersive, mentorship driven program&amp;quot;.&lt;br /&gt;
*Zeroto510: Website contains a &amp;quot;Portfolio Companies&amp;quot; tab which divides up the companies into cohorts. This identifies Zeroto510 as an accelerator.&lt;br /&gt;
*Hub:raum: Offers accelerator and incubator programs; however, none are located in North America.&lt;br /&gt;
*Orange Fab: States on the main page that &amp;quot;We're a 3-month accelerator program&amp;quot;.&lt;br /&gt;
*Furnace: &amp;quot;About&amp;quot; tab states that Furnace is &amp;quot;an innovative startup accelerator designed to form, incubate, and launch new companies&amp;quot;. Concludes with a Demo Day&lt;br /&gt;
*Launch Chapel Hill: Homepage states that they are &amp;quot;a startup accelerator&amp;quot;. Also included on the homepage is a line that states &amp;quot;Applications for Cohort 7 are now open&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
7/10 are accelerators located in the US.&lt;br /&gt;
&lt;br /&gt;
2/10 are accelerators not located in the US.&lt;br /&gt;
&lt;br /&gt;
1/10 is not an accelerator.&lt;br /&gt;
&lt;br /&gt;
===Steps for Extracting Cohort Information===&lt;br /&gt;
*TLabs: Clicked on the &amp;quot;Startup&amp;quot; tab and located a drop down menu entitled &amp;quot;Showing Startups from:&amp;quot;. This menu separates startups into Batches ranging from 1-9. These batches are cohorts.&lt;br /&gt;
*Betaspring: This website does not have a &amp;quot;Companies&amp;quot; or &amp;quot;Startups&amp;quot; tab. I clicked on their &amp;quot;Who&amp;quot; tab and noticed that within this section were two links called &amp;quot;Our portfolio&amp;quot; and &amp;quot;Our companies&amp;quot; which both linked to the same place. This place contained a list of the startups that Betaspring has funded, as well as links to each of the startup websites. The list was not separated into cohorts.&lt;br /&gt;
*The Unilever Foundry: Does not have a &amp;quot;Startups&amp;quot; or &amp;quot;Companies&amp;quot; link on the website.&lt;br /&gt;
*AIA Accelerator: Clicked on the &amp;quot;Startups&amp;quot; tab which returned a page with 5 companies and a bit of information on each of these companies. Also included the URL to each startup. However, the companies were not separated into cohorts, probably because there are so few of them.&lt;br /&gt;
*R/GA Accelerator: Clicked on the &amp;quot;Alumni&amp;quot; tab and navigated down the webpage. Startups are separated by class, which means cohort in this case. Startup info contains link to demo day presentation as well as the startup url.&lt;br /&gt;
*Zeroto510: Hovered over the &amp;quot;About Us&amp;quot; drop down menu and clicked on the &amp;quot;Portfolio Companies&amp;quot; link. Startups are separated by cohort, one for each year, starting from 2013. &lt;br /&gt;
*Hub:raum: Clicked on the &amp;quot;Portfolio&amp;quot; tab. Directed to a page with many names of startups, as well as a brief description of what their company is about. Also includes a link to each startup's website. Startups are not separated into cohorts, but rather by investment by location, current participants, and alumni.&lt;br /&gt;
*Orange Fab: Clicked on the &amp;quot;Startups&amp;quot; tab and was directed to a different page. Startups are not only separated into cohorts named &amp;quot;Seasons&amp;quot;, but they are also separated by industry.&lt;br /&gt;
*Furnace: Clicked on &amp;quot;Portfolio&amp;quot; tab, but unfortunately the website is broken and it returned an error in code.&lt;br /&gt;
*Launch Chapel Hill: Clicked on the &amp;quot;Ventures&amp;quot; tab and was directed to a page in which all startups were separated into cohorts, and a brief description of the startup was provided underneath their logo.&lt;br /&gt;
&lt;br /&gt;
=Code=&lt;br /&gt;
&lt;br /&gt;
The directory for all data related to this project is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
==F6S Web Crawler==&lt;br /&gt;
&lt;br /&gt;
This is a python script using the selenium library that retrieves the html content of each page on F6S's North American Accelerator search results. The script is located in:&lt;br /&gt;
&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs &lt;br /&gt;
&lt;br /&gt;
The script is titled f6s_crawler_gentle.py&lt;br /&gt;
&lt;br /&gt;
When run, the script visits the F6S search page for North American Accelerator's and begins retrieving the HTML of each page in that search list. &lt;br /&gt;
NOTE: Timing must be spaced out between all interactions with the browser. F6S has Captcha, and the program will fail if the site receives too many hit requests, or has any inkling that it is being probed by a bot.&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files are stored in: &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files stored as text files are stored in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files_text&lt;br /&gt;
&lt;br /&gt;
==F6S Parser==&lt;br /&gt;
The next step is to take the HTML files retrieved by the crawler and to parse them for necessary information. This parser should also determine whether or not the site is an accelerator site. &lt;br /&gt;
&lt;br /&gt;
The code for the parser is located in &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
It is titled f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
To run the code, open the file in Komodo and press play. &lt;br /&gt;
If running from the command line, change to the correct directory and run the following comand:&lt;br /&gt;
 python f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
The list of accelerators that passed through the parser is in the same directory:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
The tab delimited text file is named AcceleratorList.&lt;br /&gt;
The file contains the names of the accelerators that had the keywords listed in the file. Also, the file contains the run dates and location of the accelerator if it was listed on the f6s page.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==F6S API==&lt;br /&gt;
F6S has an API, but we have had no success getting a key to the API. The link to get a key to the API is on [https://www.f6s.com/developers/apis/deal-feed this page].&lt;br /&gt;
&lt;br /&gt;
I (Peter) have emailed F6S to ask for a key directly at support@f6s.com. As of the end of the Fall 2016 Semester, they have not responded.&lt;br /&gt;
&lt;br /&gt;
FUN FACT (MASS-RENAME FILES USING WINDOWS POWER SHELL):&lt;br /&gt;
&lt;br /&gt;
The following command allowed me to append &amp;quot;.txt&amp;quot; to all files in a folder once in the proper directory:&lt;br /&gt;
 Get-ChildItem * | Rename-Item -NewName { $_.name + '.txt'}&lt;br /&gt;
&lt;br /&gt;
To change file formats, Microsoft suggests:&lt;br /&gt;
 Get-ChildItem *.txt | Rename-Item -NewName { $_.name -Replace '\.txt', '.log'}&lt;br /&gt;
&lt;br /&gt;
==Final Data==&lt;br /&gt;
The Parser for parsing the text files of accelerator data is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
The Parser for parsing the cohort files of accelerator data is also located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
This folder contains the Python parsers. The Final_data folder contains the tab-delimited text files of parsed data. final_accelerator_data.txt contains the generalized data saved in .txt files and final_cohort_data.txt contains the cohort data saved in .cohort.txt files.&lt;br /&gt;
&lt;br /&gt;
All the files entitled accelerator_data are subsets of the final_accelerator_data.txt file, but each file contains only the accelerators that matched to the flag specified in the file title.&lt;br /&gt;
&lt;br /&gt;
find_headers .py finds a set of the headers for all the cohort files from the seed list project.&lt;br /&gt;
&lt;br /&gt;
==Google SiteSearch==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Google_SiteSearch&lt;br /&gt;
This folder contains code for a google search parser. The script sitesearch.py will search for a queried company and return a likely web address for that company.&lt;br /&gt;
&lt;br /&gt;
==Way Back Machine Parser==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\wayback_machine.py&lt;br /&gt;
This script takes URLs and returns a timestamp for the oldest documented webpage under that URL courtesy of the Way Back Machine Archive.&lt;br /&gt;
&lt;br /&gt;
==Process Locations==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\process_locations.py&lt;br /&gt;
This script takes a physical address and converts it into latitude and longitude coordinates. Should be used in conjunction with the Enclosing Circle program to find the concentration of accelerators.&lt;br /&gt;
 E:\McNair\Software\CodeBase\EnclosingCircle.py&lt;br /&gt;
&lt;br /&gt;
=Kauffman Foundation Incubator Proposal Information=&lt;br /&gt;
&lt;br /&gt;
==Institutions==&lt;br /&gt;
Summary: F6S, Crunchbase, seed-db&lt;br /&gt;
&lt;br /&gt;
Tools: Matcher - used to match lists of potential accelerators with our current list to identify duplicates/new matches (E:\McNair\Projects\Accelerators)&lt;br /&gt;
&lt;br /&gt;
===F6S===&lt;br /&gt;
F6S WebCrawler and F6S Parser - E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
===CrunchBase===&lt;br /&gt;
&lt;br /&gt;
CrunchBase 2013 Snapshot '''(All Organizations)'''- E:\McNair\Projects\Accelerators\organizations.xls&lt;br /&gt;
&lt;br /&gt;
CrunchBase 2013 Snapshot '''(Potential Accelerators)'''- E:\McNair\Projects\Accelerators\organizations.accdb under &amp;quot;Potential Accelerators query&amp;quot; &lt;br /&gt;
&lt;br /&gt;
*Obtained using keyword matches in the descriptions of the potential accelerators.&lt;br /&gt;
&lt;br /&gt;
CrunchBase 2013 Snapshot '''(New Verified Accelerators)''' - E:\McNair\Projects\Accelerators\New CrunchBase Accelerators.xls&lt;br /&gt;
&lt;br /&gt;
We have the Crunchbase 2013 Snapshot which provided lots of new data on accelerators and incubators but we would love to use the Crunchbase API to get a current database snapshot that we could use to cross reference companies and add newly formed accelerator and incubator companies.&lt;br /&gt;
&lt;br /&gt;
===AngelList===&lt;br /&gt;
&lt;br /&gt;
===seed-db===&lt;br /&gt;
&lt;br /&gt;
Obtained through www.seed.db/accelerators&lt;br /&gt;
&lt;br /&gt;
===Global Accelerator Network (GAN)===&lt;br /&gt;
&lt;br /&gt;
GAN Parser- E:\McNair\Projects\Accelerators\Web Scraping for Accelerators\scrapeaccel.py&lt;br /&gt;
&lt;br /&gt;
GAN Data- E:\McNair\Projects\Accelerators\Web Scraping for Accelerators\GAN Accelerator Data&lt;br /&gt;
*Contains: Company Name, # of Companies Range, % of Companies Funded, Funding Raised by Companies, Employee Range, Exit Funding, Exit Date, Total Company Funding Raised, # of Mentors Range, % Equity, Location, Minimum Seed Capital Investment&lt;br /&gt;
&lt;br /&gt;
==Cohorts==&lt;br /&gt;
&lt;br /&gt;
*Cohorts obtained manually&lt;br /&gt;
*All Cohort txt files are saved under &amp;quot;E:\McNair\Projects\Accelerators\Data  &lt;br /&gt;
*cohort file name = (accelerator name).cohort&lt;br /&gt;
*Most updated Accelerator cohort data: E:\McNair\Projects\Accelerators\Cleaned Cohort Data.xls&lt;br /&gt;
&lt;br /&gt;
Automation for obtaining cohorts??&lt;br /&gt;
&lt;br /&gt;
==Other Information==&lt;br /&gt;
Summary: Whois Parser, Geocode, Tools to determine industry, etc&lt;br /&gt;
&lt;br /&gt;
===Whois Parser===&lt;br /&gt;
&lt;br /&gt;
*Retrieves and parses Whois information. Specifically, takes a file with a column of domain names and populates the corresponding columns with information from the WhoIs API.&lt;br /&gt;
&lt;br /&gt;
*Often used to obtain locations.&lt;br /&gt;
&lt;br /&gt;
===Geocode===&lt;br /&gt;
&lt;br /&gt;
Input: Company Address&lt;br /&gt;
Output: Directional Coordinates&lt;br /&gt;
&lt;br /&gt;
*Used to obtain the locations of different Accelerators and Cohort companies.&lt;br /&gt;
&lt;br /&gt;
===SDC Platinum Pull===&lt;br /&gt;
&lt;br /&gt;
Used to obtain funding information and match companies that have gotten funding with companies that are Accelerator cohorts.&lt;br /&gt;
&lt;br /&gt;
===Desired Information/Variables===&lt;br /&gt;
&lt;br /&gt;
*Key People (founders, lead entrepreneurs, strategists, etc.)&lt;br /&gt;
*Total number of launched companies&lt;br /&gt;
*A FAQ for application details, accelerator vision, and&lt;br /&gt;
*Funds raised per company (average)&lt;br /&gt;
*Features offered by accelerator (perks, space, tools, etc)&lt;br /&gt;
&lt;br /&gt;
==Desired Tools/Information==&lt;br /&gt;
&lt;br /&gt;
===Automating the Process of Obtaining Cohorts===&lt;br /&gt;
*Automating this process would save a lot of time and really progress the project.&lt;br /&gt;
&lt;br /&gt;
===Obtaining More Details on Accelerators===&lt;br /&gt;
&lt;br /&gt;
*Having the kind of thorough information on industry, companies, funding, location, exits, mentors, leadership,  that we got for the GAN companies would be fantastic.&lt;br /&gt;
&lt;br /&gt;
===List of Alive/Dead Accelerators===&lt;br /&gt;
&lt;br /&gt;
This is a dream but would be very helpful&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21811</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21811"/>
		<updated>2017-11-13T21:32:15Z</updated>

		<summary type="html">&lt;p&gt;Shrey: /* List of All Relevant Files */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
*'''List of Preliminary Accelerators'''&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
**Variables: Names of potential accelerators&lt;br /&gt;
&lt;br /&gt;
*'''accelerator_data_noflag'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
&lt;br /&gt;
*'''Cohort Directory &amp;quot;Big Push&amp;quot;'''&lt;br /&gt;
&lt;br /&gt;
*'''New Crunchbase Accelerators'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017&lt;br /&gt;
**Description: After conducting some SDC matches with cohort data from the cohort companies of the accelerators in the '''accelerator_data_noflag''' text file, we realized many potential accelerators were missing. We then got an Excel file from Crunchbase containing all of its organizations, which we then sorted to identify potential missing accelerators. The accelerators we were actually missing are in this Excel file.&lt;br /&gt;
**Variables: Names of Missing Accelerators&lt;br /&gt;
**Potential Crunchbase Variables&lt;br /&gt;
&lt;br /&gt;
*'''Accelerator_Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Summer 2017\Veeral&lt;br /&gt;
**Description: This text file contains cleaned data on all of our current accelerators. This file was compiled by Veeral over Summer 2017. Some of these accelerators are not based in the United States.&lt;br /&gt;
**Variables: Accelerator, homepage_url, city, region, country_code, Creation date&lt;br /&gt;
&lt;br /&gt;
*'''Cleaned Cohort Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.&lt;br /&gt;
**Variables: Accelerator Name, Company Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, Program, Cohort, Year&lt;br /&gt;
&lt;br /&gt;
*'''ListofAccs'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all current accelerators we have been working with.&lt;br /&gt;
**Variables: Accelerator name, Whois parser code&lt;br /&gt;
&lt;br /&gt;
*'''Accelerator_Cohort_Companies'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all cohort companies of all accelerators.&lt;br /&gt;
**Variables: Cohort Companies, Accelerator name&lt;br /&gt;
&lt;br /&gt;
*'''Current Matched Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: Sheet 1 contains our matched data from matching our SDC pull with our cohort companies list found in '''Accelerator_Cohort_Companies'''. Sheet 2 removes the duplicates from the previous match. Sheet 3 contains the list of VCCompanies, which accelerator they went through, the date of their first investment. Sheet 4 contains our cohort list matched with the crunchbase organizations, but it contains too many duplicates to use.&lt;br /&gt;
**Variables: VCCompanies, Accelerator, Earliest Round Date&lt;br /&gt;
&lt;br /&gt;
*'''founders_linkedin'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains founder data for each accelerator found by Peter when crawling LinkedIn.&lt;br /&gt;
**Variables: Accelerator name, Founder name, LinkedIn URL&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21810</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21810"/>
		<updated>2017-11-13T21:13:03Z</updated>

		<summary type="html">&lt;p&gt;Shrey: /* List of All Relevant Files */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
*'''List of Preliminary Accelerators'''&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
**Variables: Names of potential accelerators&lt;br /&gt;
*'''accelerator_data_noflag'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
*Cohort Directory &amp;quot;Big Push&amp;quot;&lt;br /&gt;
*'''New Crunchbase Accelerators'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017&lt;br /&gt;
**Description: After conducting some SDC matches with cohort data from the cohort companies of the accelerators in the '''accelerator_data_noflag''' text file, we realized many potential accelerators were missing. We then got an Excel file from Crunchbase containing all of its organizations, which we then sorted to identify potential missing accelerators. The accelerators we were actually missing are in this Excel file.&lt;br /&gt;
**Variables: Names of Missing Accelerators&lt;br /&gt;
**Potential Crunchbase Variables&lt;br /&gt;
*'''Accelerator_Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Summer 2017\Veeral&lt;br /&gt;
**Description: This text file contains cleaned data on all of our current accelerators. This file was compiled by Veeral over Summer 2017. Some of these accelerators are not based in the United States.&lt;br /&gt;
**Variables: Accelerator, homepage_url, city, region, country_code, Creation date&lt;br /&gt;
*'''Cleaned Cohort Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.&lt;br /&gt;
**Variables: Accelerator Name, Company Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, Program, Cohort, Year&lt;br /&gt;
*'''ListofAccs'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all current accelerators we have been working with.&lt;br /&gt;
**Variables: Accelerator name, Whois parser code&lt;br /&gt;
*'''Accelerator_Cohort_Companies'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all cohort companies of all accelerators.&lt;br /&gt;
**Variables: Cohort Companies, Accelerator name&lt;br /&gt;
*'''Current Matched Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: Sheet 1 contains our matched data from matching our SDC pull with our cohort companies list found in '''Accelerator_Cohort_Companies'''. Sheet 2 removes the duplicates from the previous match. Sheet 3 contains the list of VCCompanies, which accelerator they went through, the date of their first investment. Sheet 4 contains our cohort list matched with the crunchbase organizations, but it contains too many duplicates to use.&lt;br /&gt;
**Variables: VCCompanies, Accelerator, Earliest Round Date&lt;br /&gt;
*'''founders_linkedin'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains founder data for each accelerator found by Peter when crawling LinkedIn.&lt;br /&gt;
**Variables: Accelerator name, Founder name, LinkedIn URL&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21809</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21809"/>
		<updated>2017-11-13T21:11:38Z</updated>

		<summary type="html">&lt;p&gt;Shrey: /* List of All Relevant Files */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
*'''List of Preliminary Accelerators'''&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
**Variables: Names of potential accelerators&lt;br /&gt;
*'''accelerator_data_noflag'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
*Cohort Directory &amp;quot;Big Push&amp;quot;&lt;br /&gt;
*'''New Crunchbase Accelerators'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017&lt;br /&gt;
**Description: After conducting some SDC matches with cohort data from the cohort companies of the accelerators in the '''accelerator_data_noflag''' text file, we realized many potential accelerators were missing. We then got an Excel file from Crunchbase containing all of its organizations, which we then sorted to identify potential missing accelerators. The accelerators we were actually missing are in this Excel file.&lt;br /&gt;
**Variables: Names of Missing Accelerators&lt;br /&gt;
**Potential Crunchbase Variables&lt;br /&gt;
*'''Accelerator_Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Summer 2017\Veeral&lt;br /&gt;
**Description: This text file contains cleaned data on all of our current accelerators. This file was compiled by Veeral over Summer 2017. Some of these accelerators are not based in the United States.&lt;br /&gt;
**Variables: Accelerator, homepage_url, city, region, country_code, Creation date&lt;br /&gt;
*'''Cleaned Cohort Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.&lt;br /&gt;
**Variables: Accelerator Name, Company Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, Program, Cohort, Year&lt;br /&gt;
*'''ListofAccs'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all current accelerators we have been working with.&lt;br /&gt;
**Variables: Accelerator name&lt;br /&gt;
*'''Accelerator_Cohort_Companies'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all cohort companies of all accelerators.&lt;br /&gt;
**Variables: Cohort Companies, Accelerator name&lt;br /&gt;
*'''Current Matched Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: Sheet 1 contains our matched data from matching our SDC pull with our cohort companies list found in '''Accelerator_Cohort_Companies'''. Sheet 2 removes the duplicates from the previous match. Sheet 3 contains the list of VCCompanies, which accelerator they went through, the date of their first investment. Sheet 4 contains our cohort list matched with the crunchbase organizations, but it contains too many duplicates to use.&lt;br /&gt;
**Variables: VCCompanies, Accelerator, Earliest Round Date&lt;br /&gt;
*'''founders_linkedin'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains founder data for each accelerator found by Peter when crawling LinkedIn.&lt;br /&gt;
**Variables: Accelerator name, Founder name, LinkedIn URL&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21725</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21725"/>
		<updated>2017-11-09T21:07:47Z</updated>

		<summary type="html">&lt;p&gt;Shrey: Created page with &amp;quot;{{McNair Projects |Has title=Composite Accelerator Data |Has owner=Matthew Ringheanu, Shrey Agarwal, |Has start date=Fall 2016 |Has deadline=Fall 2017 |Has keywords=Accelerato...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal&amp;diff=21716</id>
		<title>Shrey Agarwal</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal&amp;diff=21716"/>
		<updated>2017-11-09T18:59:42Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{{McNair Staff&lt;br /&gt;
|position=Research Team&lt;br /&gt;
|name=Shrey Agarwal&lt;br /&gt;
|degree=BSc.&lt;br /&gt;
|major=Materials Science and Nanoengineering; Business Minor&lt;br /&gt;
|class=2020,&lt;br /&gt;
|join_date=September 2016,&lt;br /&gt;
|skills=C, C++, Writing, Editing, Excel, Graphic Design&lt;br /&gt;
|interests=Rafting, Hiking, Video Games, Sustainable Energy, Basketball, Economics, Public Policy&lt;br /&gt;
|fun_fact=Traveled to 26 countries&lt;br /&gt;
|email=mailto:sva2@rice.edu&lt;br /&gt;
|skype_name=shrey.agarwal84&lt;br /&gt;
|status=Active&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
==Intro==&lt;br /&gt;
&lt;br /&gt;
Shrey Agarwal is a student currently working as a Research Assistant for the James A. Baker III Institute for Public Policy's McNair Center for Entrepreneurship and Innovation. Shrey is working towards a degree in Materials Science and Nanoengineering from Rice University, along with a minor in business. Shrey is currently a freshman at McMurtry college. Along with research at the McNair center, Shrey is involved with Catalyst, Rice's undergraduate science journal, and the Rice Annual Fund, where he works in the Industrials section. Shrey plans to get a MBA once he completes his undergrad engineering degree.&lt;br /&gt;
&lt;br /&gt;
==Early Life==&lt;br /&gt;
&lt;br /&gt;
Shrey was born in Rourkela, India, where he lived until he was 5. Somehow he ended up at Rice in Houston.&lt;br /&gt;
&lt;br /&gt;
==Time at McNair==&lt;br /&gt;
[[Shrey Agarwal (Work Log)]]&lt;br /&gt;
&amp;lt;!-- null edit dummy --&amp;gt;[[Category:McNair Staff]]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=21666</id>
		<title>Shrey Agarwal (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=21666"/>
		<updated>2017-11-07T22:17:49Z</updated>

		<summary type="html">&lt;p&gt;Shrey: /* Fall 2017 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Shrey Agarwal]] [[Work Logs]] [[Shrey Agarwal (Work Log)|(log page)]&lt;br /&gt;
&lt;br /&gt;
9/19/17 15:00 - 17:00&lt;br /&gt;
*Became reacclimatized with the project, spoke with Ed about the direction for the rest of the semester&lt;br /&gt;
9/20/17 15:00 - 17:00&lt;br /&gt;
*Worked on setting up a new pull for the updated SDC data&lt;br /&gt;
9/21/17 15:00 - 17:00&lt;br /&gt;
*Finished the pull and sorted the data from the updated accelerator list&lt;br /&gt;
9/22/17 15:00 - 17:00&lt;br /&gt;
*Tried to set up the matcher with Matthew; ran into some difficulties on Power Shell, returning a blank file in the output&lt;br /&gt;
9/26/17 15:00 - 17:00&lt;br /&gt;
*Finished the match and created pivot tables to count the number of repetitions (companies going through more than one accelerator)&lt;br /&gt;
9/27/17 15:00 - 17:00&lt;br /&gt;
*Discussed with Matthew the best way to collect the VC data from the repetitions. We tried different matches through our SDC data to no avail&lt;br /&gt;
9/28/17 16:00 - 17:00&lt;br /&gt;
*Continued attempting to match with SDC the different columns. Didn't work without separating the data into individual files, a very tedious process.&lt;br /&gt;
9/29/17 15:00 - 17:00&lt;br /&gt;
*Spoke with Ed about incubators project, will begin as soon as we can time the accelerator startup investments. Ed is expecting us to begin sometime in the next two months, using a similar process as we did for incubators. The process should be handled by a new worker.&lt;br /&gt;
10/02/17 15:00 - 17:00&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the CrunchBase database on SQL and brushed up on SQL code.&lt;br /&gt;
10/03/17 15:00 - 17:00&lt;br /&gt;
*Sifted through the database for Crunchbase investment information.&lt;br /&gt;
10/04/17 15:00 - 17:00&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
10/06/17 15:00 - 17:00&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
10/11/17 15:00 - 17:00&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
10/12/17 15:00 - 17:00&lt;br /&gt;
*Discovered that the Wayback Machine will not be a good option for identifying the time when a company went through the accelerator. Created a list of VC Companies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
10/16/17 15:00 - 17:00&lt;br /&gt;
*Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
10/17/17 15:00 - 17:00&lt;br /&gt;
*Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
10/18/17 15:00 - 17:00&lt;br /&gt;
*Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
10/19/17 15:00 - 17:00&lt;br /&gt;
*Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
10/20/17 15:00 - 17:00&lt;br /&gt;
*Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
10/23/17 15:00 - 17:00&lt;br /&gt;
*Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
10/24/17 15:00 - 17:00&lt;br /&gt;
*Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
10/25/17 15:00 - 17:00&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
10/26/17 15:00 - 17:00&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
10/30/17 15:00 - 17:00&lt;br /&gt;
*Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
10/31/17 15:00 - 17:00&lt;br /&gt;
*Began compiling data in the column for the dates that a specific company went through an Accelerator.&lt;br /&gt;
11/01/17 15:00 - 17:00&lt;br /&gt;
*Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
11/02/17 15:00 - 17:00&lt;br /&gt;
*Continued entering cohort company dates into Excel file.&lt;br /&gt;
11/06/17 15:00 - 17:00&lt;br /&gt;
*Began looking at keywords for identifying the cohort class dates for each company&lt;br /&gt;
11/07/17 15:00 - 17:00&lt;br /&gt;
*Received list from Peter with the accelerator founders matched from the Crunchbase LinkedIn URLs and proceeded to find the links for those founders without a match on Crunchbase. Data found in &amp;quot;Unfound Founders List&amp;quot; in the Fall 2017 folder&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
01/17/17 14:00 - 16:00&lt;br /&gt;
*Finished up &amp;quot;accelerating&amp;quot; from [[Accelerator Seed List (Data)]], numbers 341-351&lt;br /&gt;
1/18/17 14:00 - 16:00&lt;br /&gt;
*Finished accelerating for sure, went back and began an overview of the work done for quality control.&lt;br /&gt;
01/20/17 14:00 - 16:00&lt;br /&gt;
*Mandatory meeting, then worked through 2 of Ed's unfinished accelerators&lt;br /&gt;
1/23/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to go over about 70 items in the accelerator list and ensure that they follow a uniform structure and show correct information&lt;br /&gt;
1/24/17 14:00 - 16:00&lt;br /&gt;
*Worked with Peter to fix the problem with results not coming through on the new spreadsheet by renaming the file and including more symbols in the searches. Spreadsheet should be up to date now.&lt;br /&gt;
*Got to number 144 on the list while going through files.&lt;br /&gt;
1/25/17 14:00 - 16;00&lt;br /&gt;
*Continued looking through the list and fixing wrong entries or reporting them&lt;br /&gt;
1/26/17 14:00 - 16:00&lt;br /&gt;
*Talked with Ed about project going forward and tried to access the Crunchbase API with Peter to crawl for start-up companies.&lt;br /&gt;
*Continued working through the accelerator list, stopped at number 186.&lt;br /&gt;
1/27/17 14:00 - 16:00&lt;br /&gt;
*Continued looking through accelerator list and fixing any entries with error. Got to number 261.&lt;br /&gt;
1/30/17 14:30 - 16:30&lt;br /&gt;
*Got through about 425&lt;br /&gt;
1/31/17 14:00 - 16:00&lt;br /&gt;
*Got to number 502&lt;br /&gt;
2/01/17 14:00 - 16:00&lt;br /&gt;
*Finished looking through the initial list of accelerators and writing down which ones needed to be modified or completed (through 551)&lt;br /&gt;
2/03/17 14:00 - 17:00&lt;br /&gt;
*Finished about 30 entries for the accelerator entries that still needed to be completed. Worked out of the &amp;quot;NOT DONE&amp;quot; file in the server (which is now blank because everything is finished)&lt;br /&gt;
2/06/17 14:00 - 16:00&lt;br /&gt;
*Developed a standardized format for the text files with Matthew. Instructions are under &amp;quot;standardized format&amp;quot; in the accelerator seed list portion. I started at number 226 and standardized formats up until 370.&lt;br /&gt;
2/07/17 14:00-16:00&lt;br /&gt;
*Continued work from yesterday, completed up to number 488 from the list. Will likely need one more day to finish.&lt;br /&gt;
2/08/17 14:00 - 16:00&lt;br /&gt;
*Finished standardizing the txt files for use on the excel spreadsheet, compiled the data and examined the resultant tables. Realized we needed to fix some categories in the cohort files.&lt;br /&gt;
2/09/17 14:00 - 17:00&lt;br /&gt;
*Worked with Ed on a side project trying to gather information on climate change thanks to Baker's article on the Wall Street Journal&lt;br /&gt;
*Gathered information on climate change in relation to high-growth, high-risk innovation and organizations that deal with things such as carbon credits&lt;br /&gt;
2/10/17 14:00 - 17:00&lt;br /&gt;
*Realized that blog post was ambitious because we could not really find a clear purpose from the information we gathered, nor could we find a unique angle. Held off on the idea&lt;br /&gt;
*Went back to organizing the new columns and headers on the text file by identifying areas of error in the excel spreadsheet&lt;br /&gt;
2/15/17 14:00 - 16:00&lt;br /&gt;
*Spoke with Ed about free enterprise while he lectured all of us. It took about an hour.&lt;br /&gt;
*Looked at plans for project going forward including using linkedin to search the founders&lt;br /&gt;
2/20/17 14:00 - 16:00&lt;br /&gt;
*Found our first source for expanding the project into incubators, from angel.co. Seems similar to f6s in that we can crawl it and obtain a list of incubators and their various counterparts. &lt;br /&gt;
2/21/17 14:00 - 16:00&lt;br /&gt;
*Found more sources for incubators by reading through quora discussions and masters theses. Bookmarked these pages so that I could put them into text files after.&lt;br /&gt;
2/23/17 14:00 - 18:00&lt;br /&gt;
*Converted incubator files to text-pad and saved them (4 total), then cleaned them up through regex&lt;br /&gt;
*Took the cohort text file, put it into excel, and proceeded to clean up all of the mistakes in the excel document, particularly bad data or mistakes with organizations. Got through Y-Combinator.&lt;br /&gt;
2/24/17 14:00 - 16:00&lt;br /&gt;
*Finished up cleaning the cohort data for the names and the descriptions, but there still needs to be work done on the other stuff like dates and programs&lt;br /&gt;
2/28/17 14:00 - 16:00&lt;br /&gt;
*Created page [[Hub-Based Venture Firms]] and proceeded to research VC in Hubs listed on under E:\McNair\Projects\Hubs\summer 2016\Hubs Variables - Ariel.xls&lt;br /&gt;
*Looked at details such as whether they have in-house funds, whether they co-invest, focuses, and amounts invested.&lt;br /&gt;
3/01/17 14:00 - 16:00&lt;br /&gt;
*Worked with Ben and Matthew to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
3/02/17 14:00 - 16:00&lt;br /&gt;
*Tried to repeat the VC data pull without it crashing from pulling too many entries. Unfortunately, we were unable to finish it&lt;br /&gt;
3/06/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to put final touches on the cohort data to prep it for matching with our VC data&lt;br /&gt;
3/07/17 14:00 - 16:00&lt;br /&gt;
*Finally finished working on the cohort files, will match on the 8th&lt;br /&gt;
3/08/17 14:00 - 16:00&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
3/20/17 14:00 - 16:00&lt;br /&gt;
*Participated in a SQL training session with Ed, learned how to create a database and to pull tab delimited information from text files onto a table&lt;br /&gt;
3/21/17 14:00 - 16:00&lt;br /&gt;
*Met with Ed and arrived at the conclusion of finishing the draft for a report by the end of the semester. Put the initial report information on the accelerator page using the variables that we currently have&lt;br /&gt;
3/22/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to compile tables in our database of the matched VC-portfolio company lists and the overall accelerator cohort information. Found multiple errors in the cohort file which needed to be fixed before finishing the tables and analyzing the data&lt;br /&gt;
3/23/17 14:00 - 16:00&lt;br /&gt;
*Finished cleaning the cohort file once again.&lt;br /&gt;
3/24/17 14:00 - 16:00&lt;br /&gt;
*Continued practicing my SQL and creating the code for compiling the tables&lt;br /&gt;
3/29/17 14:00 - 16:00&lt;br /&gt;
*Worked on the matched data with Matthew. Will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC-backed company names matched to one cohort company name&lt;br /&gt;
3/30/17 14:00 - 16:00&lt;br /&gt;
*Examined the Regex code for the URLs and attempted to filter them out&lt;br /&gt;
4/03/17 14:00 - 16:00&lt;br /&gt;
*Continued learning some SQL from Ed&lt;br /&gt;
4/04/17 14:00 - 16:00&lt;br /&gt;
*Began examining the Crunchbase data; looked through the 2013 snapshot&lt;br /&gt;
*Created a new Crunchbase account with McNair center and examined the basic access, which does not give us much information&lt;br /&gt;
4/05/17 14:00 - 16:00&lt;br /&gt;
*Made the final VC percentage table from our database and previous code with Ed; realized we were missing many accelerators as well as a lot of important cohort data so need to reexamine our previous data.&lt;br /&gt;
4/06/17 14:00 - 16:00&lt;br /&gt;
*Continued looking through Crunchbase to see how we can pull accelerators up until 2013; most likely will use objects to sort the data into accelerators, perhaps keywords from &amp;quot;accelerators&amp;quot;&lt;br /&gt;
4/07/17 14:00 - 16:00&lt;br /&gt;
*Examined SARP and attempted to match their accelerators with the ones from our data, realized that a few of our cohorts were missing as well as a few of the actual accelerators so we need to fix the data in our excel file&lt;br /&gt;
*Began compiling a list of missing accelerators on textpad to later insert into our excel.&lt;br /&gt;
4/10/17 13:00 - 16:00&lt;br /&gt;
*Worked with Ben to find missing accelerators from the Crunchbase data using the keywords. Also, began recording information from some of the big accelerators we were missing&lt;br /&gt;
*Found 228 matches for accelerators, will match from our list to find the similarities&lt;br /&gt;
4/11/17 14:00 - 16:00&lt;br /&gt;
*Finished compiling the accelerator and cohort information for the few we found from SARP, will consult Ed to figure out how to approach the missing accelerators and what to do for the preliminary report&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
09/27/2016 14:00 - 17:00: &lt;br /&gt;
*Set up personal and work log pages, accessed Remote Desktop. &lt;br /&gt;
*Compiled list of accelerators from Wiki&lt;br /&gt;
09/29/2016 14:00 - 16:15; 16:45 - 17:30:&lt;br /&gt;
*Created new project: [[Accelerator Seed List (Data)]] and worked with Dr. Egan to create schematic for data entry.&lt;br /&gt;
*Evaluated 3 sources and logged data. Sources were taken from [[List of Accelerators]]. Logged each step onto project page and identified categories that would be suitable for web crawling sometime in the future.&lt;br /&gt;
10/11/2016 14:00 - 17:30;&lt;br /&gt;
*Explored how to use regular expressions in TextPad to aid with data sorting (need to review expressions with Dr. Egan in future)&lt;br /&gt;
*Continued evaluating sources from [[List of Accelerators]] and recorded steps onto project page, as before. Finished evaluating the six sources from initial list. (All work done in [[Accelerator Seed List (Data)]])&lt;br /&gt;
10/13/2016 14:00 - 17:00;&lt;br /&gt;
*All work done in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Talked to Dr. Egan about project going forward. Need to pick out 10-15 accelerators from the sources listed on my project page and identify a reliable method for obtaining cohort information, as well as other variables&lt;br /&gt;
*Used google searches to identify more sources, and evaluated three databases with the help of TextPad&lt;br /&gt;
*Began working on more generic google searches. Was able to go through &amp;quot;Location+accelerator&amp;quot;-type searches today. Will continue next time.&lt;br /&gt;
10/18/2016 14:00 - 17:30;&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Took a sample size of 10 accelerators and detailed how to extract cohort information, as well as what other information is readily available from accelerator URLs.&lt;br /&gt;
*Brought Matthew up to speed on accelerator project, added summaries to each section so they became easier to follow, and worked with him to finish up extracting cohort information&lt;br /&gt;
10/20/16 14:30 - 17:30:&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Finished up the list of instructions for finding the cohort. Continued compiling the list of variables for each of the accelerators within the sample size.&lt;br /&gt;
*Consulted Peter on prospects of creating a web crawler with the information we currently have compiled. Determined it was possible, although beyond the scope of Peter's knowledge.&lt;br /&gt;
10/25/16 14:00 - 17:00&lt;br /&gt;
*Consulted Ed with next step for project.&lt;br /&gt;
*Began listing the E-R diagram onto the accelerator database page where entities were potential categories and each entity had its associated attributes&lt;br /&gt;
10/27/16 14:00 - 17:00&lt;br /&gt;
*Continued working with Matthew to identify elements in the E-R diagram for pulling information on accelerators. &lt;br /&gt;
*Found sources to obtain/cross-reference information (ie. Angel List)&lt;br /&gt;
11/08/16 14:00 - 18:00&lt;br /&gt;
*Identified possible keywords to filter results through for accelerators&lt;br /&gt;
*Began compiling a comprehensive list of accelerators based on the data we have already sifted through.&lt;br /&gt;
*Learned how to use regular expressions from Ben to sort names individually and alphabetically.&lt;br /&gt;
11/10/16 14:00 - 18:00&lt;br /&gt;
*Began sorting through accelerator list and removing duplicates, as well as identifying more places to pull names from.&lt;br /&gt;
*Worked with Peter to create a crawl for f6s because the website does not return only accelerators.&lt;br /&gt;
11/15/16 14:00 - 18:00&lt;br /&gt;
*Took a break from f6s to locate more lists based on individual google searches such as &amp;quot;city+accelerator+list&amp;quot;&lt;br /&gt;
*Put Seed DB information into an excel file on the remote desktop&lt;br /&gt;
11/17/16 14:00 - 16:00&lt;br /&gt;
*Continued filling out information for the random Google Searches&lt;br /&gt;
*Organized TextPad files on the RDP into coherent excel spreadsheets with proper headers on the table&lt;br /&gt;
*Noticed problem with f6s: it seems although all of the html coding was protected by a captcha so the crawler did not actually extract any information; it was all blocked.&lt;br /&gt;
11/22/16 14:00 - 17:00&lt;br /&gt;
*Worked to fix f6s crawler with Peter&lt;br /&gt;
*Finished and compiled master list of accelerators&lt;br /&gt;
12/01/16 14:00 - 18:00&lt;br /&gt;
*Caught up on project with Ed and Carlin&lt;br /&gt;
*Took 20 accelerators (241-260) from the list and filled out text.html files for them; finished the 20&lt;br /&gt;
12/05/16 13:00 - 16:00&lt;br /&gt;
*After finishing first 20 accelerators, continued working down the list, beginning at 321&lt;br /&gt;
*Work noted in [[Accelerator Seed List (Data)]], but mostly stored on McNair RDP&lt;br /&gt;
12/06/16 14:00 - 18:00&lt;br /&gt;
*Continued &amp;quot;Accelerating&amp;quot; down the list in [[Accelerator Seed List (Data)]], finished up until 340&lt;br /&gt;
12/08/16 14:00 - 17:00&lt;br /&gt;
*Continued working on accelerator list on the same page.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Egg_Egan&amp;diff=20824</id>
		<title>Egg Egan</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Egg_Egan&amp;diff=20824"/>
		<updated>2017-10-16T21:05:29Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Staff&lt;br /&gt;
|position=Omelet&lt;br /&gt;
|name=Egg Egan&lt;br /&gt;
|user_image=Egg.jpg&lt;br /&gt;
|degree=Medium Well&lt;br /&gt;
|class=Grade A&lt;br /&gt;
|skills=Skillet Maneuvering, cracking under pressure, punctuality&lt;br /&gt;
|status=Cooked&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Ingredients=&lt;br /&gt;
# 3 incubated eggs&lt;br /&gt;
# 1/2 cup deadweight loss&lt;br /&gt;
# 100ml free markets&lt;br /&gt;
# 251g accelerators&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Egg_Egan&amp;diff=20811</id>
		<title>Egg Egan</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Egg_Egan&amp;diff=20811"/>
		<updated>2017-10-16T20:11:50Z</updated>

		<summary type="html">&lt;p&gt;Shrey: /* Ingredients */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Staff&lt;br /&gt;
|position=Omelet&lt;br /&gt;
|name=Egg Eggan&lt;br /&gt;
|user_image=Egg.jpg&lt;br /&gt;
|degree=Medium Well&lt;br /&gt;
|class=Grade A&lt;br /&gt;
|skills=Skillet Maneuvering, cracking under pressure&lt;br /&gt;
|status=Cooked&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Ingredients=&lt;br /&gt;
# 3 eggs&lt;br /&gt;
# 1/2 cup deadweight loss&lt;br /&gt;
# 1/3 cup free markets&lt;br /&gt;
# 251g Accelerators&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Egg_Egan&amp;diff=20588</id>
		<title>Egg Egan</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Egg_Egan&amp;diff=20588"/>
		<updated>2017-10-04T03:51:13Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Staff&lt;br /&gt;
|position=Omelet&lt;br /&gt;
|name=Egg Eggan&lt;br /&gt;
|user_image=Egg.jpg&lt;br /&gt;
|degree=Medium Well&lt;br /&gt;
|class=Grade A&lt;br /&gt;
|skills=Skillet Maneuvering, cracking under pressure&lt;br /&gt;
|status=Cooked&lt;br /&gt;
}}&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=20554</id>
		<title>Shrey Agarwal (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=20554"/>
		<updated>2017-10-03T20:35:16Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;09/27/2016 14:00 - 17:00: &lt;br /&gt;
*Set up personal and work log pages, accessed Remote Desktop. &lt;br /&gt;
*Compiled list of accelerators from Wiki&lt;br /&gt;
09/29/2016 14:00 - 16:15; 16:45 - 17:30:&lt;br /&gt;
*Created new project: [[Accelerator Seed List (Data)]] and worked with Dr. Egan to create schematic for data entry.&lt;br /&gt;
*Evaluated 3 sources and logged data. Sources were taken from [[List of Accelerators]]. Logged each step onto project page and identified categories that would be suitable for web crawling sometime in the future.&lt;br /&gt;
10/11/2016 14:00 - 17:30;&lt;br /&gt;
*Explored how to use regular expressions in TextPad to aid with data sorting (need to review expressions with Dr. Egan in future)&lt;br /&gt;
*Continued evaluating sources from [[List of Accelerators]] and recorded steps onto project page, as before. Finished evaluating the six sources from initial list. (All work done in [[Accelerator Seed List (Data)]])&lt;br /&gt;
10/13/2016 14:00 - 17:00;&lt;br /&gt;
*All work done in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Talked to Dr. Egan about project going forward. Need to pick out 10-15 accelerators from the sources listed on my project page and identify a reliable method for obtaining cohort information, as well as other variables&lt;br /&gt;
*Used google searches to identify more sources, and evaluated three databases with the help of TextPad&lt;br /&gt;
*Began working on more generic google searches. Was able to go through &amp;quot;Location+accelerator&amp;quot;-type searches today. Will continue next time.&lt;br /&gt;
10/18/2016 14:00 - 17:30;&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Took a sample size of 10 accelerators and detailed how to extract cohort information, as well as what other information is readily available from accelerator URLs.&lt;br /&gt;
*Brought Matthew up to speed on accelerator project, added summaries to each section so they became easier to follow, and worked with him to finish up extracting cohort information&lt;br /&gt;
10/20/16 14:30 - 17:30:&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Finished up the list of instructions for finding the cohort. Continued compiling the list of variables for each of the accelerators within the sample size.&lt;br /&gt;
*Consulted Peter on prospects of creating a web crawler with the information we currently have compiled. Determined it was possible, although beyond the scope of Peter's knowledge.&lt;br /&gt;
10/25/16 14:00 - 17:00&lt;br /&gt;
*Consulted Ed with next step for project.&lt;br /&gt;
*Began listing the E-R diagram onto the accelerator database page where entities were potential categories and each entity had its associated attributes&lt;br /&gt;
10/27/16 14:00 - 17:00&lt;br /&gt;
*Continued working with Matthew to identify elements in the E-R diagram for pulling information on accelerators. &lt;br /&gt;
*Found sources to obtain/cross-reference information (ie. Angel List)&lt;br /&gt;
11/08/16 14:00 - 18:00&lt;br /&gt;
*Identified possible keywords to filter results through for accelerators&lt;br /&gt;
*Began compiling a comprehensive list of accelerators based on the data we have already sifted through.&lt;br /&gt;
*Learned how to use regular expressions from Ben to sort names individually and alphabetically.&lt;br /&gt;
11/10/16 14:00 - 18:00&lt;br /&gt;
*Began sorting through accelerator list and removing duplicates, as well as identifying more places to pull names from.&lt;br /&gt;
*Worked with Peter to create a crawl for f6s because the website does not return only accelerators.&lt;br /&gt;
11/15/16 14:00 - 18:00&lt;br /&gt;
*Took a break from f6s to locate more lists based on individual google searches such as &amp;quot;city+accelerator+list&amp;quot;&lt;br /&gt;
*Put Seed DB information into an excel file on the remote desktop&lt;br /&gt;
11/17/16 14:00 - 16:00&lt;br /&gt;
*Continued filling out information for the random Google Searches&lt;br /&gt;
*Organized TextPad files on the RDP into coherent excel spreadsheets with proper headers on the table&lt;br /&gt;
*Noticed problem with f6s: it seems although all of the html coding was protected by a captcha so the crawler did not actually extract any information; it was all blocked.&lt;br /&gt;
11/22/16 14:00 - 17:00&lt;br /&gt;
*Worked to fix f6s crawler with Peter&lt;br /&gt;
*Finished and compiled master list of accelerators&lt;br /&gt;
12/01/16 14:00 - 18:00&lt;br /&gt;
*Caught up on project with Ed and Carlin&lt;br /&gt;
*Took 20 accelerators (241-260) from the list and filled out text.html files for them; finished the 20&lt;br /&gt;
12/05/16 13:00 - 16:00&lt;br /&gt;
*After finishing first 20 accelerators, continued working down the list, beginning at 321&lt;br /&gt;
*Work noted in [[Accelerator Seed List (Data)]], but mostly stored on McNair RDP&lt;br /&gt;
12/06/16 14:00 - 18:00&lt;br /&gt;
*Continued &amp;quot;Accelerating&amp;quot; down the list in [[Accelerator Seed List (Data)]], finished up until 340&lt;br /&gt;
12/08/16 14:00 - 17:00&lt;br /&gt;
*Continued working on accelerator list on the same page.&lt;br /&gt;
01/17/17 14:00 - 16:00&lt;br /&gt;
*Finished up &amp;quot;accelerating&amp;quot; from [[Accelerator Seed List (Data)]], numbers 341-351&lt;br /&gt;
1/18/17 14:00 - 16:00&lt;br /&gt;
*Finished accelerating for sure, went back and began an overview of the work done for quality control.&lt;br /&gt;
01/20/17 14:00 - 16:00&lt;br /&gt;
*Mandatory meeting, then worked through 2 of Ed's unfinished accelerators&lt;br /&gt;
1/23/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to go over about 70 items in the accelerator list and ensure that they follow a uniform structure and show correct information&lt;br /&gt;
1/24/17 14:00 - 16:00&lt;br /&gt;
*Worked with Peter to fix the problem with results not coming through on the new spreadsheet by renaming the file and including more symbols in the searches. Spreadsheet should be up to date now.&lt;br /&gt;
*Got to number 144 on the list while going through files.&lt;br /&gt;
1/25/17 14:00 - 16;00&lt;br /&gt;
*Continued looking through the list and fixing wrong entries or reporting them&lt;br /&gt;
1/26/17 14:00 - 16:00&lt;br /&gt;
*Talked with Ed about project going forward and tried to access the Crunchbase API with Peter to crawl for start-up companies.&lt;br /&gt;
*Continued working through the accelerator list, stopped at number 186.&lt;br /&gt;
1/27/17 14:00 - 16:00&lt;br /&gt;
*Continued looking through accelerator list and fixing any entries with error. Got to number 261.&lt;br /&gt;
1/30/17 14:30 - 16:30&lt;br /&gt;
*Got through about 425&lt;br /&gt;
1/31/17 14:00 - 16:00&lt;br /&gt;
*Got to number 502&lt;br /&gt;
2/01/17 14:00 - 16:00&lt;br /&gt;
*Finished looking through the initial list of accelerators and writing down which ones needed to be modified or completed (through 551)&lt;br /&gt;
2/03/17 14:00 - 17:00&lt;br /&gt;
*Finished about 30 entries for the accelerator entries that still needed to be completed. Worked out of the &amp;quot;NOT DONE&amp;quot; file in the server (which is now blank because everything is finished)&lt;br /&gt;
2/06/17 14:00 - 16:00&lt;br /&gt;
*Developed a standardized format for the text files with Matthew. Instructions are under &amp;quot;standardized format&amp;quot; in the accelerator seed list portion. I started at number 226 and standardized formats up until 370.&lt;br /&gt;
2/07/17 14:00-16:00&lt;br /&gt;
*Continued work from yesterday, completed up to number 488 from the list. Will likely need one more day to finish.&lt;br /&gt;
2/08/17 14:00 - 16:00&lt;br /&gt;
*Finished standardizing the txt files for use on the excel spreadsheet, compiled the data and examined the resultant tables. Realized we needed to fix some categories in the cohort files.&lt;br /&gt;
2/09/17 14:00 - 17:00&lt;br /&gt;
*Worked with Ed on a side project trying to gather information on climate change thanks to Baker's article on the Wall Street Journal&lt;br /&gt;
*Gathered information on climate change in relation to high-growth, high-risk innovation and organizations that deal with things such as carbon credits&lt;br /&gt;
2/10/17 14:00 - 17:00&lt;br /&gt;
*Realized that blog post was ambitious because we could not really find a clear purpose from the information we gathered, nor could we find a unique angle. Held off on the idea&lt;br /&gt;
*Went back to organizing the new columns and headers on the text file by identifying areas of error in the excel spreadsheet&lt;br /&gt;
2/15/17 14:00 - 16:00&lt;br /&gt;
*Spoke with Ed about free enterprise while he lectured all of us. It took about an hour.&lt;br /&gt;
*Looked at plans for project going forward including using linkedin to search the founders&lt;br /&gt;
2/20/17 14:00 - 16:00&lt;br /&gt;
*Found our first source for expanding the project into incubators, from angel.co. Seems similar to f6s in that we can crawl it and obtain a list of incubators and their various counterparts. &lt;br /&gt;
2/21/17 14:00 - 16:00&lt;br /&gt;
*Found more sources for incubators by reading through quora discussions and masters theses. Bookmarked these pages so that I could put them into text files after.&lt;br /&gt;
2/23/17 14:00 - 18:00&lt;br /&gt;
*Converted incubator files to text-pad and saved them (4 total), then cleaned them up through regex&lt;br /&gt;
*Took the cohort text file, put it into excel, and proceeded to clean up all of the mistakes in the excel document, particularly bad data or mistakes with organizations. Got through Y-Combinator.&lt;br /&gt;
2/24/17 14:00 - 16:00&lt;br /&gt;
*Finished up cleaning the cohort data for the names and the descriptions, but there still needs to be work done on the other stuff like dates and programs&lt;br /&gt;
2/28/17 14:00 - 16:00&lt;br /&gt;
*Created page [[Hub-Based Venture Firms]] and proceeded to research VC in Hubs listed on under E:\McNair\Projects\Hubs\summer 2016\Hubs Variables - Ariel.xls&lt;br /&gt;
*Looked at details such as whether they have in-house funds, whether they co-invest, focuses, and amounts invested.&lt;br /&gt;
3/01/17 14:00 - 16:00&lt;br /&gt;
*Worked with Ben and Matthew to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
3/02/17 14:00 - 16:00&lt;br /&gt;
*Tried to repeat the VC data pull without it crashing from pulling too many entries. Unfortunately, we were unable to finish it&lt;br /&gt;
3/06/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to put final touches on the cohort data to prep it for matching with our VC data&lt;br /&gt;
3/07/17 14:00 - 16:00&lt;br /&gt;
*Finally finished working on the cohort files, will match on the 8th&lt;br /&gt;
3/08/17 14:00 - 16:00&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
3/20/17 14:00 - 16:00&lt;br /&gt;
*Participated in a SQL training session with Ed, learned how to create a database and to pull tab delimited information from text files onto a table&lt;br /&gt;
3/21/17 14:00 - 16:00&lt;br /&gt;
*Met with Ed and arrived at the conclusion of finishing the draft for a report by the end of the semester. Put the initial report information on the accelerator page using the variables that we currently have&lt;br /&gt;
3/22/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to compile tables in our database of the matched VC-portfolio company lists and the overall accelerator cohort information. Found multiple errors in the cohort file which needed to be fixed before finishing the tables and analyzing the data&lt;br /&gt;
3/23/17 14:00 - 16:00&lt;br /&gt;
*Finished cleaning the cohort file once again.&lt;br /&gt;
3/24/17 14:00 - 16:00&lt;br /&gt;
*Continued practicing my SQL and creating the code for compiling the tables&lt;br /&gt;
3/29/17 14:00 - 16:00&lt;br /&gt;
*Worked on the matched data with Matthew. Will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC-backed company names matched to one cohort company name&lt;br /&gt;
3/30/17 14:00 - 16:00&lt;br /&gt;
*Examined the Regex code for the URLs and attempted to filter them out&lt;br /&gt;
4/03/17 14:00 - 16:00&lt;br /&gt;
*Continued learning some SQL from Ed&lt;br /&gt;
4/04/17 14:00 - 16:00&lt;br /&gt;
*Began examining the Crunchbase data; looked through the 2013 snapshot&lt;br /&gt;
*Created a new Crunchbase account with McNair center and examined the basic access, which does not give us much information&lt;br /&gt;
4/05/17 14:00 - 16:00&lt;br /&gt;
*Made the final VC percentage table from our database and previous code with Ed; realized we were missing many accelerators as well as a lot of important cohort data so need to reexamine our previous data.&lt;br /&gt;
4/06/17 14:00 - 16:00&lt;br /&gt;
*Continued looking through Crunchbase to see how we can pull accelerators up until 2013; most likely will use objects to sort the data into accelerators, perhaps keywords from &amp;quot;accelerators&amp;quot;&lt;br /&gt;
4/07/17 14:00 - 16:00&lt;br /&gt;
*Examined SARP and attempted to match their accelerators with the ones from our data, realized that a few of our cohorts were missing as well as a few of the actual accelerators so we need to fix the data in our excel file&lt;br /&gt;
*Began compiling a list of missing accelerators on textpad to later insert into our excel.&lt;br /&gt;
4/10/17 13:00 - 16:00&lt;br /&gt;
*Worked with Ben to find missing accelerators from the Crunchbase data using the keywords. Also, began recording information from some of the big accelerators we were missing&lt;br /&gt;
*Found 228 matches for accelerators, will match from our list to find the similarities&lt;br /&gt;
4/11/17 14:00 - 16:00&lt;br /&gt;
*Finished compiling the accelerator and cohort information for the few we found from SARP, will consult Ed to figure out how to approach the missing accelerators and what to do for the preliminary report&lt;br /&gt;
9/19/17 15:00 - 17:00&lt;br /&gt;
*Became reacclimatized with the project, spoke with Ed about the direction for the rest of the semester&lt;br /&gt;
9/20/17 15:00 - 17:00&lt;br /&gt;
*Worked on setting up a new pull for the updated SDC data&lt;br /&gt;
9/21/17 15:00 - 17:00&lt;br /&gt;
*Finished the pull and sorted the data from the updated accelerator list&lt;br /&gt;
9/22/17 15:00 - 17:00&lt;br /&gt;
*Tried to set up the matcher with Matthew; ran into some difficulties on Power Shell, returning a blank file in the output&lt;br /&gt;
9/26/17 15:00 - 17:00&lt;br /&gt;
*Finished the match and created pivot tables to count the number of repetitions (companies going through more than one accelerator)&lt;br /&gt;
9/27/17 15:00 - 17:00&lt;br /&gt;
*Discussed with Matthew the best way to collect the VC data from the repetitions. We tried different matches through our SDC data to no avail&lt;br /&gt;
9/28/17 16:00 - 17:00&lt;br /&gt;
*Continued attempting to match with SDC the different columns. Didn't work without separating the data into individual files, a very tedious process.&lt;br /&gt;
9/29/17 15:00 - 17:00&lt;br /&gt;
*Spoke with Ed about incubators project, will begin as soon as we can time the accelerator startup investments. Ed is expecting us to begin sometime in the next two months, using a similar process as we did for incubators. The process should be handled by a new worker.&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=20339</id>
		<title>Shrey Agarwal (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=20339"/>
		<updated>2017-09-26T21:03:42Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;09/27/2016 14:00 - 17:00: &lt;br /&gt;
*Set up personal and work log pages, accessed Remote Desktop. &lt;br /&gt;
*Compiled list of accelerators from Wiki&lt;br /&gt;
09/29/2016 14:00 - 16:15; 16:45 - 17:30:&lt;br /&gt;
*Created new project: [[Accelerator Seed List (Data)]] and worked with Dr. Egan to create schematic for data entry.&lt;br /&gt;
*Evaluated 3 sources and logged data. Sources were taken from [[List of Accelerators]]. Logged each step onto project page and identified categories that would be suitable for web crawling sometime in the future.&lt;br /&gt;
10/11/2016 14:00 - 17:30;&lt;br /&gt;
*Explored how to use regular expressions in TextPad to aid with data sorting (need to review expressions with Dr. Egan in future)&lt;br /&gt;
*Continued evaluating sources from [[List of Accelerators]] and recorded steps onto project page, as before. Finished evaluating the six sources from initial list. (All work done in [[Accelerator Seed List (Data)]])&lt;br /&gt;
10/13/2016 14:00 - 17:00;&lt;br /&gt;
*All work done in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Talked to Dr. Egan about project going forward. Need to pick out 10-15 accelerators from the sources listed on my project page and identify a reliable method for obtaining cohort information, as well as other variables&lt;br /&gt;
*Used google searches to identify more sources, and evaluated three databases with the help of TextPad&lt;br /&gt;
*Began working on more generic google searches. Was able to go through &amp;quot;Location+accelerator&amp;quot;-type searches today. Will continue next time.&lt;br /&gt;
10/18/2016 14:00 - 17:30;&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Took a sample size of 10 accelerators and detailed how to extract cohort information, as well as what other information is readily available from accelerator URLs.&lt;br /&gt;
*Brought Matthew up to speed on accelerator project, added summaries to each section so they became easier to follow, and worked with him to finish up extracting cohort information&lt;br /&gt;
10/20/16 14:30 - 17:30:&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Finished up the list of instructions for finding the cohort. Continued compiling the list of variables for each of the accelerators within the sample size.&lt;br /&gt;
*Consulted Peter on prospects of creating a web crawler with the information we currently have compiled. Determined it was possible, although beyond the scope of Peter's knowledge.&lt;br /&gt;
10/25/16 14:00 - 17:00&lt;br /&gt;
*Consulted Ed with next step for project.&lt;br /&gt;
*Began listing the E-R diagram onto the accelerator database page where entities were potential categories and each entity had its associated attributes&lt;br /&gt;
10/27/16 14:00 - 17:00&lt;br /&gt;
*Continued working with Matthew to identify elements in the E-R diagram for pulling information on accelerators. &lt;br /&gt;
*Found sources to obtain/cross-reference information (ie. Angel List)&lt;br /&gt;
11/08/16 14:00 - 18:00&lt;br /&gt;
*Identified possible keywords to filter results through for accelerators&lt;br /&gt;
*Began compiling a comprehensive list of accelerators based on the data we have already sifted through.&lt;br /&gt;
*Learned how to use regular expressions from Ben to sort names individually and alphabetically.&lt;br /&gt;
11/10/16 14:00 - 18:00&lt;br /&gt;
*Began sorting through accelerator list and removing duplicates, as well as identifying more places to pull names from.&lt;br /&gt;
*Worked with Peter to create a crawl for f6s because the website does not return only accelerators.&lt;br /&gt;
11/15/16 14:00 - 18:00&lt;br /&gt;
*Took a break from f6s to locate more lists based on individual google searches such as &amp;quot;city+accelerator+list&amp;quot;&lt;br /&gt;
*Put Seed DB information into an excel file on the remote desktop&lt;br /&gt;
11/17/16 14:00 - 16:00&lt;br /&gt;
*Continued filling out information for the random Google Searches&lt;br /&gt;
*Organized TextPad files on the RDP into coherent excel spreadsheets with proper headers on the table&lt;br /&gt;
*Noticed problem with f6s: it seems although all of the html coding was protected by a captcha so the crawler did not actually extract any information; it was all blocked.&lt;br /&gt;
11/22/16 14:00 - 17:00&lt;br /&gt;
*Worked to fix f6s crawler with Peter&lt;br /&gt;
*Finished and compiled master list of accelerators&lt;br /&gt;
12/01/16 14:00 - 18:00&lt;br /&gt;
*Caught up on project with Ed and Carlin&lt;br /&gt;
*Took 20 accelerators (241-260) from the list and filled out text.html files for them; finished the 20&lt;br /&gt;
12/05/16 13:00 - 16:00&lt;br /&gt;
*After finishing first 20 accelerators, continued working down the list, beginning at 321&lt;br /&gt;
*Work noted in [[Accelerator Seed List (Data)]], but mostly stored on McNair RDP&lt;br /&gt;
12/06/16 14:00 - 18:00&lt;br /&gt;
*Continued &amp;quot;Accelerating&amp;quot; down the list in [[Accelerator Seed List (Data)]], finished up until 340&lt;br /&gt;
12/08/16 14:00 - 17:00&lt;br /&gt;
*Continued working on accelerator list on the same page.&lt;br /&gt;
01/17/17 14:00 - 16:00&lt;br /&gt;
*Finished up &amp;quot;accelerating&amp;quot; from [[Accelerator Seed List (Data)]], numbers 341-351&lt;br /&gt;
1/18/17 14:00 - 16:00&lt;br /&gt;
*Finished accelerating for sure, went back and began an overview of the work done for quality control.&lt;br /&gt;
01/20/17 14:00 - 16:00&lt;br /&gt;
*Mandatory meeting, then worked through 2 of Ed's unfinished accelerators&lt;br /&gt;
1/23/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to go over about 70 items in the accelerator list and ensure that they follow a uniform structure and show correct information&lt;br /&gt;
1/24/17 14:00 - 16:00&lt;br /&gt;
*Worked with Peter to fix the problem with results not coming through on the new spreadsheet by renaming the file and including more symbols in the searches. Spreadsheet should be up to date now.&lt;br /&gt;
*Got to number 144 on the list while going through files.&lt;br /&gt;
1/25/17 14:00 - 16;00&lt;br /&gt;
*Continued looking through the list and fixing wrong entries or reporting them&lt;br /&gt;
1/26/17 14:00 - 16:00&lt;br /&gt;
*Talked with Ed about project going forward and tried to access the Crunchbase API with Peter to crawl for start-up companies.&lt;br /&gt;
*Continued working through the accelerator list, stopped at number 186.&lt;br /&gt;
1/27/17 14:00 - 16:00&lt;br /&gt;
*Continued looking through accelerator list and fixing any entries with error. Got to number 261.&lt;br /&gt;
1/30/17 14:30 - 16:30&lt;br /&gt;
*Got through about 425&lt;br /&gt;
1/31/17 14:00 - 16:00&lt;br /&gt;
*Got to number 502&lt;br /&gt;
2/01/17 14:00 - 16:00&lt;br /&gt;
*Finished looking through the initial list of accelerators and writing down which ones needed to be modified or completed (through 551)&lt;br /&gt;
2/03/17 14:00 - 17:00&lt;br /&gt;
*Finished about 30 entries for the accelerator entries that still needed to be completed. Worked out of the &amp;quot;NOT DONE&amp;quot; file in the server (which is now blank because everything is finished)&lt;br /&gt;
2/06/17 14:00 - 16:00&lt;br /&gt;
*Developed a standardized format for the text files with Matthew. Instructions are under &amp;quot;standardized format&amp;quot; in the accelerator seed list portion. I started at number 226 and standardized formats up until 370.&lt;br /&gt;
2/07/17 14:00-16:00&lt;br /&gt;
*Continued work from yesterday, completed up to number 488 from the list. Will likely need one more day to finish.&lt;br /&gt;
2/08/17 14:00 - 16:00&lt;br /&gt;
*Finished standardizing the txt files for use on the excel spreadsheet, compiled the data and examined the resultant tables. Realized we needed to fix some categories in the cohort files.&lt;br /&gt;
2/09/17 14:00 - 17:00&lt;br /&gt;
*Worked with Ed on a side project trying to gather information on climate change thanks to Baker's article on the Wall Street Journal&lt;br /&gt;
*Gathered information on climate change in relation to high-growth, high-risk innovation and organizations that deal with things such as carbon credits&lt;br /&gt;
2/10/17 14:00 - 17:00&lt;br /&gt;
*Realized that blog post was ambitious because we could not really find a clear purpose from the information we gathered, nor could we find a unique angle. Held off on the idea&lt;br /&gt;
*Went back to organizing the new columns and headers on the text file by identifying areas of error in the excel spreadsheet&lt;br /&gt;
2/15/17 14:00 - 16:00&lt;br /&gt;
*Spoke with Ed about free enterprise while he lectured all of us. It took about an hour.&lt;br /&gt;
*Looked at plans for project going forward including using linkedin to search the founders&lt;br /&gt;
2/20/17 14:00 - 16:00&lt;br /&gt;
*Found our first source for expanding the project into incubators, from angel.co. Seems similar to f6s in that we can crawl it and obtain a list of incubators and their various counterparts. &lt;br /&gt;
2/21/17 14:00 - 16:00&lt;br /&gt;
*Found more sources for incubators by reading through quora discussions and masters theses. Bookmarked these pages so that I could put them into text files after.&lt;br /&gt;
2/23/17 14:00 - 18:00&lt;br /&gt;
*Converted incubator files to text-pad and saved them (4 total), then cleaned them up through regex&lt;br /&gt;
*Took the cohort text file, put it into excel, and proceeded to clean up all of the mistakes in the excel document, particularly bad data or mistakes with organizations. Got through Y-Combinator.&lt;br /&gt;
2/24/17 14:00 - 16:00&lt;br /&gt;
*Finished up cleaning the cohort data for the names and the descriptions, but there still needs to be work done on the other stuff like dates and programs&lt;br /&gt;
2/28/17 14:00 - 16:00&lt;br /&gt;
*Created page [[Hub-Based Venture Firms]] and proceeded to research VC in Hubs listed on under E:\McNair\Projects\Hubs\summer 2016\Hubs Variables - Ariel.xls&lt;br /&gt;
*Looked at details such as whether they have in-house funds, whether they co-invest, focuses, and amounts invested.&lt;br /&gt;
3/01/17 14:00 - 16:00&lt;br /&gt;
*Worked with Ben and Matthew to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
3/02/17 14:00 - 16:00&lt;br /&gt;
*Tried to repeat the VC data pull without it crashing from pulling too many entries. Unfortunately, we were unable to finish it&lt;br /&gt;
3/06/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to put final touches on the cohort data to prep it for matching with our VC data&lt;br /&gt;
3/07/17 14:00 - 16:00&lt;br /&gt;
*Finally finished working on the cohort files, will match on the 8th&lt;br /&gt;
3/08/17 14:00 - 16:00&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
3/20/17 14:00 - 16:00&lt;br /&gt;
*Participated in a SQL training session with Ed, learned how to create a database and to pull tab delimited information from text files onto a table&lt;br /&gt;
3/21/17 14:00 - 16:00&lt;br /&gt;
*Met with Ed and arrived at the conclusion of finishing the draft for a report by the end of the semester. Put the initial report information on the accelerator page using the variables that we currently have&lt;br /&gt;
3/22/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to compile tables in our database of the matched VC-portfolio company lists and the overall accelerator cohort information. Found multiple errors in the cohort file which needed to be fixed before finishing the tables and analyzing the data&lt;br /&gt;
3/23/17 14:00 - 16:00&lt;br /&gt;
*Finished cleaning the cohort file once again.&lt;br /&gt;
3/24/17 14:00 - 16:00&lt;br /&gt;
*Continued practicing my SQL and creating the code for compiling the tables&lt;br /&gt;
3/29/17 14:00 - 16:00&lt;br /&gt;
*Worked on the matched data with Matthew. Will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC-backed company names matched to one cohort company name&lt;br /&gt;
3/30/17 14:00 - 16:00&lt;br /&gt;
*Examined the Regex code for the URLs and attempted to filter them out&lt;br /&gt;
4/03/17 14:00 - 16:00&lt;br /&gt;
*Continued learning some SQL from Ed&lt;br /&gt;
4/04/17 14:00 - 16:00&lt;br /&gt;
*Began examining the Crunchbase data; looked through the 2013 snapshot&lt;br /&gt;
*Created a new Crunchbase account with McNair center and examined the basic access, which does not give us much information&lt;br /&gt;
4/05/17 14:00 - 16:00&lt;br /&gt;
*Made the final VC percentage table from our database and previous code with Ed; realized we were missing many accelerators as well as a lot of important cohort data so need to reexamine our previous data.&lt;br /&gt;
4/06/17 14:00 - 16:00&lt;br /&gt;
*Continued looking through Crunchbase to see how we can pull accelerators up until 2013; most likely will use objects to sort the data into accelerators, perhaps keywords from &amp;quot;accelerators&amp;quot;&lt;br /&gt;
4/07/17 14:00 - 16:00&lt;br /&gt;
*Examined SARP and attempted to match their accelerators with the ones from our data, realized that a few of our cohorts were missing as well as a few of the actual accelerators so we need to fix the data in our excel file&lt;br /&gt;
*Began compiling a list of missing accelerators on textpad to later insert into our excel.&lt;br /&gt;
4/10/17 13:00 - 16:00&lt;br /&gt;
*Worked with Ben to find missing accelerators from the Crunchbase data using the keywords. Also, began recording information from some of the big accelerators we were missing&lt;br /&gt;
*Found 228 matches for accelerators, will match from our list to find the similarities&lt;br /&gt;
4/11/17 14:00 - 16:00&lt;br /&gt;
*Finished compiling the accelerator and cohort information for the few we found from SARP, will consult Ed to figure out how to approach the missing accelerators and what to do for the preliminary report&lt;br /&gt;
9/19/17 15:00 - 17:00&lt;br /&gt;
*Became reacclimatized with the project, spoke with Ed about the direction for the rest of the semester&lt;br /&gt;
9/20/17 15:00 - 17:00&lt;br /&gt;
*Worked on setting up a new pull for the updated SDC data&lt;br /&gt;
9/21/17 15:00 - 17:00&lt;br /&gt;
*Finished the pull and sorted the data from the updated accelerator list&lt;br /&gt;
9/22/17 15:00 - 17:00&lt;br /&gt;
*Tried to set up the matcher with Matthew; ran into some difficulties on Power Shell, returning a blank file in the output&lt;br /&gt;
9/26/17 15:00 - 17:00&lt;br /&gt;
*Finished the match and created pivot tables to count the number of repetitions (companies going through more than one accelerator)&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=17944</id>
		<title>Accelerator Seed List (Data)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=17944"/>
		<updated>2017-04-19T20:03:57Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Accelerator Seed List (Data)&lt;br /&gt;
|Has owner=Shrey Agarwal, Matthew Ringheanu&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
||Has keywords=Accelerators,Data&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Industry Classifier&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=End of Semester Report=&lt;br /&gt;
The end of semester report will focus on ranking accelerators and environments based on the variables we have gathered. Our primary form of categorization will be ranking individual accelerators based on their venture capital raise rate. We can probably generate information over time for accelerators and the amount of VC they raised to get a sense of what locations have developed in the past five years from the dates of transactions recorded by SDC. To obtain these rankings, we will identify which cohorts companies were trained in, as well as complete details of the accelerator and the details of cohort companies. We will focus only on accelerators because there are many other entities in each ecosystem. We will also utilize information on IPO or acquisition by companies, obtained through Crunchbase, to gain some sense of how successful startups emerging from a particular accelerator are. To obtain the data over time, we will need to fill out the cohort date information column in our cohort data, which will require the help of either Crunchbase or the Wayback machine for older accelerators. In ranking the accelerators across regions, we can also track industry-specific hotspots for accelerators such as medicine in Memphis or technology in San Francisco.&lt;br /&gt;
&lt;br /&gt;
To complete the report, we need to fill information in:&lt;br /&gt;
*Industry and focus&lt;br /&gt;
*Location&lt;br /&gt;
*Name, description&lt;br /&gt;
*Matched VC data&lt;br /&gt;
*Founder information (maybe)&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
This project is developing broad and near-population data on accelerators and their cohort companies. The objective is to identify which cohorts of which accelerators a cohort company was trained in, obtain details of the accelerators, and obtain details of the cohort companies, including information about any venture capital investment that the cohort company might have received and any IPO or acquisition the company may have experienced.&lt;br /&gt;
&lt;br /&gt;
The primary use of this data is for an academic paper detailed on the [[Matching Entrepreneurs to Accelerators and VCs (Academic Paper)]] page. &lt;br /&gt;
&lt;br /&gt;
However, this project can also provide useful data to other academic papers ([[Urban Start-up Agglomeration]], [[Hubs (Academic Paper)]], and [[Hubs Scorecard (Academic Paper)]]), projects ([[Houston Entrepreneurship]]) and blog posts (under the [[Emerging Ecosystems]] umbrella project).&lt;br /&gt;
&lt;br /&gt;
This project needs the results of the [[Industry Classifier]], [[Whois Parser]], and other tools.&lt;br /&gt;
&lt;br /&gt;
=Current Project Write-Up=&lt;br /&gt;
&lt;br /&gt;
==Things To Do==&lt;br /&gt;
*Obtain all URLs for accelerators in order to run through the Wayback Machine to find out when they started.&lt;br /&gt;
*Match Crunchbase Data with our Accelerator List to see if they have any accelerators that we do not.&lt;br /&gt;
*Obtain an example of accelerator that started early and has multiple companies but does not separate them into cohorts and figure out a way to determine which companies went through each cohort.&lt;br /&gt;
&lt;br /&gt;
==What Each File in the &amp;quot;Accelerator&amp;quot; Folder on the RDP Contains==&lt;br /&gt;
*&amp;quot;Accelerator List Sources&amp;quot; (Folder) - This folder contains most of the sources that we pulled accelerator names from at the very beginning of the project.&lt;br /&gt;
*&amp;quot;Code+Final_Data&amp;quot; (Folder) - This folder contains Peter's code for pulling the data from the text files in the &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Crunchbase Snapshot&amp;quot; (Folder) - This folder contains the data we obtained from Crunchbase. There is a massive amount of data which we will need to sort through to find useful information and hopefully match that data with our current cohort data.&lt;br /&gt;
*&amp;quot;Data&amp;quot; (Folder) - This folder contains all of our data on accelerators including cohort information and the html files of each cohort page. I would estimate that it is about 95% clean currently.&lt;br /&gt;
*&amp;quot;Data - Copy&amp;quot; (Folder) - This is just a copy of our current &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Data_Copy&amp;quot; (Folder) - This is a copy of our original &amp;quot;Data&amp;quot; folder before we did any manual cleaning.&lt;br /&gt;
*&amp;quot;Enclosing_Circle&amp;quot; (Folder) - This folder seems to contain some data on VC but I'm not sure how it pertains to the Accelerator project.&lt;br /&gt;
*&amp;quot;F6S Accelerator HTMLs&amp;quot; (Folder) - This folder contains the HTML pages of all the pages on the F6S website. We used it to add more potential accelerators to our list.&lt;br /&gt;
*&amp;quot;Google_SiteSearch&amp;quot; (Folder) - This folder contains Python code for Google searches.&lt;br /&gt;
*&amp;quot;Industry_Classifier&amp;quot; (Folder) - This folder seems to contain Python code but I'm not sure what for.&lt;br /&gt;
*&amp;quot;Matcher&amp;quot; (Folder) - This folder contains the Matcher.&lt;br /&gt;
*&amp;quot;Python WebCrawler&amp;quot; (Folder) - This folder contains code that is a work in progress for pulling descriptions from accelerator websites. It is Jeemin's project.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data Copy&amp;quot; (Excel File) - This file contains a copy of our cleaned cohort data.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data&amp;quot; (Excel File) - This file contains the most current, completely cleaned data on cohort company information.&lt;br /&gt;
*&amp;quot;NormalizeFixedWidth&amp;quot; (PL File) - This is the normalizer.&lt;br /&gt;
*&amp;quot;PortCoNames&amp;quot; (TXT File) - This file contains all of the names of the cohort companies as well as the accelerator they went through.&lt;br /&gt;
*&amp;quot;VC Data&amp;quot; (Excel File) - This file contains all of the names of the companies that have ever received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data&amp;quot; (TXT File) - This file contains that non-normalized data of all of the VC information.&lt;br /&gt;
*&amp;quot;VC_Data_Names&amp;quot; (TXT File) - This file contains all of the names of companies that have received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data_Names_Matched_PortCoNames&amp;quot; (Excel File) - This file contains all of the cohort companies that have also received VC funding. Still needs to be sorted through.&lt;br /&gt;
&lt;br /&gt;
==Process==&lt;br /&gt;
After accumulating the massive amount of data on accelerators, their cohorts, and their html files, we began cleaning those text files, which are located in the &amp;quot;Data&amp;quot; folder within &amp;quot;Accelerators&amp;quot;. After going through the first round of cleaning, we ran a code through the cohort data which put all of that information into an Excel document called &amp;quot;Cleaned Cohort Data&amp;quot;. There were still some mistakes in the cohort information unfortunately, which we fixed within the Excel file itself. Therefore, there are some text files within the &amp;quot;Data&amp;quot; folder that do not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file. If we were to run the cohort code through the &amp;quot;Data&amp;quot; folder, we would get something that does not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file, which is problematic. The solution to this (other than manually cleaning the text files again) would be to write a code from the &amp;quot;Cleaned Cohort Data&amp;quot; file which would allow us to clean the data in the &amp;quot;Data&amp;quot; folder through the format of the Excel file. We have also matched all of the cohort companies with our list of all companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
=Current To Do=&lt;br /&gt;
&lt;br /&gt;
#Work on the [[Crunchbase 2013 Snapshot]]&lt;br /&gt;
#Match cohort companies to VC-backed portfolio companies&lt;br /&gt;
#Refine our data to work out which cohort each cohort company was a member of, cohort start dates and locations, etc.&lt;br /&gt;
#Make a list of top accelerator lists (e.g., http://tech.co/top-startup-accelerators-ranked-2012-08) and check that we have those accelerators&lt;br /&gt;
&lt;br /&gt;
=End of Semester Notes=&lt;br /&gt;
&lt;br /&gt;
*We have compiled a very long list of accelerators from many different databases. For the past couple of weeks, everyone in the center has been going through this list, 20 at a time, classifying each one as an accelerator or not an accelerator, and then proceeding to gather data on the accelerator using the process outlined below. This process went very smoothly. We have successfully gone through about 80% of the list. We are still missing information on the last hundred or so names. All of the collected data is located on the RDP, within the &amp;quot;Accelerators&amp;quot; folder under &amp;quot;Data&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
=Data Collection Notes=&lt;br /&gt;
&lt;br /&gt;
==MATCHING==&lt;br /&gt;
&lt;br /&gt;
The files we used to match are located in the E drive. We used the matcher to match our portfolio company names from the cohort file located in E:\McNair\Projects\Accelerators. &lt;br /&gt;
*The files used to matching are located E:\McNair\Projects\Accelerators\Matcher&lt;br /&gt;
*Portco is the name of the companies pulled from the cohort file&lt;br /&gt;
*AccCo includes both the cohort company name, along with the name of the accelerator itself&lt;br /&gt;
*In the matcher, the inputs are the PortCo names, as well as the VC data from our pull in SDC&lt;br /&gt;
*The outputs include the AccCo_VC data located in E:\McNair\Projects\Accelerators which give a lot of information on the matches, including:&lt;br /&gt;
:*name of the match itself&lt;br /&gt;
:*number of investments&lt;br /&gt;
:*dates that the company received its investments&lt;br /&gt;
&lt;br /&gt;
==SDC Pull==&lt;br /&gt;
&lt;br /&gt;
We accessed SDC platinum and pulled information on round-based funding that all registered companies received from between the years 1999 to 2017.&lt;br /&gt;
&lt;br /&gt;
The receipt is as follows:&lt;br /&gt;
&lt;br /&gt;
Session Details&lt;br /&gt;
---------------&lt;br /&gt;
Request   Hits    Request Description&lt;br /&gt;
   0        -     DATABASE: Portfolio Companies (VIPC)&lt;br /&gt;
   1     96155    Venture Related Deals: Select All Venture Related Deals&lt;br /&gt;
   2     79572    Round Date: 1/1/1999 to 3/1/2017 (Custom) (Calendar)&lt;br /&gt;
   3              Custom Report: VC Data (Columnar) - Save As:&lt;br /&gt;
                  E:\McNair\Projects\Accelerators\VC Data.txt&lt;br /&gt;
�&lt;br /&gt;
Billing Ref # : 2054025&lt;br /&gt;
Capture File  : riceuniv.2054025&lt;br /&gt;
Session Name  : &lt;br /&gt;
&lt;br /&gt;
The VC data pull includes the following variables: &lt;br /&gt;
&lt;br /&gt;
Company Name                                                           Date Company      Date Company      Company        Company City                           Company Street Address, Line 1               Company Street Address, Line 2            Total Known     Company Industry Sub-Group 3                              Company Industry Major Group     Round          Company Stage Level 3     Round Amt,       Round Amt,&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==3 files==&lt;br /&gt;
&lt;br /&gt;
For each accelerator in the list, put files in E:\Projects\Accelerators\Data&lt;br /&gt;
*AcceleratorName.txt - copy and paste the variables below into a (tab-delimited) txt file and complete&lt;br /&gt;
*AcceleratorName.cohort - your cohort text file (see below)&lt;br /&gt;
*AcceleratorName.html (possibly automatically with a folder too) - save a copy of the html of the cohort page&lt;br /&gt;
&lt;br /&gt;
==.txt Variables==&lt;br /&gt;
&lt;br /&gt;
 Name	&lt;br /&gt;
 Score	&lt;br /&gt;
 Flag	&lt;br /&gt;
 CohortURL	&lt;br /&gt;
 Address	&lt;br /&gt;
 Duration	&lt;br /&gt;
 Vintage		&lt;br /&gt;
 Industry	&lt;br /&gt;
 Description	&lt;br /&gt;
 Equity	&lt;br /&gt;
 NonProfit	 &lt;br /&gt;
 Notes	&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Try to get '''Name, Score, Flag, Cohort URL and Address''' for all. ONLY GRAB OTHER VARIABLES IF EASY. Just leave things blank if you can't find them quickly.&lt;br /&gt;
&lt;br /&gt;
'''If the score is 0, or the flag is S, I, A, or F just stop''' - don't bother downloading a cohort list, saving an HTML file, etc. If possible, do stick a very brief description of the problem in the notes field.&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Score: is 0-1 where 0 is definitely not an accelerator, 1 is definitely an accelerator&lt;br /&gt;
*Flag: (leave blank if not needed), if multiple then separate by comma&lt;br /&gt;
**S for social entrep&lt;br /&gt;
**I for incubator&lt;br /&gt;
**A for an angel group&lt;br /&gt;
**F is for foreign&lt;br /&gt;
**C for in coworking space/hub/etc&lt;br /&gt;
**V for if part of venture fund&lt;br /&gt;
**D is for Dead&lt;br /&gt;
*Put just the root URL in Cohort URL if there isn't a Cohort page&lt;br /&gt;
*Duration: in wks (months x 4.33 and round)&lt;br /&gt;
*Vintage is year of first cohort if possible&lt;br /&gt;
*Industry is industry focus but only if clear focus&lt;br /&gt;
*Equity is a number (don't put %) or Y/N&lt;br /&gt;
*Notes is only there if need it. Particularly try to use this field to note discards.&lt;br /&gt;
&lt;br /&gt;
==.cohort files==&lt;br /&gt;
&lt;br /&gt;
Your .cohort files must:&lt;br /&gt;
*Be tab delimited txt&lt;br /&gt;
*Have a header&lt;br /&gt;
*The first column must be the portfolio company name&lt;br /&gt;
*Grab as many columns as you can easily (and name them)&lt;br /&gt;
&lt;br /&gt;
==Standardized format for text files==&lt;br /&gt;
&lt;br /&gt;
Information Text file&lt;br /&gt;
*1 tab only after each category&lt;br /&gt;
*No spaces after commas for flags or industry&lt;br /&gt;
*For duration put only a number in weeks but do not write &amp;quot;weeks&amp;quot;&lt;br /&gt;
*Equity is either only a number (no percent sign) or a Y/N&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Cohort Text file&lt;br /&gt;
*1 tab between each column&lt;br /&gt;
*Titles of each column on top&lt;br /&gt;
*Make a new category for &amp;quot;Cohort Number&amp;quot; and write either &amp;quot;1 2 3 4 etc.&amp;quot;&lt;br /&gt;
*Matthew: 1-225 (done) Shrey: 226-550 (done)&lt;br /&gt;
&lt;br /&gt;
==Link to Crunchbase API application==&lt;br /&gt;
&lt;br /&gt;
https://about.crunchbase.com/forms/research-access-apply/&lt;br /&gt;
&lt;br /&gt;
==Sign-Ups==&lt;br /&gt;
&lt;br /&gt;
 Ed - 1-10 (done)&lt;br /&gt;
 Carlin -  11-20 (done)&lt;br /&gt;
 Carlin - 21-40 (done)&lt;br /&gt;
 Christy - 41-60 (done)&lt;br /&gt;
 Avesh - 61-80 (done)&lt;br /&gt;
 Eliza - 81-100 (done)&lt;br /&gt;
 Meghana - 101-120 (done)&lt;br /&gt;
 Peter - 121-140 (done)&lt;br /&gt;
 Ramee - 141-160 (done)&lt;br /&gt;
 Will - 161-180 (done)&lt;br /&gt;
 Matthew - 181-200 (done)&lt;br /&gt;
 Julia - 201-220 (done)&lt;br /&gt;
 Peter - 221-240 (done)&lt;br /&gt;
 Shrey - 241-260 (done)&lt;br /&gt;
 Matthew - 261-280 (done)&lt;br /&gt;
 Eliza - 281-300 (done)&lt;br /&gt;
 Julia - 301-320 (done)&lt;br /&gt;
 Shrey - 321-340 (done)&lt;br /&gt;
 Carlin - 341-361 (done)&lt;br /&gt;
 Julia - 362-380 (done)&lt;br /&gt;
 Dylan - 381-393 (done)&lt;br /&gt;
 Jake - 394-404 (done)&lt;br /&gt;
 Dylan - 405-410 (done)&lt;br /&gt;
 Avesh - 411-415 (done)&lt;br /&gt;
 Dylan - 416-423 (done)&lt;br /&gt;
 Peter - 424-460(done)&lt;br /&gt;
 Carlin - 461-480 (done)&lt;br /&gt;
 Peter - 481-490(done)&lt;br /&gt;
 Julia - 491-510 (done)&lt;br /&gt;
 Peter - 511-515 (done)&lt;br /&gt;
 Julia - 516-529 (done)&lt;br /&gt;
 Ben - 530-540 (done)&lt;br /&gt;
 Shrey - 541-551 (done)&lt;br /&gt;
&lt;br /&gt;
=List of Accelerators=&lt;br /&gt;
#10Xelerator&lt;br /&gt;
#1440&lt;br /&gt;
#33entrepreneurs&lt;br /&gt;
#500 Startups&lt;br /&gt;
#9Mile Labs&lt;br /&gt;
#AIA Accelerator&lt;br /&gt;
#ARK Challenge&lt;br /&gt;
#AT&amp;amp;T Aspire Accelerator&lt;br /&gt;
#ATDC Community&lt;br /&gt;
#AZ TechCelerator&lt;br /&gt;
#AccelFoods&lt;br /&gt;
#Acceleprise&lt;br /&gt;
#Accelerate Baltimore&lt;br /&gt;
#Accelerate Genius&lt;br /&gt;
#Accelerate Tectoria Accelerator&lt;br /&gt;
#Accelerator Centre&lt;br /&gt;
#Advanced Technology Development Center (ATDC)&lt;br /&gt;
#Airbus BizLab&lt;br /&gt;
#Alchemist Accelerator&lt;br /&gt;
#AlphaLab&lt;br /&gt;
#Amplify.LA&lt;br /&gt;
#Angel Capital&lt;br /&gt;
#Angelcube&lt;br /&gt;
#Angelpad&lt;br /&gt;
#Annual Business BootCamp&lt;br /&gt;
#Arizona Center for Innovation&lt;br /&gt;
#Arizona Furnace&lt;br /&gt;
#Arrowhead Tech Incubator 2016&lt;br /&gt;
#Aspire 3 Accelerator 2017&lt;br /&gt;
#Atlanta Ventures Accelerator &lt;br /&gt;
#AutoXLR8R&lt;br /&gt;
#Awesome Inc.&lt;br /&gt;
#Axel Springer Plug and Play&lt;br /&gt;
#B 4 Change Impact Accelerator&lt;br /&gt;
#B2B Acceleration Program&lt;br /&gt;
#B4C Social Venture Accelerator&lt;br /&gt;
#BBC Worldwide Labs&lt;br /&gt;
#BMW Startup Garage&lt;br /&gt;
#Brandcelerate&lt;br /&gt;
#Bunker Labs&lt;br /&gt;
#Bank of Ireland Accelerator Programme&lt;br /&gt;
#Bantunium Labs Accelerator&lt;br /&gt;
#Barclays Accelerator&lt;br /&gt;
#Barclays New York Summer 2015&lt;br /&gt;
#Berkley Ventures&lt;br /&gt;
#Bessemer Business Incubation System&lt;br /&gt;
#Beta-i&lt;br /&gt;
#Beta.MN&lt;br /&gt;
#BetaFactory&lt;br /&gt;
#BetaSpring&lt;br /&gt;
#Betablox&lt;br /&gt;
#Betaspring RevUp  (DUPLICATE)&lt;br /&gt;
#Bethnal Green Ventures&lt;br /&gt;
#BioAccel&lt;br /&gt;
#BioInspire&lt;br /&gt;
#Bir 2015&lt;br /&gt;
#BitAngel Engagement Level&lt;br /&gt;
#BitAngels Startup Summer Program of 2013&lt;br /&gt;
#Bizdom&lt;br /&gt;
#Black Forest Accelerator&lt;br /&gt;
#Blue Startups&lt;br /&gt;
#Blueprint Health&lt;br /&gt;
#Bolt Boston&lt;br /&gt;
#Bonnier Accelerator&lt;br /&gt;
#BoomStartup&lt;br /&gt;
#BoomStartup Winter 2017 (DUPLICATE)&lt;br /&gt;
#Boomtown Accelerator&lt;br /&gt;
#Boomtown Health Tech (DUPLICATE)&lt;br /&gt;
#Boost VC&lt;br /&gt;
#BootupLabs&lt;br /&gt;
#Brandery&lt;br /&gt;
#Brooklyn Beta Summer Camp&lt;br /&gt;
#Budweiser Dream Brewery&lt;br /&gt;
#Buildit&lt;br /&gt;
#BuiltinPGH Companies&lt;br /&gt;
#Business Innovation Center&lt;br /&gt;
#Business Opportunity Academy 2017&lt;br /&gt;
#Business Technology Development Center (BizTech)&lt;br /&gt;
#CLT Joules Energy Accelerator 2014&lt;br /&gt;
#CWI Ventures&lt;br /&gt;
#CWI Ventures Application (DUPLICATE)&lt;br /&gt;
#CableLabs Technology Tours 2016&lt;br /&gt;
#Capital Factory&lt;br /&gt;
#Capital Innovators&lt;br /&gt;
#Capital Investment Network (Startups)&lt;br /&gt;
#Caroline Plouff&lt;br /&gt;
#Catalyst Partners&lt;br /&gt;
#Cause Collective : Social Innovation Lab&lt;br /&gt;
#Center for Entrepreneurial Innovation&lt;br /&gt;
#Chain Reaction Innovations 2017&lt;br /&gt;
#Chemical Angel Network&lt;br /&gt;
#Chinaccelerator&lt;br /&gt;
#Cisco Entrepreneurs in Residence&lt;br /&gt;
#Citi Accelerator&lt;br /&gt;
#Citrix Startup Accelerator&lt;br /&gt;
#Claremont/Upland Makerspace Fablab&lt;br /&gt;
#Climate Ventures 2.0 Accelerator&lt;br /&gt;
#Co.Lab accelerator&lt;br /&gt;
#Code for America Accelerator&lt;br /&gt;
#Cohab's Traxtion Point&lt;br /&gt;
#Collision Conference Investors&lt;br /&gt;
#Common Bond&lt;br /&gt;
#Communitech Hyperdrive&lt;br /&gt;
#Conquer Accelerator&lt;br /&gt;
#Coolhouse Labs&lt;br /&gt;
#CuriousMinds Incubator / Accelerator&lt;br /&gt;
#CyberTECH San Diego&lt;br /&gt;
#DBS Accelerator&lt;br /&gt;
#DPD Last Mile labs&lt;br /&gt;
#DV X Labs&lt;br /&gt;
#Dat Ventures&lt;br /&gt;
#Decatur-Morgan County Entrepreneurial Center&lt;br /&gt;
#Deep Space Ventures&lt;br /&gt;
#Demo Accelerator 2016- 2017&lt;br /&gt;
#DeveloperTown&lt;br /&gt;
#Difference Engine&lt;br /&gt;
#Digital Malaysia Corporate Accelerator Program&lt;br /&gt;
#Digital Media Zone Incubator/Accelerator&lt;br /&gt;
#Disney Accelerator&lt;br /&gt;
#DogFish Accelerator&lt;br /&gt;
#Domi Station&lt;br /&gt;
#Dotforge accelerator&lt;br /&gt;
#Dream Funded&lt;br /&gt;
#DreamIT Health&lt;br /&gt;
#DreamStart - Free Mentoring Program&lt;br /&gt;
#Dreamit Ventures (DUPLICATE)&lt;br /&gt;
#Ducky Diggy Lloyd &lt;br /&gt;
#E-Capital Summit&lt;br /&gt;
#EC Mentor Skills Inventory&lt;br /&gt;
#EIGERlab&lt;br /&gt;
#ETRAC&lt;br /&gt;
#EY Startup Challenge&lt;br /&gt;
#Eco Holding&lt;br /&gt;
#Eleven Startup Accelerator&lt;br /&gt;
#Emerge Xcelerate&lt;br /&gt;
#EnterpriseWorks Incubation Program&lt;br /&gt;
#Entrepreneur Development Center&lt;br /&gt;
#Entrepreneurs Roundtable Accelerator&lt;br /&gt;
#Environmental Business Cluster&lt;br /&gt;
#Equity Legal&lt;br /&gt;
#Excelerate Labs&lt;br /&gt;
#Execution Labs&lt;br /&gt;
#Exhilarator&lt;br /&gt;
#Extreme Startups&lt;br /&gt;
#Extreme University&lt;br /&gt;
#FOOD-X&lt;br /&gt;
#Factory45&lt;br /&gt;
#Fargo Startup House 2014-2015&lt;br /&gt;
#FastTrack Propero Healthcare&lt;br /&gt;
#FbFund&lt;br /&gt;
#Female Propeller for High Flyers&lt;br /&gt;
#FinTech Innovation Lab&lt;br /&gt;
#FinTech Studios 2015&lt;br /&gt;
#Fintech Founders Club #2&lt;br /&gt;
#First Growth Venture Network&lt;br /&gt;
#Fishbowl Labs AOL&lt;br /&gt;
#Flagship Enterprise Center&lt;br /&gt;
#FlashStarts&lt;br /&gt;
#Flashpoint&lt;br /&gt;
#Flat6 Labs&lt;br /&gt;
#Fledge9&lt;br /&gt;
#Flextronics Lab IX&lt;br /&gt;
#Food Future Scale-up Accelerator 2017&lt;br /&gt;
#Food System 6 (FS6) Accelerator&lt;br /&gt;
#FoodForwardX&lt;br /&gt;
#Fortify Ventures&lt;br /&gt;
#Founder Institute&lt;br /&gt;
#FounderFuel&lt;br /&gt;
#FoundersPad&lt;br /&gt;
#Fownders Accelerator&lt;br /&gt;
#French Accelerator 2016&lt;br /&gt;
#Fund the Food&lt;br /&gt;
#Fuse Corps Host&lt;br /&gt;
#GAKKEN Accelerator Program&lt;br /&gt;
#Gainesville Technology Enterprise Center&lt;br /&gt;
#Game CoLab Incubator Program 2014&lt;br /&gt;
#GameFounders&lt;br /&gt;
#GammaRebels&lt;br /&gt;
#Gazelle Lab&lt;br /&gt;
#Gener8tor&lt;br /&gt;
#German Accelerator Life Sciences&lt;br /&gt;
#German Accelerator Tech&lt;br /&gt;
#Global Accelerator Network 2015&lt;br /&gt;
#Good Works Houston Lab&lt;br /&gt;
#GoodCompany Ventures&lt;br /&gt;
#Google Launchpad Accelerator&lt;br /&gt;
#Grants4Apps Accelerator&lt;br /&gt;
#GreenStart&lt;br /&gt;
#Greenlite Labs&lt;br /&gt;
#GrowLab&lt;br /&gt;
#Growth Hacking Accelerator 2015&lt;br /&gt;
#Gulf Coast Center for Innovation and Entrepreneurship&lt;br /&gt;
#H-Farm Ventures&lt;br /&gt;
#HACKT Mission for International Founders&lt;br /&gt;
#HAXLR8R&lt;br /&gt;
#HCC Entrepreneurship Launchpad&lt;br /&gt;
#HIGHLINE Academy&lt;br /&gt;
#HUB&lt;br /&gt;
#HUBB Accelerator&lt;br /&gt;
#HUBB GTLA 2016&lt;br /&gt;
#HackFWD&lt;br /&gt;
#Hatch&lt;br /&gt;
#Health Wildcatters&lt;br /&gt;
#Health accelerator&lt;br /&gt;
#Healthbox&lt;br /&gt;
#Hero City Co-Working Space&lt;br /&gt;
#High Street Startups Accelerator&lt;br /&gt;
#Highway1&lt;br /&gt;
#Honda Xcelerator &lt;br /&gt;
#Houston Technology Center&lt;br /&gt;
#Hub Ventures&lt;br /&gt;
#HugeThing&lt;br /&gt;
#I/O ventures&lt;br /&gt;
#ICONYC labs&lt;br /&gt;
#IDC Elevator&lt;br /&gt;
#INcubes Funnel and Accelerator 2014/2015&lt;br /&gt;
#INcubes Online Form&lt;br /&gt;
#INcubes Startup Visa&lt;br /&gt;
#Illumina Accelerator&lt;br /&gt;
#Illuminator,  New York Accelerator 2015&lt;br /&gt;
#Imagine K12&lt;br /&gt;
#Immokalee Business Development Center&lt;br /&gt;
#Impact Engine&lt;br /&gt;
#Impact USA - 2017&lt;br /&gt;
#Incubate Miami&lt;br /&gt;
#Infuse Accelerator&lt;br /&gt;
#Ingenuity Partner Program&lt;br /&gt;
#InnoSpring&lt;br /&gt;
#Innov&amp;amp;Connect&lt;br /&gt;
#Innov8 for Health&lt;br /&gt;
#Innova Memphis&lt;br /&gt;
#InnovateOC&lt;br /&gt;
#Innovation Depot&lt;br /&gt;
#Innovation Pavilion&lt;br /&gt;
#Innovation Showcase Winter 2017&lt;br /&gt;
#Insight Accelerator Labs&lt;br /&gt;
#Intel Education Accelerator&lt;br /&gt;
#Investment Preparedness Lab&lt;br /&gt;
#Invoke Collective&lt;br /&gt;
#Iowa Startup Accelerator&lt;br /&gt;
#JFDI.Asia&lt;br /&gt;
#JFE Accelerator SF&lt;br /&gt;
#JLAB&lt;br /&gt;
#Jaguar Land Rover Tech Incubator&lt;br /&gt;
#Jolt&lt;br /&gt;
#JumpSchool &lt;br /&gt;
#JumpStart Foundry&lt;br /&gt;
#Jumpstart! Boulder&lt;br /&gt;
#JusticeXL&lt;br /&gt;
#Kairos Boston Spring Program&lt;br /&gt;
#Kaplan EdTech&lt;br /&gt;
#Kick&lt;br /&gt;
#Kick Boise&lt;br /&gt;
#Kick LA&lt;br /&gt;
#Kick Victoria&lt;br /&gt;
#Kicklabs&lt;br /&gt;
#Kinetiq Labs&lt;br /&gt;
#L-SPARK Accelerator&lt;br /&gt;
#LAUNCH incubator&lt;br /&gt;
#LAUNCHub&lt;br /&gt;
#LI TechCOMETS&lt;br /&gt;
#LabFunding Project Accelerator 2014&lt;br /&gt;
#Labs Venture Accelerator&lt;br /&gt;
#Launch Chapel Hill&lt;br /&gt;
#Launch Memphis&lt;br /&gt;
#LaunchBox Digital&lt;br /&gt;
#LaunchHouse&lt;br /&gt;
#LaunchPad PEI&lt;br /&gt;
#LaunchSpot&lt;br /&gt;
#Launch_Academy&lt;br /&gt;
#Launchpad Digital Health, LLC&lt;br /&gt;
#Launchpad LA&lt;br /&gt;
#Launchpad Long Island&lt;br /&gt;
#Le Camping&lt;br /&gt;
#Leading Entrepreneurial Accelerator Program&lt;br /&gt;
#Lean Launch Ventures&lt;br /&gt;
#LearnLaunchX&lt;br /&gt;
#Lemnos Labs&lt;br /&gt;
#Life Changing Labs&lt;br /&gt;
#LiftOff Health Incubator&lt;br /&gt;
#Lightbank Start&lt;br /&gt;
#LightningLab&lt;br /&gt;
#Lowe's Accelerator&lt;br /&gt;
#MACH37&lt;br /&gt;
#MACH37 Spring&lt;br /&gt;
#MIT SA+P venture accelerator&lt;br /&gt;
#MITA Institute Accelerator&lt;br /&gt;
#MTGx MediaFactory&lt;br /&gt;
#Mac6&lt;br /&gt;
#Madworks Governance Accelerator&lt;br /&gt;
#Maine Center for Entrepreneurial Development - Top Gun Program&lt;br /&gt;
#Matter&lt;br /&gt;
#Maven Ventures Fund &amp;amp; Incubator&lt;br /&gt;
#Media Camp&lt;br /&gt;
#Melbourne Accelerator Program&lt;br /&gt;
#Memphis BioWorks&lt;br /&gt;
#Merck Accelerator&lt;br /&gt;
#MergeLane 2017 Accelerator&lt;br /&gt;
#Mergelane&lt;br /&gt;
#Metavallon&lt;br /&gt;
#Microsoft Accelerator&lt;br /&gt;
#MindTheBridge&lt;br /&gt;
#Momentum&lt;br /&gt;
#MuckerLab&lt;br /&gt;
#Muru-D&lt;br /&gt;
#My5ive Accelerator 2016&lt;br /&gt;
#N-Motion (DUPLICATE)&lt;br /&gt;
#NDRC (LaunchPad / VentureLab)&lt;br /&gt;
#NEXT Dashboard&lt;br /&gt;
#NMotion&lt;br /&gt;
#NY Digital Health Accelerator&lt;br /&gt;
#NY Fashion Tech Lab 2017&lt;br /&gt;
#NYC ACRE&lt;br /&gt;
#NYC SeedStart&lt;br /&gt;
#Nashville Entrepreneur Center&lt;br /&gt;
#Nebula Shift&lt;br /&gt;
#Nephoscale IaaS&lt;br /&gt;
#Nest New York &lt;br /&gt;
#New Ventures Group&lt;br /&gt;
#New York Digital Health Accelerator (DUPLICATE)&lt;br /&gt;
#NewME Accelerator PopUps &lt;br /&gt;
#NewMe&lt;br /&gt;
#Next media accelerator&lt;br /&gt;
#NextHIT&lt;br /&gt;
#NextStart&lt;br /&gt;
#Nike+ Accelerator&lt;br /&gt;
#Northern Arizona Center for Entrepreneurship and Technology (NACET)&lt;br /&gt;
#Northern England&lt;br /&gt;
#Nxtp.labs&lt;br /&gt;
#OCTANe&lt;br /&gt;
#Oasis 500&lt;br /&gt;
#OpenFund&lt;br /&gt;
#Orange Fab&lt;br /&gt;
#Orange Works&lt;br /&gt;
#Orion Startups&lt;br /&gt;
#Oxygen Accelerator&lt;br /&gt;
#PIE&lt;br /&gt;
#Patriot Boot Camp&lt;br /&gt;
#Pearson Catalyst for Education&lt;br /&gt;
#Pipeline H2O&lt;br /&gt;
#Pitney Bowes Inc&lt;br /&gt;
#Plarium Labs&lt;br /&gt;
#Plug In South LA &lt;br /&gt;
#Plug and Play&lt;br /&gt;
#Plum Alley Investments 2016&lt;br /&gt;
#Points of Light Accelerator&lt;br /&gt;
#PowerHaus&lt;br /&gt;
#Preccelerator® Program 2016&lt;br /&gt;
#ProSiebenSat.1 Accelerator&lt;br /&gt;
#Project Entrepreneur 2016/17&lt;br /&gt;
#Project Healtchare&lt;br /&gt;
#Project Lift&lt;br /&gt;
#Project Music&lt;br /&gt;
#Project Skyway&lt;br /&gt;
#Propeller Venture Accelerator&lt;br /&gt;
#Prosper Capital Accelerator&lt;br /&gt;
#Proton Enterprises&lt;br /&gt;
#Pushstart Accelerator&lt;br /&gt;
#Qualcomm Robotics Accelerator&lt;br /&gt;
#Queen Creek Business Incubator&lt;br /&gt;
#R/GA Accelerator&lt;br /&gt;
#RAIN Incubator/Accelerator&lt;br /&gt;
#RJI Investment Group&lt;br /&gt;
#Reach&lt;br /&gt;
#RetailXelerator&lt;br /&gt;
#Rock Health&lt;br /&gt;
#Rocket Fuel Labs&lt;br /&gt;
#Rockstart Accelerator&lt;br /&gt;
#RunUp Labs&lt;br /&gt;
#Runway IoT Accelerator 2015&lt;br /&gt;
#SAP Startup Focus Program&lt;br /&gt;
#SKTA Innopartners Innovation Accelerator&lt;br /&gt;
#SPACELAB Tech Accelerator&lt;br /&gt;
#SPARK&lt;br /&gt;
#SPH Plug and Play&lt;br /&gt;
#SURF Incubator&lt;br /&gt;
#SaltMines Group Start-Up Studio&lt;br /&gt;
#ScaleTown&lt;br /&gt;
#Seamless IoT 2016&lt;br /&gt;
#Searchcamp&lt;br /&gt;
#Seed Hatchery&lt;br /&gt;
#SeedSpot&lt;br /&gt;
#SeedStartup&lt;br /&gt;
#SeedSumo&lt;br /&gt;
#Seedcamp&lt;br /&gt;
#Seedrocket&lt;br /&gt;
#Seeqnce&lt;br /&gt;
#Sequoia Apps&lt;br /&gt;
#Serval Ventures&lt;br /&gt;
#Shenzhen Valley Ventures Incubator&lt;br /&gt;
#Shoals Entrepreneurial Center&lt;br /&gt;
#Shopper Futures Accelerator&lt;br /&gt;
#Shotput Ventures&lt;br /&gt;
#Sid Martin Biotechnology Institute&lt;br /&gt;
#SigmaLabs Accelerator&lt;br /&gt;
#Silicon Valley Incubator &amp;amp; Accelerator&lt;br /&gt;
#SixThirty&lt;br /&gt;
#Sixers Innovation Lab&lt;br /&gt;
#Skywalker Accelerator&lt;br /&gt;
#SmartHealth Activator&lt;br /&gt;
#Smashd Labs&lt;br /&gt;
#SoCo Nexus Accelerator Spring 2017&lt;br /&gt;
#Social Enterprise Challenge&lt;br /&gt;
#Socratic Labs&lt;br /&gt;
#SparkLabs&lt;br /&gt;
#Sparkgap&lt;br /&gt;
#Sports Tank&lt;br /&gt;
#Springboard&lt;br /&gt;
#Sprint Accelerator&lt;br /&gt;
#Sprint Mobile Health Accelerator&lt;br /&gt;
#SproutBox&lt;br /&gt;
#SproutCamp&lt;br /&gt;
#Starburst Aerospace Accelerator&lt;br /&gt;
#Start Path Europe&lt;br /&gt;
#Start'inPost&lt;br /&gt;
#StartEngine&lt;br /&gt;
#StartFast Venture Accelerator&lt;br /&gt;
#Starta Accelerator Winter 2017&lt;br /&gt;
#Startl&lt;br /&gt;
#Startmate&lt;br /&gt;
#Startup Accelerator (DUPLICATE)&lt;br /&gt;
#Startup Front&lt;br /&gt;
#Startup Next &amp;amp; GAN&lt;br /&gt;
#Startup Orange County Accelerator&lt;br /&gt;
#Startup Runway&lt;br /&gt;
#Startup Wise Guys&lt;br /&gt;
#Startup Zone PEI&lt;br /&gt;
#Startup52X Accelerator&lt;br /&gt;
#StartupCity&lt;br /&gt;
#StartupHighway&lt;br /&gt;
#StartupHouse Foundry program&lt;br /&gt;
#StartupMinds Accelerator &lt;br /&gt;
#StartupYard&lt;br /&gt;
#Startupbootcamp&lt;br /&gt;
#Straight Shot&lt;br /&gt;
#Summer@Highland&lt;br /&gt;
#Surge&lt;br /&gt;
#SynBio axlr8r&lt;br /&gt;
#TEB Incubation &amp;amp; Acceleration Center&lt;br /&gt;
#THRIVE Accelerator III&lt;br /&gt;
#THRIVE Open Innovation (DUPLICATE)&lt;br /&gt;
#TIM#WCAP Accelerator&lt;br /&gt;
#TLabs&lt;br /&gt;
#TMCx Accelerator Digital Health 2017&lt;br /&gt;
#Tallwave&lt;br /&gt;
#Tampa Bay Innovation Center&lt;br /&gt;
#Tampa Bay Wave&lt;br /&gt;
#Tandem Mobile Accelerator&lt;br /&gt;
#Tech Nexus&lt;br /&gt;
#Tech Wildcatters&lt;br /&gt;
#Tech2020&lt;br /&gt;
#TechLaunch&lt;br /&gt;
#TechRanch&lt;br /&gt;
#TechSquareLabs&lt;br /&gt;
#Techstars&lt;br /&gt;
#Techstars Music&lt;br /&gt;
#Telenet Idealabs&lt;br /&gt;
#Telluride Venture Accelerator&lt;br /&gt;
#TenX&lt;br /&gt;
#The Alchemist Accelerator (DUPLICATE)&lt;br /&gt;
#The Ark&lt;br /&gt;
#The Bakery&lt;br /&gt;
#The Batchery&lt;br /&gt;
#The Brandery&lt;br /&gt;
#The Bridge&lt;br /&gt;
#The Center For Technology Enterprise &amp;amp; Development&lt;br /&gt;
#The Chaser&lt;br /&gt;
#The Company Lab (CO.LAB)&lt;br /&gt;
#The Draper FinTech Connection&lt;br /&gt;
#The Factory&lt;br /&gt;
#The Greatest Pitch&lt;br /&gt;
#The Harbor Accelerator&lt;br /&gt;
#The Incubator&lt;br /&gt;
#The Iron Yard&lt;br /&gt;
#The Mediapreneur Incubator&lt;br /&gt;
#The Morpheus&lt;br /&gt;
#The New York Venture Summit&lt;br /&gt;
#The Next Step: from idea to startup&lt;br /&gt;
#The Refinery&lt;br /&gt;
#The Unilever Foundry&lt;br /&gt;
#The Venture Center's Pre-Accelerator I&lt;br /&gt;
#The Vine OC&lt;br /&gt;
#The Vogt Awards&lt;br /&gt;
#The Yield Lab&lt;br /&gt;
#The eFactory Accelerator&lt;br /&gt;
#Think Big Partners Accelerator&lt;br /&gt;
#TiE Angels&lt;br /&gt;
#Tigerlabs Digital Health Accelerator&lt;br /&gt;
#Tolstoy Summer Camp&lt;br /&gt;
#TopSeedsLab&lt;br /&gt;
#Travel Startups Incubator&lt;br /&gt;
#Travelport Labs Accelerator&lt;br /&gt;
#Travelport Labs Incubator&lt;br /&gt;
#Triangle Startup Factory&lt;br /&gt;
#Tumml&lt;br /&gt;
#Tune Labs&lt;br /&gt;
#Twin Cities Accelerator 2016&lt;br /&gt;
#UW-Whitewater Launch Pad Accelerator&lt;br /&gt;
#Unbank.ventures FinTech Incubator&lt;br /&gt;
#University Technology Park&lt;br /&gt;
#Unreasonable Institute&lt;br /&gt;
#UpTech&lt;br /&gt;
#Upstart Accelerator&lt;br /&gt;
#Upstart Labs&lt;br /&gt;
#Upstart Memphis&lt;br /&gt;
#Uptima Business Bootcamp&lt;br /&gt;
#Upwest Labs&lt;br /&gt;
#VANTEC&lt;br /&gt;
#VC FinTech Accelerator&lt;br /&gt;
#Velocity Indiana Accelerator&lt;br /&gt;
#Venture Catalyst Partners&lt;br /&gt;
#Venture Hive&lt;br /&gt;
#Venture I&lt;br /&gt;
#VentureOut's  Enterprise Tech Expedition&lt;br /&gt;
#Venturegeeks&lt;br /&gt;
#Vet-Tech Accelerator&lt;br /&gt;
#VictorySpark&lt;br /&gt;
#Village88 Techlab&lt;br /&gt;
#Volkswagen ERL Technology Accelerator&lt;br /&gt;
#WHLabs&lt;br /&gt;
#Wasabi Ventures Academy&lt;br /&gt;
#Wayra&lt;br /&gt;
#Wellness Accelerator&lt;br /&gt;
#Wells Fargo Startup Accelerator&lt;br /&gt;
#Wireless IoT&lt;br /&gt;
#Women Innovate Mobile&lt;br /&gt;
#XLerateHealth&lt;br /&gt;
#XTRATOS&lt;br /&gt;
#Xlerate Health&lt;br /&gt;
#Y Combinator&lt;br /&gt;
#Y&amp;amp;R SparkPlug 2017&lt;br /&gt;
#YEurope&lt;br /&gt;
#YLE Media Startup Accelerator Program&lt;br /&gt;
#Yahoo Ad Tech Program&lt;br /&gt;
#Yangler (online accelerator)&lt;br /&gt;
#Year of the Startup&lt;br /&gt;
#Yetizen Accelerator&lt;br /&gt;
#You Is Now&lt;br /&gt;
#Z80 Labs&lt;br /&gt;
#ZIP Launchpad Admission&lt;br /&gt;
#ZeroTo510&lt;br /&gt;
#Zone Startups Calgary&lt;br /&gt;
#designX 2017&lt;br /&gt;
#eMerging Ventures&lt;br /&gt;
#ezone&lt;br /&gt;
#iStart Jax (DUPLICATE)&lt;br /&gt;
#iStart Valley&lt;br /&gt;
#iVentures10&lt;br /&gt;
#ignite100&lt;br /&gt;
#innovyz start&lt;br /&gt;
#tekMountain Accelerator&lt;br /&gt;
&lt;br /&gt;
=Project Summary=&lt;br /&gt;
This project will be used to determine which accelerators are the most effective at churning out successful startups, as well as what characteristics are exhibited by these accelerators. First, we need to gather as much data as we can about as many accelerators as we can in order to look at factors that differentiate successful vs. unsuccessful ventures. Next, we need to create a web crawling program which will gather information about accelerators across the world by accessing their websites and extracting information. I believe that our overall goal with this research project is to gain insight into the methods of successful accelerators, as well as to find out what exactly differentiates very successful accelerators from dead accelerators.&lt;br /&gt;
&lt;br /&gt;
Helpful Links: http://seedrankings.com/&lt;br /&gt;
&lt;br /&gt;
=Sources=&lt;br /&gt;
&lt;br /&gt;
Summary: These are sources obtained from [[List of Accelerators]], Crunchbase, and other Google searches. We will evaluate these sources by looking at the number of accelerators they supply (as most of them are lists) and then also taking a look at the type of information they provide about each accelerator. Key data points are cohort-related data, startup-related data, and logistics of the accelerator. Better sources supply more information that the URL alone.&lt;br /&gt;
&lt;br /&gt;
(Obtained from [[List of Accelerators]] and various Google searches)&lt;br /&gt;
*http://seedrankings.com/&lt;br /&gt;
*http://www.acceleratorinfo.com/see-all.html&lt;br /&gt;
*http://www.seed-db.com/accelerators&lt;br /&gt;
*http://gust.com/usa-canada-accelerator-report-2015/?utm_content=35401577&amp;amp;utm_medium=social&amp;amp;utm_source=twitter&lt;br /&gt;
*https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/&lt;br /&gt;
*http://www.builtinnyc.com/2016/06/03/accelerators-incubators-nyc&lt;br /&gt;
*http://www.represent.la/&lt;br /&gt;
*http://www.launch.co/blog/complete-list-of-incubators-and-accelerators-like-y-combinat.html&lt;br /&gt;
*https://angel.co/accelerator-4&lt;br /&gt;
&lt;br /&gt;
(Obtained from Google search: &amp;quot;Accelerator Database&amp;quot;)&lt;br /&gt;
*seed-db is the first result that pops up&lt;br /&gt;
*https://www.corporate-accelerators.net/database/&lt;br /&gt;
*https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json&lt;br /&gt;
*By the 5th or 6th search result, the utility diminished greatly&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2015/03/17/the-best-startup-accelerators-of-2015-powering-a-tech-boom/#2f52fa7e34e4&lt;br /&gt;
*http://www.inc.com/will-yakowicz/the-15-best-startup-accelerators-in-the-us.html&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2016/03/11/the-best-startup-accelerators-of-2016/#74086a7724f2&lt;br /&gt;
*https://techcrunch.com/2015/03/17/these-are-the-top-20-us-accelerators/&lt;br /&gt;
*https://www.nexpcb.com/blogs/news/the-hardware-incubators-accelerators-list&lt;br /&gt;
&lt;br /&gt;
Other ways used to find Accelerators (listed below &amp;quot;List of Sources Obtained from Various Google Searches&amp;quot;):&lt;br /&gt;
*Type in generic location + &amp;quot;accelerators&amp;quot; (e.g. Houston Accelerators)&lt;br /&gt;
:*Looked at roughly the first 20 results&lt;br /&gt;
:*Used three locations as examples of accelerators that pop up&lt;br /&gt;
*Type in a specific state + &amp;quot;accelerator&amp;quot; + &amp;quot;list&amp;quot; (e.g. Texas accelerator list) to search for more relevant lists&lt;br /&gt;
:*Once again, looked at roughly the first 20 results&lt;br /&gt;
*Crunchbase has its own webpage with instructions for how we retrieve the data&lt;br /&gt;
&lt;br /&gt;
=Source Evaluations=&lt;br /&gt;
&lt;br /&gt;
Summary: These evaluations couple with each of the sources above. The evaluations provide instructions for obtaining the information listed, as well as a general review of how useful the data seems. The review serves to determine whether a crawler would be suitable for obtaining information from the source autonomously.&lt;br /&gt;
&lt;br /&gt;
==SOURCE: Crunchbase==&lt;br /&gt;
*All of the information for the Crunchbase documentation is located in the page [[Crunchbase 2013 Snapshot]] webpage, along with the documentation for how we determined the accelerator information.&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.acceleratorinfo.com/see-all.html==&lt;br /&gt;
#Opened source website&lt;br /&gt;
#Copied Information under &amp;quot;All Accelerator Programs&amp;quot; to TextPad, already sorted. Returned 190 results&lt;br /&gt;
#Each link on parent list leads to individual '''home page url''' of accelerator&lt;br /&gt;
:*Used sample size of 20 links, determined 16 to be accelerators, 2 to be incubators, 2 to be inactive or broken links&lt;br /&gt;
:*Many accelerators do not include founding date, most recent accelerators from around 2013-2014 (as determined from home page)&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for specific URLs to older accelerators, not very helpful for more specific information.&lt;br /&gt;
*Web crawling seems improbable because information is not readily available from source. Can potentially mine staff information or contact information from associated &amp;quot;about&amp;quot; page in the home url&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators/all==&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 235 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes:&lt;br /&gt;
::# &amp;quot;state&amp;quot;&lt;br /&gt;
::# &amp;quot;company name&amp;quot;&lt;br /&gt;
::# &amp;quot;website and CrunchBase links&amp;quot;&lt;br /&gt;
::# &amp;quot;cohort date&amp;quot;&lt;br /&gt;
::#&amp;quot;exit value&amp;quot;&lt;br /&gt;
::#&amp;quot;funding&amp;quot;. &lt;br /&gt;
:::Many entries for &amp;quot;exit value&amp;quot; are missing, some values for &amp;quot;funding&amp;quot; are missing&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators out of 235 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the following:&lt;br /&gt;
::#Status&lt;br /&gt;
::#Program (name)&lt;br /&gt;
::#Location&lt;br /&gt;
::#Country&lt;br /&gt;
::#Number of companies&lt;br /&gt;
::#Cumulative exit values&lt;br /&gt;
::#Cumulative funding &lt;br /&gt;
::#Average funding for startups&lt;br /&gt;
::#Median funding for startups&lt;br /&gt;
:::Many entries for &amp;quot;median funding&amp;quot; are left empty, as well as entries for all types of funding on the bottom half of the table&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, but after cross-referencing from other sources shows that seed-db is lacking many newer accelerators; list is not all-inclusive.&lt;br /&gt;
*Includes regional distributions for accelerator groups as well. For example, rather than just &amp;quot;Techstars&amp;quot;, the group is broken into Austin, Berlin, Boston, Boulder, etc.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators==&lt;br /&gt;
:Very similar to &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;, but contains large regional accelerators as groups, rather than individual accelerators. For example, Techstars appears only once.&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 239 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes same information as previous source, &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;. However, accelerators spanning across multiple regions have their startups located under one category on this webpage.&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators/groups out of 239 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the same information as the &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; source&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, includes large groups as well as individual accelerators. It seems that some accelerators missing from &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; are located here, since there are 239 returns rather than 235.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.f6s.com/programs?type==&lt;br /&gt;
#On the webpage, set &amp;quot;Type&amp;quot; to &amp;quot;Accelerator/Program&amp;quot;, set &amp;quot;Location&amp;quot; to &amp;quot;North America&amp;quot;, and set &amp;quot;Invest in Country&amp;quot; to &amp;quot;United States&amp;quot; to return results&lt;br /&gt;
#Highlighted results and scrolled down until all results found; copied results to TextPad&lt;br /&gt;
#In TextPad, sorted out lines with &amp;quot;by&amp;quot;, as well as miscellaneous categories such as dates and dollar signs through Regular Expressions&lt;br /&gt;
#Using the &amp;quot;More Info&amp;quot; line which held constant through the entire list, assigned a sequential number to the line (in order to determine the number of results)&lt;br /&gt;
::*Obtained a grand total of 1467 results from the list&lt;br /&gt;
::*Along with the name of the program/accelerator, the data included:&lt;br /&gt;
::#Dollar value per team&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Application Site&lt;br /&gt;
::#Accelerator URL&lt;br /&gt;
::*Many entries are not accelerators, from a quick glance through the results, there were various conferences, 3-5 days events, and written literature pertaining to accelerators as well&lt;br /&gt;
::*From a sample size of the first 30 entries, determined 10 to be valid accelerators, 3 incubators, 6 conferences/weekends, and the rest to be miscellaneous entries such as startup events or &amp;quot;studios&amp;quot; (perhaps useful but not relevant to search)&lt;br /&gt;
::*As we go down the list, the number of accelerators proportionately decreases. Can comfortably say that overall accelerator turnout from this website is much less than 33%, probably closer to 10-15%.&lt;br /&gt;
===Review===&lt;br /&gt;
*Potentially useful website if crawler could remove the clutter and target solely the accelerators; very useful for identifying new accelerators since data automatically sorted by date and location.&lt;br /&gt;
*Large list of sources includes many irrelevant results, such as conferences or weekends which are difficult to identify. The name of the sorting category itself, &amp;quot;Accelerator/Program&amp;quot; suggests that many of the results fall under the &amp;quot;Program&amp;quot; section rather than being valid accelerators.&lt;br /&gt;
*Potential site for identifying accelerators, but limited by in-site sorting; useful for URL and perhaps equity, but not very detailed information relating to the accelerator/program.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://gust.com/usa-canada-accelerator-report-2015/==&lt;br /&gt;
#Selected region of US and Canada&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Top 20 Active Accelerators&amp;quot; and selected &amp;quot;see the full list&amp;quot; near the bottom of the listed accelerators&lt;br /&gt;
#Copied resulting entries into TextPad and sorted out the numbers to leave only the name of the accelerator&lt;br /&gt;
::*Obtained 100 results for different accelerators&lt;br /&gt;
::*Accelerator lists included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Number of Start-ups funded (2015 only)&lt;br /&gt;
::*Accelerator list limited to 2015&lt;br /&gt;
===Review===&lt;br /&gt;
*Website provides its own evaluation of an accelerator's success based on various factors and provides data for larger trends.&lt;br /&gt;
*Usefulness is questionable because website does not provide much except the URL, and all of the entries are based on success in 2015.&lt;br /&gt;
*Other interesting data within website such as &amp;quot;Hot Markets&amp;quot;, investment breakdowns by state, etc. All of this data is also limited to 2015.&lt;br /&gt;
&lt;br /&gt;
==Source: https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/==&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Startup accelerators in Boston&amp;quot;&lt;br /&gt;
#Copied text beginning from &amp;quot;MassChallenge&amp;quot; (the first paragraph was just a general definition of startups) and continued to copy until &amp;quot;Startup Incubators in Boston&amp;quot;&lt;br /&gt;
#After pasting in TextPad, I sorted the data to delete any characters after the &amp;quot;-&amp;quot; and added a sequential number at the beginning of each line&lt;br /&gt;
::*Returned a total of 17 results for startups in Boston&lt;br /&gt;
::*Accelerator list included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Capital requirements&lt;br /&gt;
::#Application periods and requirements&lt;br /&gt;
::#Paragraph describing accelerator and its goals&lt;br /&gt;
===Review===&lt;br /&gt;
*Although the guide is dated, useful for identifying strong accelerator programs in Boston&lt;br /&gt;
*Limitation: only focuses on Boston, but the description is helpful in identifying the role of the accelerator&lt;br /&gt;
*Limited information on accelerator, not very useful by itself without information from the accelerator URL&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.corporate-accelerators.net/database/==&lt;br /&gt;
#Copied and pasted table into Microsoft Excel (Data was already sorted into categories so no need for TextPad)&lt;br /&gt;
#Table returned 72 references (but there was a link to the bottom to a larger database)&lt;br /&gt;
::*The table itself includes:&lt;br /&gt;
::#Major Company&lt;br /&gt;
::#Accelerator&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Website&lt;br /&gt;
::#Details&lt;br /&gt;
::*The &amp;quot;Details&amp;quot; link led to a variety of other information including:&lt;br /&gt;
::#Status (Active or Inactive)&lt;br /&gt;
::#Locations&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Term&lt;br /&gt;
::#Cohort Based? (Regular or Irregular)&lt;br /&gt;
::#Pitch Day&lt;br /&gt;
::#Office Space&lt;br /&gt;
::#Powered by&lt;br /&gt;
::#Support Offered?&lt;br /&gt;
::#Launch year&lt;br /&gt;
::#Focus Areas&lt;br /&gt;
::#General Description&lt;br /&gt;
::*Also Included a variety of data regarding the host company as well&lt;br /&gt;
===Review===&lt;br /&gt;
*Solid list for corporate accelerators and also includes a variety of information about the accelerator, the cohorts, etc. Some of the entries are international accelerators however so need to filter them out&lt;br /&gt;
*Only limited to 72 accelerators from major companies&lt;br /&gt;
&lt;br /&gt;
==Source: https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json==&lt;br /&gt;
#This source is a .json file from the previous database&lt;br /&gt;
#After placing into TextPad, replaced each space with a ###, replaced each new line with a tab, and replaced each ### with a new line. Ultimately returned 80 results&lt;br /&gt;
::*From the file, the .json includes:&lt;br /&gt;
::#NAICS and NAICS sector &lt;br /&gt;
::#Classification&lt;br /&gt;
::#Sector Description&lt;br /&gt;
::#Term&lt;br /&gt;
::#Goal&lt;br /&gt;
::#Partner&lt;br /&gt;
::*Also includes most of the information from the previous source, since they are undoubtedly linked&lt;br /&gt;
===Review===&lt;br /&gt;
*Another solid list for corporate accelerators with some more information, but ultimately very similar to the previous source.&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.quora.com/Where-can-I-find-a-comprehensive-list-of-startup-incubators-and-accelerators-in-the-US==&lt;br /&gt;
#Since we already looked at the first listed source (seed-db), I clicked on the second link &amp;quot;(by Robert Shedd) http://blog.shedd.us/321987608/&amp;quot; which took me to a page headed &amp;quot;Help for Startups! – A semi-complete list of startup accelerator programs&amp;quot; created by a blogger, Robert Shedd&lt;br /&gt;
#List included 102 entries by the blogger, each of which do look like an accelerator&lt;br /&gt;
::*Upon immediate overview, noticed many results from previous sources were missing. Immediately noticed lack of &amp;quot;OwlSpark&amp;quot;, the accelerator from Rice.&lt;br /&gt;
::*Shedd only offers us the accelerator name plus its URL&lt;br /&gt;
===Review===&lt;br /&gt;
*Nice list to cross-reference with other sources but does not offer much new insight compared to more powerful engines such as seed-db\&lt;br /&gt;
&lt;br /&gt;
=List of Sources Obtained from Various Google Searches=&lt;br /&gt;
&lt;br /&gt;
Summary: These accelerators are taken from a specific Google search rather than a list. The idea is to compile a list of Google searches that return relevant results of accelerators. This will aid in the creation of a future web crawler.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;Location + Accelerator&amp;quot;(Only individual results, not lists)==&lt;br /&gt;
===Houston Accelerators===&lt;br /&gt;
*Examples of single accelerators found&lt;br /&gt;
:#TMCx: http://www.tmc.edu/innovation/innovation-programs/tmcx/&lt;br /&gt;
:#RED labs: http://redlabs.uh.edu/8&lt;br /&gt;
:#SURGE accelerator: https://kirkcoburn.com/&lt;br /&gt;
:#OwlSpark: http://owlspark.com/&lt;br /&gt;
:#NextHIT: http://www.houstonhealthventures.com/nexthit-accelerator-program-application/&lt;br /&gt;
===Los Angeles Accelerators===&lt;br /&gt;
:#Amplify: http://amplify.la/&lt;br /&gt;
:#Y Combinator: https://www.ycombinator.com/&lt;br /&gt;
:#Chicklabs: https://www.chicklabsllc.com/&lt;br /&gt;
:#Disney Accelerator: https://disneyaccelerator.com/&lt;br /&gt;
:#Launchpad: https://launchpad.la/&lt;br /&gt;
===New York Accelerators===&lt;br /&gt;
:#DreamIT Ventures: http://www.dreamit.com/#meaningful-experience&lt;br /&gt;
:#Women Innovate Mobile: http://www.wim.co/&lt;br /&gt;
:#Techstars NYC: http://www.techstars.com/programs/nyc-program/&lt;br /&gt;
:#Entrepreneurs Roundtable: http://eranyc.com/&lt;br /&gt;
:#FirstGrowthVC: http://venturecrush.com/fg/&lt;br /&gt;
:#New York Digital Health Accelerator: http://digitalhealthaccelerator.com/&lt;br /&gt;
:#Grand Central Tech: http://www.grandcentraltech.com/&lt;br /&gt;
:#Accelerator Corp: http://www.acceleratorcorp.com/&lt;br /&gt;
:#New York Startup Lab: http://nystartuplab.com/&lt;br /&gt;
===Review===&lt;br /&gt;
*Some locations return more viable results for a similar sample size. For example, New York returned 9 valid accelerators, whereas Los Angeles and Houston both returned 5 actual accelerators out of the first 20 results: an 80% difference. Some optimization may come from identifying which locations return more accelerators upon searching.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;State+Accelerator+List&amp;quot;==&lt;br /&gt;
===New York Accelerator List===&lt;br /&gt;
*http://www.ongridventures.com/resources/new-york-silicon-alley-resources/newyorkaccelerators/ (Ranks 14 accelerators)&lt;br /&gt;
*http://under30ceo.com/11-new-york-tech-incubators-and-accelerators-for-entrepreneurs/ (Ranks 11 accelerators)&lt;br /&gt;
===California Accelerator List===&lt;br /&gt;
*http://www.socaltech.com/the_complete_guide_to_southern_california_accelerators_and_incubators_part_i/s-0040924.html (Lists accelerators in Southern Cali)&lt;br /&gt;
*http://barberacorporatelaw.com/blog/2014/4/8/28-business-incubators-in-the-los-angeles-area (List of 24 accelerators near the LA area)&lt;br /&gt;
===Texas Accelerator List===&lt;br /&gt;
*http://www.austinstartuplist.com/incubators (List of accelerators in Austin, &amp;lt;5 results)&lt;br /&gt;
*http://www.siliconhillsnews.com/2016/09/02/the-top-texas-healthcare-accelerators-and-incubators/ (Modest list of accelerators aiding in healthcare)&lt;br /&gt;
*http://realfoodmba.com/food-startup-accelerators/ (List of food-based accelerators, some of which are in Austin, others of which are international)&lt;br /&gt;
===Colorado Accelerator List===&lt;br /&gt;
*http://www.builtincolorado.com/2015/01/14/best-colorado-accelerators-your-startup (8 results)&lt;br /&gt;
*https://www.quora.com/What-accelerator-programs-are-located-in-Colorado (Quora inquiry yielding modest results)&lt;br /&gt;
===Washington Accelerator List===&lt;br /&gt;
*http://www.geekwire.com/2015/mapping-seattles-incubators-accelerators-and-co-working-spaces/ (Returns 14 results)&lt;br /&gt;
===Oregon Accelerator List===&lt;br /&gt;
*http://www.bizjournals.com/portland/subscriber-only/2016/01/15/incubators-and-accelerators.html (Returns list of 5 accelerators and details)&lt;br /&gt;
*http://www.oregon4biz.com/Innovate-&amp;amp;-Create/R&amp;amp;D-Business/Incubators/ (Returns list of 26 accelerators and incubators)&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Seed-DB appears for almost all of the search results&lt;br /&gt;
*Acceleratorinfo appears for most of the search results&lt;br /&gt;
*There are multiple cumulative reports of incubators per location, but not for accelerators&lt;br /&gt;
*Most regionalized accelerator lists deal with either an article or a ranking of a particular amount of accelerators in the area&lt;br /&gt;
*Many results returned nationally ranked lists of accelerators, such as the Forbes list of &amp;quot;Top Accelerators&amp;quot; or something along the lines of &amp;quot;Best Accelerators in the US&amp;quot;. The connection is that perhaps one accelerator mentioned on the list may be located within the searched state.&lt;br /&gt;
*There are also a few results for actual particle accelerators that must be sorted out (i.e. superconducting super collider)&lt;br /&gt;
&lt;br /&gt;
==Found through google searching accelerators found previously==&lt;br /&gt;
'''Found from googling YLE Media Startup Accelerator'''&lt;br /&gt;
*https://www.corporate-accelerators.net/database/index.html (DB of Corporate Accelerators 71-79 entries)&lt;br /&gt;
*http://startupaccelerator.vc/accelerator-corporate-innovation-sig/ (Database of Accelerators and Corporate Innovation 92 entries)&lt;br /&gt;
neither of these have had their entries added to list of accelerators&lt;br /&gt;
&lt;br /&gt;
=Individual Accelerator Evaluations=&lt;br /&gt;
Summary: The purpose of this section is to create instructions for each accelerator on how to find cohort information from their URLs. Along with specific instructions for obtaining the cohorts for each accelerator chosen, there should be a list of easy-to-obtain and relevant statistics regarding the accelerator, such as information about its team, location, etc. The variable statistics list is cumulative, whereas the cohort directions are unique per the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerators Chosen (Format = Name (source))==&lt;br /&gt;
#Blue Startups (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Launchpad LA (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Y Combinator (http://www.seed-db.com/accelerators)&lt;br /&gt;
#FlashPoint (http://www.seed-db.com/accelerators/all)&lt;br /&gt;
#Prosper Accelerator (https://www.f6s.com/programs?type)&lt;br /&gt;
#Axel Springer Plug and Play (http://www.axelspringerplugandplay.com/)&lt;br /&gt;
#Techstars (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Startmate (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Capital Factory (http://blog.shedd.us/321987608/)&lt;br /&gt;
#OwlSpark (Google search: &amp;quot;Houston + accelerators&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Blue Startups (http://bluestartups.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Track Record&amp;quot; page under the &amp;quot;Home&amp;quot; tab; found total number of graduated cohorts to be 7&lt;br /&gt;
#Navigated to &amp;quot;Portfolio&amp;quot; tab. Tab includes list of all seven graduated cohorts along with companies emerging from each one. Each cohort is listed under a separate page (ex. &amp;quot;Cohort 1&amp;quot;, &amp;quot;Cohort 2&amp;quot;, etc) and at the bottom of each cohort page, there is a link to the other 6. Each company has a short description along with its URL.&lt;br /&gt;
#An &amp;quot;Alumni News&amp;quot; page at the bottom of &amp;quot;Portfolio&amp;quot; includes articles pertinent to graduated startups.&lt;br /&gt;
#Unfortunately does not include the date and year of each cohort class, but perhaps could cross-reference with other sources.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Launchpad LA (http://launchpad.la/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Companies&amp;quot; in the top of the homepage&lt;br /&gt;
#&amp;quot;Companies&amp;quot; returns all companies backed by Launchpad LA based on their class year and number (cohort)&lt;br /&gt;
#:*Also sorted by active startups vs. inactive startups&lt;br /&gt;
#At the bottom of the &amp;quot;Companies&amp;quot; tab, there is a statistical layout returning values for the number of companies started by Launchpad during its time as an accelerator (2012-present), as well as the total funding funneled into the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Y Combinator (http://www.ycombinator.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Scrolled down on the home page and clicked on a link entitled &amp;quot;See all companies&amp;quot;.&lt;br /&gt;
#Navigated to a drop down menu named &amp;quot;All Batches&amp;quot;, and clicked on it to expand the list.&lt;br /&gt;
#List is made up of dates ranging from 2005-2016, and these dates return lists of launched companies including most but not all of their URL's, as well as their launch year.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Flashpoint (http://flashpoint.gatech.edu/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#On upper right corner after animation, there is a tab sign which lets you navigate to a page labeled &amp;quot;Teams&amp;quot;&lt;br /&gt;
#The &amp;quot;Team&amp;quot; page has each batch of companies emerging from Georgia Tech, although it does not include the dates or cohorts of these companies. For example, &amp;quot;Batch 1&amp;quot; at the top of the page just lists the companies in the batch without URLs or any additional information.&lt;br /&gt;
#On the &amp;quot;Application&amp;quot; page on the tab near the top, there is information regarding Batch 7, which begins early 2017. Suggests that batch 6 either ended spring 2016 or fall 2016.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Prosper Women Entrepreneurs (http://www.prosperstl.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Accelerator&amp;quot; tab and clicked &amp;quot;Companies&amp;quot; when prompted with the drop down menu.&lt;br /&gt;
#This tab returned all of the launched company logos which then redirected to the company's home page when clicked.&lt;br /&gt;
#No other relevant form of information such as date launched or cohort was included on this page.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Axel Springer Plug and Play(http://www.axelspringerplugandplay.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Clicked on the &amp;quot;Companies&amp;quot; tab on the home page and was directed to the middle of the page which included a short list of current companies.&lt;br /&gt;
#Clicked on the &amp;quot;All Companies&amp;quot; link which returned a page filled with startup logos and brief descriptions of those startups. When clicked, each logo serves to redirect to that startup's home page.&lt;br /&gt;
#Companies were not sorted by cohort or in any other relevant way.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Techstars (http://www.techstars.com)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the Accelerators tabs and clicked &amp;quot;Companies&amp;quot; on the drop down menu.&lt;br /&gt;
#Firstly, this returns a table comprised of a long list of different classes from different areas separated by years.&lt;br /&gt;
#Upon scrolling down further, each of these classes is broken down by the startups that graduated from them. It also includes information such as how much was invested in each startup, as well as whether or not the startup was acquired, is active, or failed.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Startmate (http://www.startmate.com.au)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startups&amp;quot; tab, which returned a page of all startups that have graduated from Startmate.&lt;br /&gt;
#Startups are separated by year of graduation, and each company is linked on this page.&lt;br /&gt;
#It appears as if each year, 1 cohort is taken through the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Capital Factory (https://capitalfactory.com/accelerate/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the startups tab, which returned a long list of companies that were accelerated by Capital Factory.&lt;br /&gt;
#Each logo for the startups served as a link to their respective websites.&lt;br /&gt;
#There was no evidence or mention of any cohorts.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: OwlSpark (http://entrepreneurship.rice.edu/accelerator/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startup Teams&amp;quot; tab, which returned a page that included links to 4 &amp;quot;Classes&amp;quot;.&lt;br /&gt;
#Each class link i.e. (Class 1, Class 2, Class 3, Class 4) returned links to each startup that graduated from the program.&lt;br /&gt;
#These classes signify cohorts.&lt;br /&gt;
&lt;br /&gt;
==List of Promising Variables==&lt;br /&gt;
*Key People (founders, lead entrepreneurs, strategists, etc.)&lt;br /&gt;
*Total number of launched companies&lt;br /&gt;
*A FAQ for application details, accelerator vision, and &lt;br /&gt;
*Funds raised per company (average)&lt;br /&gt;
*Features offered by accelerator (perks, space, tools, etc)&lt;br /&gt;
*General events hosted by the accelerator&lt;br /&gt;
*(Success) stories for graduated start-ups&lt;br /&gt;
&lt;br /&gt;
=E-R Diagram (in list form) for Identifying Attributes to Pull from Accelerators=&lt;br /&gt;
Summary: I will look at different entities within the accelerator page (e.g accelerators, cohorts, founders) and then find potential attributes that can be codified from those entities. Along with the attribute, we list a potential method for pulling that particular attribute. &lt;br /&gt;
&lt;br /&gt;
Format: &lt;br /&gt;
:&amp;lt;u&amp;gt;Entity&amp;lt;/u&amp;gt;&lt;br /&gt;
:*Attribute - Possible sources/ways to get&lt;br /&gt;
&lt;br /&gt;
Ed: &amp;quot;Be creative with finding new attributes to pull!&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==List==&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
*Accelerator Name - Website, external database&lt;br /&gt;
*Contact Form - General contact section in each website &lt;br /&gt;
*Industry focus - can be pulled from description&lt;br /&gt;
*Description - pulled from website itself&lt;br /&gt;
*Takes equity? - Database or from &amp;quot;about&amp;quot; page&lt;br /&gt;
*Non-profit? - Database&lt;br /&gt;
*URL - Already have way of obtaining&lt;br /&gt;
*DNS Registration Date - Already have way of obtaining&lt;br /&gt;
*Address - Google Maps, maybe the website&lt;br /&gt;
*Founding Date - Google Maps, website, server registration&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
*Mentorship? - Description in website&lt;br /&gt;
*Space Offered - Google Maps, Website description&lt;br /&gt;
*Partnerships - Angel list, Same section as mentorship or events&lt;br /&gt;
*Hosted Events - Calender&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
*Name - Founders or Team Page&lt;br /&gt;
*Title - Directly underneath or next to name&lt;br /&gt;
*PhD? - Biography, webpage under name&lt;br /&gt;
*Serial - Biography&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot; in &amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt; (n) has (n) &amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt; &lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt;&lt;br /&gt;
*Other Companies - Biography, webpage&lt;br /&gt;
*Previous Companies - Biography&lt;br /&gt;
*Net Worth - Forbes, Biography&lt;br /&gt;
*Link back to &amp;quot;Name&amp;quot; in &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
*Date + Accelerator = Cohort ID - Database or Website&lt;br /&gt;
*Number of Startups - Website, count from &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Cohort Number - Categorization on website, external database&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Names - Website, external database&lt;br /&gt;
*State of Inc - Angel List&lt;br /&gt;
*URL - Angel List, website&lt;br /&gt;
*Founding Date - Registration database, Angel List&lt;br /&gt;
*Industry - startup description&lt;br /&gt;
*Founding Location - Angel List&lt;br /&gt;
*Current Location - Angel List&lt;br /&gt;
*VC Raised to Date - SDC Platinum&lt;br /&gt;
*Angel Funds Raised to date - Angel List&lt;br /&gt;
&lt;br /&gt;
==Variables which Distinguish Accelerator Websites==&lt;br /&gt;
*The word &amp;quot;Accelerator&amp;quot;&lt;br /&gt;
**This word appears at least one time on the home page of the vast majority of accelerator websites. The word &amp;quot;Accelerator&amp;quot; appears either as a link to another page on the website or in a title on the homepage of the website. Not many other websites contain this word on their homepage, especially not if one Googles something generic such as &amp;quot;Accelerators in the US&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
*Fixed Term&lt;br /&gt;
**Accelerators normally work with their cohorts for 3 months. This is a major factor which differentiates between an accelerator and any other member of a startup ecosystem. If on their website they mention either &amp;quot;3 months&amp;quot; or &amp;quot;12 weeks&amp;quot;, it is extremely likely that the website belongs to an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Cohorts, Portfolio, Class, or Companies&lt;br /&gt;
**This is a potential variable that could link the websites of many different accelerators. The problem with the word &amp;quot;portfolio&amp;quot; is also used by numerous venture capital firms, which could potentially cause complications when attempting to pull only the sites of accelerators from a Google search. The word &amp;quot;cohort&amp;quot;, however, would have an extremely high probability of identifying the website as belonging to an accelerator. The words &amp;quot;class&amp;quot; and &amp;quot;companies&amp;quot; are promising but do not offer certainty.&lt;br /&gt;
&lt;br /&gt;
*Equity, Investment&lt;br /&gt;
**Although by itself, equity does not mean much, when paired with any of these other terms, it could potentially point to an accelerator. Most accelerators take equity in the form of common stock (6-8%), or they will ask for some alternate form of stake in the company.&lt;br /&gt;
&lt;br /&gt;
*Education and Mentorship&lt;br /&gt;
**Accelerators differ from incubators and angel investors in that they emphasize the education of the potential startup. They offer advice and intense mentorship from more experienced entrepreneurs within their staff, as well as many networking opportunities with the outside world. This variable is more difficult to find on the website of the accelerator, but I believe that if the website includes numerous keywords such as &amp;quot;education&amp;quot;, &amp;quot;mentorship&amp;quot;, or &amp;quot;networking opportunities&amp;quot;, it would be somewhat safe to assume that the website is owned by an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Demo Day&lt;br /&gt;
**This variable does not have tremendous potential in terms of crawling websites, but I feel that it is worth mentioning. Most accelerators &amp;quot;graduate&amp;quot; their cohorts with a demo day, which is a day when the startups present their company to potential investors. If the website contains the words &amp;quot;demo day&amp;quot;, which is fairly uncommon, it could be a good source of accelerator identification.&lt;br /&gt;
&lt;br /&gt;
A combination of any of these variables would certainly identify the current website as belonging to an accelerator.&lt;br /&gt;
&lt;br /&gt;
==Comprehensive List of Accelerators==&lt;br /&gt;
&lt;br /&gt;
All text files saved in &amp;quot;Accelerators&amp;quot; project on the McNair RPD. &lt;br /&gt;
&lt;br /&gt;
*Acc.Info: 190&lt;br /&gt;
*SeedDB: 240&lt;br /&gt;
*SARP: 59&lt;br /&gt;
*Corp: 79&lt;br /&gt;
*Total: 568 results&lt;br /&gt;
&lt;br /&gt;
After removing duplicates and locations: 363 results&lt;br /&gt;
&lt;br /&gt;
Doesn't count f6s, which returns 1170 results, roughly only 300 of which were accelerators. We created a crawler to sift through the webpages and parse HTML so we could identify the accelerators. Program and HTML saved on the Desktop.&lt;br /&gt;
&lt;br /&gt;
==Randomly Chosen Accelerators==&lt;br /&gt;
*TLabs&lt;br /&gt;
*BetaSpring&lt;br /&gt;
*The Unilever Foundry&lt;br /&gt;
*AIA Accelerator&lt;br /&gt;
*R/GA Accelerator&lt;br /&gt;
*Zeroto510&lt;br /&gt;
*Hub:raum&lt;br /&gt;
*Orange Fab&lt;br /&gt;
*Furnace&lt;br /&gt;
*Launch Chapel Hill&lt;br /&gt;
&lt;br /&gt;
===Determining whether or not these are accelerators===&lt;br /&gt;
Googled name of Accelerator and clicked on the first link&lt;br /&gt;
&lt;br /&gt;
Looked for Variables which Distinguish Accelerator Websites&lt;br /&gt;
*TLabs: Homepage states: &amp;quot;Leading Indian Tech Accelerator&amp;quot;; TLabs is an accelerator, but it is located in India.&lt;br /&gt;
*Betaspring: Under the &amp;quot;About Betaspring&amp;quot; tab,  it states that &amp;quot;Betaspring was among the first ten startup accelerators to launch worldwide&amp;quot;.&lt;br /&gt;
*The Unilever Foundry: Does not claim to be an accelerator, nor does it have information on the website about cohorts. This name was pulled from the source Corporate Accelerators.&lt;br /&gt;
*AIA Accelerator: The word &amp;quot;accelerator&amp;quot; is included in the name. Under the &amp;quot;Overview&amp;quot; tab, it states that startups have received mentorship.&lt;br /&gt;
*R/GA Accelerator: Under the &amp;quot;Overview&amp;quot; tab it states that the &amp;quot;R/GA Accelerator is designed for startups and... it is a three month, immersive, mentorship driven program&amp;quot;.&lt;br /&gt;
*Zeroto510: Website contains a &amp;quot;Portfolio Companies&amp;quot; tab which divides up the companies into cohorts. This identifies Zeroto510 as an accelerator.&lt;br /&gt;
*Hub:raum: Offers accelerator and incubator programs; however, none are located in North America.&lt;br /&gt;
*Orange Fab: States on the main page that &amp;quot;We're a 3-month accelerator program&amp;quot;.&lt;br /&gt;
*Furnace: &amp;quot;About&amp;quot; tab states that Furnace is &amp;quot;an innovative startup accelerator designed to form, incubate, and launch new companies&amp;quot;. Concludes with a Demo Day&lt;br /&gt;
*Launch Chapel Hill: Homepage states that they are &amp;quot;a startup accelerator&amp;quot;. Also included on the homepage is a line that states &amp;quot;Applications for Cohort 7 are now open&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
7/10 are accelerators located in the US.&lt;br /&gt;
&lt;br /&gt;
2/10 are accelerators not located in the US.&lt;br /&gt;
&lt;br /&gt;
1/10 is not an accelerator.&lt;br /&gt;
&lt;br /&gt;
===Steps for Extracting Cohort Information===&lt;br /&gt;
*TLabs: Clicked on the &amp;quot;Startup&amp;quot; tab and located a drop down menu entitled &amp;quot;Showing Startups from:&amp;quot;. This menu separates startups into Batches ranging from 1-9. These batches are cohorts.&lt;br /&gt;
*Betaspring: This website does not have a &amp;quot;Companies&amp;quot; or &amp;quot;Startups&amp;quot; tab. I clicked on their &amp;quot;Who&amp;quot; tab and noticed that within this section were two links called &amp;quot;Our portfolio&amp;quot; and &amp;quot;Our companies&amp;quot; which both linked to the same place. This place contained a list of the startups that Betaspring has funded, as well as links to each of the startup websites. The list was not separated into cohorts.&lt;br /&gt;
*The Unilever Foundry: Does not have a &amp;quot;Startups&amp;quot; or &amp;quot;Companies&amp;quot; link on the website.&lt;br /&gt;
*AIA Accelerator: Clicked on the &amp;quot;Startups&amp;quot; tab which returned a page with 5 companies and a bit of information on each of these companies. Also included the URL to each startup. However, the companies were not separated into cohorts, probably because there are so few of them.&lt;br /&gt;
*R/GA Accelerator: Clicked on the &amp;quot;Alumni&amp;quot; tab and navigated down the webpage. Startups are separated by class, which means cohort in this case. Startup info contains link to demo day presentation as well as the startup url.&lt;br /&gt;
*Zeroto510: Hovered over the &amp;quot;About Us&amp;quot; drop down menu and clicked on the &amp;quot;Portfolio Companies&amp;quot; link. Startups are separated by cohort, one for each year, starting from 2013. &lt;br /&gt;
*Hub:raum: Clicked on the &amp;quot;Portfolio&amp;quot; tab. Directed to a page with many names of startups, as well as a brief description of what their company is about. Also includes a link to each startup's website. Startups are not separated into cohorts, but rather by investment by location, current participants, and alumni.&lt;br /&gt;
*Orange Fab: Clicked on the &amp;quot;Startups&amp;quot; tab and was directed to a different page. Startups are not only separated into cohorts named &amp;quot;Seasons&amp;quot;, but they are also separated by industry.&lt;br /&gt;
*Furnace: Clicked on &amp;quot;Portfolio&amp;quot; tab, but unfortunately the website is broken and it returned an error in code.&lt;br /&gt;
*Launch Chapel Hill: Clicked on the &amp;quot;Ventures&amp;quot; tab and was directed to a page in which all startups were separated into cohorts, and a brief description of the startup was provided underneath their logo.&lt;br /&gt;
&lt;br /&gt;
=Code=&lt;br /&gt;
&lt;br /&gt;
The directory for all data related to this project is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
==F6S Web Crawler==&lt;br /&gt;
&lt;br /&gt;
This is a python script using the selenium library that retrieves the html content of each page on F6S's North American Accelerator search results. The script is located in:&lt;br /&gt;
&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs &lt;br /&gt;
&lt;br /&gt;
The script is titled f6s_crawler_gentle.py&lt;br /&gt;
&lt;br /&gt;
When run, the script visits the F6S search page for North American Accelerator's and begins retrieving the HTML of each page in that search list. &lt;br /&gt;
NOTE: Timing must be spaced out between all interactions with the browser. F6S has Captcha, and the program will fail if the site receives too many hit requests, or has any inkling that it is being probed by a bot.&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files are stored in: &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files stored as text files are stored in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files_text&lt;br /&gt;
&lt;br /&gt;
==F6S Parser==&lt;br /&gt;
The next step is to take the HTML files retrieved by the crawler and to parse them for necessary information. This parser should also determine whether or not the site is an accelerator site. &lt;br /&gt;
&lt;br /&gt;
The code for the parser is located in &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
It is titled f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
To run the code, open the file in Komodo and press play. &lt;br /&gt;
If running from the command line, change to the correct directory and run the following comand:&lt;br /&gt;
 python f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
The list of accelerators that passed through the parser is in the same directory:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
The tab delimited text file is named AcceleratorList.&lt;br /&gt;
The file contains the names of the accelerators that had the keywords listed in the file. Also, the file contains the run dates and location of the accelerator if it was listed on the f6s page.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==F6S API==&lt;br /&gt;
F6S has an API, but we have had no success getting a key to the API. The link to get a key to the API is on [https://www.f6s.com/developers/apis/deal-feed this page].&lt;br /&gt;
&lt;br /&gt;
I (Peter) have emailed F6S to ask for a key directly at support@f6s.com. As of the end of the Fall 2016 Semester, they have not responded.&lt;br /&gt;
&lt;br /&gt;
FUN FACT (MASS-RENAME FILES USING WINDOWS POWER SHELL):&lt;br /&gt;
&lt;br /&gt;
The following command allowed me to append &amp;quot;.txt&amp;quot; to all files in a folder once in the proper directory:&lt;br /&gt;
 Get-ChildItem * | Rename-Item -NewName { $_.name + '.txt'}&lt;br /&gt;
&lt;br /&gt;
To change file formats, Microsoft suggests:&lt;br /&gt;
 Get-ChildItem *.txt | Rename-Item -NewName { $_.name -Replace '\.txt', '.log'}&lt;br /&gt;
&lt;br /&gt;
==Final Data==&lt;br /&gt;
The Parser for parsing the text files of accelerator data is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
The Parser for parsing the cohort files of accelerator data is also located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
This folder contains the Python parsers. The Final_data folder contains the tab-delimited text files of parsed data. final_accelerator_data.txt contains the generalized data saved in .txt files and final_cohort_data.txt contains the cohort data saved in .cohort.txt files.&lt;br /&gt;
&lt;br /&gt;
All the files entitled accelerator_data are subsets of the final_accelerator_data.txt file, but each file contains only the accelerators that matched to the flag specified in the file title.&lt;br /&gt;
&lt;br /&gt;
find_headers .py finds a set of the headers for all the cohort files from the seed list project.&lt;br /&gt;
&lt;br /&gt;
==Google SiteSearch==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Google_SiteSearch&lt;br /&gt;
This folder contains code for a google search parser. The script sitesearch.py will search for a queried company and return a likely web address for that company.&lt;br /&gt;
&lt;br /&gt;
==Way Back Machine Parser==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\wayback_machine.py&lt;br /&gt;
This script takes URLs and returns a timestamp for the oldest documented webpage under that URL courtesy of the Way Back Machine Archive.&lt;br /&gt;
&lt;br /&gt;
==Process Locations==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\process_locations.py&lt;br /&gt;
This script takes a physical address and converts it into latitude and longitude coordinates. Should be used in conjunction with the Enclosing Circle program to find the concentration of accelerators.&lt;br /&gt;
 E:\McNair\Software\CodeBase\EnclosingCircle.py&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Crunchbase_2013_Snapshot&amp;diff=17870</id>
		<title>Crunchbase 2013 Snapshot</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Crunchbase_2013_Snapshot&amp;diff=17870"/>
		<updated>2017-04-17T20:59:02Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Username: mcnair@rice.edu&lt;br /&gt;
&lt;br /&gt;
password: amount&lt;br /&gt;
&lt;br /&gt;
==Original Email==&lt;br /&gt;
&lt;br /&gt;
Thank you for submitting a request for Research Access to Crunchbase through our API. We have reviewed your request, and granted you Basic Access. You can now access Crunchbase data in the following ways. &lt;br /&gt;
&lt;br /&gt;
Check out the Open Data Map&lt;br /&gt;
Explore the 2013 Snapshot&lt;br /&gt;
Visit our website for instructions on accessing Crunchbase data. To access the REST API, you'll need your user key: &lt;br /&gt;
&lt;br /&gt;
6d382e4bbdaa297138f32a588b139f53&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
With Basic Access, API use is limited to the Open Data Map and 2013 Snapshot. Access to the full API and latest funding round data requires a license. To learn more check out our offerings. &lt;br /&gt;
&lt;br /&gt;
==Basic Membership==&lt;br /&gt;
*Can not seem to filter results past the first 50 companies&lt;br /&gt;
*Very basic information such as company name, location, industry classification, website, and &amp;quot;Crunchbase ranking&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
==Retrieval==&lt;br /&gt;
&lt;br /&gt;
The data was retrieved by Shrey and Matthew through an application from the Crunchbase Website for the API service. The data took about a month to come in due to a lack of response from Crunchbase itself. Eventually, they gave us basic access.&lt;br /&gt;
&lt;br /&gt;
==Content==&lt;br /&gt;
&lt;br /&gt;
The snapshot contained 2 .tar.qz files, which were extracted into 181/crunchbase using the command&lt;br /&gt;
 tar -zxvf file.tar.gz&lt;br /&gt;
&lt;br /&gt;
The csv files (organizations.csv and people.csv) were copied for access to:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Crunchbase Snapshot&lt;br /&gt;
&lt;br /&gt;
The files (size in bytes) and their contents are&lt;br /&gt;
&lt;br /&gt;
'''crunchbase_2013_snapshot_mysql.tar.gz'''&lt;br /&gt;
*license.txt		 526 &lt;br /&gt;
*cb_objects.sql	 338955612 &lt;br /&gt;
*cb_offices.sql	 14850092 &lt;br /&gt;
*cb_people.sql		 13253952 &lt;br /&gt;
*cb_ipos.sql		 178397 &lt;br /&gt;
*cb_milestones.sql	 10498840 &lt;br /&gt;
*cb_funds.sql		 385010 &lt;br /&gt;
*cb_relationships.sql	 48655529 &lt;br /&gt;
*cb_degrees.sql	 13829471 &lt;br /&gt;
*cb_investments.sql	 6185134 &lt;br /&gt;
*cb_acquisitions.sql	 2309393 &lt;br /&gt;
*cb_funding_rounds.sql	 14681705 &lt;br /&gt;
&lt;br /&gt;
'''odm.csv.tar.gz'''&lt;br /&gt;
*organizations.csv	 212013301&lt;br /&gt;
**459916 records with the following fields: &lt;br /&gt;
***crunchbase_uuid&lt;br /&gt;
***type&lt;br /&gt;
***primary_role&lt;br /&gt;
***name&lt;br /&gt;
***crunchbase_url&lt;br /&gt;
***homepage_domain&lt;br /&gt;
***homepage_url&lt;br /&gt;
***profile_image_url&lt;br /&gt;
***facebook_url&lt;br /&gt;
***twitter_url&lt;br /&gt;
***linkedin_url&lt;br /&gt;
***stock_symbol&lt;br /&gt;
***location_city&lt;br /&gt;
***location_region&lt;br /&gt;
***location_country_code&lt;br /&gt;
***short_description&lt;br /&gt;
*people.csv	 	 188924229&lt;br /&gt;
**521634 records with the following fields: &lt;br /&gt;
***crunchbase_uuid&lt;br /&gt;
***type&lt;br /&gt;
***first_name&lt;br /&gt;
***last_name&lt;br /&gt;
***crunchbase_url&lt;br /&gt;
***profile_image_url&lt;br /&gt;
***facebook_url&lt;br /&gt;
***twitter_url&lt;br /&gt;
***linkedin_url&lt;br /&gt;
***location_city&lt;br /&gt;
***location_region&lt;br /&gt;
***location_country_code&lt;br /&gt;
***title&lt;br /&gt;
***organization&lt;br /&gt;
***organization_crunchbase_url&lt;br /&gt;
*crunchbase_license.txt 487&lt;br /&gt;
&lt;br /&gt;
==Changing MYSQL to PostgreSQL==&lt;br /&gt;
&lt;br /&gt;
The SQL files were generated in MySQL. We need to convert them to PostgreSQL. See: https://en.wikibooks.org/wiki/Converting_MySQL_to_PostgreSQL and http://stackoverflow.com/questions/1942586/comparison-of-database-column-types-in-mysql-postgresql-and-sqlite-cross-map&lt;br /&gt;
&lt;br /&gt;
The key changes are:&lt;br /&gt;
&lt;br /&gt;
 MYSQL          POSTGRESQL&lt;br /&gt;
 -----          ----------&lt;br /&gt;
 LOCK           --comment out as no need but LOCK [ TABLE ] [ ONLY ] name [ * ] [, ...] [ IN lockmode MODE ] [ NOWAIT ]&lt;br /&gt;
 UNLOCK         --comment out&lt;br /&gt;
 decimal(x,y)   real (might work as is)&lt;br /&gt;
 datetime       timestamp&lt;br /&gt;
 KEY            --comment out as no need but FOREIGN KEY ( column_name [, ... ] ) REFERENCES reftable [ ( refcolumn [, ... ] ) ]&lt;br /&gt;
&lt;br /&gt;
==Documentation and File Locations==&lt;br /&gt;
The Crunchbase information were broken down into two different files:&lt;br /&gt;
*The &amp;quot;organizations&amp;quot; Excel file contains: crunchbase_uuid	type	primary_role	name	crunchbase_url	homepage_domain	homepage_url	profile_image_url	facebook_url	twitter_url	linkedin_url	stock_symbol	location_city	location_region	location_country_code	short_description&lt;br /&gt;
:* Located in E:\McNair\Projects\Accelerators\Crunchbase Snapshot&lt;br /&gt;
*The &amp;quot;people&amp;quot;  Excel file contains: crunchbase_uuid	type	first_name	last_name	crunchbase_url	profile_image_url	facebook_url	twitter_url	linkedin_url	location_city	location_region	location_country_code	title	organization	organization_crunchbase_url&lt;br /&gt;
:* Located in E:\McNair\Projects\Accelerators\Crunchbase Snapshot&lt;br /&gt;
&lt;br /&gt;
==Obtaining Accelerators from the &amp;quot;Organizations&amp;quot; file==&lt;br /&gt;
#Created new columns in the data labeled &amp;quot;match: blah&amp;quot;, where blah is the word we're searching for in the descriptions&lt;br /&gt;
#Added: &amp;quot;=if(isnumber(search(&amp;quot;blah&amp;quot;,B2))=TRUE,1,0)&amp;quot;, where blah is the substring (what you're searching for), B2 is the string (what your searching in) and 1 represents that it's present and 0 means it isn't.&lt;br /&gt;
#Added: &amp;quot;=sum(A1:C1) This just sums the cells from A1 to C1&amp;quot;&lt;br /&gt;
#Compiled a list of potential accelerators depending on the number of matches (the sum)&lt;br /&gt;
:# File is labeled &amp;quot;PotentialAccelerators&amp;quot; which just has the list of accelerators we were considering based on their match number; located in E:\McNair\Projects\Accelerators\Crunchbase Snapshot&lt;br /&gt;
&lt;br /&gt;
==Appending Crunchbase Accelerators Information to Old Accelerator Data==&lt;br /&gt;
Matthew created a file called &amp;quot;Crunchbase Potential Accelerators&amp;quot;, which repeats the names of all the accelerators in the Crunchbase folder, but includes a note as to whether the accelerator is already included in our data, or whether we need to add the file to our data before the semester ends.&lt;br /&gt;
*Adding the accelerator consists of adding the accelerator text file with basic information, the cohort html file, and most importantly, the cohort text file so we can calculate the VC raise rate&lt;br /&gt;
&lt;br /&gt;
[[category:internal]]&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Talk:Accelerator_Seed_List_(Data)&amp;diff=17860</id>
		<title>Talk:Accelerator Seed List (Data)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Talk:Accelerator_Seed_List_(Data)&amp;diff=17860"/>
		<updated>2017-04-17T20:44:27Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Rank on VC&lt;br /&gt;
*Getting a VC percentage for each Accelerator&lt;br /&gt;
&lt;br /&gt;
Also categorize&lt;br /&gt;
*Age&lt;br /&gt;
*Nonprofit or not&lt;br /&gt;
*Location&lt;br /&gt;
&lt;br /&gt;
RegEx Code for repeating data down for the round data from SDC:&lt;br /&gt;
&lt;br /&gt;
\n([^\t]+\t[^\t]*\t[^\t]*\t[^\t]*\t[^\t]*\t[^\t]*\t[^\t]*\t[^\t]*\t[^\t]*\t[^\t]*\t)(.*)\n\t\t\t\t\t\t\t\t\t\t&lt;br /&gt;
&lt;br /&gt;
\n\1\2\n\1&lt;br /&gt;
&lt;br /&gt;
=if(isnumber(search(&amp;quot;blah&amp;quot;,B2))=TRUE,1,0)&lt;br /&gt;
where blah is the substring (what you're searching for), B2 is the string (what your searching in) and 1 represents that it's present and 0 means it isn't.&lt;br /&gt;
&lt;br /&gt;
=sum(A1:C1)&lt;br /&gt;
This just sums the cells from A1 to C1&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=17859</id>
		<title>Accelerator Seed List (Data)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=17859"/>
		<updated>2017-04-17T20:37:51Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Accelerator Seed List (Data)&lt;br /&gt;
|Has owner=Shrey Agarwal, Matthew Ringheanu&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
||Has keywords=Accelerators,Data&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Industry Classifier&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=End of Semester Report=&lt;br /&gt;
The end of semester report will focus on ranking accelerators and environments based on the variables we have gathered. Our primary form of categorization will be ranking individual accelerators based on their venture capital raise rate. We can probably generate information over time for accelerators and the amount of VC they raised to get a sense of what locations have developed in the past five years from the dates of transactions recorded by SDC. To obtain these rankings, we will identify which cohorts companies were trained in, as well as complete details of the accelerator and the details of cohort companies. We will focus only on accelerators because there are many other entities in each ecosystem. We will also utilize information on IPO or acquisition by companies, obtained through Crunchbase, to gain some sense of how successful startups emerging from a particular accelerator are. To obtain the data over time, we will need to fill out the cohort date information column in our cohort data, which will require the help of either Crunchbase or the Wayback machine for older accelerators. In ranking the accelerators across regions, we can also track industry-specific hotspots for accelerators such as medicine in Memphis or technology in San Francisco.&lt;br /&gt;
&lt;br /&gt;
To complete the report, we need to fill information in:&lt;br /&gt;
*Industry and focus&lt;br /&gt;
*Location&lt;br /&gt;
*Name, description&lt;br /&gt;
*Matched VC data&lt;br /&gt;
*Founder information (maybe)&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
This project is developing broad and near-population data on accelerators and their cohort companies. The objective is to identify which cohorts of which accelerators a cohort company was trained in, obtain details of the accelerators, and obtain details of the cohort companies, including information about any venture capital investment that the cohort company might have received and any IPO or acquisition the company may have experienced.&lt;br /&gt;
&lt;br /&gt;
The primary use of this data is for an academic paper detailed on the [[Matching Entrepreneurs to Accelerators and VCs (Academic Paper)]] page. &lt;br /&gt;
&lt;br /&gt;
However, this project can also provide useful data to other academic papers ([[Urban Start-up Agglomeration]], [[Hubs (Academic Paper)]], and [[Hubs Scorecard (Academic Paper)]]), projects ([[Houston Entrepreneurship]]) and blog posts (under the [[Emerging Ecosystems]] umbrella project).&lt;br /&gt;
&lt;br /&gt;
This project needs the results of the [[Industry Classifier]], [[Whois Parser]], and other tools.&lt;br /&gt;
&lt;br /&gt;
=Current Project Write-Up=&lt;br /&gt;
&lt;br /&gt;
==Things To Do==&lt;br /&gt;
*Obtain all URLs for accelerators in order to run through the Wayback Machine to find out when they started.&lt;br /&gt;
*Match Crunchbase Data with our Accelerator List to see if they have any accelerators that we do not.&lt;br /&gt;
*Obtain an example of accelerator that started early and has multiple companies but does not separate them into cohorts and figure out a way to determine which companies went through each cohort.&lt;br /&gt;
&lt;br /&gt;
==What Each File in the &amp;quot;Accelerator&amp;quot; Folder on the RDP Contains==&lt;br /&gt;
*&amp;quot;Accelerator List Sources&amp;quot; (Folder) - This folder contains most of the sources that we pulled accelerator names from at the very beginning of the project.&lt;br /&gt;
*&amp;quot;Code+Final_Data&amp;quot; (Folder) - This folder contains Peter's code for pulling the data from the text files in the &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Crunchbase Snapshot&amp;quot; (Folder) - This folder contains the data we obtained from Crunchbase. There is a massive amount of data which we will need to sort through to find useful information and hopefully match that data with our current cohort data.&lt;br /&gt;
*&amp;quot;Data&amp;quot; (Folder) - This folder contains all of our data on accelerators including cohort information and the html files of each cohort page. I would estimate that it is about 95% clean currently.&lt;br /&gt;
*&amp;quot;Data - Copy&amp;quot; (Folder) - This is just a copy of our current &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Data_Copy&amp;quot; (Folder) - This is a copy of our original &amp;quot;Data&amp;quot; folder before we did any manual cleaning.&lt;br /&gt;
*&amp;quot;Enclosing_Circle&amp;quot; (Folder) - This folder seems to contain some data on VC but I'm not sure how it pertains to the Accelerator project.&lt;br /&gt;
*&amp;quot;F6S Accelerator HTMLs&amp;quot; (Folder) - This folder contains the HTML pages of all the pages on the F6S website. We used it to add more potential accelerators to our list.&lt;br /&gt;
*&amp;quot;Google_SiteSearch&amp;quot; (Folder) - This folder contains Python code for Google searches.&lt;br /&gt;
*&amp;quot;Industry_Classifier&amp;quot; (Folder) - This folder seems to contain Python code but I'm not sure what for.&lt;br /&gt;
*&amp;quot;Matcher&amp;quot; (Folder) - This folder contains the Matcher.&lt;br /&gt;
*&amp;quot;Python WebCrawler&amp;quot; (Folder) - This folder contains code that is a work in progress for pulling descriptions from accelerator websites. It is Jeemin's project.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data Copy&amp;quot; (Excel File) - This file contains a copy of our cleaned cohort data.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data&amp;quot; (Excel File) - This file contains the most current, completely cleaned data on cohort company information.&lt;br /&gt;
*&amp;quot;NormalizeFixedWidth&amp;quot; (PL File) - This is the normalizer.&lt;br /&gt;
*&amp;quot;PortCoNames&amp;quot; (TXT File) - This file contains all of the names of the cohort companies as well as the accelerator they went through.&lt;br /&gt;
*&amp;quot;VC Data&amp;quot; (Excel File) - This file contains all of the names of the companies that have ever received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data&amp;quot; (TXT File) - This file contains that non-normalized data of all of the VC information.&lt;br /&gt;
*&amp;quot;VC_Data_Names&amp;quot; (TXT File) - This file contains all of the names of companies that have received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data_Names_Matched_PortCoNames&amp;quot; (Excel File) - This file contains all of the cohort companies that have also received VC funding. Still needs to be sorted through.&lt;br /&gt;
&lt;br /&gt;
==Process==&lt;br /&gt;
After accumulating the massive amount of data on accelerators, their cohorts, and their html files, we began cleaning those text files, which are located in the &amp;quot;Data&amp;quot; folder within &amp;quot;Accelerators&amp;quot;. After going through the first round of cleaning, we ran a code through the cohort data which put all of that information into an Excel document called &amp;quot;Cleaned Cohort Data&amp;quot;. There were still some mistakes in the cohort information unfortunately, which we fixed within the Excel file itself. Therefore, there are some text files within the &amp;quot;Data&amp;quot; folder that do not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file. If we were to run the cohort code through the &amp;quot;Data&amp;quot; folder, we would get something that does not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file, which is problematic. The solution to this (other than manually cleaning the text files again) would be to write a code from the &amp;quot;Cleaned Cohort Data&amp;quot; file which would allow us to clean the data in the &amp;quot;Data&amp;quot; folder through the format of the Excel file. We have also matched all of the cohort companies with our list of all companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
=Current To Do=&lt;br /&gt;
&lt;br /&gt;
#Work on the [[Crunchbase 2013 Snapshot]]&lt;br /&gt;
#Match cohort companies to VC backed portfolio companies&lt;br /&gt;
#Refine our data to work out which cohort each cohort company was a member of, cohort start dates and locations, etc.&lt;br /&gt;
#Make a list of top accelerator lists (e.g., http://tech.co/top-startup-accelerators-ranked-2012-08) and check that we have those accelerators&lt;br /&gt;
&lt;br /&gt;
=End of Semester Notes=&lt;br /&gt;
&lt;br /&gt;
*We have compiled a very long list of accelerators from many different databases. For the past couple of weeks, everyone in the center has been going through this list, 20 at a time, classifying each one as an accelerator or not an accelerator, and then proceeding to gather data on the accelerator using the process outlined below. This process went very smoothly. We have successfully gone through about 80% of the list. We are still missing information on the last hundred or so names. All of the collected data is located on the RDP, within the &amp;quot;Accelerators&amp;quot; folder under &amp;quot;Data&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
=Data Collection Notes=&lt;br /&gt;
&lt;br /&gt;
==3 files==&lt;br /&gt;
&lt;br /&gt;
For each accelerator in the list, put files in E:\Projects\Accelerators\Data&lt;br /&gt;
*AcceleratorName.txt - copy and paste the variables below into a (tab-delimited) txt file and complete&lt;br /&gt;
*AcceleratorName.cohort - your cohort text file (see below)&lt;br /&gt;
*AcceleratorName.html (possibly automatically with a folder too) - save a copy of the html of the cohort page&lt;br /&gt;
&lt;br /&gt;
==.txt Variables==&lt;br /&gt;
&lt;br /&gt;
 Name	&lt;br /&gt;
 Score	&lt;br /&gt;
 Flag	&lt;br /&gt;
 CohortURL	&lt;br /&gt;
 Address	&lt;br /&gt;
 Duration	&lt;br /&gt;
 Vintage		&lt;br /&gt;
 Industry	&lt;br /&gt;
 Description	&lt;br /&gt;
 Equity	&lt;br /&gt;
 NonProfit	 &lt;br /&gt;
 Notes	&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Try to get '''Name, Score, Flag, Cohort URL and Address''' for all. ONLY GRAB OTHER VARIABLES IF EASY. Just leave things blank if you can't find them quickly.&lt;br /&gt;
&lt;br /&gt;
'''If the score is 0, or the flag is S, I, A, or F just stop''' - don't bother downloading a cohort list, saving an HTML file, etc. If possible, do stick a very brief description of the problem in the notes field.&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Score: is 0-1 where 0 is definitely not an accelerator, 1 is definitely an accelerator&lt;br /&gt;
*Flag: (leave blank if not needed), if multiple then separate by comma&lt;br /&gt;
**S for social entrep&lt;br /&gt;
**I for incubator&lt;br /&gt;
**A for an angel group&lt;br /&gt;
**F is for foreign&lt;br /&gt;
**C for in coworking space/hub/etc&lt;br /&gt;
**V for if part of venture fund&lt;br /&gt;
**D is for Dead&lt;br /&gt;
*Put just the root URL in Cohort URL if there isn't a Cohort page&lt;br /&gt;
*Duration: in wks (months x 4.33 and round)&lt;br /&gt;
*Vintage is year of first cohort if possible&lt;br /&gt;
*Industry is industry focus but only if clear focus&lt;br /&gt;
*Equity is a number (don't put %) or Y/N&lt;br /&gt;
*Notes is only there if need it. Particularly try to use this field to note discards.&lt;br /&gt;
&lt;br /&gt;
==.cohort files==&lt;br /&gt;
&lt;br /&gt;
Your .cohort files must:&lt;br /&gt;
*Be tab delimited txt&lt;br /&gt;
*Have a header&lt;br /&gt;
*The first column must be the portfolio company name&lt;br /&gt;
*Grab as many columns as you can easily (and name them)&lt;br /&gt;
&lt;br /&gt;
==Standardized format for text files==&lt;br /&gt;
&lt;br /&gt;
Information Text file&lt;br /&gt;
*1 tab only after each category&lt;br /&gt;
*No spaces after commas for flags or industry&lt;br /&gt;
*For duration put only a number in weeks but do not write &amp;quot;weeks&amp;quot;&lt;br /&gt;
*Equity is either only a number (no percent sign) or a Y/N&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Cohort Text file&lt;br /&gt;
*1 tab between each column&lt;br /&gt;
*Titles of each column on top&lt;br /&gt;
*Make a new category for &amp;quot;Cohort Number&amp;quot; and write either &amp;quot;1 2 3 4 etc.&amp;quot;&lt;br /&gt;
*Matthew: 1-225 (done) Shrey: 226-550 (done)&lt;br /&gt;
&lt;br /&gt;
==Link to Crunchbase API application==&lt;br /&gt;
&lt;br /&gt;
https://about.crunchbase.com/forms/research-access-apply/&lt;br /&gt;
&lt;br /&gt;
==Sign-Ups==&lt;br /&gt;
&lt;br /&gt;
 Ed - 1-10 (done)&lt;br /&gt;
 Carlin -  11-20 (done)&lt;br /&gt;
 Carlin - 21-40 (done)&lt;br /&gt;
 Christy - 41-60 (done)&lt;br /&gt;
 Avesh - 61-80 (done)&lt;br /&gt;
 Eliza - 81-100 (done)&lt;br /&gt;
 Meghana - 101-120 (done)&lt;br /&gt;
 Peter - 121-140 (done)&lt;br /&gt;
 Ramee - 141-160 (done)&lt;br /&gt;
 Will - 161-180 (done)&lt;br /&gt;
 Matthew - 181-200 (done)&lt;br /&gt;
 Julia - 201-220 (done)&lt;br /&gt;
 Peter - 221-240 (done)&lt;br /&gt;
 Shrey - 241-260 (done)&lt;br /&gt;
 Matthew - 261-280 (done)&lt;br /&gt;
 Eliza - 281-300 (done)&lt;br /&gt;
 Julia - 301-320 (done)&lt;br /&gt;
 Shrey - 321-340 (done)&lt;br /&gt;
 Carlin - 341-361 (done)&lt;br /&gt;
 Julia - 362-380 (done)&lt;br /&gt;
 Dylan - 381-393 (done)&lt;br /&gt;
 Jake - 394-404 (done)&lt;br /&gt;
 Dylan - 405-410 (done)&lt;br /&gt;
 Avesh - 411-415 (done)&lt;br /&gt;
 Dylan - 416-423 (done)&lt;br /&gt;
 Peter - 424-460(done)&lt;br /&gt;
 Carlin - 461-480 (done)&lt;br /&gt;
 Peter - 481-490(done)&lt;br /&gt;
 Julia - 491-510 (done)&lt;br /&gt;
 Peter - 511-515 (done)&lt;br /&gt;
 Julia - 516-529 (done)&lt;br /&gt;
 Ben - 530-540 (done)&lt;br /&gt;
 Shrey - 541-551 (done)&lt;br /&gt;
&lt;br /&gt;
=List of Accelerators=&lt;br /&gt;
#10Xelerator&lt;br /&gt;
#1440&lt;br /&gt;
#33entrepreneurs&lt;br /&gt;
#500 Startups&lt;br /&gt;
#9Mile Labs&lt;br /&gt;
#AIA Accelerator&lt;br /&gt;
#ARK Challenge&lt;br /&gt;
#AT&amp;amp;T Aspire Accelerator&lt;br /&gt;
#ATDC Community&lt;br /&gt;
#AZ TechCelerator&lt;br /&gt;
#AccelFoods&lt;br /&gt;
#Acceleprise&lt;br /&gt;
#Accelerate Baltimore&lt;br /&gt;
#Accelerate Genius&lt;br /&gt;
#Accelerate Tectoria Accelerator&lt;br /&gt;
#Accelerator Centre&lt;br /&gt;
#Advanced Technology Development Center (ATDC)&lt;br /&gt;
#Airbus BizLab&lt;br /&gt;
#Alchemist Accelerator&lt;br /&gt;
#AlphaLab&lt;br /&gt;
#Amplify.LA&lt;br /&gt;
#Angel Capital&lt;br /&gt;
#Angelcube&lt;br /&gt;
#Angelpad&lt;br /&gt;
#Annual Business BootCamp&lt;br /&gt;
#Arizona Center for Innovation&lt;br /&gt;
#Arizona Furnace&lt;br /&gt;
#Arrowhead Tech Incubator 2016&lt;br /&gt;
#Aspire 3 Accelerator 2017&lt;br /&gt;
#Atlanta Ventures Accelerator &lt;br /&gt;
#AutoXLR8R&lt;br /&gt;
#Awesome Inc.&lt;br /&gt;
#Axel Springer Plug and Play&lt;br /&gt;
#B 4 Change Impact Accelerator&lt;br /&gt;
#B2B Acceleration Program&lt;br /&gt;
#B4C Social Venture Accelerator&lt;br /&gt;
#BBC Worldwide Labs&lt;br /&gt;
#BMW Startup Garage&lt;br /&gt;
#Brandcelerate&lt;br /&gt;
#Bunker Labs&lt;br /&gt;
#Bank of Ireland Accelerator Programme&lt;br /&gt;
#Bantunium Labs Accelerator&lt;br /&gt;
#Barclays Accelerator&lt;br /&gt;
#Barclays New York Summer 2015&lt;br /&gt;
#Berkley Ventures&lt;br /&gt;
#Bessemer Business Incubation System&lt;br /&gt;
#Beta-i&lt;br /&gt;
#Beta.MN&lt;br /&gt;
#BetaFactory&lt;br /&gt;
#BetaSpring&lt;br /&gt;
#Betablox&lt;br /&gt;
#Betaspring RevUp  (DUPLICATE)&lt;br /&gt;
#Bethnal Green Ventures&lt;br /&gt;
#BioAccel&lt;br /&gt;
#BioInspire&lt;br /&gt;
#Bir 2015&lt;br /&gt;
#BitAngel Engagement Level&lt;br /&gt;
#BitAngels Startup Summer Program of 2013&lt;br /&gt;
#Bizdom&lt;br /&gt;
#Black Forest Accelerator&lt;br /&gt;
#Blue Startups&lt;br /&gt;
#Blueprint Health&lt;br /&gt;
#Bolt Boston&lt;br /&gt;
#Bonnier Accelerator&lt;br /&gt;
#BoomStartup&lt;br /&gt;
#BoomStartup Winter 2017 (DUPLICATE)&lt;br /&gt;
#Boomtown Accelerator&lt;br /&gt;
#Boomtown Health Tech (DUPLICATE)&lt;br /&gt;
#Boost VC&lt;br /&gt;
#BootupLabs&lt;br /&gt;
#Brandery&lt;br /&gt;
#Brooklyn Beta Summer Camp&lt;br /&gt;
#Budweiser Dream Brewery&lt;br /&gt;
#Buildit&lt;br /&gt;
#BuiltinPGH Companies&lt;br /&gt;
#Business Innovation Center&lt;br /&gt;
#Business Opportunity Academy 2017&lt;br /&gt;
#Business Technology Development Center (BizTech)&lt;br /&gt;
#CLT Joules Energy Accelerator 2014&lt;br /&gt;
#CWI Ventures&lt;br /&gt;
#CWI Ventures Application (DUPLICATE)&lt;br /&gt;
#CableLabs Technology Tours 2016&lt;br /&gt;
#Capital Factory&lt;br /&gt;
#Capital Innovators&lt;br /&gt;
#Capital Investment Network (Startups)&lt;br /&gt;
#Caroline Plouff&lt;br /&gt;
#Catalyst Partners&lt;br /&gt;
#Cause Collective : Social Innovation Lab&lt;br /&gt;
#Center for Entrepreneurial Innovation&lt;br /&gt;
#Chain Reaction Innovations 2017&lt;br /&gt;
#Chemical Angel Network&lt;br /&gt;
#Chinaccelerator&lt;br /&gt;
#Cisco Entrepreneurs in Residence&lt;br /&gt;
#Citi Accelerator&lt;br /&gt;
#Citrix Startup Accelerator&lt;br /&gt;
#Claremont/Upland Makerspace Fablab&lt;br /&gt;
#Climate Ventures 2.0 Accelerator&lt;br /&gt;
#Co.Lab accelerator&lt;br /&gt;
#Code for America Accelerator&lt;br /&gt;
#Cohab's Traxtion Point&lt;br /&gt;
#Collision Conference Investors&lt;br /&gt;
#Common Bond&lt;br /&gt;
#Communitech Hyperdrive&lt;br /&gt;
#Conquer Accelerator&lt;br /&gt;
#Coolhouse Labs&lt;br /&gt;
#CuriousMinds Incubator / Accelerator&lt;br /&gt;
#CyberTECH San Diego&lt;br /&gt;
#DBS Accelerator&lt;br /&gt;
#DPD Last Mile labs&lt;br /&gt;
#DV X Labs&lt;br /&gt;
#Dat Ventures&lt;br /&gt;
#Decatur-Morgan County Entrepreneurial Center&lt;br /&gt;
#Deep Space Ventures&lt;br /&gt;
#Demo Accelerator 2016- 2017&lt;br /&gt;
#DeveloperTown&lt;br /&gt;
#Difference Engine&lt;br /&gt;
#Digital Malaysia Corporate Accelerator Program&lt;br /&gt;
#Digital Media Zone Incubator/Accelerator&lt;br /&gt;
#Disney Accelerator&lt;br /&gt;
#DogFish Accelerator&lt;br /&gt;
#Domi Station&lt;br /&gt;
#Dotforge accelerator&lt;br /&gt;
#Dream Funded&lt;br /&gt;
#DreamIT Health&lt;br /&gt;
#DreamStart - Free Mentoring Program&lt;br /&gt;
#Dreamit Ventures (DUPLICATE)&lt;br /&gt;
#Ducky Diggy Lloyd &lt;br /&gt;
#E-Capital Summit&lt;br /&gt;
#EC Mentor Skills Inventory&lt;br /&gt;
#EIGERlab&lt;br /&gt;
#ETRAC&lt;br /&gt;
#EY Startup Challenge&lt;br /&gt;
#Eco Holding&lt;br /&gt;
#Eleven Startup Accelerator&lt;br /&gt;
#Emerge Xcelerate&lt;br /&gt;
#EnterpriseWorks Incubation Program&lt;br /&gt;
#Entrepreneur Development Center&lt;br /&gt;
#Entrepreneurs Roundtable Accelerator&lt;br /&gt;
#Environmental Business Cluster&lt;br /&gt;
#Equity Legal&lt;br /&gt;
#Excelerate Labs&lt;br /&gt;
#Execution Labs&lt;br /&gt;
#Exhilarator&lt;br /&gt;
#Extreme Startups&lt;br /&gt;
#Extreme University&lt;br /&gt;
#FOOD-X&lt;br /&gt;
#Factory45&lt;br /&gt;
#Fargo Startup House 2014-2015&lt;br /&gt;
#FastTrack Propero Healthcare&lt;br /&gt;
#FbFund&lt;br /&gt;
#Female Propeller for High Flyers&lt;br /&gt;
#FinTech Innovation Lab&lt;br /&gt;
#FinTech Studios 2015&lt;br /&gt;
#Fintech Founders Club #2&lt;br /&gt;
#First Growth Venture Network&lt;br /&gt;
#Fishbowl Labs AOL&lt;br /&gt;
#Flagship Enterprise Center&lt;br /&gt;
#FlashStarts&lt;br /&gt;
#Flashpoint&lt;br /&gt;
#Flat6 Labs&lt;br /&gt;
#Fledge9&lt;br /&gt;
#Flextronics Lab IX&lt;br /&gt;
#Food Future Scale-up Accelerator 2017&lt;br /&gt;
#Food System 6 (FS6) Accelerator&lt;br /&gt;
#FoodForwardX&lt;br /&gt;
#Fortify Ventures&lt;br /&gt;
#Founder Institute&lt;br /&gt;
#FounderFuel&lt;br /&gt;
#FoundersPad&lt;br /&gt;
#Fownders Accelerator&lt;br /&gt;
#French Accelerator 2016&lt;br /&gt;
#Fund the Food&lt;br /&gt;
#Fuse Corps Host&lt;br /&gt;
#GAKKEN Accelerator Program&lt;br /&gt;
#Gainesville Technology Enterprise Center&lt;br /&gt;
#Game CoLab Incubator Program 2014&lt;br /&gt;
#GameFounders&lt;br /&gt;
#GammaRebels&lt;br /&gt;
#Gazelle Lab&lt;br /&gt;
#Gener8tor&lt;br /&gt;
#German Accelerator Life Sciences&lt;br /&gt;
#German Accelerator Tech&lt;br /&gt;
#Global Accelerator Network 2015&lt;br /&gt;
#Good Works Houston Lab&lt;br /&gt;
#GoodCompany Ventures&lt;br /&gt;
#Google Launchpad Accelerator&lt;br /&gt;
#Grants4Apps Accelerator&lt;br /&gt;
#GreenStart&lt;br /&gt;
#Greenlite Labs&lt;br /&gt;
#GrowLab&lt;br /&gt;
#Growth Hacking Accelerator 2015&lt;br /&gt;
#Gulf Coast Center for Innovation and Entrepreneurship&lt;br /&gt;
#H-Farm Ventures&lt;br /&gt;
#HACKT Mission for International Founders&lt;br /&gt;
#HAXLR8R&lt;br /&gt;
#HCC Entrepreneurship Launchpad&lt;br /&gt;
#HIGHLINE Academy&lt;br /&gt;
#HUB&lt;br /&gt;
#HUBB Accelerator&lt;br /&gt;
#HUBB GTLA 2016&lt;br /&gt;
#HackFWD&lt;br /&gt;
#Hatch&lt;br /&gt;
#Health Wildcatters&lt;br /&gt;
#Health accelerator&lt;br /&gt;
#Healthbox&lt;br /&gt;
#Hero City Co-Working Space&lt;br /&gt;
#High Street Startups Accelerator&lt;br /&gt;
#Highway1&lt;br /&gt;
#Honda Xcelerator &lt;br /&gt;
#Houston Technology Center&lt;br /&gt;
#Hub Ventures&lt;br /&gt;
#HugeThing&lt;br /&gt;
#I/O ventures&lt;br /&gt;
#ICONYC labs&lt;br /&gt;
#IDC Elevator&lt;br /&gt;
#INcubes Funnel and Accelerator 2014/2015&lt;br /&gt;
#INcubes Online Form&lt;br /&gt;
#INcubes Startup Visa&lt;br /&gt;
#Illumina Accelerator&lt;br /&gt;
#Illuminator,  New York Accelerator 2015&lt;br /&gt;
#Imagine K12&lt;br /&gt;
#Immokalee Business Development Center&lt;br /&gt;
#Impact Engine&lt;br /&gt;
#Impact USA - 2017&lt;br /&gt;
#Incubate Miami&lt;br /&gt;
#Infuse Accelerator&lt;br /&gt;
#Ingenuity Partner Program&lt;br /&gt;
#InnoSpring&lt;br /&gt;
#Innov&amp;amp;Connect&lt;br /&gt;
#Innov8 for Health&lt;br /&gt;
#Innova Memphis&lt;br /&gt;
#InnovateOC&lt;br /&gt;
#Innovation Depot&lt;br /&gt;
#Innovation Pavilion&lt;br /&gt;
#Innovation Showcase Winter 2017&lt;br /&gt;
#Insight Accelerator Labs&lt;br /&gt;
#Intel Education Accelerator&lt;br /&gt;
#Investment Preparedness Lab&lt;br /&gt;
#Invoke Collective&lt;br /&gt;
#Iowa Startup Accelerator&lt;br /&gt;
#JFDI.Asia&lt;br /&gt;
#JFE Accelerator SF&lt;br /&gt;
#JLAB&lt;br /&gt;
#Jaguar Land Rover Tech Incubator&lt;br /&gt;
#Jolt&lt;br /&gt;
#JumpSchool &lt;br /&gt;
#JumpStart Foundry&lt;br /&gt;
#Jumpstart! Boulder&lt;br /&gt;
#JusticeXL&lt;br /&gt;
#Kairos Boston Spring Program&lt;br /&gt;
#Kaplan EdTech&lt;br /&gt;
#Kick&lt;br /&gt;
#Kick Boise&lt;br /&gt;
#Kick LA&lt;br /&gt;
#Kick Victoria&lt;br /&gt;
#Kicklabs&lt;br /&gt;
#Kinetiq Labs&lt;br /&gt;
#L-SPARK Accelerator&lt;br /&gt;
#LAUNCH incubator&lt;br /&gt;
#LAUNCHub&lt;br /&gt;
#LI TechCOMETS&lt;br /&gt;
#LabFunding Project Accelerator 2014&lt;br /&gt;
#Labs Venture Accelerator&lt;br /&gt;
#Launch Chapel Hill&lt;br /&gt;
#Launch Memphis&lt;br /&gt;
#LaunchBox Digital&lt;br /&gt;
#LaunchHouse&lt;br /&gt;
#LaunchPad PEI&lt;br /&gt;
#LaunchSpot&lt;br /&gt;
#Launch_Academy&lt;br /&gt;
#Launchpad Digital Health, LLC&lt;br /&gt;
#Launchpad LA&lt;br /&gt;
#Launchpad Long Island&lt;br /&gt;
#Le Camping&lt;br /&gt;
#Leading Entrepreneurial Accelerator Program&lt;br /&gt;
#Lean Launch Ventures&lt;br /&gt;
#LearnLaunchX&lt;br /&gt;
#Lemnos Labs&lt;br /&gt;
#Life Changing Labs&lt;br /&gt;
#LiftOff Health Incubator&lt;br /&gt;
#Lightbank Start&lt;br /&gt;
#LightningLab&lt;br /&gt;
#Lowe's Accelerator&lt;br /&gt;
#MACH37&lt;br /&gt;
#MACH37 Spring&lt;br /&gt;
#MIT SA+P venture accelerator&lt;br /&gt;
#MITA Institute Accelerator&lt;br /&gt;
#MTGx MediaFactory&lt;br /&gt;
#Mac6&lt;br /&gt;
#Madworks Governance Accelerator&lt;br /&gt;
#Maine Center for Entrepreneurial Development - Top Gun Program&lt;br /&gt;
#Matter&lt;br /&gt;
#Maven Ventures Fund &amp;amp; Incubator&lt;br /&gt;
#Media Camp&lt;br /&gt;
#Melbourne Accelerator Program&lt;br /&gt;
#Memphis BioWorks&lt;br /&gt;
#Merck Accelerator&lt;br /&gt;
#MergeLane 2017 Accelerator&lt;br /&gt;
#Mergelane&lt;br /&gt;
#Metavallon&lt;br /&gt;
#Microsoft Accelerator&lt;br /&gt;
#MindTheBridge&lt;br /&gt;
#Momentum&lt;br /&gt;
#MuckerLab&lt;br /&gt;
#Muru-D&lt;br /&gt;
#My5ive Accelerator 2016&lt;br /&gt;
#N-Motion (DUPLICATE)&lt;br /&gt;
#NDRC (LaunchPad / VentureLab)&lt;br /&gt;
#NEXT Dashboard&lt;br /&gt;
#NMotion&lt;br /&gt;
#NY Digital Health Accelerator&lt;br /&gt;
#NY Fashion Tech Lab 2017&lt;br /&gt;
#NYC ACRE&lt;br /&gt;
#NYC SeedStart&lt;br /&gt;
#Nashville Entrepreneur Center&lt;br /&gt;
#Nebula Shift&lt;br /&gt;
#Nephoscale IaaS&lt;br /&gt;
#Nest New York &lt;br /&gt;
#New Ventures Group&lt;br /&gt;
#New York Digital Health Accelerator (DUPLICATE)&lt;br /&gt;
#NewME Accelerator PopUps &lt;br /&gt;
#NewMe&lt;br /&gt;
#Next media accelerator&lt;br /&gt;
#NextHIT&lt;br /&gt;
#NextStart&lt;br /&gt;
#Nike+ Accelerator&lt;br /&gt;
#Northern Arizona Center for Entrepreneurship and Technology (NACET)&lt;br /&gt;
#Northern England&lt;br /&gt;
#Nxtp.labs&lt;br /&gt;
#OCTANe&lt;br /&gt;
#Oasis 500&lt;br /&gt;
#OpenFund&lt;br /&gt;
#Orange Fab&lt;br /&gt;
#Orange Works&lt;br /&gt;
#Orion Startups&lt;br /&gt;
#Oxygen Accelerator&lt;br /&gt;
#PIE&lt;br /&gt;
#Patriot Boot Camp&lt;br /&gt;
#Pearson Catalyst for Education&lt;br /&gt;
#Pipeline H2O&lt;br /&gt;
#Pitney Bowes Inc&lt;br /&gt;
#Plarium Labs&lt;br /&gt;
#Plug In South LA &lt;br /&gt;
#Plug and Play&lt;br /&gt;
#Plum Alley Investments 2016&lt;br /&gt;
#Points of Light Accelerator&lt;br /&gt;
#PowerHaus&lt;br /&gt;
#Preccelerator® Program 2016&lt;br /&gt;
#ProSiebenSat.1 Accelerator&lt;br /&gt;
#Project Entrepreneur 2016/17&lt;br /&gt;
#Project Healtchare&lt;br /&gt;
#Project Lift&lt;br /&gt;
#Project Music&lt;br /&gt;
#Project Skyway&lt;br /&gt;
#Propeller Venture Accelerator&lt;br /&gt;
#Prosper Capital Accelerator&lt;br /&gt;
#Proton Enterprises&lt;br /&gt;
#Pushstart Accelerator&lt;br /&gt;
#Qualcomm Robotics Accelerator&lt;br /&gt;
#Queen Creek Business Incubator&lt;br /&gt;
#R/GA Accelerator&lt;br /&gt;
#RAIN Incubator/Accelerator&lt;br /&gt;
#RJI Investment Group&lt;br /&gt;
#Reach&lt;br /&gt;
#RetailXelerator&lt;br /&gt;
#Rock Health&lt;br /&gt;
#Rocket Fuel Labs&lt;br /&gt;
#Rockstart Accelerator&lt;br /&gt;
#RunUp Labs&lt;br /&gt;
#Runway IoT Accelerator 2015&lt;br /&gt;
#SAP Startup Focus Program&lt;br /&gt;
#SKTA Innopartners Innovation Accelerator&lt;br /&gt;
#SPACELAB Tech Accelerator&lt;br /&gt;
#SPARK&lt;br /&gt;
#SPH Plug and Play&lt;br /&gt;
#SURF Incubator&lt;br /&gt;
#SaltMines Group Start-Up Studio&lt;br /&gt;
#ScaleTown&lt;br /&gt;
#Seamless IoT 2016&lt;br /&gt;
#Searchcamp&lt;br /&gt;
#Seed Hatchery&lt;br /&gt;
#SeedSpot&lt;br /&gt;
#SeedStartup&lt;br /&gt;
#SeedSumo&lt;br /&gt;
#Seedcamp&lt;br /&gt;
#Seedrocket&lt;br /&gt;
#Seeqnce&lt;br /&gt;
#Sequoia Apps&lt;br /&gt;
#Serval Ventures&lt;br /&gt;
#Shenzhen Valley Ventures Incubator&lt;br /&gt;
#Shoals Entrepreneurial Center&lt;br /&gt;
#Shopper Futures Accelerator&lt;br /&gt;
#Shotput Ventures&lt;br /&gt;
#Sid Martin Biotechnology Institute&lt;br /&gt;
#SigmaLabs Accelerator&lt;br /&gt;
#Silicon Valley Incubator &amp;amp; Accelerator&lt;br /&gt;
#SixThirty&lt;br /&gt;
#Sixers Innovation Lab&lt;br /&gt;
#Skywalker Accelerator&lt;br /&gt;
#SmartHealth Activator&lt;br /&gt;
#Smashd Labs&lt;br /&gt;
#SoCo Nexus Accelerator Spring 2017&lt;br /&gt;
#Social Enterprise Challenge&lt;br /&gt;
#Socratic Labs&lt;br /&gt;
#SparkLabs&lt;br /&gt;
#Sparkgap&lt;br /&gt;
#Sports Tank&lt;br /&gt;
#Springboard&lt;br /&gt;
#Sprint Accelerator&lt;br /&gt;
#Sprint Mobile Health Accelerator&lt;br /&gt;
#SproutBox&lt;br /&gt;
#SproutCamp&lt;br /&gt;
#Starburst Aerospace Accelerator&lt;br /&gt;
#Start Path Europe&lt;br /&gt;
#Start'inPost&lt;br /&gt;
#StartEngine&lt;br /&gt;
#StartFast Venture Accelerator&lt;br /&gt;
#Starta Accelerator Winter 2017&lt;br /&gt;
#Startl&lt;br /&gt;
#Startmate&lt;br /&gt;
#Startup Accelerator (DUPLICATE)&lt;br /&gt;
#Startup Front&lt;br /&gt;
#Startup Next &amp;amp; GAN&lt;br /&gt;
#Startup Orange County Accelerator&lt;br /&gt;
#Startup Runway&lt;br /&gt;
#Startup Wise Guys&lt;br /&gt;
#Startup Zone PEI&lt;br /&gt;
#Startup52X Accelerator&lt;br /&gt;
#StartupCity&lt;br /&gt;
#StartupHighway&lt;br /&gt;
#StartupHouse Foundry program&lt;br /&gt;
#StartupMinds Accelerator &lt;br /&gt;
#StartupYard&lt;br /&gt;
#Startupbootcamp&lt;br /&gt;
#Straight Shot&lt;br /&gt;
#Summer@Highland&lt;br /&gt;
#Surge&lt;br /&gt;
#SynBio axlr8r&lt;br /&gt;
#TEB Incubation &amp;amp; Acceleration Center&lt;br /&gt;
#THRIVE Accelerator III&lt;br /&gt;
#THRIVE Open Innovation (DUPLICATE)&lt;br /&gt;
#TIM#WCAP Accelerator&lt;br /&gt;
#TLabs&lt;br /&gt;
#TMCx Accelerator Digital Health 2017&lt;br /&gt;
#Tallwave&lt;br /&gt;
#Tampa Bay Innovation Center&lt;br /&gt;
#Tampa Bay Wave&lt;br /&gt;
#Tandem Mobile Accelerator&lt;br /&gt;
#Tech Nexus&lt;br /&gt;
#Tech Wildcatters&lt;br /&gt;
#Tech2020&lt;br /&gt;
#TechLaunch&lt;br /&gt;
#TechRanch&lt;br /&gt;
#TechSquareLabs&lt;br /&gt;
#Techstars&lt;br /&gt;
#Techstars Music&lt;br /&gt;
#Telenet Idealabs&lt;br /&gt;
#Telluride Venture Accelerator&lt;br /&gt;
#TenX&lt;br /&gt;
#The Alchemist Accelerator (DUPLICATE)&lt;br /&gt;
#The Ark&lt;br /&gt;
#The Bakery&lt;br /&gt;
#The Batchery&lt;br /&gt;
#The Brandery&lt;br /&gt;
#The Bridge&lt;br /&gt;
#The Center For Technology Enterprise &amp;amp; Development&lt;br /&gt;
#The Chaser&lt;br /&gt;
#The Company Lab (CO.LAB)&lt;br /&gt;
#The Draper FinTech Connection&lt;br /&gt;
#The Factory&lt;br /&gt;
#The Greatest Pitch&lt;br /&gt;
#The Harbor Accelerator&lt;br /&gt;
#The Incubator&lt;br /&gt;
#The Iron Yard&lt;br /&gt;
#The Mediapreneur Incubator&lt;br /&gt;
#The Morpheus&lt;br /&gt;
#The New York Venture Summit&lt;br /&gt;
#The Next Step: from idea to startup&lt;br /&gt;
#The Refinery&lt;br /&gt;
#The Unilever Foundry&lt;br /&gt;
#The Venture Center's Pre-Accelerator I&lt;br /&gt;
#The Vine OC&lt;br /&gt;
#The Vogt Awards&lt;br /&gt;
#The Yield Lab&lt;br /&gt;
#The eFactory Accelerator&lt;br /&gt;
#Think Big Partners Accelerator&lt;br /&gt;
#TiE Angels&lt;br /&gt;
#Tigerlabs Digital Health Accelerator&lt;br /&gt;
#Tolstoy Summer Camp&lt;br /&gt;
#TopSeedsLab&lt;br /&gt;
#Travel Startups Incubator&lt;br /&gt;
#Travelport Labs Accelerator&lt;br /&gt;
#Travelport Labs Incubator&lt;br /&gt;
#Triangle Startup Factory&lt;br /&gt;
#Tumml&lt;br /&gt;
#Tune Labs&lt;br /&gt;
#Twin Cities Accelerator 2016&lt;br /&gt;
#UW-Whitewater Launch Pad Accelerator&lt;br /&gt;
#Unbank.ventures FinTech Incubator&lt;br /&gt;
#University Technology Park&lt;br /&gt;
#Unreasonable Institute&lt;br /&gt;
#UpTech&lt;br /&gt;
#Upstart Accelerator&lt;br /&gt;
#Upstart Labs&lt;br /&gt;
#Upstart Memphis&lt;br /&gt;
#Uptima Business Bootcamp&lt;br /&gt;
#Upwest Labs&lt;br /&gt;
#VANTEC&lt;br /&gt;
#VC FinTech Accelerator&lt;br /&gt;
#Velocity Indiana Accelerator&lt;br /&gt;
#Venture Catalyst Partners&lt;br /&gt;
#Venture Hive&lt;br /&gt;
#Venture I&lt;br /&gt;
#VentureOut's  Enterprise Tech Expedition&lt;br /&gt;
#Venturegeeks&lt;br /&gt;
#Vet-Tech Accelerator&lt;br /&gt;
#VictorySpark&lt;br /&gt;
#Village88 Techlab&lt;br /&gt;
#Volkswagen ERL Technology Accelerator&lt;br /&gt;
#WHLabs&lt;br /&gt;
#Wasabi Ventures Academy&lt;br /&gt;
#Wayra&lt;br /&gt;
#Wellness Accelerator&lt;br /&gt;
#Wells Fargo Startup Accelerator&lt;br /&gt;
#Wireless IoT&lt;br /&gt;
#Women Innovate Mobile&lt;br /&gt;
#XLerateHealth&lt;br /&gt;
#XTRATOS&lt;br /&gt;
#Xlerate Health&lt;br /&gt;
#Y Combinator&lt;br /&gt;
#Y&amp;amp;R SparkPlug 2017&lt;br /&gt;
#YEurope&lt;br /&gt;
#YLE Media Startup Accelerator Program&lt;br /&gt;
#Yahoo Ad Tech Program&lt;br /&gt;
#Yangler (online accelerator)&lt;br /&gt;
#Year of the Startup&lt;br /&gt;
#Yetizen Accelerator&lt;br /&gt;
#You Is Now&lt;br /&gt;
#Z80 Labs&lt;br /&gt;
#ZIP Launchpad Admission&lt;br /&gt;
#ZeroTo510&lt;br /&gt;
#Zone Startups Calgary&lt;br /&gt;
#designX 2017&lt;br /&gt;
#eMerging Ventures&lt;br /&gt;
#ezone&lt;br /&gt;
#iStart Jax (DUPLICATE)&lt;br /&gt;
#iStart Valley&lt;br /&gt;
#iVentures10&lt;br /&gt;
#ignite100&lt;br /&gt;
#innovyz start&lt;br /&gt;
#tekMountain Accelerator&lt;br /&gt;
&lt;br /&gt;
=Project Summary=&lt;br /&gt;
This project will be used to determine which accelerators are the most effective at churning out successful startups, as well as what characteristics are exhibited by these accelerators. First, we need to gather as much data as we can about as many accelerators as we can in order to look at factors that differentiate successful vs. unsuccessful ventures. Next, we need to create a web crawling program which will gather information about accelerators across the world by accessing their websites and extracting information. I believe that our overall goal with this research project is to gain insight into the methods of successful accelerators, as well as to find out what exactly differentiates very successful accelerators from dead accelerators.&lt;br /&gt;
&lt;br /&gt;
Helpful Links: http://seedrankings.com/&lt;br /&gt;
&lt;br /&gt;
=Sources=&lt;br /&gt;
&lt;br /&gt;
Summary: These are sources obtained from [[List of Accelerators]], Crunchbase, and other Google searches. We will evaluate these sources by looking at the number of accelerators they supply (as most of them are lists) and then also taking a look at the type of information they provide about each accelerator. Key data points are cohort-related data, startup-related data, and logistics of the accelerator. Better sources supply more information that the URL alone.&lt;br /&gt;
&lt;br /&gt;
(Obtained from [[List of Accelerators]] and various Google searches)&lt;br /&gt;
*http://seedrankings.com/&lt;br /&gt;
*http://www.acceleratorinfo.com/see-all.html&lt;br /&gt;
*http://www.seed-db.com/accelerators&lt;br /&gt;
*http://gust.com/usa-canada-accelerator-report-2015/?utm_content=35401577&amp;amp;utm_medium=social&amp;amp;utm_source=twitter&lt;br /&gt;
*https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/&lt;br /&gt;
*http://www.builtinnyc.com/2016/06/03/accelerators-incubators-nyc&lt;br /&gt;
*http://www.represent.la/&lt;br /&gt;
*http://www.launch.co/blog/complete-list-of-incubators-and-accelerators-like-y-combinat.html&lt;br /&gt;
*https://angel.co/accelerator-4&lt;br /&gt;
&lt;br /&gt;
(Obtained from Google search: &amp;quot;Accelerator Database&amp;quot;)&lt;br /&gt;
*seed-db is the first result that pops up&lt;br /&gt;
*https://www.corporate-accelerators.net/database/&lt;br /&gt;
*https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json&lt;br /&gt;
*By the 5th or 6th search result, the utility diminished greatly&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2015/03/17/the-best-startup-accelerators-of-2015-powering-a-tech-boom/#2f52fa7e34e4&lt;br /&gt;
*http://www.inc.com/will-yakowicz/the-15-best-startup-accelerators-in-the-us.html&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2016/03/11/the-best-startup-accelerators-of-2016/#74086a7724f2&lt;br /&gt;
*https://techcrunch.com/2015/03/17/these-are-the-top-20-us-accelerators/&lt;br /&gt;
*https://www.nexpcb.com/blogs/news/the-hardware-incubators-accelerators-list&lt;br /&gt;
&lt;br /&gt;
Other ways used to find Accelerators (listed below &amp;quot;List of Sources Obtained from Various Google Searches&amp;quot;):&lt;br /&gt;
*Type in generic location + &amp;quot;accelerators&amp;quot; (e.g. Houston Accelerators)&lt;br /&gt;
:*Looked at roughly the first 20 results&lt;br /&gt;
:*Used three locations as examples of accelerators that pop up&lt;br /&gt;
*Type in a specific state + &amp;quot;accelerator&amp;quot; + &amp;quot;list&amp;quot; (e.g. Texas accelerator list) to search for more relevant lists&lt;br /&gt;
:*Once again, looked at roughly the first 20 results&lt;br /&gt;
*Crunchbase has its own webpage with instructions for how we retrieve the data&lt;br /&gt;
&lt;br /&gt;
=Source Evaluations=&lt;br /&gt;
&lt;br /&gt;
Summary: These evaluations couple with each of the sources above. The evaluations provide instructions for obtaining the information listed, as well as a general review of how useful the data seems. The review serves to determine whether a crawler would be suitable for obtaining information from the source autonomously.&lt;br /&gt;
&lt;br /&gt;
==SOURCE: Crunchbase==&lt;br /&gt;
*All of the information for the Crunchbase documentation is located in the page [[Crunchbase 2013 Snapshot]] webpage, along with the documentation for how we determined the accelerator information.&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.acceleratorinfo.com/see-all.html==&lt;br /&gt;
#Opened source website&lt;br /&gt;
#Copied Information under &amp;quot;All Accelerator Programs&amp;quot; to TextPad, already sorted. Returned 190 results&lt;br /&gt;
#Each link on parent list leads to individual '''home page url''' of accelerator&lt;br /&gt;
:*Used sample size of 20 links, determined 16 to be accelerators, 2 to be incubators, 2 to be inactive or broken links&lt;br /&gt;
:*Many accelerators do not include founding date, most recent accelerators from around 2013-2014 (as determined from home page)&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for specific URLs to older accelerators, not very helpful for more specific information.&lt;br /&gt;
*Web crawling seems improbable because information is not readily available from source. Can potentially mine staff information or contact information from associated &amp;quot;about&amp;quot; page in the home url&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators/all==&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 235 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes:&lt;br /&gt;
::# &amp;quot;state&amp;quot;&lt;br /&gt;
::# &amp;quot;company name&amp;quot;&lt;br /&gt;
::# &amp;quot;website and CrunchBase links&amp;quot;&lt;br /&gt;
::# &amp;quot;cohort date&amp;quot;&lt;br /&gt;
::#&amp;quot;exit value&amp;quot;&lt;br /&gt;
::#&amp;quot;funding&amp;quot;. &lt;br /&gt;
:::Many entries for &amp;quot;exit value&amp;quot; are missing, some values for &amp;quot;funding&amp;quot; are missing&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators out of 235 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the following:&lt;br /&gt;
::#Status&lt;br /&gt;
::#Program (name)&lt;br /&gt;
::#Location&lt;br /&gt;
::#Country&lt;br /&gt;
::#Number of companies&lt;br /&gt;
::#Cumulative exit values&lt;br /&gt;
::#Cumulative funding &lt;br /&gt;
::#Average funding for startups&lt;br /&gt;
::#Median funding for startups&lt;br /&gt;
:::Many entries for &amp;quot;median funding&amp;quot; are left empty, as well as entries for all types of funding on the bottom half of the table&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, but after cross-referencing from other sources shows that seed-db is lacking many newer accelerators; list is not all-inclusive.&lt;br /&gt;
*Includes regional distributions for accelerator groups as well. For example, rather than just &amp;quot;Techstars&amp;quot;, the group is broken into Austin, Berlin, Boston, Boulder, etc.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators==&lt;br /&gt;
:Very similar to &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;, but contains large regional accelerators as groups, rather than individual accelerators. For example, Techstars appears only once.&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 239 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes same information as previous source, &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;. However, accelerators spanning across multiple regions have their startups located under one category on this webpage.&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators/groups out of 239 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the same information as the &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; source&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, includes large groups as well as individual accelerators. It seems that some accelerators missing from &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; are located here, since there are 239 returns rather than 235.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.f6s.com/programs?type==&lt;br /&gt;
#On the webpage, set &amp;quot;Type&amp;quot; to &amp;quot;Accelerator/Program&amp;quot;, set &amp;quot;Location&amp;quot; to &amp;quot;North America&amp;quot;, and set &amp;quot;Invest in Country&amp;quot; to &amp;quot;United States&amp;quot; to return results&lt;br /&gt;
#Highlighted results and scrolled down until all results found; copied results to TextPad&lt;br /&gt;
#In TextPad, sorted out lines with &amp;quot;by&amp;quot;, as well as miscellaneous categories such as dates and dollar signs through Regular Expressions&lt;br /&gt;
#Using the &amp;quot;More Info&amp;quot; line which held constant through the entire list, assigned a sequential number to the line (in order to determine the number of results)&lt;br /&gt;
::*Obtained a grand total of 1467 results from the list&lt;br /&gt;
::*Along with the name of the program/accelerator, the data included:&lt;br /&gt;
::#Dollar value per team&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Application Site&lt;br /&gt;
::#Accelerator URL&lt;br /&gt;
::*Many entries are not accelerators, from a quick glance through the results, there were various conferences, 3-5 days events, and written literature pertaining to accelerators as well&lt;br /&gt;
::*From a sample size of the first 30 entries, determined 10 to be valid accelerators, 3 incubators, 6 conferences/weekends, and the rest to be miscellaneous entries such as startup events or &amp;quot;studios&amp;quot; (perhaps useful but not relevant to search)&lt;br /&gt;
::*As we go down the list, the number of accelerators proportionately decreases. Can comfortably say that overall accelerator turnout from this website is much less than 33%, probably closer to 10-15%.&lt;br /&gt;
===Review===&lt;br /&gt;
*Potentially useful website if crawler could remove the clutter and target solely the accelerators; very useful for identifying new accelerators since data automatically sorted by date and location.&lt;br /&gt;
*Large list of sources includes many irrelevant results, such as conferences or weekends which are difficult to identify. The name of the sorting category itself, &amp;quot;Accelerator/Program&amp;quot; suggests that many of the results fall under the &amp;quot;Program&amp;quot; section rather than being valid accelerators.&lt;br /&gt;
*Potential site for identifying accelerators, but limited by in-site sorting; useful for URL and perhaps equity, but not very detailed information relating to the accelerator/program.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://gust.com/usa-canada-accelerator-report-2015/==&lt;br /&gt;
#Selected region of US and Canada&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Top 20 Active Accelerators&amp;quot; and selected &amp;quot;see the full list&amp;quot; near the bottom of the listed accelerators&lt;br /&gt;
#Copied resulting entries into TextPad and sorted out the numbers to leave only the name of the accelerator&lt;br /&gt;
::*Obtained 100 results for different accelerators&lt;br /&gt;
::*Accelerator lists included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Number of Start-ups funded (2015 only)&lt;br /&gt;
::*Accelerator list limited to 2015&lt;br /&gt;
===Review===&lt;br /&gt;
*Website provides its own evaluation of an accelerator's success based on various factors and provides data for larger trends.&lt;br /&gt;
*Usefulness is questionable because website does not provide much except the URL, and all of the entries are based on success in 2015.&lt;br /&gt;
*Other interesting data within website such as &amp;quot;Hot Markets&amp;quot;, investment breakdowns by state, etc. All of this data is also limited to 2015.&lt;br /&gt;
&lt;br /&gt;
==Source: https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/==&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Startup accelerators in Boston&amp;quot;&lt;br /&gt;
#Copied text beginning from &amp;quot;MassChallenge&amp;quot; (the first paragraph was just a general definition of startups) and continued to copy until &amp;quot;Startup Incubators in Boston&amp;quot;&lt;br /&gt;
#After pasting in TextPad, I sorted the data to delete any characters after the &amp;quot;-&amp;quot; and added a sequential number at the beginning of each line&lt;br /&gt;
::*Returned a total of 17 results for startups in Boston&lt;br /&gt;
::*Accelerator list included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Capital requirements&lt;br /&gt;
::#Application periods and requirements&lt;br /&gt;
::#Paragraph describing accelerator and its goals&lt;br /&gt;
===Review===&lt;br /&gt;
*Although the guide is dated, useful for identifying strong accelerator programs in Boston&lt;br /&gt;
*Limitation: only focuses on Boston, but the description is helpful in identifying the role of the accelerator&lt;br /&gt;
*Limited information on accelerator, not very useful by itself without information from the accelerator URL&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.corporate-accelerators.net/database/==&lt;br /&gt;
#Copied and pasted table into Microsoft Excel (Data was already sorted into categories so no need for TextPad)&lt;br /&gt;
#Table returned 72 references (but there was a link to the bottom to a larger database)&lt;br /&gt;
::*The table itself includes:&lt;br /&gt;
::#Major Company&lt;br /&gt;
::#Accelerator&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Website&lt;br /&gt;
::#Details&lt;br /&gt;
::*The &amp;quot;Details&amp;quot; link led to a variety of other information including:&lt;br /&gt;
::#Status (Active or Inactive)&lt;br /&gt;
::#Locations&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Term&lt;br /&gt;
::#Cohort Based? (Regular or Irregular)&lt;br /&gt;
::#Pitch Day&lt;br /&gt;
::#Office Space&lt;br /&gt;
::#Powered by&lt;br /&gt;
::#Support Offered?&lt;br /&gt;
::#Launch year&lt;br /&gt;
::#Focus Areas&lt;br /&gt;
::#General Description&lt;br /&gt;
::*Also Included a variety of data regarding the host company as well&lt;br /&gt;
===Review===&lt;br /&gt;
*Solid list for corporate accelerators and also includes a variety of information about the accelerator, the cohorts, etc. Some of the entries are international accelerators however so need to filter them out&lt;br /&gt;
*Only limited to 72 accelerators from major companies&lt;br /&gt;
&lt;br /&gt;
==Source: https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json==&lt;br /&gt;
#This source is a .json file from the previous database&lt;br /&gt;
#After placing into TextPad, replaced each space with a ###, replaced each new line with a tab, and replaced each ### with a new line. Ultimately returned 80 results&lt;br /&gt;
::*From the file, the .json includes:&lt;br /&gt;
::#NAICS and NAICS sector &lt;br /&gt;
::#Classification&lt;br /&gt;
::#Sector Description&lt;br /&gt;
::#Term&lt;br /&gt;
::#Goal&lt;br /&gt;
::#Partner&lt;br /&gt;
::*Also includes most of the information from the previous source, since they are undoubtedly linked&lt;br /&gt;
===Review===&lt;br /&gt;
*Another solid list for corporate accelerators with some more information, but ultimately very similar to the previous source.&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.quora.com/Where-can-I-find-a-comprehensive-list-of-startup-incubators-and-accelerators-in-the-US==&lt;br /&gt;
#Since we already looked at the first listed source (seed-db), I clicked on the second link &amp;quot;(by Robert Shedd) http://blog.shedd.us/321987608/&amp;quot; which took me to a page headed &amp;quot;Help for Startups! – A semi-complete list of startup accelerator programs&amp;quot; created by a blogger, Robert Shedd&lt;br /&gt;
#List included 102 entries by the blogger, each of which do look like an accelerator&lt;br /&gt;
::*Upon immediate overview, noticed many results from previous sources were missing. Immediately noticed lack of &amp;quot;OwlSpark&amp;quot;, the accelerator from Rice.&lt;br /&gt;
::*Shedd only offers us the accelerator name plus its URL&lt;br /&gt;
===Review===&lt;br /&gt;
*Nice list to cross-reference with other sources but does not offer much new insight compared to more powerful engines such as seed-db\&lt;br /&gt;
&lt;br /&gt;
=List of Sources Obtained from Various Google Searches=&lt;br /&gt;
&lt;br /&gt;
Summary: These accelerators are taken from a specific Google search rather than a list. The idea is to compile a list of Google searches that return relevant results of accelerators. This will aid in the creation of a future web crawler.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;Location + Accelerator&amp;quot;(Only individual results, not lists)==&lt;br /&gt;
===Houston Accelerators===&lt;br /&gt;
*Examples of single accelerators found&lt;br /&gt;
:#TMCx: http://www.tmc.edu/innovation/innovation-programs/tmcx/&lt;br /&gt;
:#RED labs: http://redlabs.uh.edu/8&lt;br /&gt;
:#SURGE accelerator: https://kirkcoburn.com/&lt;br /&gt;
:#OwlSpark: http://owlspark.com/&lt;br /&gt;
:#NextHIT: http://www.houstonhealthventures.com/nexthit-accelerator-program-application/&lt;br /&gt;
===Los Angeles Accelerators===&lt;br /&gt;
:#Amplify: http://amplify.la/&lt;br /&gt;
:#Y Combinator: https://www.ycombinator.com/&lt;br /&gt;
:#Chicklabs: https://www.chicklabsllc.com/&lt;br /&gt;
:#Disney Accelerator: https://disneyaccelerator.com/&lt;br /&gt;
:#Launchpad: https://launchpad.la/&lt;br /&gt;
===New York Accelerators===&lt;br /&gt;
:#DreamIT Ventures: http://www.dreamit.com/#meaningful-experience&lt;br /&gt;
:#Women Innovate Mobile: http://www.wim.co/&lt;br /&gt;
:#Techstars NYC: http://www.techstars.com/programs/nyc-program/&lt;br /&gt;
:#Entrepreneurs Roundtable: http://eranyc.com/&lt;br /&gt;
:#FirstGrowthVC: http://venturecrush.com/fg/&lt;br /&gt;
:#New York Digital Health Accelerator: http://digitalhealthaccelerator.com/&lt;br /&gt;
:#Grand Central Tech: http://www.grandcentraltech.com/&lt;br /&gt;
:#Accelerator Corp: http://www.acceleratorcorp.com/&lt;br /&gt;
:#New York Startup Lab: http://nystartuplab.com/&lt;br /&gt;
===Review===&lt;br /&gt;
*Some locations return more viable results for a similar sample size. For example, New York returned 9 valid accelerators, whereas Los Angeles and Houston both returned 5 actual accelerators out of the first 20 results: an 80% difference. Some optimization may come from identifying which locations return more accelerators upon searching.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;State+Accelerator+List&amp;quot;==&lt;br /&gt;
===New York Accelerator List===&lt;br /&gt;
*http://www.ongridventures.com/resources/new-york-silicon-alley-resources/newyorkaccelerators/ (Ranks 14 accelerators)&lt;br /&gt;
*http://under30ceo.com/11-new-york-tech-incubators-and-accelerators-for-entrepreneurs/ (Ranks 11 accelerators)&lt;br /&gt;
===California Accelerator List===&lt;br /&gt;
*http://www.socaltech.com/the_complete_guide_to_southern_california_accelerators_and_incubators_part_i/s-0040924.html (Lists accelerators in Southern Cali)&lt;br /&gt;
*http://barberacorporatelaw.com/blog/2014/4/8/28-business-incubators-in-the-los-angeles-area (List of 24 accelerators near the LA area)&lt;br /&gt;
===Texas Accelerator List===&lt;br /&gt;
*http://www.austinstartuplist.com/incubators (List of accelerators in Austin, &amp;lt;5 results)&lt;br /&gt;
*http://www.siliconhillsnews.com/2016/09/02/the-top-texas-healthcare-accelerators-and-incubators/ (Modest list of accelerators aiding in healthcare)&lt;br /&gt;
*http://realfoodmba.com/food-startup-accelerators/ (List of food-based accelerators, some of which are in Austin, others of which are international)&lt;br /&gt;
===Colorado Accelerator List===&lt;br /&gt;
*http://www.builtincolorado.com/2015/01/14/best-colorado-accelerators-your-startup (8 results)&lt;br /&gt;
*https://www.quora.com/What-accelerator-programs-are-located-in-Colorado (Quora inquiry yielding modest results)&lt;br /&gt;
===Washington Accelerator List===&lt;br /&gt;
*http://www.geekwire.com/2015/mapping-seattles-incubators-accelerators-and-co-working-spaces/ (Returns 14 results)&lt;br /&gt;
===Oregon Accelerator List===&lt;br /&gt;
*http://www.bizjournals.com/portland/subscriber-only/2016/01/15/incubators-and-accelerators.html (Returns list of 5 accelerators and details)&lt;br /&gt;
*http://www.oregon4biz.com/Innovate-&amp;amp;-Create/R&amp;amp;D-Business/Incubators/ (Returns list of 26 accelerators and incubators)&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Seed-DB appears for almost all of the search results&lt;br /&gt;
*Acceleratorinfo appears for most of the search results&lt;br /&gt;
*There are multiple cumulative reports of incubators per location, but not for accelerators&lt;br /&gt;
*Most regionalized accelerator lists deal with either an article or a ranking of a particular amount of accelerators in the area&lt;br /&gt;
*Many results returned nationally ranked lists of accelerators, such as the Forbes list of &amp;quot;Top Accelerators&amp;quot; or something along the lines of &amp;quot;Best Accelerators in the US&amp;quot;. The connection is that perhaps one accelerator mentioned on the list may be located within the searched state.&lt;br /&gt;
*There are also a few results for actual particle accelerators that must be sorted out (i.e. superconducting super collider)&lt;br /&gt;
&lt;br /&gt;
==Found through google searching accelerators found previously==&lt;br /&gt;
'''Found from googling YLE Media Startup Accelerator'''&lt;br /&gt;
*https://www.corporate-accelerators.net/database/index.html (DB of Corporate Accelerators 71-79 entries)&lt;br /&gt;
*http://startupaccelerator.vc/accelerator-corporate-innovation-sig/ (Database of Accelerators and Corporate Innovation 92 entries)&lt;br /&gt;
neither of these have had their entries added to list of accelerators&lt;br /&gt;
&lt;br /&gt;
=Individual Accelerator Evaluations=&lt;br /&gt;
Summary: The purpose of this section is to create instructions for each accelerator on how to find cohort information from their URLs. Along with specific instructions for obtaining the cohorts for each accelerator chosen, there should be a list of easy-to-obtain and relevant statistics regarding the accelerator, such as information about its team, location, etc. The variable statistics list is cumulative, whereas the cohort directions are unique per the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerators Chosen (Format = Name (source))==&lt;br /&gt;
#Blue Startups (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Launchpad LA (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Y Combinator (http://www.seed-db.com/accelerators)&lt;br /&gt;
#FlashPoint (http://www.seed-db.com/accelerators/all)&lt;br /&gt;
#Prosper Accelerator (https://www.f6s.com/programs?type)&lt;br /&gt;
#Axel Springer Plug and Play (http://www.axelspringerplugandplay.com/)&lt;br /&gt;
#Techstars (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Startmate (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Capital Factory (http://blog.shedd.us/321987608/)&lt;br /&gt;
#OwlSpark (Google search: &amp;quot;Houston + accelerators&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Blue Startups (http://bluestartups.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Track Record&amp;quot; page under the &amp;quot;Home&amp;quot; tab; found total number of graduated cohorts to be 7&lt;br /&gt;
#Navigated to &amp;quot;Portfolio&amp;quot; tab. Tab includes list of all seven graduated cohorts along with companies emerging from each one. Each cohort is listed under a separate page (ex. &amp;quot;Cohort 1&amp;quot;, &amp;quot;Cohort 2&amp;quot;, etc) and at the bottom of each cohort page, there is a link to the other 6. Each company has a short description along with its URL.&lt;br /&gt;
#An &amp;quot;Alumni News&amp;quot; page at the bottom of &amp;quot;Portfolio&amp;quot; includes articles pertinent to graduated startups.&lt;br /&gt;
#Unfortunately does not include the date and year of each cohort class, but perhaps could cross-reference with other sources.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Launchpad LA (http://launchpad.la/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Companies&amp;quot; in the top of the homepage&lt;br /&gt;
#&amp;quot;Companies&amp;quot; returns all companies backed by Launchpad LA based on their class year and number (cohort)&lt;br /&gt;
#:*Also sorted by active startups vs. inactive startups&lt;br /&gt;
#At the bottom of the &amp;quot;Companies&amp;quot; tab, there is a statistical layout returning values for the number of companies started by Launchpad during its time as an accelerator (2012-present), as well as the total funding funneled into the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Y Combinator (http://www.ycombinator.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Scrolled down on the home page and clicked on a link entitled &amp;quot;See all companies&amp;quot;.&lt;br /&gt;
#Navigated to a drop down menu named &amp;quot;All Batches&amp;quot;, and clicked on it to expand the list.&lt;br /&gt;
#List is made up of dates ranging from 2005-2016, and these dates return lists of launched companies including most but not all of their URL's, as well as their launch year.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Flashpoint (http://flashpoint.gatech.edu/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#On upper right corner after animation, there is a tab sign which lets you navigate to a page labeled &amp;quot;Teams&amp;quot;&lt;br /&gt;
#The &amp;quot;Team&amp;quot; page has each batch of companies emerging from Georgia Tech, although it does not include the dates or cohorts of these companies. For example, &amp;quot;Batch 1&amp;quot; at the top of the page just lists the companies in the batch without URLs or any additional information.&lt;br /&gt;
#On the &amp;quot;Application&amp;quot; page on the tab near the top, there is information regarding Batch 7, which begins early 2017. Suggests that batch 6 either ended spring 2016 or fall 2016.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Prosper Women Entrepreneurs (http://www.prosperstl.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Accelerator&amp;quot; tab and clicked &amp;quot;Companies&amp;quot; when prompted with the drop down menu.&lt;br /&gt;
#This tab returned all of the launched company logos which then redirected to the company's home page when clicked.&lt;br /&gt;
#No other relevant form of information such as date launched or cohort was included on this page.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Axel Springer Plug and Play(http://www.axelspringerplugandplay.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Clicked on the &amp;quot;Companies&amp;quot; tab on the home page and was directed to the middle of the page which included a short list of current companies.&lt;br /&gt;
#Clicked on the &amp;quot;All Companies&amp;quot; link which returned a page filled with startup logos and brief descriptions of those startups. When clicked, each logo serves to redirect to that startup's home page.&lt;br /&gt;
#Companies were not sorted by cohort or in any other relevant way.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Techstars (http://www.techstars.com)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the Accelerators tabs and clicked &amp;quot;Companies&amp;quot; on the drop down menu.&lt;br /&gt;
#Firstly, this returns a table comprised of a long list of different classes from different areas separated by years.&lt;br /&gt;
#Upon scrolling down further, each of these classes is broken down by the startups that graduated from them. It also includes information such as how much was invested in each startup, as well as whether or not the startup was acquired, is active, or failed.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Startmate (http://www.startmate.com.au)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startups&amp;quot; tab, which returned a page of all startups that have graduated from Startmate.&lt;br /&gt;
#Startups are separated by year of graduation, and each company is linked on this page.&lt;br /&gt;
#It appears as if each year, 1 cohort is taken through the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Capital Factory (https://capitalfactory.com/accelerate/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the startups tab, which returned a long list of companies that were accelerated by Capital Factory.&lt;br /&gt;
#Each logo for the startups served as a link to their respective websites.&lt;br /&gt;
#There was no evidence or mention of any cohorts.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: OwlSpark (http://entrepreneurship.rice.edu/accelerator/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startup Teams&amp;quot; tab, which returned a page that included links to 4 &amp;quot;Classes&amp;quot;.&lt;br /&gt;
#Each class link i.e. (Class 1, Class 2, Class 3, Class 4) returned links to each startup that graduated from the program.&lt;br /&gt;
#These classes signify cohorts.&lt;br /&gt;
&lt;br /&gt;
==List of Promising Variables==&lt;br /&gt;
*Key People (founders, lead entrepreneurs, strategists, etc.)&lt;br /&gt;
*Total number of launched companies&lt;br /&gt;
*A FAQ for application details, accelerator vision, and &lt;br /&gt;
*Funds raised per company (average)&lt;br /&gt;
*Features offered by accelerator (perks, space, tools, etc)&lt;br /&gt;
*General events hosted by the accelerator&lt;br /&gt;
*(Success) stories for graduated start-ups&lt;br /&gt;
&lt;br /&gt;
=E-R Diagram (in list form) for Identifying Attributes to Pull from Accelerators=&lt;br /&gt;
Summary: I will look at different entities within the accelerator page (e.g accelerators, cohorts, founders) and then find potential attributes that can be codified from those entities. Along with the attribute, we list a potential method for pulling that particular attribute. &lt;br /&gt;
&lt;br /&gt;
Format: &lt;br /&gt;
:&amp;lt;u&amp;gt;Entity&amp;lt;/u&amp;gt;&lt;br /&gt;
:*Attribute - Possible sources/ways to get&lt;br /&gt;
&lt;br /&gt;
Ed: &amp;quot;Be creative with finding new attributes to pull!&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==List==&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
*Accelerator Name - Website, external database&lt;br /&gt;
*Contact Form - General contact section in each website &lt;br /&gt;
*Industry focus - can be pulled from description&lt;br /&gt;
*Description - pulled from website itself&lt;br /&gt;
*Takes equity? - Database or from &amp;quot;about&amp;quot; page&lt;br /&gt;
*Non-profit? - Database&lt;br /&gt;
*URL - Already have way of obtaining&lt;br /&gt;
*DNS Registration Date - Already have way of obtaining&lt;br /&gt;
*Address - Google Maps, maybe the website&lt;br /&gt;
*Founding Date - Google Maps, website, server registration&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
*Mentorship? - Description in website&lt;br /&gt;
*Space Offered - Google Maps, Website description&lt;br /&gt;
*Partnerships - Angel list, Same section as mentorship or events&lt;br /&gt;
*Hosted Events - Calender&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
*Name - Founders or Team Page&lt;br /&gt;
*Title - Directly underneath or next to name&lt;br /&gt;
*PhD? - Biography, webpage under name&lt;br /&gt;
*Serial - Biography&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot; in &amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt; (n) has (n) &amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt; &lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt;&lt;br /&gt;
*Other Companies - Biography, webpage&lt;br /&gt;
*Previous Companies - Biography&lt;br /&gt;
*Net Worth - Forbes, Biography&lt;br /&gt;
*Link back to &amp;quot;Name&amp;quot; in &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
*Date + Accelerator = Cohort ID - Database or Website&lt;br /&gt;
*Number of Startups - Website, count from &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Cohort Number - Categorization on website, external database&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Names - Website, external database&lt;br /&gt;
*State of Inc - Angel List&lt;br /&gt;
*URL - Angel List, website&lt;br /&gt;
*Founding Date - Registration database, Angel List&lt;br /&gt;
*Industry - startup description&lt;br /&gt;
*Founding Location - Angel List&lt;br /&gt;
*Current Location - Angel List&lt;br /&gt;
*VC Raised to Date - SDC Platinum&lt;br /&gt;
*Angel Funds Raised to date - Angel List&lt;br /&gt;
&lt;br /&gt;
==Variables which Distinguish Accelerator Websites==&lt;br /&gt;
*The word &amp;quot;Accelerator&amp;quot;&lt;br /&gt;
**This word appears at least one time on the home page of the vast majority of accelerator websites. The word &amp;quot;Accelerator&amp;quot; appears either as a link to another page on the website or in a title on the homepage of the website. Not many other websites contain this word on their homepage, especially not if one Googles something generic such as &amp;quot;Accelerators in the US&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
*Fixed Term&lt;br /&gt;
**Accelerators normally work with their cohorts for 3 months. This is a major factor which differentiates between an accelerator and any other member of a startup ecosystem. If on their website they mention either &amp;quot;3 months&amp;quot; or &amp;quot;12 weeks&amp;quot;, it is extremely likely that the website belongs to an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Cohorts, Portfolio, Class, or Companies&lt;br /&gt;
**This is a potential variable that could link the websites of many different accelerators. The problem with the word &amp;quot;portfolio&amp;quot; is also used by numerous venture capital firms, which could potentially cause complications when attempting to pull only the sites of accelerators from a Google search. The word &amp;quot;cohort&amp;quot;, however, would have an extremely high probability of identifying the website as belonging to an accelerator. The words &amp;quot;class&amp;quot; and &amp;quot;companies&amp;quot; are promising but do not offer certainty.&lt;br /&gt;
&lt;br /&gt;
*Equity, Investment&lt;br /&gt;
**Although by itself, equity does not mean much, when paired with any of these other terms, it could potentially point to an accelerator. Most accelerators take equity in the form of common stock (6-8%), or they will ask for some alternate form of stake in the company.&lt;br /&gt;
&lt;br /&gt;
*Education and Mentorship&lt;br /&gt;
**Accelerators differ from incubators and angel investors in that they emphasize the education of the potential startup. They offer advice and intense mentorship from more experienced entrepreneurs within their staff, as well as many networking opportunities with the outside world. This variable is more difficult to find on the website of the accelerator, but I believe that if the website includes numerous keywords such as &amp;quot;education&amp;quot;, &amp;quot;mentorship&amp;quot;, or &amp;quot;networking opportunities&amp;quot;, it would be somewhat safe to assume that the website is owned by an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Demo Day&lt;br /&gt;
**This variable does not have tremendous potential in terms of crawling websites, but I feel that it is worth mentioning. Most accelerators &amp;quot;graduate&amp;quot; their cohorts with a demo day, which is a day when the startups present their company to potential investors. If the website contains the words &amp;quot;demo day&amp;quot;, which is fairly uncommon, it could be a good source of accelerator identification.&lt;br /&gt;
&lt;br /&gt;
A combination of any of these variables would certainly identify the current website as belonging to an accelerator.&lt;br /&gt;
&lt;br /&gt;
==Comprehensive List of Accelerators==&lt;br /&gt;
&lt;br /&gt;
All text files saved in &amp;quot;Accelerators&amp;quot; project on the McNair RPD. &lt;br /&gt;
&lt;br /&gt;
*Acc.Info: 190&lt;br /&gt;
*SeedDB: 240&lt;br /&gt;
*SARP: 59&lt;br /&gt;
*Corp: 79&lt;br /&gt;
*Total: 568 results&lt;br /&gt;
&lt;br /&gt;
After removing duplicates and locations: 363 results&lt;br /&gt;
&lt;br /&gt;
Doesn't count f6s, which returns 1170 results, roughly only 300 of which were accelerators. We created a crawler to sift through the webpages and parse HTML so we could identify the accelerators. Program and HTML saved on the Desktop.&lt;br /&gt;
&lt;br /&gt;
==Randomly Chosen Accelerators==&lt;br /&gt;
*TLabs&lt;br /&gt;
*BetaSpring&lt;br /&gt;
*The Unilever Foundry&lt;br /&gt;
*AIA Accelerator&lt;br /&gt;
*R/GA Accelerator&lt;br /&gt;
*Zeroto510&lt;br /&gt;
*Hub:raum&lt;br /&gt;
*Orange Fab&lt;br /&gt;
*Furnace&lt;br /&gt;
*Launch Chapel Hill&lt;br /&gt;
&lt;br /&gt;
===Determining whether or not these are accelerators===&lt;br /&gt;
Googled name of Accelerator and clicked on the first link&lt;br /&gt;
&lt;br /&gt;
Looked for Variables which Distinguish Accelerator Websites&lt;br /&gt;
*TLabs: Homepage states: &amp;quot;Leading Indian Tech Accelerator&amp;quot;; TLabs is an accelerator, but it is located in India.&lt;br /&gt;
*Betaspring: Under the &amp;quot;About Betaspring&amp;quot; tab,  it states that &amp;quot;Betaspring was among the first ten startup accelerators to launch worldwide&amp;quot;.&lt;br /&gt;
*The Unilever Foundry: Does not claim to be an accelerator, nor does it have information on the website about cohorts. This name was pulled from the source Corporate Accelerators.&lt;br /&gt;
*AIA Accelerator: The word &amp;quot;accelerator&amp;quot; is included in the name. Under the &amp;quot;Overview&amp;quot; tab, it states that startups have received mentorship.&lt;br /&gt;
*R/GA Accelerator: Under the &amp;quot;Overview&amp;quot; tab it states that the &amp;quot;R/GA Accelerator is designed for startups and... it is a three month, immersive, mentorship driven program&amp;quot;.&lt;br /&gt;
*Zeroto510: Website contains a &amp;quot;Portfolio Companies&amp;quot; tab which divides up the companies into cohorts. This identifies Zeroto510 as an accelerator.&lt;br /&gt;
*Hub:raum: Offers accelerator and incubator programs; however, none are located in North America.&lt;br /&gt;
*Orange Fab: States on the main page that &amp;quot;We're a 3-month accelerator program&amp;quot;.&lt;br /&gt;
*Furnace: &amp;quot;About&amp;quot; tab states that Furnace is &amp;quot;an innovative startup accelerator designed to form, incubate, and launch new companies&amp;quot;. Concludes with a Demo Day&lt;br /&gt;
*Launch Chapel Hill: Homepage states that they are &amp;quot;a startup accelerator&amp;quot;. Also included on the homepage is a line that states &amp;quot;Applications for Cohort 7 are now open&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
7/10 are accelerators located in the US.&lt;br /&gt;
&lt;br /&gt;
2/10 are accelerators not located in the US.&lt;br /&gt;
&lt;br /&gt;
1/10 is not an accelerator.&lt;br /&gt;
&lt;br /&gt;
===Steps for Extracting Cohort Information===&lt;br /&gt;
*TLabs: Clicked on the &amp;quot;Startup&amp;quot; tab and located a drop down menu entitled &amp;quot;Showing Startups from:&amp;quot;. This menu separates startups into Batches ranging from 1-9. These batches are cohorts.&lt;br /&gt;
*Betaspring: This website does not have a &amp;quot;Companies&amp;quot; or &amp;quot;Startups&amp;quot; tab. I clicked on their &amp;quot;Who&amp;quot; tab and noticed that within this section were two links called &amp;quot;Our portfolio&amp;quot; and &amp;quot;Our companies&amp;quot; which both linked to the same place. This place contained a list of the startups that Betaspring has funded, as well as links to each of the startup websites. The list was not separated into cohorts.&lt;br /&gt;
*The Unilever Foundry: Does not have a &amp;quot;Startups&amp;quot; or &amp;quot;Companies&amp;quot; link on the website.&lt;br /&gt;
*AIA Accelerator: Clicked on the &amp;quot;Startups&amp;quot; tab which returned a page with 5 companies and a bit of information on each of these companies. Also included the URL to each startup. However, the companies were not separated into cohorts, probably because there are so few of them.&lt;br /&gt;
*R/GA Accelerator: Clicked on the &amp;quot;Alumni&amp;quot; tab and navigated down the webpage. Startups are separated by class, which means cohort in this case. Startup info contains link to demo day presentation as well as the startup url.&lt;br /&gt;
*Zeroto510: Hovered over the &amp;quot;About Us&amp;quot; drop down menu and clicked on the &amp;quot;Portfolio Companies&amp;quot; link. Startups are separated by cohort, one for each year, starting from 2013. &lt;br /&gt;
*Hub:raum: Clicked on the &amp;quot;Portfolio&amp;quot; tab. Directed to a page with many names of startups, as well as a brief description of what their company is about. Also includes a link to each startup's website. Startups are not separated into cohorts, but rather by investment by location, current participants, and alumni.&lt;br /&gt;
*Orange Fab: Clicked on the &amp;quot;Startups&amp;quot; tab and was directed to a different page. Startups are not only separated into cohorts named &amp;quot;Seasons&amp;quot;, but they are also separated by industry.&lt;br /&gt;
*Furnace: Clicked on &amp;quot;Portfolio&amp;quot; tab, but unfortunately the website is broken and it returned an error in code.&lt;br /&gt;
*Launch Chapel Hill: Clicked on the &amp;quot;Ventures&amp;quot; tab and was directed to a page in which all startups were separated into cohorts, and a brief description of the startup was provided underneath their logo.&lt;br /&gt;
&lt;br /&gt;
=Code=&lt;br /&gt;
&lt;br /&gt;
The directory for all data related to this project is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
==F6S Web Crawler==&lt;br /&gt;
&lt;br /&gt;
This is a python script using the selenium library that retrieves the html content of each page on F6S's North American Accelerator search results. The script is located in:&lt;br /&gt;
&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs &lt;br /&gt;
&lt;br /&gt;
The script is titled f6s_crawler_gentle.py&lt;br /&gt;
&lt;br /&gt;
When run, the script visits the F6S search page for North American Accelerator's and begins retrieving the HTML of each page in that search list. &lt;br /&gt;
NOTE: Timing must be spaced out between all interactions with the browser. F6S has Captcha, and the program will fail if the site receives too many hit requests, or has any inkling that it is being probed by a bot.&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files are stored in: &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files stored as text files are stored in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files_text&lt;br /&gt;
&lt;br /&gt;
==F6S Parser==&lt;br /&gt;
The next step is to take the HTML files retrieved by the crawler and to parse them for necessary information. This parser should also determine whether or not the site is an accelerator site. &lt;br /&gt;
&lt;br /&gt;
The code for the parser is located in &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
It is titled f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
To run the code, open the file in Komodo and press play. &lt;br /&gt;
If running from the command line, change to the correct directory and run the following comand:&lt;br /&gt;
 python f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
The list of accelerators that passed through the parser is in the same directory:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
The tab delimited text file is named AcceleratorList.&lt;br /&gt;
The file contains the names of the accelerators that had the keywords listed in the file. Also, the file contains the run dates and location of the accelerator if it was listed on the f6s page.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==F6S API==&lt;br /&gt;
F6S has an API, but we have had no success getting a key to the API. The link to get a key to the API is on [https://www.f6s.com/developers/apis/deal-feed this page].&lt;br /&gt;
&lt;br /&gt;
I (Peter) have emailed F6S to ask for a key directly at support@f6s.com. As of the end of the Fall 2016 Semester, they have not responded.&lt;br /&gt;
&lt;br /&gt;
FUN FACT (MASS-RENAME FILES USING WINDOWS POWER SHELL):&lt;br /&gt;
&lt;br /&gt;
The following command allowed me to append &amp;quot;.txt&amp;quot; to all files in a folder once in the proper directory:&lt;br /&gt;
 Get-ChildItem * | Rename-Item -NewName { $_.name + '.txt'}&lt;br /&gt;
&lt;br /&gt;
To change file formats, Microsoft suggests:&lt;br /&gt;
 Get-ChildItem *.txt | Rename-Item -NewName { $_.name -Replace '\.txt', '.log'}&lt;br /&gt;
&lt;br /&gt;
==Final Data==&lt;br /&gt;
The Parser for parsing the text files of accelerator data is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
The Parser for parsing the cohort files of accelerator data is also located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
This folder contains the Python parsers. The Final_data folder contains the tab-delimited text files of parsed data. final_accelerator_data.txt contains the generalized data saved in .txt files and final_cohort_data.txt contains the cohort data saved in .cohort.txt files.&lt;br /&gt;
&lt;br /&gt;
All the files entitled accelerator_data are subsets of the final_accelerator_data.txt file, but each file contains only the accelerators that matched to the flag specified in the file title.&lt;br /&gt;
&lt;br /&gt;
find_headers .py finds a set of the headers for all the cohort files from the seed list project.&lt;br /&gt;
&lt;br /&gt;
==Google SiteSearch==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Google_SiteSearch&lt;br /&gt;
This folder contains code for a google search parser. The script sitesearch.py will search for a queried company and return a likely web address for that company.&lt;br /&gt;
&lt;br /&gt;
==Way Back Machine Parser==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\wayback_machine.py&lt;br /&gt;
This script takes URLs and returns a timestamp for the oldest documented webpage under that URL courtesy of the Way Back Machine Archive.&lt;br /&gt;
&lt;br /&gt;
==Process Locations==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\process_locations.py&lt;br /&gt;
This script takes a physical address and converts it into latitude and longitude coordinates. Should be used in conjunction with the Enclosing Circle program to find the concentration of accelerators.&lt;br /&gt;
 E:\McNair\Software\CodeBase\EnclosingCircle.py&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=17692</id>
		<title>Shrey Agarwal (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=17692"/>
		<updated>2017-04-12T20:47:22Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;09/27/2016 14:00 - 17:00: &lt;br /&gt;
*Set up personal and work log pages, accessed Remote Desktop. &lt;br /&gt;
*Compiled list of accelerators from Wiki&lt;br /&gt;
09/29/2016 14:00 - 16:15; 16:45 - 17:30:&lt;br /&gt;
*Created new project: [[Accelerator Seed List (Data)]] and worked with Dr. Egan to create schematic for data entry.&lt;br /&gt;
*Evaluated 3 sources and logged data. Sources were taken from [[List of Accelerators]]. Logged each step onto project page and identified categories that would be suitable for web crawling sometime in the future.&lt;br /&gt;
10/11/2016 14:00 - 17:30;&lt;br /&gt;
*Explored how to use regular expressions in TextPad to aid with data sorting (need to review expressions with Dr. Egan in future)&lt;br /&gt;
*Continued evaluating sources from [[List of Accelerators]] and recorded steps onto project page, as before. Finished evaluating the six sources from initial list. (All work done in [[Accelerator Seed List (Data)]])&lt;br /&gt;
10/13/2016 14:00 - 17:00;&lt;br /&gt;
*All work done in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Talked to Dr. Egan about project going forward. Need to pick out 10-15 accelerators from the sources listed on my project page and identify a reliable method for obtaining cohort information, as well as other variables&lt;br /&gt;
*Used google searches to identify more sources, and evaluated three databases with the help of TextPad&lt;br /&gt;
*Began working on more generic google searches. Was able to go through &amp;quot;Location+accelerator&amp;quot;-type searches today. Will continue next time.&lt;br /&gt;
10/18/2016 14:00 - 17:30;&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Took a sample size of 10 accelerators and detailed how to extract cohort information, as well as what other information is readily available from accelerator URLs.&lt;br /&gt;
*Brought Matthew up to speed on accelerator project, added summaries to each section so they became easier to follow, and worked with him to finish up extracting cohort information&lt;br /&gt;
10/20/16 14:30 - 17:30:&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Finished up the list of instructions for finding the cohort. Continued compiling the list of variables for each of the accelerators within the sample size.&lt;br /&gt;
*Consulted Peter on prospects of creating a web crawler with the information we currently have compiled. Determined it was possible, although beyond the scope of Peter's knowledge.&lt;br /&gt;
10/25/16 14:00 - 17:00&lt;br /&gt;
*Consulted Ed with next step for project.&lt;br /&gt;
*Began listing the E-R diagram onto the accelerator database page where entities were potential categories and each entity had its associated attributes&lt;br /&gt;
10/27/16 14:00 - 17:00&lt;br /&gt;
*Continued working with Matthew to identify elements in the E-R diagram for pulling information on accelerators. &lt;br /&gt;
*Found sources to obtain/cross-reference information (ie. Angel List)&lt;br /&gt;
11/08/16 14:00 - 18:00&lt;br /&gt;
*Identified possible keywords to filter results through for accelerators&lt;br /&gt;
*Began compiling a comprehensive list of accelerators based on the data we have already sifted through.&lt;br /&gt;
*Learned how to use regular expressions from Ben to sort names individually and alphabetically.&lt;br /&gt;
11/10/16 14:00 - 18:00&lt;br /&gt;
*Began sorting through accelerator list and removing duplicates, as well as identifying more places to pull names from.&lt;br /&gt;
*Worked with Peter to create a crawl for f6s because the website does not return only accelerators.&lt;br /&gt;
11/15/16 14:00 - 18:00&lt;br /&gt;
*Took a break from f6s to locate more lists based on individual google searches such as &amp;quot;city+accelerator+list&amp;quot;&lt;br /&gt;
*Put Seed DB information into an excel file on the remote desktop&lt;br /&gt;
11/17/16 14:00 - 16:00&lt;br /&gt;
*Continued filling out information for the random Google Searches&lt;br /&gt;
*Organized TextPad files on the RDP into coherent excel spreadsheets with proper headers on the table&lt;br /&gt;
*Noticed problem with f6s: it seems although all of the html coding was protected by a captcha so the crawler did not actually extract any information; it was all blocked.&lt;br /&gt;
11/22/16 14:00 - 17:00&lt;br /&gt;
*Worked to fix f6s crawler with Peter&lt;br /&gt;
*Finished and compiled master list of accelerators&lt;br /&gt;
12/01/16 14:00 - 18:00&lt;br /&gt;
*Caught up on project with Ed and Carlin&lt;br /&gt;
*Took 20 accelerators (241-260) from the list and filled out text.html files for them; finished the 20&lt;br /&gt;
12/05/16 13:00 - 16:00&lt;br /&gt;
*After finishing first 20 accelerators, continued working down the list, beginning at 321&lt;br /&gt;
*Work noted in [[Accelerator Seed List (Data)]], but mostly stored on McNair RDP&lt;br /&gt;
12/06/16 14:00 - 18:00&lt;br /&gt;
*Continued &amp;quot;Accelerating&amp;quot; down the list in [[Accelerator Seed List (Data)]], finished up until 340&lt;br /&gt;
12/08/16 14:00 - 17:00&lt;br /&gt;
*Continued working on accelerator list on the same page.&lt;br /&gt;
01/17/17 14:00 - 16:00&lt;br /&gt;
*Finished up &amp;quot;accelerating&amp;quot; from [[Accelerator Seed List (Data)]], numbers 341-351&lt;br /&gt;
1/18/17 14:00 - 16:00&lt;br /&gt;
*Finished accelerating for sure, went back and began an overview of the work done for quality control.&lt;br /&gt;
01/20/17 14:00 - 16:00&lt;br /&gt;
*Mandatory meeting, then worked through 2 of Ed's unfinished accelerators&lt;br /&gt;
1/23/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to go over about 70 items in the accelerator list and ensure that they follow a uniform structure and show correct information&lt;br /&gt;
1/24/17 14:00 - 16:00&lt;br /&gt;
*Worked with Peter to fix the problem with results not coming through on the new spreadsheet by renaming the file and including more symbols in the searches. Spreadsheet should be up to date now.&lt;br /&gt;
*Got to number 144 on the list while going through files.&lt;br /&gt;
1/25/17 14:00 - 16;00&lt;br /&gt;
*Continued looking through the list and fixing wrong entries or reporting them&lt;br /&gt;
1/26/17 14:00 - 16:00&lt;br /&gt;
*Talked with Ed about project going forward and tried to access the Crunchbase API with Peter to crawl for start-up companies.&lt;br /&gt;
*Continued working through the accelerator list, stopped at number 186.&lt;br /&gt;
1/27/17 14:00 - 16:00&lt;br /&gt;
*Continued looking through accelerator list and fixing any entries with error. Got to number 261.&lt;br /&gt;
1/30/17 14:30 - 16:30&lt;br /&gt;
*Got through about 425&lt;br /&gt;
1/31/17 14:00 - 16:00&lt;br /&gt;
*Got to number 502&lt;br /&gt;
2/01/17 14:00 - 16:00&lt;br /&gt;
*Finished looking through the initial list of accelerators and writing down which ones needed to be modified or completed (through 551)&lt;br /&gt;
2/03/17 14:00 - 17:00&lt;br /&gt;
*Finished about 30 entries for the accelerator entries that still needed to be completed. Worked out of the &amp;quot;NOT DONE&amp;quot; file in the server (which is now blank because everything is finished)&lt;br /&gt;
2/06/17 14:00 - 16:00&lt;br /&gt;
*Developed a standardized format for the text files with Matthew. Instructions are under &amp;quot;standardized format&amp;quot; in the accelerator seed list portion. I started at number 226 and standardized formats up until 370.&lt;br /&gt;
2/07/17 14:00-16:00&lt;br /&gt;
*Continued work from yesterday, completed up to number 488 from the list. Will likely need one more day to finish.&lt;br /&gt;
2/08/17 14:00 - 16:00&lt;br /&gt;
*Finished standardizing the txt files for use on the excel spreadsheet, compiled the data and examined the resultant tables. Realized we needed to fix some categories in the cohort files.&lt;br /&gt;
2/09/17 14:00 - 17:00&lt;br /&gt;
*Worked with Ed on a side project trying to gather information on climate change thanks to Baker's article on the Wall Street Journal&lt;br /&gt;
*Gathered information on climate change in relation to high-growth, high-risk innovation and organizations that deal with things such as carbon credits&lt;br /&gt;
2/10/17 14:00 - 17:00&lt;br /&gt;
*Realized that blog post was ambitious because we could not really find a clear purpose from the information we gathered, nor could we find a unique angle. Held off on the idea&lt;br /&gt;
*Went back to organizing the new columns and headers on the text file by identifying areas of error in the excel spreadsheet&lt;br /&gt;
2/15/17 14:00 - 16:00&lt;br /&gt;
*Spoke with Ed about free enterprise while he lectured all of us. It took about an hour.&lt;br /&gt;
*Looked at plans for project going forward including using linkedin to search the founders&lt;br /&gt;
2/20/17 14:00 - 16:00&lt;br /&gt;
*Found our first source for expanding the project into incubators, from angel.co. Seems similar to f6s in that we can crawl it and obtain a list of incubators and their various counterparts. &lt;br /&gt;
2/21/17 14:00 - 16:00&lt;br /&gt;
*Found more sources for incubators by reading through quora discussions and masters theses. Bookmarked these pages so that I could put them into text files after.&lt;br /&gt;
2/23/17 14:00 - 18:00&lt;br /&gt;
*Converted incubator files to text-pad and saved them (4 total), then cleaned them up through regex&lt;br /&gt;
*Took the cohort text file, put it into excel, and proceeded to clean up all of the mistakes in the excel document, particularly bad data or mistakes with organizations. Got through Y-Combinator.&lt;br /&gt;
2/24/17 14:00 - 16:00&lt;br /&gt;
*Finished up cleaning the cohort data for the names and the descriptions, but there still needs to be work done on the other stuff like dates and programs&lt;br /&gt;
2/28/17 14:00 - 16:00&lt;br /&gt;
*Created page [[Hub-Based Venture Firms]] and proceeded to research VC in Hubs listed on under E:\McNair\Projects\Hubs\summer 2016\Hubs Variables - Ariel.xls&lt;br /&gt;
*Looked at details such as whether they have in-house funds, whether they co-invest, focuses, and amounts invested.&lt;br /&gt;
3/01/17 14:00 - 16:00&lt;br /&gt;
*Worked with Ben and Matthew to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
3/02/17 14:00 - 16:00&lt;br /&gt;
*Tried to repeat the VC data pull without it crashing from pulling too many entries. Unfortunately, we were unable to finish it&lt;br /&gt;
3/06/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to put final touches on the cohort data to prep it for matching with our VC data&lt;br /&gt;
3/07/17 14:00 - 16:00&lt;br /&gt;
*Finally finished working on the cohort files, will match on the 8th&lt;br /&gt;
3/08/17 14:00 - 16:00&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
3/20/17 14:00 - 16:00&lt;br /&gt;
*Participated in a SQL training session with Ed, learned how to create a database and to pull tab delimited information from text files onto a table&lt;br /&gt;
3/21/17 14:00 - 16:00&lt;br /&gt;
*Met with Ed and arrived at the conclusion of finishing the draft for a report by the end of the semester. Put the initial report information on the accelerator page using the variables that we currently have&lt;br /&gt;
3/22/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to compile tables in our database of the matched VC-portfolio company lists and the overall accelerator cohort information. Found multiple errors in the cohort file which needed to be fixed before finishing the tables and analyzing the data&lt;br /&gt;
3/23/17 14:00 - 16:00&lt;br /&gt;
*Finished cleaning the cohort file once again.&lt;br /&gt;
3/24/17 14:00 - 16:00&lt;br /&gt;
*Continued practicing my SQL and creating the code for compiling the tables&lt;br /&gt;
3/29/17 14:00 - 16:00&lt;br /&gt;
*Worked on the matched data with Matthew. Will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC-backed company names matched to one cohort company name&lt;br /&gt;
3/30/17 14:00 - 16:00&lt;br /&gt;
*Examined the Regex code for the URLs and attempted to filter them out&lt;br /&gt;
4/03/17 14:00 - 16:00&lt;br /&gt;
*Continued learning some SQL from Ed&lt;br /&gt;
4/04/17 14:00 - 16:00&lt;br /&gt;
*Began examining the Crunchbase data; looked through the 2013 snapshot&lt;br /&gt;
*Created a new Crunchbase account with McNair center and examined the basic access, which does not give us much information&lt;br /&gt;
4/05/17 14:00 - 16:00&lt;br /&gt;
*Made the final VC percentage table from our database and previous code with Ed; realized we were missing many accelerators as well as a lot of important cohort data so need to reexamine our previous data.&lt;br /&gt;
4/06/17 14:00 - 16:00&lt;br /&gt;
*Continued looking through Crunchbase to see how we can pull accelerators up until 2013; most likely will use objects to sort the data into accelerators, perhaps keywords from &amp;quot;accelerators&amp;quot;&lt;br /&gt;
4/07/17 14:00 - 16:00&lt;br /&gt;
*Examined SARP and attempted to match their accelerators with the ones from our data, realized that a few of our cohorts were missing as well as a few of the actual accelerators so we need to fix the data in our excel file&lt;br /&gt;
*Began compiling a list of missing accelerators on textpad to later insert into our excel.&lt;br /&gt;
4/10/17 13:00 - 16:00&lt;br /&gt;
*Worked with Ben to find missing accelerators from the Crunchbase data using the keywords. Also, began recording information from some of the big accelerators we were missing&lt;br /&gt;
*Found 228 matches for accelerators, will match from our list to find the similarities&lt;br /&gt;
4/11/17 14:00 - 16:00&lt;br /&gt;
*Finished compiling the accelerator and cohort information for the few we found from SARP, will consult Ed to figure out how to approach the missing accelerators and what to do for the preliminary report&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=17568</id>
		<title>Shrey Agarwal (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=17568"/>
		<updated>2017-04-07T19:26:01Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;09/27/2016 14:00 - 17:00: &lt;br /&gt;
*Set up personal and work log pages, accessed Remote Desktop. &lt;br /&gt;
*Compiled list of accelerators from Wiki&lt;br /&gt;
09/29/2016 14:00 - 16:15; 16:45 - 17:30:&lt;br /&gt;
*Created new project: [[Accelerator Seed List (Data)]] and worked with Dr. Egan to create schematic for data entry.&lt;br /&gt;
*Evaluated 3 sources and logged data. Sources were taken from [[List of Accelerators]]. Logged each step onto project page and identified categories that would be suitable for web crawling sometime in the future.&lt;br /&gt;
10/11/2016 14:00 - 17:30;&lt;br /&gt;
*Explored how to use regular expressions in TextPad to aid with data sorting (need to review expressions with Dr. Egan in future)&lt;br /&gt;
*Continued evaluating sources from [[List of Accelerators]] and recorded steps onto project page, as before. Finished evaluating the six sources from initial list. (All work done in [[Accelerator Seed List (Data)]])&lt;br /&gt;
10/13/2016 14:00 - 17:00;&lt;br /&gt;
*All work done in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Talked to Dr. Egan about project going forward. Need to pick out 10-15 accelerators from the sources listed on my project page and identify a reliable method for obtaining cohort information, as well as other variables&lt;br /&gt;
*Used google searches to identify more sources, and evaluated three databases with the help of TextPad&lt;br /&gt;
*Began working on more generic google searches. Was able to go through &amp;quot;Location+accelerator&amp;quot;-type searches today. Will continue next time.&lt;br /&gt;
10/18/2016 14:00 - 17:30;&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Took a sample size of 10 accelerators and detailed how to extract cohort information, as well as what other information is readily available from accelerator URLs.&lt;br /&gt;
*Brought Matthew up to speed on accelerator project, added summaries to each section so they became easier to follow, and worked with him to finish up extracting cohort information&lt;br /&gt;
10/20/16 14:30 - 17:30:&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Finished up the list of instructions for finding the cohort. Continued compiling the list of variables for each of the accelerators within the sample size.&lt;br /&gt;
*Consulted Peter on prospects of creating a web crawler with the information we currently have compiled. Determined it was possible, although beyond the scope of Peter's knowledge.&lt;br /&gt;
10/25/16 14:00 - 17:00&lt;br /&gt;
*Consulted Ed with next step for project.&lt;br /&gt;
*Began listing the E-R diagram onto the accelerator database page where entities were potential categories and each entity had its associated attributes&lt;br /&gt;
10/27/16 14:00 - 17:00&lt;br /&gt;
*Continued working with Matthew to identify elements in the E-R diagram for pulling information on accelerators. &lt;br /&gt;
*Found sources to obtain/cross-reference information (ie. Angel List)&lt;br /&gt;
11/08/16 14:00 - 18:00&lt;br /&gt;
*Identified possible keywords to filter results through for accelerators&lt;br /&gt;
*Began compiling a comprehensive list of accelerators based on the data we have already sifted through.&lt;br /&gt;
*Learned how to use regular expressions from Ben to sort names individually and alphabetically.&lt;br /&gt;
11/10/16 14:00 - 18:00&lt;br /&gt;
*Began sorting through accelerator list and removing duplicates, as well as identifying more places to pull names from.&lt;br /&gt;
*Worked with Peter to create a crawl for f6s because the website does not return only accelerators.&lt;br /&gt;
11/15/16 14:00 - 18:00&lt;br /&gt;
*Took a break from f6s to locate more lists based on individual google searches such as &amp;quot;city+accelerator+list&amp;quot;&lt;br /&gt;
*Put Seed DB information into an excel file on the remote desktop&lt;br /&gt;
11/17/16 14:00 - 16:00&lt;br /&gt;
*Continued filling out information for the random Google Searches&lt;br /&gt;
*Organized TextPad files on the RDP into coherent excel spreadsheets with proper headers on the table&lt;br /&gt;
*Noticed problem with f6s: it seems although all of the html coding was protected by a captcha so the crawler did not actually extract any information; it was all blocked.&lt;br /&gt;
11/22/16 14:00 - 17:00&lt;br /&gt;
*Worked to fix f6s crawler with Peter&lt;br /&gt;
*Finished and compiled master list of accelerators&lt;br /&gt;
12/01/16 14:00 - 18:00&lt;br /&gt;
*Caught up on project with Ed and Carlin&lt;br /&gt;
*Took 20 accelerators (241-260) from the list and filled out text.html files for them; finished the 20&lt;br /&gt;
12/05/16 13:00 - 16:00&lt;br /&gt;
*After finishing first 20 accelerators, continued working down the list, beginning at 321&lt;br /&gt;
*Work noted in [[Accelerator Seed List (Data)]], but mostly stored on McNair RDP&lt;br /&gt;
12/06/16 14:00 - 18:00&lt;br /&gt;
*Continued &amp;quot;Accelerating&amp;quot; down the list in [[Accelerator Seed List (Data)]], finished up until 340&lt;br /&gt;
12/08/16 14:00 - 17:00&lt;br /&gt;
*Continued working on accelerator list on the same page.&lt;br /&gt;
01/17/17 14:00 - 16:00&lt;br /&gt;
*Finished up &amp;quot;accelerating&amp;quot; from [[Accelerator Seed List (Data)]], numbers 341-351&lt;br /&gt;
1/18/17 14:00 - 16:00&lt;br /&gt;
*Finished accelerating for sure, went back and began an overview of the work done for quality control.&lt;br /&gt;
01/20/17 14:00 - 16:00&lt;br /&gt;
*Mandatory meeting, then worked through 2 of Ed's unfinished accelerators&lt;br /&gt;
1/23/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to go over about 70 items in the accelerator list and ensure that they follow a uniform structure and show correct information&lt;br /&gt;
1/24/17 14:00 - 16:00&lt;br /&gt;
*Worked with Peter to fix the problem with results not coming through on the new spreadsheet by renaming the file and including more symbols in the searches. Spreadsheet should be up to date now.&lt;br /&gt;
*Got to number 144 on the list while going through files.&lt;br /&gt;
1/25/17 14:00 - 16;00&lt;br /&gt;
*Continued looking through the list and fixing wrong entries or reporting them&lt;br /&gt;
1/26/17 14:00 - 16:00&lt;br /&gt;
*Talked with Ed about project going forward and tried to access the Crunchbase API with Peter to crawl for start-up companies.&lt;br /&gt;
*Continued working through the accelerator list, stopped at number 186.&lt;br /&gt;
1/27/17 14:00 - 16:00&lt;br /&gt;
*Continued looking through accelerator list and fixing any entries with error. Got to number 261.&lt;br /&gt;
1/30/17 14:30 - 16:30&lt;br /&gt;
*Got through about 425&lt;br /&gt;
1/31/17 14:00 - 16:00&lt;br /&gt;
*Got to number 502&lt;br /&gt;
2/01/17 14:00 - 16:00&lt;br /&gt;
*Finished looking through the initial list of accelerators and writing down which ones needed to be modified or completed (through 551)&lt;br /&gt;
2/03/17 14:00 - 17:00&lt;br /&gt;
*Finished about 30 entries for the accelerator entries that still needed to be completed. Worked out of the &amp;quot;NOT DONE&amp;quot; file in the server (which is now blank because everything is finished)&lt;br /&gt;
2/06/17 14:00 - 16:00&lt;br /&gt;
*Developed a standardized format for the text files with Matthew. Instructions are under &amp;quot;standardized format&amp;quot; in the accelerator seed list portion. I started at number 226 and standardized formats up until 370.&lt;br /&gt;
2/07/17 14:00-16:00&lt;br /&gt;
*Continued work from yesterday, completed up to number 488 from the list. Will likely need one more day to finish.&lt;br /&gt;
2/08/17 14:00 - 16:00&lt;br /&gt;
*Finished standardizing the txt files for use on the excel spreadsheet, compiled the data and examined the resultant tables. Realized we needed to fix some categories in the cohort files.&lt;br /&gt;
2/09/17 14:00 - 17:00&lt;br /&gt;
*Worked with Ed on a side project trying to gather information on climate change thanks to Baker's article on the Wall Street Journal&lt;br /&gt;
*Gathered information on climate change in relation to high-growth, high-risk innovation and organizations that deal with things such as carbon credits&lt;br /&gt;
2/10/17 14:00 - 17:00&lt;br /&gt;
*Realized that blog post was ambitious because we could not really find a clear purpose from the information we gathered, nor could we find a unique angle. Held off on the idea&lt;br /&gt;
*Went back to organizing the new columns and headers on the text file by identifying areas of error in the excel spreadsheet&lt;br /&gt;
2/15/17 14:00 - 16:00&lt;br /&gt;
*Spoke with Ed about free enterprise while he lectured all of us. It took about an hour.&lt;br /&gt;
*Looked at plans for project going forward including using linkedin to search the founders&lt;br /&gt;
2/20/17 14:00 - 16:00&lt;br /&gt;
*Found our first source for expanding the project into incubators, from angel.co. Seems similar to f6s in that we can crawl it and obtain a list of incubators and their various counterparts. &lt;br /&gt;
2/21/17 14:00 - 16:00&lt;br /&gt;
*Found more sources for incubators by reading through quora discussions and masters theses. Bookmarked these pages so that I could put them into text files after.&lt;br /&gt;
2/23/17 14:00 - 18:00&lt;br /&gt;
*Converted incubator files to text-pad and saved them (4 total), then cleaned them up through regex&lt;br /&gt;
*Took the cohort text file, put it into excel, and proceeded to clean up all of the mistakes in the excel document, particularly bad data or mistakes with organizations. Got through Y-Combinator.&lt;br /&gt;
2/24/17 14:00 - 16:00&lt;br /&gt;
*Finished up cleaning the cohort data for the names and the descriptions, but there still needs to be work done on the other stuff like dates and programs&lt;br /&gt;
2/28/17 14:00 - 16:00&lt;br /&gt;
*Created page [[Hub-Based Venture Firms]] and proceeded to research VC in Hubs listed on under E:\McNair\Projects\Hubs\summer 2016\Hubs Variables - Ariel.xls&lt;br /&gt;
*Looked at details such as whether they have in-house funds, whether they co-invest, focuses, and amounts invested.&lt;br /&gt;
3/01/17 14:00 - 16:00&lt;br /&gt;
*Worked with Ben and Matthew to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
3/02/17 14:00 - 16:00&lt;br /&gt;
*Tried to repeat the VC data pull without it crashing from pulling too many entries. Unfortunately, we were unable to finish it&lt;br /&gt;
3/06/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to put final touches on the cohort data to prep it for matching with our VC data&lt;br /&gt;
3/07/17 14:00 - 16:00&lt;br /&gt;
*Finally finished working on the cohort files, will match on the 8th&lt;br /&gt;
3/08/17 14:00 - 16:00&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
3/20/17 14:00 - 16:00&lt;br /&gt;
*Participated in a SQL training session with Ed, learned how to create a database and to pull tab delimited information from text files onto a table&lt;br /&gt;
3/21/17 14:00 - 16:00&lt;br /&gt;
*Met with Ed and arrived at the conclusion of finishing the draft for a report by the end of the semester. Put the initial report information on the accelerator page using the variables that we currently have&lt;br /&gt;
3/22/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to compile tables in our database of the matched VC-portfolio company lists and the overall accelerator cohort information. Found multiple errors in the cohort file which needed to be fixed before finishing the tables and analyzing the data&lt;br /&gt;
3/23/17 14:00 - 16:00&lt;br /&gt;
*Finished cleaning the cohort file once again.&lt;br /&gt;
3/24/17 14:00 - 16:00&lt;br /&gt;
*Continued practicing my SQL and creating the code for compiling the tables&lt;br /&gt;
3/29/17 14:00 - 16:00&lt;br /&gt;
*Worked on the matched data with Matthew. Will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC-backed company names matched to one cohort company name&lt;br /&gt;
3/30/17 14:00 - 16:00&lt;br /&gt;
*Examined the Regex code for the URLs and attempted to filter them out&lt;br /&gt;
4/03/17 14:00 - 16:00&lt;br /&gt;
*Continued learning some SQL from Ed&lt;br /&gt;
4/04/17 14:00 - 16:00&lt;br /&gt;
*Began examining the Crunchbase data; looked through the 2013 snapshot&lt;br /&gt;
*Created a new Crunchbase account with McNair center and examined the basic access, which does not give us much information&lt;br /&gt;
4/05/17 14:00 - 16:00&lt;br /&gt;
*Made the final VC percentage table from our database and previous code with Ed; realized we were missing many accelerators as well as a lot of important cohort data so need to reexamine our previous data.&lt;br /&gt;
4/06/17 14:00 - 16:00&lt;br /&gt;
*Continued looking through Crunchbase to see how we can pull accelerators up until 2013; most likely will use objects to sort the data into accelerators, perhaps keywords from &amp;quot;accelerators&amp;quot;&lt;br /&gt;
4/07/17 14:00 - 16:00&lt;br /&gt;
*Examined SARP and attempted to match their accelerators with the ones from our data, realized that a few of our cohorts were missing as well as a few of the actual accelerators so we need to fix the data in our excel file&lt;br /&gt;
*Began compiling a list of missing accelerators on textpad to later insert into our excel.&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Emerging_Ecosystems&amp;diff=17487</id>
		<title>Emerging Ecosystems</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Emerging_Ecosystems&amp;diff=17487"/>
		<updated>2017-04-05T21:06:23Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Emerging Entrepreneurship Ecosystems&lt;br /&gt;
|Has owner=Eliza Martin&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
}}&lt;br /&gt;
Tentative timeline for six blog posts to be completed by end of the semester: &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
St. Louis: completed by 3/24 and sent for peer editing. &lt;br /&gt;
&lt;br /&gt;
Washington DC: already completed, but needs editing (on 3/27). Send for peer editing 3/27. &lt;br /&gt;
&lt;br /&gt;
Houston: complete by 3/30 and send for peer editing. &lt;br /&gt;
&lt;br /&gt;
Chicago: complete by 4/7 and send for peer editing. &lt;br /&gt;
&lt;br /&gt;
Las Vegas: complete by 4/14 and send for peer editing. &lt;br /&gt;
&lt;br /&gt;
Nashville: complete by 4/21 and send for peer editing. &lt;br /&gt;
&lt;br /&gt;
Denver: complete by 4/25 and send for peer editing. &lt;br /&gt;
&lt;br /&gt;
File with cities and accelerators called &amp;quot;City Accelerators Eliza&amp;quot; located in E:\McNair\Projects\Accelerators&lt;br /&gt;
*Should be able to copy and paste into Excel to look at all of the data, except for the last few entries because we found those on the web&lt;br /&gt;
 &lt;br /&gt;
==Project Overview==&lt;br /&gt;
According to a 2015 Global Entrepreneurship Monitor Report, approximately 11.9% of United States adults are starting and running new businesses. The United States Bureau of Labor Statistics reports an increase in the number of businesses less than one year old and jobs created by businesses less than one year old. While entrepreneurship in the United States is strong, there are significant challenges to maintaining growth. &lt;br /&gt;
&lt;br /&gt;
Important for continued US entrepreneurship growth is the development of entrepreneurial ecosystems. Accelerators, angel investors, hubs, strong crowd funding and micro-finance availability, encouraging regulatory environment, and availability of venture capital are all components on successful ecosystems. Silicon Valley (Palo Alto, California), Route 128 (Massachusetts), and The Research Triangle (North-Carolina) have facilitated the success of many companies such as Facebook, Google, Integrated Computer Solutions, and Digital Equipment Corporation. Across the country, many new entrepreneurship ecosystems are emerging. &lt;br /&gt;
&lt;br /&gt;
This project will examine the emerging ecosystems throughout the United States in a blog post format. &lt;br /&gt;
&lt;br /&gt;
==List of Emerging Ecosystems==&lt;br /&gt;
*Austin (completed: [[Austin TX Emerging Ecosystems (Blog Post)| Austin, TX: Emerging Ecosystems]])&lt;br /&gt;
*Denver&lt;br /&gt;
*Cincinnati (completed: http://mcnair.bakerinstitute.org/wiki/Cincinnati_Ecosystem) &lt;br /&gt;
*Chicago&lt;br /&gt;
*Las Vegas&lt;br /&gt;
*Minneapolis-St. Paul&lt;br /&gt;
*St. Louis (in progress: http://mcnair.bakerinstitute.org/wiki/St._Louis:_Emerging_Ecosystems) &lt;br /&gt;
*Kansas City&lt;br /&gt;
*Nashville&lt;br /&gt;
*Miami&lt;br /&gt;
*San Diego&lt;br /&gt;
*Phoenix&lt;br /&gt;
*Washington DC (completed but not published)&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Crunchbase_2013_Snapshot&amp;diff=17376</id>
		<title>Crunchbase 2013 Snapshot</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Crunchbase_2013_Snapshot&amp;diff=17376"/>
		<updated>2017-04-04T20:48:08Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Username: mcnair@rice.edu&lt;br /&gt;
&lt;br /&gt;
password: amount&lt;br /&gt;
&lt;br /&gt;
==Original Email==&lt;br /&gt;
&lt;br /&gt;
Thank you for submitting a request for Research Access to Crunchbase through our API. We have reviewed your request, and granted you Basic Access. You can now access Crunchbase data in the following ways. &lt;br /&gt;
&lt;br /&gt;
Check out the Open Data Map&lt;br /&gt;
Explore the 2013 Snapshot&lt;br /&gt;
Visit our website for instructions on accessing Crunchbase data. To access the REST API, you'll need your user key: &lt;br /&gt;
&lt;br /&gt;
6d382e4bbdaa297138f32a588b139f53&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
With Basic Access, API use is limited to the Open Data Map and 2013 Snapshot. Access to the full API and latest funding round data requires a license. To learn more check out our offerings. &lt;br /&gt;
&lt;br /&gt;
==Basic Membership==&lt;br /&gt;
*Can not seem to filter results past the first 50 companies&lt;br /&gt;
*Very basic information such as company name, location, industry classification, website, and &amp;quot;Crunchbase ranking&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
==Retrieval==&lt;br /&gt;
&lt;br /&gt;
The data was retrieved by Shrey and Matthew through an application from the Crunchbase Website for the API service. The data took about a month to come in due to a lack of response from Crunchbase itself. Eventually, they gave us basic access.&lt;br /&gt;
&lt;br /&gt;
==Content==&lt;br /&gt;
&lt;br /&gt;
The snapshot contained 2 .tar.qz files, which were extracted into 181/crunchbase using the command&lt;br /&gt;
 tar -zxvf file.tar.gz&lt;br /&gt;
&lt;br /&gt;
The csv files (organizations.csv and people.csv) were copied for access to:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Crunchbase Snapshot&lt;br /&gt;
&lt;br /&gt;
The files (size in bytes) and their contents are&lt;br /&gt;
&lt;br /&gt;
'''crunchbase_2013_snapshot_mysql.tar.gz'''&lt;br /&gt;
*license.txt		 526 &lt;br /&gt;
*cb_objects.sql	 338955612 &lt;br /&gt;
*cb_offices.sql	 14850092 &lt;br /&gt;
*cb_people.sql		 13253952 &lt;br /&gt;
*cb_ipos.sql		 178397 &lt;br /&gt;
*cb_milestones.sql	 10498840 &lt;br /&gt;
*cb_funds.sql		 385010 &lt;br /&gt;
*cb_relationships.sql	 48655529 &lt;br /&gt;
*cb_degrees.sql	 13829471 &lt;br /&gt;
*cb_investments.sql	 6185134 &lt;br /&gt;
*cb_acquisitions.sql	 2309393 &lt;br /&gt;
*cb_funding_rounds.sql	 14681705 &lt;br /&gt;
&lt;br /&gt;
'''odm.csv.tar.gz'''&lt;br /&gt;
*organizations.csv	 212013301&lt;br /&gt;
**459916 records with the following fields: &lt;br /&gt;
***crunchbase_uuid&lt;br /&gt;
***type&lt;br /&gt;
***primary_role&lt;br /&gt;
***name&lt;br /&gt;
***crunchbase_url&lt;br /&gt;
***homepage_domain&lt;br /&gt;
***homepage_url&lt;br /&gt;
***profile_image_url&lt;br /&gt;
***facebook_url&lt;br /&gt;
***twitter_url&lt;br /&gt;
***linkedin_url&lt;br /&gt;
***stock_symbol&lt;br /&gt;
***location_city&lt;br /&gt;
***location_region&lt;br /&gt;
***location_country_code&lt;br /&gt;
***short_description&lt;br /&gt;
*people.csv	 	 188924229&lt;br /&gt;
**521634 records with the following fields: &lt;br /&gt;
***crunchbase_uuid&lt;br /&gt;
***type&lt;br /&gt;
***first_name&lt;br /&gt;
***last_name&lt;br /&gt;
***crunchbase_url&lt;br /&gt;
***profile_image_url&lt;br /&gt;
***facebook_url&lt;br /&gt;
***twitter_url&lt;br /&gt;
***linkedin_url&lt;br /&gt;
***location_city&lt;br /&gt;
***location_region&lt;br /&gt;
***location_country_code&lt;br /&gt;
***title&lt;br /&gt;
***organization&lt;br /&gt;
***organization_crunchbase_url&lt;br /&gt;
*crunchbase_license.txt 487&lt;br /&gt;
&lt;br /&gt;
==Changing MYSQL to PostgreSQL==&lt;br /&gt;
&lt;br /&gt;
The SQL files were generated in MySQL. We need to convert them to PostgreSQL. See: https://en.wikibooks.org/wiki/Converting_MySQL_to_PostgreSQL and http://stackoverflow.com/questions/1942586/comparison-of-database-column-types-in-mysql-postgresql-and-sqlite-cross-map&lt;br /&gt;
&lt;br /&gt;
The key changes are:&lt;br /&gt;
&lt;br /&gt;
 MYSQL          POSTGRESQL&lt;br /&gt;
 -----          ----------&lt;br /&gt;
 LOCK           --comment out as no need but LOCK [ TABLE ] [ ONLY ] name [ * ] [, ...] [ IN lockmode MODE ] [ NOWAIT ]&lt;br /&gt;
 UNLOCK         --comment out&lt;br /&gt;
 decimal(x,y)   real (might work as is)&lt;br /&gt;
 datetime       timestamp&lt;br /&gt;
 KEY            --comment out as no need but FOREIGN KEY ( column_name [, ... ] ) REFERENCES reftable [ ( refcolumn [, ... ] ) ]&lt;br /&gt;
&lt;br /&gt;
[[category:internal]]&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Crunchbase_2013_Snapshot&amp;diff=17374</id>
		<title>Crunchbase 2013 Snapshot</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Crunchbase_2013_Snapshot&amp;diff=17374"/>
		<updated>2017-04-04T20:44:21Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Username: mcnair@rice.edu&lt;br /&gt;
&lt;br /&gt;
password: amount&lt;br /&gt;
&lt;br /&gt;
==Original Email==&lt;br /&gt;
&lt;br /&gt;
Thank you for submitting a request for Research Access to Crunchbase through our API. We have reviewed your request, and granted you Basic Access. You can now access Crunchbase data in the following ways. &lt;br /&gt;
&lt;br /&gt;
Check out the Open Data Map&lt;br /&gt;
Explore the 2013 Snapshot&lt;br /&gt;
Visit our website for instructions on accessing Crunchbase data. To access the REST API, you'll need your user key: &lt;br /&gt;
&lt;br /&gt;
6d382e4bbdaa297138f32a588b139f53&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
With Basic Access, API use is limited to the Open Data Map and 2013 Snapshot. Access to the full API and latest funding round data requires a license. To learn more check out our offerings. &lt;br /&gt;
&lt;br /&gt;
==Retrieval==&lt;br /&gt;
&lt;br /&gt;
The data was retrieved by Shrey and Matthew through an application from the Crunchbase Website for the API service. The data took about a month to come in due to a lack of response from Crunchbase itself. Eventually, they gave us basic access.&lt;br /&gt;
&lt;br /&gt;
==Content==&lt;br /&gt;
&lt;br /&gt;
The snapshot contained 2 .tar.qz files, which were extracted into 181/crunchbase using the command&lt;br /&gt;
 tar -zxvf file.tar.gz&lt;br /&gt;
&lt;br /&gt;
The csv files (organizations.csv and people.csv) were copied for access to:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Crunchbase Snapshot&lt;br /&gt;
&lt;br /&gt;
The files (size in bytes) and their contents are&lt;br /&gt;
&lt;br /&gt;
'''crunchbase_2013_snapshot_mysql.tar.gz'''&lt;br /&gt;
*license.txt		 526 &lt;br /&gt;
*cb_objects.sql	 338955612 &lt;br /&gt;
*cb_offices.sql	 14850092 &lt;br /&gt;
*cb_people.sql		 13253952 &lt;br /&gt;
*cb_ipos.sql		 178397 &lt;br /&gt;
*cb_milestones.sql	 10498840 &lt;br /&gt;
*cb_funds.sql		 385010 &lt;br /&gt;
*cb_relationships.sql	 48655529 &lt;br /&gt;
*cb_degrees.sql	 13829471 &lt;br /&gt;
*cb_investments.sql	 6185134 &lt;br /&gt;
*cb_acquisitions.sql	 2309393 &lt;br /&gt;
*cb_funding_rounds.sql	 14681705 &lt;br /&gt;
&lt;br /&gt;
'''odm.csv.tar.gz'''&lt;br /&gt;
*organizations.csv	 212013301&lt;br /&gt;
**459916 records with the following fields: &lt;br /&gt;
***crunchbase_uuid&lt;br /&gt;
***type&lt;br /&gt;
***primary_role&lt;br /&gt;
***name&lt;br /&gt;
***crunchbase_url&lt;br /&gt;
***homepage_domain&lt;br /&gt;
***homepage_url&lt;br /&gt;
***profile_image_url&lt;br /&gt;
***facebook_url&lt;br /&gt;
***twitter_url&lt;br /&gt;
***linkedin_url&lt;br /&gt;
***stock_symbol&lt;br /&gt;
***location_city&lt;br /&gt;
***location_region&lt;br /&gt;
***location_country_code&lt;br /&gt;
***short_description&lt;br /&gt;
*people.csv	 	 188924229&lt;br /&gt;
**521634 records with the following fields: &lt;br /&gt;
***crunchbase_uuid&lt;br /&gt;
***type&lt;br /&gt;
***first_name&lt;br /&gt;
***last_name&lt;br /&gt;
***crunchbase_url&lt;br /&gt;
***profile_image_url&lt;br /&gt;
***facebook_url&lt;br /&gt;
***twitter_url&lt;br /&gt;
***linkedin_url&lt;br /&gt;
***location_city&lt;br /&gt;
***location_region&lt;br /&gt;
***location_country_code&lt;br /&gt;
***title&lt;br /&gt;
***organization&lt;br /&gt;
***organization_crunchbase_url&lt;br /&gt;
*crunchbase_license.txt 487&lt;br /&gt;
&lt;br /&gt;
==Changing MYSQL to PostgreSQL==&lt;br /&gt;
&lt;br /&gt;
The SQL files were generated in MySQL. We need to convert them to PostgreSQL. See: https://en.wikibooks.org/wiki/Converting_MySQL_to_PostgreSQL and http://stackoverflow.com/questions/1942586/comparison-of-database-column-types-in-mysql-postgresql-and-sqlite-cross-map&lt;br /&gt;
&lt;br /&gt;
The key changes are:&lt;br /&gt;
&lt;br /&gt;
 MYSQL          POSTGRESQL&lt;br /&gt;
 -----          ----------&lt;br /&gt;
 LOCK           --comment out as no need but LOCK [ TABLE ] [ ONLY ] name [ * ] [, ...] [ IN lockmode MODE ] [ NOWAIT ]&lt;br /&gt;
 UNLOCK         --comment out&lt;br /&gt;
 decimal(x,y)   real (might work as is)&lt;br /&gt;
 datetime       timestamp&lt;br /&gt;
 KEY            --comment out as no need but FOREIGN KEY ( column_name [, ... ] ) REFERENCES reftable [ ( refcolumn [, ... ] ) ]&lt;br /&gt;
&lt;br /&gt;
[[category:internal]]&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Crunchbase_2013_Snapshot&amp;diff=17369</id>
		<title>Crunchbase 2013 Snapshot</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Crunchbase_2013_Snapshot&amp;diff=17369"/>
		<updated>2017-04-04T20:35:43Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Username: mcnair@rice.edu&lt;br /&gt;
password: amount&lt;br /&gt;
&lt;br /&gt;
==Original Email==&lt;br /&gt;
&lt;br /&gt;
Thank you for submitting a request for Research Access to Crunchbase through our API. We have reviewed your request, and granted you Basic Access. You can now access Crunchbase data in the following ways. &lt;br /&gt;
&lt;br /&gt;
Check out the Open Data Map&lt;br /&gt;
Explore the 2013 Snapshot&lt;br /&gt;
Visit our website for instructions on accessing Crunchbase data. To access the REST API, you'll need your user key: &lt;br /&gt;
&lt;br /&gt;
6d382e4bbdaa297138f32a588b139f53&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
With Basic Access, API use is limited to the Open Data Map and 2013 Snapshot. Access to the full API and latest funding round data requires a license. To learn more check out our offerings. &lt;br /&gt;
&lt;br /&gt;
==Retrieval==&lt;br /&gt;
&lt;br /&gt;
The data was retrieved by Shrey and Matthew through an application from the Crunchbase Website for the API service. The data took about a month to come in due to a lack of response from Crunchbase itself. Eventually, they gave us basic access.&lt;br /&gt;
&lt;br /&gt;
==Content==&lt;br /&gt;
&lt;br /&gt;
The snapshot contained 2 .tar.qz files, which were extracted into 181/crunchbase using the command&lt;br /&gt;
 tar -zxvf file.tar.gz&lt;br /&gt;
&lt;br /&gt;
The csv files (organizations.csv and people.csv) were copied for access to:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Crunchbase Snapshot&lt;br /&gt;
&lt;br /&gt;
The files (size in bytes) and their contents are&lt;br /&gt;
&lt;br /&gt;
'''crunchbase_2013_snapshot_mysql.tar.gz'''&lt;br /&gt;
*license.txt		 526 &lt;br /&gt;
*cb_objects.sql	 338955612 &lt;br /&gt;
*cb_offices.sql	 14850092 &lt;br /&gt;
*cb_people.sql		 13253952 &lt;br /&gt;
*cb_ipos.sql		 178397 &lt;br /&gt;
*cb_milestones.sql	 10498840 &lt;br /&gt;
*cb_funds.sql		 385010 &lt;br /&gt;
*cb_relationships.sql	 48655529 &lt;br /&gt;
*cb_degrees.sql	 13829471 &lt;br /&gt;
*cb_investments.sql	 6185134 &lt;br /&gt;
*cb_acquisitions.sql	 2309393 &lt;br /&gt;
*cb_funding_rounds.sql	 14681705 &lt;br /&gt;
&lt;br /&gt;
'''odm.csv.tar.gz'''&lt;br /&gt;
*organizations.csv	 212013301&lt;br /&gt;
**459916 records with the following fields: &lt;br /&gt;
***crunchbase_uuid&lt;br /&gt;
***type&lt;br /&gt;
***primary_role&lt;br /&gt;
***name&lt;br /&gt;
***crunchbase_url&lt;br /&gt;
***homepage_domain&lt;br /&gt;
***homepage_url&lt;br /&gt;
***profile_image_url&lt;br /&gt;
***facebook_url&lt;br /&gt;
***twitter_url&lt;br /&gt;
***linkedin_url&lt;br /&gt;
***stock_symbol&lt;br /&gt;
***location_city&lt;br /&gt;
***location_region&lt;br /&gt;
***location_country_code&lt;br /&gt;
***short_description&lt;br /&gt;
*people.csv	 	 188924229&lt;br /&gt;
**521634 records with the following fields: &lt;br /&gt;
***crunchbase_uuid&lt;br /&gt;
***type&lt;br /&gt;
***first_name&lt;br /&gt;
***last_name&lt;br /&gt;
***crunchbase_url&lt;br /&gt;
***profile_image_url&lt;br /&gt;
***facebook_url&lt;br /&gt;
***twitter_url&lt;br /&gt;
***linkedin_url&lt;br /&gt;
***location_city&lt;br /&gt;
***location_region&lt;br /&gt;
***location_country_code&lt;br /&gt;
***title&lt;br /&gt;
***organization&lt;br /&gt;
***organization_crunchbase_url&lt;br /&gt;
*crunchbase_license.txt 487&lt;br /&gt;
&lt;br /&gt;
==Changing MYSQL to PostgreSQL==&lt;br /&gt;
&lt;br /&gt;
The SQL files were generated in MySQL. We need to convert them to PostgreSQL. See: https://en.wikibooks.org/wiki/Converting_MySQL_to_PostgreSQL and http://stackoverflow.com/questions/1942586/comparison-of-database-column-types-in-mysql-postgresql-and-sqlite-cross-map&lt;br /&gt;
&lt;br /&gt;
The key changes are:&lt;br /&gt;
&lt;br /&gt;
 MYSQL          POSTGRESQL&lt;br /&gt;
 -----          ----------&lt;br /&gt;
 LOCK           --comment out as no need but LOCK [ TABLE ] [ ONLY ] name [ * ] [, ...] [ IN lockmode MODE ] [ NOWAIT ]&lt;br /&gt;
 UNLOCK         --comment out&lt;br /&gt;
 decimal(x,y)   real (might work as is)&lt;br /&gt;
 datetime       timestamp&lt;br /&gt;
 KEY            --comment out as no need but FOREIGN KEY ( column_name [, ... ] ) REFERENCES reftable [ ( refcolumn [, ... ] ) ]&lt;br /&gt;
&lt;br /&gt;
[[category:internal]]&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Crunchbase_2013_Snapshot&amp;diff=17366</id>
		<title>Crunchbase 2013 Snapshot</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Crunchbase_2013_Snapshot&amp;diff=17366"/>
		<updated>2017-04-04T20:24:58Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Original Email==&lt;br /&gt;
&lt;br /&gt;
Thank you for submitting a request for Research Access to Crunchbase through our API. We have reviewed your request, and granted you Basic Access. You can now access Crunchbase data in the following ways. &lt;br /&gt;
&lt;br /&gt;
Check out the Open Data Map&lt;br /&gt;
Explore the 2013 Snapshot&lt;br /&gt;
Visit our website for instructions on accessing Crunchbase data. To access the REST API, you'll need your user key: &lt;br /&gt;
&lt;br /&gt;
6d382e4bbdaa297138f32a588b139f53&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
With Basic Access, API use is limited to the Open Data Map and 2013 Snapshot. Access to the full API and latest funding round data requires a license. To learn more check out our offerings. &lt;br /&gt;
&lt;br /&gt;
==Retrieval==&lt;br /&gt;
&lt;br /&gt;
The data was retrieved by Shrey and Matthew through an application from the Crunchbase Website for the API service. The data took about a month to come in due to a lack of response from Crunchbase itself. Eventually, they gave us basic access.&lt;br /&gt;
&lt;br /&gt;
==Content==&lt;br /&gt;
&lt;br /&gt;
The snapshot contained 2 .tar.qz files, which were extracted into 181/crunchbase using the command&lt;br /&gt;
 tar -zxvf file.tar.gz&lt;br /&gt;
&lt;br /&gt;
The csv files (organizations.csv and people.csv) were copied for access to:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Crunchbase Snapshot&lt;br /&gt;
&lt;br /&gt;
The files (size in bytes) and their contents are&lt;br /&gt;
&lt;br /&gt;
'''crunchbase_2013_snapshot_mysql.tar.gz'''&lt;br /&gt;
*license.txt		 526 &lt;br /&gt;
*cb_objects.sql	 338955612 &lt;br /&gt;
*cb_offices.sql	 14850092 &lt;br /&gt;
*cb_people.sql		 13253952 &lt;br /&gt;
*cb_ipos.sql		 178397 &lt;br /&gt;
*cb_milestones.sql	 10498840 &lt;br /&gt;
*cb_funds.sql		 385010 &lt;br /&gt;
*cb_relationships.sql	 48655529 &lt;br /&gt;
*cb_degrees.sql	 13829471 &lt;br /&gt;
*cb_investments.sql	 6185134 &lt;br /&gt;
*cb_acquisitions.sql	 2309393 &lt;br /&gt;
*cb_funding_rounds.sql	 14681705 &lt;br /&gt;
&lt;br /&gt;
'''odm.csv.tar.gz'''&lt;br /&gt;
*organizations.csv	 212013301&lt;br /&gt;
**459916 records with the following fields: &lt;br /&gt;
***crunchbase_uuid&lt;br /&gt;
***type&lt;br /&gt;
***primary_role&lt;br /&gt;
***name&lt;br /&gt;
***crunchbase_url&lt;br /&gt;
***homepage_domain&lt;br /&gt;
***homepage_url&lt;br /&gt;
***profile_image_url&lt;br /&gt;
***facebook_url&lt;br /&gt;
***twitter_url&lt;br /&gt;
***linkedin_url&lt;br /&gt;
***stock_symbol&lt;br /&gt;
***location_city&lt;br /&gt;
***location_region&lt;br /&gt;
***location_country_code&lt;br /&gt;
***short_description&lt;br /&gt;
*people.csv	 	 188924229&lt;br /&gt;
**521634 records with the following fields: &lt;br /&gt;
***crunchbase_uuid&lt;br /&gt;
***type&lt;br /&gt;
***first_name&lt;br /&gt;
***last_name&lt;br /&gt;
***crunchbase_url&lt;br /&gt;
***profile_image_url&lt;br /&gt;
***facebook_url&lt;br /&gt;
***twitter_url&lt;br /&gt;
***linkedin_url&lt;br /&gt;
***location_city&lt;br /&gt;
***location_region&lt;br /&gt;
***location_country_code&lt;br /&gt;
***title&lt;br /&gt;
***organization&lt;br /&gt;
***organization_crunchbase_url&lt;br /&gt;
*crunchbase_license.txt 487&lt;br /&gt;
&lt;br /&gt;
==Changing MYSQL to PostgreSQL==&lt;br /&gt;
&lt;br /&gt;
The SQL files were generated in MySQL. We need to convert them to PostgreSQL. See: https://en.wikibooks.org/wiki/Converting_MySQL_to_PostgreSQL and http://stackoverflow.com/questions/1942586/comparison-of-database-column-types-in-mysql-postgresql-and-sqlite-cross-map&lt;br /&gt;
&lt;br /&gt;
The key changes are:&lt;br /&gt;
&lt;br /&gt;
 MYSQL          POSTGRESQL&lt;br /&gt;
 -----          ----------&lt;br /&gt;
 LOCK           --comment out as no need but LOCK [ TABLE ] [ ONLY ] name [ * ] [, ...] [ IN lockmode MODE ] [ NOWAIT ]&lt;br /&gt;
 UNLOCK         --comment out&lt;br /&gt;
 decimal(x,y)   real (might work as is)&lt;br /&gt;
 datetime       timestamp&lt;br /&gt;
 KEY            --comment out as no need but FOREIGN KEY ( column_name [, ... ] ) REFERENCES reftable [ ( refcolumn [, ... ] ) ]&lt;br /&gt;
&lt;br /&gt;
[[category:internal]]&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Crunchbase_2013_Snapshot&amp;diff=17361</id>
		<title>Crunchbase 2013 Snapshot</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Crunchbase_2013_Snapshot&amp;diff=17361"/>
		<updated>2017-04-04T20:21:28Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Original Email==&lt;br /&gt;
&lt;br /&gt;
Thank you for submitting a request for Research Access to Crunchbase through our API. We have reviewed your request, and granted you Basic Access. You can now access Crunchbase data in the following ways. &lt;br /&gt;
&lt;br /&gt;
Check out the Open Data Map&lt;br /&gt;
Explore the 2013 Snapshot&lt;br /&gt;
Visit our website for instructions on accessing Crunchbase data. To access the REST API, you'll need your user key: &lt;br /&gt;
&lt;br /&gt;
6d382e4bbdaa297138f32a588b139f53&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
With Basic Access, API use is limited to the Open Data Map and 2013 Snapshot. Access to the full API and latest funding round data requires a license. To learn more check out our offerings. &lt;br /&gt;
&lt;br /&gt;
==Retrieval==&lt;br /&gt;
&lt;br /&gt;
The data was retrieved by Shrey and Matthew - STATE HOW AND FROM WHERE&lt;br /&gt;
&lt;br /&gt;
==Content==&lt;br /&gt;
&lt;br /&gt;
The snapshot contained 2 .tar.qz files, which were extracted into 181/crunchbase using the command&lt;br /&gt;
 tar -zxvf file.tar.gz&lt;br /&gt;
&lt;br /&gt;
The csv files (organizations.csv and people.csv) were copied for access to:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Crunchbase Snapshot&lt;br /&gt;
&lt;br /&gt;
The files (size in bytes) and their contents are&lt;br /&gt;
&lt;br /&gt;
'''crunchbase_2013_snapshot_mysql.tar.gz'''&lt;br /&gt;
*license.txt		 526 &lt;br /&gt;
*cb_objects.sql	 338955612 &lt;br /&gt;
*cb_offices.sql	 14850092 &lt;br /&gt;
*cb_people.sql		 13253952 &lt;br /&gt;
*cb_ipos.sql		 178397 &lt;br /&gt;
*cb_milestones.sql	 10498840 &lt;br /&gt;
*cb_funds.sql		 385010 &lt;br /&gt;
*cb_relationships.sql	 48655529 &lt;br /&gt;
*cb_degrees.sql	 13829471 &lt;br /&gt;
*cb_investments.sql	 6185134 &lt;br /&gt;
*cb_acquisitions.sql	 2309393 &lt;br /&gt;
*cb_funding_rounds.sql	 14681705 &lt;br /&gt;
&lt;br /&gt;
'''odm.csv.tar.gz'''&lt;br /&gt;
*organizations.csv	 212013301&lt;br /&gt;
**459916 records with the following fields: &lt;br /&gt;
***crunchbase_uuid&lt;br /&gt;
***type&lt;br /&gt;
***primary_role&lt;br /&gt;
***name&lt;br /&gt;
***crunchbase_url&lt;br /&gt;
***homepage_domain&lt;br /&gt;
***homepage_url&lt;br /&gt;
***profile_image_url&lt;br /&gt;
***facebook_url&lt;br /&gt;
***twitter_url&lt;br /&gt;
***linkedin_url&lt;br /&gt;
***stock_symbol&lt;br /&gt;
***location_city&lt;br /&gt;
***location_region&lt;br /&gt;
***location_country_code&lt;br /&gt;
***short_description&lt;br /&gt;
*people.csv	 	 188924229&lt;br /&gt;
**521634 records with the following fields: &lt;br /&gt;
***crunchbase_uuid&lt;br /&gt;
***type&lt;br /&gt;
***first_name&lt;br /&gt;
***last_name&lt;br /&gt;
***crunchbase_url&lt;br /&gt;
***profile_image_url&lt;br /&gt;
***facebook_url&lt;br /&gt;
***twitter_url&lt;br /&gt;
***linkedin_url&lt;br /&gt;
***location_city&lt;br /&gt;
***location_region&lt;br /&gt;
***location_country_code&lt;br /&gt;
***title&lt;br /&gt;
***organization&lt;br /&gt;
***organization_crunchbase_url&lt;br /&gt;
*crunchbase_license.txt 487&lt;br /&gt;
&lt;br /&gt;
==Changing MYSQL to PostgreSQL==&lt;br /&gt;
&lt;br /&gt;
The SQL files were generated in MySQL. We need to convert them to PostgreSQL. See: https://en.wikibooks.org/wiki/Converting_MySQL_to_PostgreSQL and http://stackoverflow.com/questions/1942586/comparison-of-database-column-types-in-mysql-postgresql-and-sqlite-cross-map&lt;br /&gt;
&lt;br /&gt;
The key changes are:&lt;br /&gt;
&lt;br /&gt;
 MYSQL          POSTGRESQL&lt;br /&gt;
 -----          ----------&lt;br /&gt;
 LOCK           --comment out as no need but LOCK [ TABLE ] [ ONLY ] name [ * ] [, ...] [ IN lockmode MODE ] [ NOWAIT ]&lt;br /&gt;
 UNLOCK         --comment out&lt;br /&gt;
 decimal(x,y)   real (might work as is)&lt;br /&gt;
 datetime       timestamp&lt;br /&gt;
 KEY            --comment out as no need but FOREIGN KEY ( column_name [, ... ] ) REFERENCES reftable [ ( refcolumn [, ... ] ) ]&lt;br /&gt;
&lt;br /&gt;
[[category:internal]]&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=16863</id>
		<title>Shrey Agarwal (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=16863"/>
		<updated>2017-03-23T19:41:33Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;09/27/2016 14:00 - 17:00: &lt;br /&gt;
*Set up personal and work log pages, accessed Remote Desktop. &lt;br /&gt;
*Compiled list of accelerators from Wiki&lt;br /&gt;
09/29/2016 14:00 - 16:15; 16:45 - 17:30:&lt;br /&gt;
*Created new project: [[Accelerator Seed List (Data)]] and worked with Dr. Egan to create schematic for data entry.&lt;br /&gt;
*Evaluated 3 sources and logged data. Sources were taken from [[List of Accelerators]]. Logged each step onto project page and identified categories that would be suitable for web crawling sometime in the future.&lt;br /&gt;
10/11/2016 14:00 - 17:30;&lt;br /&gt;
*Explored how to use regular expressions in TextPad to aid with data sorting (need to review expressions with Dr. Egan in future)&lt;br /&gt;
*Continued evaluating sources from [[List of Accelerators]] and recorded steps onto project page, as before. Finished evaluating the six sources from initial list. (All work done in [[Accelerator Seed List (Data)]])&lt;br /&gt;
10/13/2016 14:00 - 17:00;&lt;br /&gt;
*All work done in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Talked to Dr. Egan about project going forward. Need to pick out 10-15 accelerators from the sources listed on my project page and identify a reliable method for obtaining cohort information, as well as other variables&lt;br /&gt;
*Used google searches to identify more sources, and evaluated three databases with the help of TextPad&lt;br /&gt;
*Began working on more generic google searches. Was able to go through &amp;quot;Location+accelerator&amp;quot;-type searches today. Will continue next time.&lt;br /&gt;
10/18/2016 14:00 - 17:30;&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Took a sample size of 10 accelerators and detailed how to extract cohort information, as well as what other information is readily available from accelerator URLs.&lt;br /&gt;
*Brought Matthew up to speed on accelerator project, added summaries to each section so they became easier to follow, and worked with him to finish up extracting cohort information&lt;br /&gt;
10/20/16 14:30 - 17:30:&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Finished up the list of instructions for finding the cohort. Continued compiling the list of variables for each of the accelerators within the sample size.&lt;br /&gt;
*Consulted Peter on prospects of creating a web crawler with the information we currently have compiled. Determined it was possible, although beyond the scope of Peter's knowledge.&lt;br /&gt;
10/25/16 14:00 - 17:00&lt;br /&gt;
*Consulted Ed with next step for project.&lt;br /&gt;
*Began listing the E-R diagram onto the accelerator database page where entities were potential categories and each entity had its associated attributes&lt;br /&gt;
10/27/16 14:00 - 17:00&lt;br /&gt;
*Continued working with Matthew to identify elements in the E-R diagram for pulling information on accelerators. &lt;br /&gt;
*Found sources to obtain/cross-reference information (ie. Angel List)&lt;br /&gt;
11/08/16 14:00 - 18:00&lt;br /&gt;
*Identified possible keywords to filter results through for accelerators&lt;br /&gt;
*Began compiling a comprehensive list of accelerators based on the data we have already sifted through.&lt;br /&gt;
*Learned how to use regular expressions from Ben to sort names individually and alphabetically.&lt;br /&gt;
11/10/16 14:00 - 18:00&lt;br /&gt;
*Began sorting through accelerator list and removing duplicates, as well as identifying more places to pull names from.&lt;br /&gt;
*Worked with Peter to create a crawl for f6s because the website does not return only accelerators.&lt;br /&gt;
11/15/16 14:00 - 18:00&lt;br /&gt;
*Took a break from f6s to locate more lists based on individual google searches such as &amp;quot;city+accelerator+list&amp;quot;&lt;br /&gt;
*Put Seed DB information into an excel file on the remote desktop&lt;br /&gt;
11/17/16 14:00 - 16:00&lt;br /&gt;
*Continued filling out information for the random Google Searches&lt;br /&gt;
*Organized TextPad files on the RDP into coherent excel spreadsheets with proper headers on the table&lt;br /&gt;
*Noticed problem with f6s: it seems although all of the html coding was protected by a captcha so the crawler did not actually extract any information; it was all blocked.&lt;br /&gt;
11/22/16 14:00 - 17:00&lt;br /&gt;
*Worked to fix f6s crawler with Peter&lt;br /&gt;
*Finished and compiled master list of accelerators&lt;br /&gt;
12/01/16 14:00 - 18:00&lt;br /&gt;
*Caught up on project with Ed and Carlin&lt;br /&gt;
*Took 20 accelerators (241-260) from the list and filled out text.html files for them; finished the 20&lt;br /&gt;
12/05/16 13:00 - 16:00&lt;br /&gt;
*After finishing first 20 accelerators, continued working down the list, beginning at 321&lt;br /&gt;
*Work noted in [[Accelerator Seed List (Data)]], but mostly stored on McNair RDP&lt;br /&gt;
12/06/16 14:00 - 18:00&lt;br /&gt;
*Continued &amp;quot;Accelerating&amp;quot; down the list in [[Accelerator Seed List (Data)]], finished up until 340&lt;br /&gt;
12/08/16 14:00 - 17:00&lt;br /&gt;
*Continued working on accelerator list on the same page.&lt;br /&gt;
01/17/17 14:00 - 16:00&lt;br /&gt;
*Finished up &amp;quot;accelerating&amp;quot; from [[Accelerator Seed List (Data)]], numbers 341-351&lt;br /&gt;
1/18/17 14:00 - 16:00&lt;br /&gt;
*Finished accelerating for sure, went back and began an overview of the work done for quality control.&lt;br /&gt;
01/20/17 14:00 - 16:00&lt;br /&gt;
*Mandatory meeting, then worked through 2 of Ed's unfinished accelerators&lt;br /&gt;
1/23/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to go over about 70 items in the accelerator list and ensure that they follow a uniform structure and show correct information&lt;br /&gt;
1/24/17 14:00 - 16:00&lt;br /&gt;
*Worked with Peter to fix the problem with results not coming through on the new spreadsheet by renaming the file and including more symbols in the searches. Spreadsheet should be up to date now.&lt;br /&gt;
*Got to number 144 on the list while going through files.&lt;br /&gt;
1/25/17 14:00 - 16;00&lt;br /&gt;
*Continued looking through the list and fixing wrong entries or reporting them&lt;br /&gt;
1/26/17 14:00 - 16:00&lt;br /&gt;
*Talked with Ed about project going forward and tried to access the Crunchbase API with Peter to crawl for start-up companies.&lt;br /&gt;
*Continued working through the accelerator list, stopped at number 186.&lt;br /&gt;
1/27/17 14:00 - 16:00&lt;br /&gt;
*Continued looking through accelerator list and fixing any entries with error. Got to number 261.&lt;br /&gt;
1/30/17 14:30 - 16:30&lt;br /&gt;
*Got through about 425&lt;br /&gt;
1/31/17 14:00 - 16:00&lt;br /&gt;
*Got to number 502&lt;br /&gt;
2/01/17 14:00 - 16:00&lt;br /&gt;
*Finished looking through the initial list of accelerators and writing down which ones needed to be modified or completed (through 551)&lt;br /&gt;
2/03/17 14:00 - 17:00&lt;br /&gt;
*Finished about 30 entries for the accelerator entries that still needed to be completed. Worked out of the &amp;quot;NOT DONE&amp;quot; file in the server (which is now blank because everything is finished)&lt;br /&gt;
2/06/17 14:00 - 16:00&lt;br /&gt;
*Developed a standardized format for the text files with Matthew. Instructions are under &amp;quot;standardized format&amp;quot; in the accelerator seed list portion. I started at number 226 and standardized formats up until 370.&lt;br /&gt;
2/07/17 14:00-16:00&lt;br /&gt;
*Continued work from yesterday, completed up to number 488 from the list. Will likely need one more day to finish.&lt;br /&gt;
2/08/17 14:00 - 16:00&lt;br /&gt;
*Finished standardizing the txt files for use on the excel spreadsheet, compiled the data and examined the resultant tables. Realized we needed to fix some categories in the cohort files.&lt;br /&gt;
2/09/17 14:00 - 17:00&lt;br /&gt;
*Worked with Ed on a side project trying to gather information on climate change thanks to Baker's article on the Wall Street Journal&lt;br /&gt;
*Gathered information on climate change in relation to high-growth, high-risk innovation and organizations that deal with things such as carbon credits&lt;br /&gt;
2/10/17 14:00 - 17:00&lt;br /&gt;
*Realized that blog post was ambitious because we could not really find a clear purpose from the information we gathered, nor could we find a unique angle. Held off on the idea&lt;br /&gt;
*Went back to organizing the new columns and headers on the text file by identifying areas of error in the excel spreadsheet&lt;br /&gt;
2/15/17 14:00 - 16:00&lt;br /&gt;
*Spoke with Ed about free enterprise while he lectured all of us. It took about an hour.&lt;br /&gt;
*Looked at plans for project going forward including using linkedin to search the founders&lt;br /&gt;
2/20/17 14:00 - 16:00&lt;br /&gt;
*Found our first source for expanding the project into incubators, from angel.co. Seems similar to f6s in that we can crawl it and obtain a list of incubators and their various counterparts. &lt;br /&gt;
2/21/17 14:00 - 16:00&lt;br /&gt;
*Found more sources for incubators by reading through quora discussions and masters theses. Bookmarked these pages so that I could put them into text files after.&lt;br /&gt;
2/23/17 14:00 - 18:00&lt;br /&gt;
*Converted incubator files to text-pad and saved them (4 total), then cleaned them up through regex&lt;br /&gt;
*Took the cohort text file, put it into excel, and proceeded to clean up all of the mistakes in the excel document, particularly bad data or mistakes with organizations. Got through Y-Combinator.&lt;br /&gt;
2/24/17 14:00 - 16:00&lt;br /&gt;
*Finished up cleaning the cohort data for the names and the descriptions, but there still needs to be work done on the other stuff like dates and programs&lt;br /&gt;
2/28/17 14:00 - 16:00&lt;br /&gt;
*Created page [[Hub-Based Venture Firms]] and proceeded to research VC in Hubs listed on under E:\McNair\Projects\Hubs\summer 2016\Hubs Variables - Ariel.xls&lt;br /&gt;
*Looked at details such as whether they have in-house funds, whether they co-invest, focuses, and amounts invested.&lt;br /&gt;
3/01/17 14:00 - 16:00&lt;br /&gt;
*Worked with Ben and Matthew to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
3/02/17 14:00 - 16:00&lt;br /&gt;
*Tried to repeat the VC data pull without it crashing from pulling too many entries. Unfortunately, we were unable to finish it&lt;br /&gt;
3/06/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to put final touches on the cohort data to prep it for matching with our VC data&lt;br /&gt;
3/07/17 14:00 - 16:00&lt;br /&gt;
*Finally finished working on the cohort files, will match on the 8th&lt;br /&gt;
3/08/17 14:00 - 16:00&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
3/20/17 14:00 - 16:00&lt;br /&gt;
*Participated in a SQL training session with Ed, learned how to create a database and to pull tab delimited information from text files onto a table&lt;br /&gt;
3/21/17 14:00 - 16:00&lt;br /&gt;
*Met with Ed and arrived at the conclusion of finishing the draft for a report by the end of the semester. Put the initial report information on the accelerator page using the variables that we currently have&lt;br /&gt;
3/22/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to compile tables in our database of the matched VC-portfolio company lists and the overall accelerator cohort information. Found multiple errors in the cohort file which needed to be fixed before finishing the tables and analyzing the data&lt;br /&gt;
3/23/17 14:00 - 16:00&lt;br /&gt;
Finished cleaning the cohort file once again.&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=16815</id>
		<title>Accelerator Seed List (Data)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=16815"/>
		<updated>2017-03-22T19:25:53Z</updated>

		<summary type="html">&lt;p&gt;Shrey: /* End of Semester Report */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Accelerator Seed List (Data)&lt;br /&gt;
|Has owner=Shrey Agarwal, Matthew Ringheanu&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
||Has keywords=Accelerators,Data&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Industry Classifier&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=End of Semester Report=&lt;br /&gt;
The end of semester report will focus on ranking accelerators and environments based on the variables we have gathered. Our primary form of categorization will be ranking individual accelerators based on their venture capital raise rate. We can probably generate information over time for accelerators and the amount of VC they raised to get a sense of what locations have developed in the past five years from the dates of transactions recorded by SDC. To obtain these rankings, we will identify which cohorts companies were trained in, as well as complete details of the accelerator and the details of cohort companies. We will focus only on accelerators because there are many other entities in each ecosystem. We will also utilize information on IPO or acquisition by companies, obtained through Crunchbase, to gain some sense of how successful startups emerging from a particular accelerator are. To obtain the data over time, we will need to fill out the cohort date information column in our cohort data, which will require the help of either Crunchbase or the Wayback machine for older accelerators. In ranking the accelerators across regions, we can also track industry-specific hotspots for accelerators such as medicine in Memphis or technology in San Francisco.&lt;br /&gt;
&lt;br /&gt;
To complete the report, we need to fill information in:&lt;br /&gt;
*Industry and focus&lt;br /&gt;
*Location&lt;br /&gt;
*Name, description&lt;br /&gt;
*Matched VC data&lt;br /&gt;
*Founder information (maybe)&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
This project is developing broad and near-population data on accelerators and their cohort companies. The objective is to identify which cohorts of which accelerators a cohort company was trained in, obtain details of the accelerators, and obtain details of the cohort companies, including information about any venture capital investment that the cohort company might have received and any IPO or acquisition the company may have experienced.&lt;br /&gt;
&lt;br /&gt;
The primary use of this data is for an academic paper detailed on the [[Matching Entrepreneurs to Accelerators and VCs (Academic Paper)]] page. &lt;br /&gt;
&lt;br /&gt;
However, this project can also provide useful data to other academic papers ([[Urban Start-up Agglomeration]], [[Hubs (Academic Paper)]], and [[Hubs Scorecard (Academic Paper)]]), projects ([[Houston Entrepreneurship]]) and blog posts (under the [[Emerging Ecosystems]] umbrella project).&lt;br /&gt;
&lt;br /&gt;
This project needs the results of the [[Industry Classifier]], [[Whois Parser]], and other tools.&lt;br /&gt;
&lt;br /&gt;
=Current Project Write-Up=&lt;br /&gt;
&lt;br /&gt;
==Things To Do==&lt;br /&gt;
*Obtain all URLs for accelerators in order to run through the Wayback Machine to find out when they started.&lt;br /&gt;
*Match Crunchbase Data with our Accelerator List to see if they have any accelerators that we do not.&lt;br /&gt;
*Obtain an example of accelerator that started early and has multiple companies but does not separate them into cohorts and figure out a way to determine which companies went through each cohort.&lt;br /&gt;
&lt;br /&gt;
==What Each File in the &amp;quot;Accelerator&amp;quot; Folder on the RDP Contains==&lt;br /&gt;
*&amp;quot;Accelerator List Sources&amp;quot; (Folder) - This folder contains most of the sources that we pulled accelerator names from at the very beginning of the project.&lt;br /&gt;
*&amp;quot;Code+Final_Data&amp;quot; (Folder) - This folder contains Peter's code for pulling the data from the text files in the &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Crunchbase Snapshot&amp;quot; (Folder) - This folder contains the data we obtained from Crunchbase. There is a massive amount of data which we will need to sort through to find useful information and hopefully match that data with our current cohort data.&lt;br /&gt;
*&amp;quot;Data&amp;quot; (Folder) - This folder contains all of our data on accelerators including cohort information and the html files of each cohort page. I would estimate that it is about 95% clean currently.&lt;br /&gt;
*&amp;quot;Data - Copy&amp;quot; (Folder) - This is just a copy of our current &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Data_Copy&amp;quot; (Folder) - This is a copy of our original &amp;quot;Data&amp;quot; folder before we did any manual cleaning.&lt;br /&gt;
*&amp;quot;Enclosing_Circle&amp;quot; (Folder) - This folder seems to contain some data on VC but I'm not sure how it pertains to the Accelerator project.&lt;br /&gt;
*&amp;quot;F6S Accelerator HTMLs&amp;quot; (Folder) - This folder contains the HTML pages of all the pages on the F6S website. We used it to add more potential accelerators to our list.&lt;br /&gt;
*&amp;quot;Google_SiteSearch&amp;quot; (Folder) - This folder contains Python code for Google searches.&lt;br /&gt;
*&amp;quot;Industry_Classifier&amp;quot; (Folder) - This folder seems to contain Python code but I'm not sure what for.&lt;br /&gt;
*&amp;quot;Matcher&amp;quot; (Folder) - This folder contains the Matcher.&lt;br /&gt;
*&amp;quot;Python WebCrawler&amp;quot; (Folder) - This folder contains code that is a work in progress for pulling descriptions from accelerator websites. It is Jeemin's project.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data Copy&amp;quot; (Excel File) - This file contains a copy of our cleaned cohort data.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data&amp;quot; (Excel File) - This file contains the most current, completely cleaned data on cohort company information.&lt;br /&gt;
*&amp;quot;NormalizeFixedWidth&amp;quot; (PL File) - This is the normalizer.&lt;br /&gt;
*&amp;quot;PortCoNames&amp;quot; (TXT File) - This file contains all of the names of the cohort companies as well as the accelerator they went through.&lt;br /&gt;
*&amp;quot;VC Data&amp;quot; (Excel File) - This file contains all of the names of the companies that have ever received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data&amp;quot; (TXT File) - This file contains that non-normalized data of all of the VC information.&lt;br /&gt;
*&amp;quot;VC_Data_Names&amp;quot; (TXT File) - This file contains all of the names of companies that have received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data_Names_Matched_PortCoNames&amp;quot; (Excel File) - This file contains all of the cohort companies that have also received VC funding. Still needs to be sorted through.&lt;br /&gt;
&lt;br /&gt;
==Process==&lt;br /&gt;
After accumulating the massive amount of data on accelerators, their cohorts, and their html files, we began cleaning those text files, which are located in the &amp;quot;Data&amp;quot; folder within &amp;quot;Accelerators&amp;quot;. After going through the first round of cleaning, we ran a code through the cohort data which put all of that information into an Excel document called &amp;quot;Cleaned Cohort Data&amp;quot;. There were still some mistakes in the cohort information unfortunately, which we fixed within the Excel file itself. Therefore, there are some text files within the &amp;quot;Data&amp;quot; folder that do not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file. If we were to run the cohort code through the &amp;quot;Data&amp;quot; folder, we would get something that does not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file, which is problematic. The solution to this (other than manually cleaning the text files again) would be to write a code from the &amp;quot;Cleaned Cohort Data&amp;quot; file which would allow us to clean the data in the &amp;quot;Data&amp;quot; folder through the format of the Excel file. We have also matched all of the cohort companies with our list of all companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
=Current To Do=&lt;br /&gt;
&lt;br /&gt;
#Work on the [[Crunchbase 2013 Snapshot]]&lt;br /&gt;
#Match cohort companies to VC backed portfolio companies&lt;br /&gt;
#Refine our data to work out which cohort each cohort company was a member of, cohort start dates and locations, etc.&lt;br /&gt;
#Make a list of top accelerator lists (e.g., http://tech.co/top-startup-accelerators-ranked-2012-08) and check that we have those accelerators&lt;br /&gt;
&lt;br /&gt;
=End of Semester Notes=&lt;br /&gt;
&lt;br /&gt;
*We have compiled a very long list of accelerators from many different databases. For the past couple of weeks, everyone in the center has been going through this list, 20 at a time, classifying each one as an accelerator or not an accelerator, and then proceeding to gather data on the accelerator using the process outlined below. This process went very smoothly. We have successfully gone through about 80% of the list. We are still missing information on the last hundred or so names. All of the collected data is located on the RDP, within the &amp;quot;Accelerators&amp;quot; folder under &amp;quot;Data&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
=Data Collection Notes=&lt;br /&gt;
&lt;br /&gt;
==3 files==&lt;br /&gt;
&lt;br /&gt;
For each accelerator in the list, put files in E:\Projects\Accelerators\Data&lt;br /&gt;
*AcceleratorName.txt - copy and paste the variables below into a (tab-delimited) txt file and complete&lt;br /&gt;
*AcceleratorName.cohort - your cohort text file (see below)&lt;br /&gt;
*AcceleratorName.html (possibly automatically with a folder too) - save a copy of the html of the cohort page&lt;br /&gt;
&lt;br /&gt;
==.txt Variables==&lt;br /&gt;
&lt;br /&gt;
 Name	&lt;br /&gt;
 Score	&lt;br /&gt;
 Flag	&lt;br /&gt;
 CohortURL	&lt;br /&gt;
 Address	&lt;br /&gt;
 Duration	&lt;br /&gt;
 Vintage		&lt;br /&gt;
 Industry	&lt;br /&gt;
 Description	&lt;br /&gt;
 Equity	&lt;br /&gt;
 NonProfit	 &lt;br /&gt;
 Notes	&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Try to get '''Name, Score, Flag, Cohort URL and Address''' for all. ONLY GRAB OTHER VARIABLES IF EASY. Just leave things blank if you can't find them quickly.&lt;br /&gt;
&lt;br /&gt;
'''If the score is 0, or the flag is S, I, A, or F just stop''' - don't bother downloading a cohort list, saving an HTML file, etc. If possible, do  stick a very brief description of the problem in the notes field.&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Score: is 0-1 where 0 is definitely not an accelerator, 1 is definitely an accelerator&lt;br /&gt;
*Flag: (leave blank if not needed), if multiple then separate by comma&lt;br /&gt;
**S for social entrep&lt;br /&gt;
**I for incubator&lt;br /&gt;
**A for an angel group&lt;br /&gt;
**F is for foreign&lt;br /&gt;
**C for in coworking space/hub/etc&lt;br /&gt;
**V for if part of venture fund&lt;br /&gt;
**D is for Dead&lt;br /&gt;
*Put just the root URL in Cohort URL if there isn't a Cohort page&lt;br /&gt;
*Duration: in wks (months x 4.33 and round)&lt;br /&gt;
*Vintage is year of first cohort if possible&lt;br /&gt;
*Industry is industry focus but only if clear focus&lt;br /&gt;
*Equity is a number (don't put %) or Y/N&lt;br /&gt;
*Notes is only there if need it. Particularly try to use this field to note discards.&lt;br /&gt;
&lt;br /&gt;
==.cohort files==&lt;br /&gt;
&lt;br /&gt;
Your .cohort files must:&lt;br /&gt;
*Be tab delimited txt&lt;br /&gt;
*Have a header&lt;br /&gt;
*The first column must be the portfolio company name&lt;br /&gt;
*Grab as many columns as you can easily (and name them)&lt;br /&gt;
&lt;br /&gt;
==Standardized format for text files==&lt;br /&gt;
&lt;br /&gt;
Information Text file&lt;br /&gt;
*1 tab only after each category&lt;br /&gt;
*No spaces after commas for flags or industry&lt;br /&gt;
*For duration put only a number in weeks but do not write &amp;quot;weeks&amp;quot;&lt;br /&gt;
*Equity is either only a number (no percent sign) or a Y/N&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Cohort Text file&lt;br /&gt;
*1 tab between each column&lt;br /&gt;
*Titles of each column on top&lt;br /&gt;
*Make a new category for &amp;quot;Cohort Number&amp;quot; and write either &amp;quot;1 2 3 4 etc.&amp;quot;&lt;br /&gt;
*Matthew: 1-225 (done) Shrey: 226-550 (done)&lt;br /&gt;
&lt;br /&gt;
==Link to Crunchbase API application==&lt;br /&gt;
&lt;br /&gt;
https://about.crunchbase.com/forms/research-access-apply/&lt;br /&gt;
&lt;br /&gt;
==Sign-Ups==&lt;br /&gt;
&lt;br /&gt;
 Ed - 1-10 (done)&lt;br /&gt;
 Carlin -  11-20 (done)&lt;br /&gt;
 Carlin - 21-40 (done)&lt;br /&gt;
 Christy - 41-60 (done)&lt;br /&gt;
 Avesh - 61-80 (done)&lt;br /&gt;
 Eliza - 81-100 (done)&lt;br /&gt;
 Meghana - 101-120 (done)&lt;br /&gt;
 Peter - 121-140 (done)&lt;br /&gt;
 Ramee - 141-160 (done)&lt;br /&gt;
 Will - 161-180 (done)&lt;br /&gt;
 Matthew - 181-200 (done)&lt;br /&gt;
 Julia - 201-220 (done)&lt;br /&gt;
 Peter - 221-240 (done)&lt;br /&gt;
 Shrey - 241-260 (done)&lt;br /&gt;
 Matthew - 261-280 (done)&lt;br /&gt;
 Eliza - 281-300 (done)&lt;br /&gt;
 Julia - 301-320 (done)&lt;br /&gt;
 Shrey - 321-340 (done)&lt;br /&gt;
 Carlin - 341-361 (done)&lt;br /&gt;
 Julia - 362-380 (done)&lt;br /&gt;
 Dylan - 381-393 (done)&lt;br /&gt;
 Jake - 394-404 (done)&lt;br /&gt;
 Dylan - 405-410 (done)&lt;br /&gt;
 Avesh - 411-415 (done)&lt;br /&gt;
 Dylan - 416-423 (done)&lt;br /&gt;
 Peter - 424-460(done)&lt;br /&gt;
 Carlin - 461-480 (done)&lt;br /&gt;
 Peter - 481-490(done)&lt;br /&gt;
 Julia - 491-510 (done)&lt;br /&gt;
 Peter - 511-515 (done)&lt;br /&gt;
 Julia - 516-529 (done)&lt;br /&gt;
 Ben - 530-540 (done)&lt;br /&gt;
 Shrey - 541-551 (done)&lt;br /&gt;
&lt;br /&gt;
=List of Accelerators=&lt;br /&gt;
#10Xelerator&lt;br /&gt;
#1440&lt;br /&gt;
#33entrepreneurs&lt;br /&gt;
#500 Startups&lt;br /&gt;
#9Mile Labs&lt;br /&gt;
#AIA Accelerator&lt;br /&gt;
#ARK Challenge&lt;br /&gt;
#AT&amp;amp;T Aspire Accelerator&lt;br /&gt;
#ATDC Community&lt;br /&gt;
#AZ TechCelerator&lt;br /&gt;
#AccelFoods&lt;br /&gt;
#Acceleprise&lt;br /&gt;
#Accelerate Baltimore&lt;br /&gt;
#Accelerate Genius&lt;br /&gt;
#Accelerate Tectoria Accelerator&lt;br /&gt;
#Accelerator Centre&lt;br /&gt;
#Advanced Technology Development Center (ATDC)&lt;br /&gt;
#Airbus BizLab&lt;br /&gt;
#Alchemist Accelerator&lt;br /&gt;
#AlphaLab&lt;br /&gt;
#Amplify.LA&lt;br /&gt;
#Angel Capital&lt;br /&gt;
#Angelcube&lt;br /&gt;
#Angelpad&lt;br /&gt;
#Annual Business BootCamp&lt;br /&gt;
#Arizona Center for Innovation&lt;br /&gt;
#Arizona Furnace&lt;br /&gt;
#Arrowhead Tech Incubator 2016&lt;br /&gt;
#Aspire 3 Accelerator 2017&lt;br /&gt;
#Atlanta Ventures Accelerator &lt;br /&gt;
#AutoXLR8R&lt;br /&gt;
#Awesome Inc.&lt;br /&gt;
#Axel Springer Plug and Play&lt;br /&gt;
#B 4 Change Impact Accelerator&lt;br /&gt;
#B2B Acceleration Program&lt;br /&gt;
#B4C Social Venture Accelerator&lt;br /&gt;
#BBC Worldwide Labs&lt;br /&gt;
#BMW Startup Garage&lt;br /&gt;
#Brandcelerate&lt;br /&gt;
#Bunker Labs&lt;br /&gt;
#Bank of Ireland Accelerator Programme&lt;br /&gt;
#Bantunium Labs Accelerator&lt;br /&gt;
#Barclays Accelerator&lt;br /&gt;
#Barclays New York Summer 2015&lt;br /&gt;
#Berkley Ventures&lt;br /&gt;
#Bessemer Business Incubation System&lt;br /&gt;
#Beta-i&lt;br /&gt;
#Beta.MN&lt;br /&gt;
#BetaFactory&lt;br /&gt;
#BetaSpring&lt;br /&gt;
#Betablox&lt;br /&gt;
#Betaspring RevUp  (DUPLICATE)&lt;br /&gt;
#Bethnal Green Ventures&lt;br /&gt;
#BioAccel&lt;br /&gt;
#BioInspire&lt;br /&gt;
#Bir 2015&lt;br /&gt;
#BitAngel Engagement Level&lt;br /&gt;
#BitAngels Startup Summer Program of 2013&lt;br /&gt;
#Bizdom&lt;br /&gt;
#Black Forest Accelerator&lt;br /&gt;
#Blue Startups&lt;br /&gt;
#Blueprint Health&lt;br /&gt;
#Bolt Boston&lt;br /&gt;
#Bonnier Accelerator&lt;br /&gt;
#BoomStartup&lt;br /&gt;
#BoomStartup Winter 2017 (DUPLICATE)&lt;br /&gt;
#Boomtown Accelerator&lt;br /&gt;
#Boomtown Health Tech (DUPLICATE)&lt;br /&gt;
#Boost VC&lt;br /&gt;
#BootupLabs&lt;br /&gt;
#Brandery&lt;br /&gt;
#Brooklyn Beta Summer Camp&lt;br /&gt;
#Budweiser Dream Brewery&lt;br /&gt;
#Buildit&lt;br /&gt;
#BuiltinPGH Companies&lt;br /&gt;
#Business Innovation Center&lt;br /&gt;
#Business Opportunity Academy 2017&lt;br /&gt;
#Business Technology Development Center (BizTech)&lt;br /&gt;
#CLT Joules Energy Accelerator 2014&lt;br /&gt;
#CWI Ventures&lt;br /&gt;
#CWI Ventures Application (DUPLICATE)&lt;br /&gt;
#CableLabs Technology Tours 2016&lt;br /&gt;
#Capital Factory&lt;br /&gt;
#Capital Innovators&lt;br /&gt;
#Capital Investment Network (Startups)&lt;br /&gt;
#Caroline Plouff&lt;br /&gt;
#Catalyst Partners&lt;br /&gt;
#Cause Collective : Social Innovation Lab&lt;br /&gt;
#Center for Entrepreneurial Innovation&lt;br /&gt;
#Chain Reaction Innovations 2017&lt;br /&gt;
#Chemical Angel Network&lt;br /&gt;
#Chinaccelerator&lt;br /&gt;
#Cisco Entrepreneurs in Residence&lt;br /&gt;
#Citi Accelerator&lt;br /&gt;
#Citrix Startup Accelerator&lt;br /&gt;
#Claremont/Upland Makerspace Fablab&lt;br /&gt;
#Climate Ventures 2.0 Accelerator&lt;br /&gt;
#Co.Lab accelerator&lt;br /&gt;
#Code for America Accelerator&lt;br /&gt;
#Cohab's Traxtion Point&lt;br /&gt;
#Collision Conference Investors&lt;br /&gt;
#Common Bond&lt;br /&gt;
#Communitech Hyperdrive&lt;br /&gt;
#Conquer Accelerator&lt;br /&gt;
#Coolhouse Labs&lt;br /&gt;
#CuriousMinds Incubator / Accelerator&lt;br /&gt;
#CyberTECH San Diego&lt;br /&gt;
#DBS Accelerator&lt;br /&gt;
#DPD Last Mile labs&lt;br /&gt;
#DV X Labs&lt;br /&gt;
#Dat Ventures&lt;br /&gt;
#Decatur-Morgan County Entrepreneurial Center&lt;br /&gt;
#Deep Space Ventures&lt;br /&gt;
#Demo Accelerator 2016- 2017&lt;br /&gt;
#DeveloperTown&lt;br /&gt;
#Difference Engine&lt;br /&gt;
#Digital Malaysia Corporate Accelerator Program&lt;br /&gt;
#Digital Media Zone Incubator/Accelerator&lt;br /&gt;
#Disney Accelerator&lt;br /&gt;
#DogFish Accelerator&lt;br /&gt;
#Domi Station&lt;br /&gt;
#Dotforge accelerator&lt;br /&gt;
#Dream Funded&lt;br /&gt;
#DreamIT Health&lt;br /&gt;
#DreamStart - Free Mentoring Program&lt;br /&gt;
#Dreamit Ventures (DUPLICATE)&lt;br /&gt;
#Ducky Diggy Lloyd &lt;br /&gt;
#E-Capital Summit&lt;br /&gt;
#EC Mentor Skills Inventory&lt;br /&gt;
#EIGERlab&lt;br /&gt;
#ETRAC&lt;br /&gt;
#EY Startup Challenge&lt;br /&gt;
#Eco Holding&lt;br /&gt;
#Eleven Startup Accelerator&lt;br /&gt;
#Emerge Xcelerate&lt;br /&gt;
#EnterpriseWorks Incubation Program&lt;br /&gt;
#Entrepreneur Development Center&lt;br /&gt;
#Entrepreneurs Roundtable Accelerator&lt;br /&gt;
#Environmental Business Cluster&lt;br /&gt;
#Equity Legal&lt;br /&gt;
#Excelerate Labs&lt;br /&gt;
#Execution Labs&lt;br /&gt;
#Exhilarator&lt;br /&gt;
#Extreme Startups&lt;br /&gt;
#Extreme University&lt;br /&gt;
#FOOD-X&lt;br /&gt;
#Factory45&lt;br /&gt;
#Fargo Startup House 2014-2015&lt;br /&gt;
#FastTrack Propero Healthcare&lt;br /&gt;
#FbFund&lt;br /&gt;
#Female Propeller for High Flyers&lt;br /&gt;
#FinTech Innovation Lab&lt;br /&gt;
#FinTech Studios 2015&lt;br /&gt;
#Fintech Founders Club #2&lt;br /&gt;
#First Growth Venture Network&lt;br /&gt;
#Fishbowl Labs AOL&lt;br /&gt;
#Flagship Enterprise Center&lt;br /&gt;
#FlashStarts&lt;br /&gt;
#Flashpoint&lt;br /&gt;
#Flat6 Labs&lt;br /&gt;
#Fledge9&lt;br /&gt;
#Flextronics Lab IX&lt;br /&gt;
#Food Future Scale-up Accelerator 2017&lt;br /&gt;
#Food System 6 (FS6) Accelerator&lt;br /&gt;
#FoodForwardX&lt;br /&gt;
#Fortify Ventures&lt;br /&gt;
#Founder Institute&lt;br /&gt;
#FounderFuel&lt;br /&gt;
#FoundersPad&lt;br /&gt;
#Fownders Accelerator&lt;br /&gt;
#French Accelerator 2016&lt;br /&gt;
#Fund the Food&lt;br /&gt;
#Fuse Corps Host&lt;br /&gt;
#GAKKEN Accelerator Program&lt;br /&gt;
#Gainesville Technology Enterprise Center&lt;br /&gt;
#Game CoLab Incubator Program 2014&lt;br /&gt;
#GameFounders&lt;br /&gt;
#GammaRebels&lt;br /&gt;
#Gazelle Lab&lt;br /&gt;
#Gener8tor&lt;br /&gt;
#German Accelerator Life Sciences&lt;br /&gt;
#German Accelerator Tech&lt;br /&gt;
#Global Accelerator Network 2015&lt;br /&gt;
#Good Works Houston Lab&lt;br /&gt;
#GoodCompany Ventures&lt;br /&gt;
#Google Launchpad Accelerator&lt;br /&gt;
#Grants4Apps Accelerator&lt;br /&gt;
#GreenStart&lt;br /&gt;
#Greenlite Labs&lt;br /&gt;
#GrowLab&lt;br /&gt;
#Growth Hacking Accelerator 2015&lt;br /&gt;
#Gulf Coast Center for Innovation and Entrepreneurship&lt;br /&gt;
#H-Farm Ventures&lt;br /&gt;
#HACKT Mission for International Founders&lt;br /&gt;
#HAXLR8R&lt;br /&gt;
#HCC Entrepreneurship Launchpad&lt;br /&gt;
#HIGHLINE Academy&lt;br /&gt;
#HUB&lt;br /&gt;
#HUBB Accelerator&lt;br /&gt;
#HUBB GTLA 2016&lt;br /&gt;
#HackFWD&lt;br /&gt;
#Hatch&lt;br /&gt;
#Health Wildcatters&lt;br /&gt;
#Health accelerator&lt;br /&gt;
#Healthbox&lt;br /&gt;
#Hero City Co-Working Space&lt;br /&gt;
#High Street Startups Accelerator&lt;br /&gt;
#Highway1&lt;br /&gt;
#Honda Xcelerator &lt;br /&gt;
#Houston Technology Center&lt;br /&gt;
#Hub Ventures&lt;br /&gt;
#HugeThing&lt;br /&gt;
#I/O ventures&lt;br /&gt;
#ICONYC labs&lt;br /&gt;
#IDC Elevator&lt;br /&gt;
#INcubes Funnel and Accelerator 2014/2015&lt;br /&gt;
#INcubes Online Form&lt;br /&gt;
#INcubes Startup Visa&lt;br /&gt;
#Illumina Accelerator&lt;br /&gt;
#Illuminator,  New York Accelerator 2015&lt;br /&gt;
#Imagine K12&lt;br /&gt;
#Immokalee Business Development Center&lt;br /&gt;
#Impact Engine&lt;br /&gt;
#Impact USA - 2017&lt;br /&gt;
#Incubate Miami&lt;br /&gt;
#Infuse Accelerator&lt;br /&gt;
#Ingenuity Partner Program&lt;br /&gt;
#InnoSpring&lt;br /&gt;
#Innov&amp;amp;Connect&lt;br /&gt;
#Innov8 for Health&lt;br /&gt;
#Innova Memphis&lt;br /&gt;
#InnovateOC&lt;br /&gt;
#Innovation Depot&lt;br /&gt;
#Innovation Pavilion&lt;br /&gt;
#Innovation Showcase Winter 2017&lt;br /&gt;
#Insight Accelerator Labs&lt;br /&gt;
#Intel Education Accelerator&lt;br /&gt;
#Investment Preparedness Lab&lt;br /&gt;
#Invoke Collective&lt;br /&gt;
#Iowa Startup Accelerator&lt;br /&gt;
#JFDI.Asia&lt;br /&gt;
#JFE Accelerator SF&lt;br /&gt;
#JLAB&lt;br /&gt;
#Jaguar Land Rover Tech Incubator&lt;br /&gt;
#Jolt&lt;br /&gt;
#JumpSchool &lt;br /&gt;
#JumpStart Foundry&lt;br /&gt;
#Jumpstart! Boulder&lt;br /&gt;
#JusticeXL&lt;br /&gt;
#Kairos Boston Spring Program&lt;br /&gt;
#Kaplan EdTech&lt;br /&gt;
#Kick&lt;br /&gt;
#Kick Boise&lt;br /&gt;
#Kick LA&lt;br /&gt;
#Kick Victoria&lt;br /&gt;
#Kicklabs&lt;br /&gt;
#Kinetiq Labs&lt;br /&gt;
#L-SPARK Accelerator&lt;br /&gt;
#LAUNCH incubator&lt;br /&gt;
#LAUNCHub&lt;br /&gt;
#LI TechCOMETS&lt;br /&gt;
#LabFunding Project Accelerator 2014&lt;br /&gt;
#Labs Venture Accelerator&lt;br /&gt;
#Launch Chapel Hill&lt;br /&gt;
#Launch Memphis&lt;br /&gt;
#LaunchBox Digital&lt;br /&gt;
#LaunchHouse&lt;br /&gt;
#LaunchPad PEI&lt;br /&gt;
#LaunchSpot&lt;br /&gt;
#Launch_Academy&lt;br /&gt;
#Launchpad Digital Health, LLC&lt;br /&gt;
#Launchpad LA&lt;br /&gt;
#Launchpad Long Island&lt;br /&gt;
#Le Camping&lt;br /&gt;
#Leading Entrepreneurial Accelerator Program&lt;br /&gt;
#Lean Launch Ventures&lt;br /&gt;
#LearnLaunchX&lt;br /&gt;
#Lemnos Labs&lt;br /&gt;
#Life Changing Labs&lt;br /&gt;
#LiftOff Health Incubator&lt;br /&gt;
#Lightbank Start&lt;br /&gt;
#LightningLab&lt;br /&gt;
#Lowe's Accelerator&lt;br /&gt;
#MACH37&lt;br /&gt;
#MACH37 Spring&lt;br /&gt;
#MIT SA+P venture accelerator&lt;br /&gt;
#MITA Institute Accelerator&lt;br /&gt;
#MTGx MediaFactory&lt;br /&gt;
#Mac6&lt;br /&gt;
#Madworks Governance Accelerator&lt;br /&gt;
#Maine Center for Entrepreneurial Development - Top Gun Program&lt;br /&gt;
#Matter&lt;br /&gt;
#Maven Ventures Fund &amp;amp; Incubator&lt;br /&gt;
#Media Camp&lt;br /&gt;
#Melbourne Accelerator Program&lt;br /&gt;
#Memphis BioWorks&lt;br /&gt;
#Merck Accelerator&lt;br /&gt;
#MergeLane 2017 Accelerator&lt;br /&gt;
#Mergelane&lt;br /&gt;
#Metavallon&lt;br /&gt;
#Microsoft Accelerator&lt;br /&gt;
#MindTheBridge&lt;br /&gt;
#Momentum&lt;br /&gt;
#MuckerLab&lt;br /&gt;
#Muru-D&lt;br /&gt;
#My5ive Accelerator 2016&lt;br /&gt;
#N-Motion (DUPLICATE)&lt;br /&gt;
#NDRC (LaunchPad / VentureLab)&lt;br /&gt;
#NEXT Dashboard&lt;br /&gt;
#NMotion&lt;br /&gt;
#NY Digital Health Accelerator&lt;br /&gt;
#NY Fashion Tech Lab 2017&lt;br /&gt;
#NYC ACRE&lt;br /&gt;
#NYC SeedStart&lt;br /&gt;
#Nashville Entrepreneur Center&lt;br /&gt;
#Nebula Shift&lt;br /&gt;
#Nephoscale IaaS&lt;br /&gt;
#Nest New York &lt;br /&gt;
#New Ventures Group&lt;br /&gt;
#New York Digital Health Accelerator (DUPLICATE)&lt;br /&gt;
#NewME Accelerator PopUps &lt;br /&gt;
#NewMe&lt;br /&gt;
#Next media accelerator&lt;br /&gt;
#NextHIT&lt;br /&gt;
#NextStart&lt;br /&gt;
#Nike+ Accelerator&lt;br /&gt;
#Northern Arizona Center for Entrepreneurship and Technology (NACET)&lt;br /&gt;
#Northern England&lt;br /&gt;
#Nxtp.labs&lt;br /&gt;
#OCTANe&lt;br /&gt;
#Oasis 500&lt;br /&gt;
#OpenFund&lt;br /&gt;
#Orange Fab&lt;br /&gt;
#Orange Works&lt;br /&gt;
#Orion Startups&lt;br /&gt;
#Oxygen Accelerator&lt;br /&gt;
#PIE&lt;br /&gt;
#Patriot Boot Camp&lt;br /&gt;
#Pearson Catalyst for Education&lt;br /&gt;
#Pipeline H2O&lt;br /&gt;
#Pitney Bowes Inc&lt;br /&gt;
#Plarium Labs&lt;br /&gt;
#Plug In South LA &lt;br /&gt;
#Plug and Play&lt;br /&gt;
#Plum Alley Investments 2016&lt;br /&gt;
#Points of Light Accelerator&lt;br /&gt;
#PowerHaus&lt;br /&gt;
#Preccelerator® Program 2016&lt;br /&gt;
#ProSiebenSat.1 Accelerator&lt;br /&gt;
#Project Entrepreneur 2016/17&lt;br /&gt;
#Project Healtchare&lt;br /&gt;
#Project Lift&lt;br /&gt;
#Project Music&lt;br /&gt;
#Project Skyway&lt;br /&gt;
#Propeller Venture Accelerator&lt;br /&gt;
#Prosper Capital Accelerator&lt;br /&gt;
#Proton Enterprises&lt;br /&gt;
#Pushstart Accelerator&lt;br /&gt;
#Qualcomm Robotics Accelerator&lt;br /&gt;
#Queen Creek Business Incubator&lt;br /&gt;
#R/GA Accelerator&lt;br /&gt;
#RAIN Incubator/Accelerator&lt;br /&gt;
#RJI Investment Group&lt;br /&gt;
#Reach&lt;br /&gt;
#RetailXelerator&lt;br /&gt;
#Rock Health&lt;br /&gt;
#Rocket Fuel Labs&lt;br /&gt;
#Rockstart Accelerator&lt;br /&gt;
#RunUp Labs&lt;br /&gt;
#Runway IoT Accelerator 2015&lt;br /&gt;
#SAP Startup Focus Program&lt;br /&gt;
#SKTA Innopartners Innovation Accelerator&lt;br /&gt;
#SPACELAB Tech Accelerator&lt;br /&gt;
#SPARK&lt;br /&gt;
#SPH Plug and Play&lt;br /&gt;
#SURF Incubator&lt;br /&gt;
#SaltMines Group Start-Up Studio&lt;br /&gt;
#ScaleTown&lt;br /&gt;
#Seamless IoT 2016&lt;br /&gt;
#Searchcamp&lt;br /&gt;
#Seed Hatchery&lt;br /&gt;
#SeedSpot&lt;br /&gt;
#SeedStartup&lt;br /&gt;
#SeedSumo&lt;br /&gt;
#Seedcamp&lt;br /&gt;
#Seedrocket&lt;br /&gt;
#Seeqnce&lt;br /&gt;
#Sequoia Apps&lt;br /&gt;
#Serval Ventures&lt;br /&gt;
#Shenzhen Valley Ventures Incubator&lt;br /&gt;
#Shoals Entrepreneurial Center&lt;br /&gt;
#Shopper Futures Accelerator&lt;br /&gt;
#Shotput Ventures&lt;br /&gt;
#Sid Martin Biotechnology Institute&lt;br /&gt;
#SigmaLabs Accelerator&lt;br /&gt;
#Silicon Valley Incubator &amp;amp; Accelerator&lt;br /&gt;
#SixThirty&lt;br /&gt;
#Sixers Innovation Lab&lt;br /&gt;
#Skywalker Accelerator&lt;br /&gt;
#SmartHealth Activator&lt;br /&gt;
#Smashd Labs&lt;br /&gt;
#SoCo Nexus Accelerator Spring 2017&lt;br /&gt;
#Social Enterprise Challenge&lt;br /&gt;
#Socratic Labs&lt;br /&gt;
#SparkLabs&lt;br /&gt;
#Sparkgap&lt;br /&gt;
#Sports Tank&lt;br /&gt;
#Springboard&lt;br /&gt;
#Sprint Accelerator&lt;br /&gt;
#Sprint Mobile Health Accelerator&lt;br /&gt;
#SproutBox&lt;br /&gt;
#SproutCamp&lt;br /&gt;
#Starburst Aerospace Accelerator&lt;br /&gt;
#Start Path Europe&lt;br /&gt;
#Start'inPost&lt;br /&gt;
#StartEngine&lt;br /&gt;
#StartFast Venture Accelerator&lt;br /&gt;
#Starta Accelerator Winter 2017&lt;br /&gt;
#Startl&lt;br /&gt;
#Startmate&lt;br /&gt;
#Startup Accelerator (DUPLICATE)&lt;br /&gt;
#Startup Front&lt;br /&gt;
#Startup Next &amp;amp; GAN&lt;br /&gt;
#Startup Orange County Accelerator&lt;br /&gt;
#Startup Runway&lt;br /&gt;
#Startup Wise Guys&lt;br /&gt;
#Startup Zone PEI&lt;br /&gt;
#Startup52X Accelerator&lt;br /&gt;
#StartupCity&lt;br /&gt;
#StartupHighway&lt;br /&gt;
#StartupHouse Foundry program&lt;br /&gt;
#StartupMinds Accelerator &lt;br /&gt;
#StartupYard&lt;br /&gt;
#Startupbootcamp&lt;br /&gt;
#Straight Shot&lt;br /&gt;
#Summer@Highland&lt;br /&gt;
#Surge&lt;br /&gt;
#SynBio axlr8r&lt;br /&gt;
#TEB Incubation &amp;amp; Acceleration Center&lt;br /&gt;
#THRIVE Accelerator III&lt;br /&gt;
#THRIVE Open Innovation (DUPLICATE)&lt;br /&gt;
#TIM#WCAP Accelerator&lt;br /&gt;
#TLabs&lt;br /&gt;
#TMCx Accelerator Digital Health 2017&lt;br /&gt;
#Tallwave&lt;br /&gt;
#Tampa Bay Innovation Center&lt;br /&gt;
#Tampa Bay Wave&lt;br /&gt;
#Tandem Mobile Accelerator&lt;br /&gt;
#Tech Nexus&lt;br /&gt;
#Tech Wildcatters&lt;br /&gt;
#Tech2020&lt;br /&gt;
#TechLaunch&lt;br /&gt;
#TechRanch&lt;br /&gt;
#TechSquareLabs&lt;br /&gt;
#Techstars&lt;br /&gt;
#Techstars Music&lt;br /&gt;
#Telenet Idealabs&lt;br /&gt;
#Telluride Venture Accelerator&lt;br /&gt;
#TenX&lt;br /&gt;
#The Alchemist Accelerator (DUPLICATE)&lt;br /&gt;
#The Ark&lt;br /&gt;
#The Bakery&lt;br /&gt;
#The Batchery&lt;br /&gt;
#The Brandery&lt;br /&gt;
#The Bridge&lt;br /&gt;
#The Center For Technology Enterprise &amp;amp; Development&lt;br /&gt;
#The Chaser&lt;br /&gt;
#The Company Lab (CO.LAB)&lt;br /&gt;
#The Draper FinTech Connection&lt;br /&gt;
#The Factory&lt;br /&gt;
#The Greatest Pitch&lt;br /&gt;
#The Harbor Accelerator&lt;br /&gt;
#The Incubator&lt;br /&gt;
#The Iron Yard&lt;br /&gt;
#The Mediapreneur Incubator&lt;br /&gt;
#The Morpheus&lt;br /&gt;
#The New York Venture Summit&lt;br /&gt;
#The Next Step: from idea to startup&lt;br /&gt;
#The Refinery&lt;br /&gt;
#The Unilever Foundry&lt;br /&gt;
#The Venture Center's Pre-Accelerator I&lt;br /&gt;
#The Vine OC&lt;br /&gt;
#The Vogt Awards&lt;br /&gt;
#The Yield Lab&lt;br /&gt;
#The eFactory Accelerator&lt;br /&gt;
#Think Big Partners Accelerator&lt;br /&gt;
#TiE Angels&lt;br /&gt;
#Tigerlabs Digital Health Accelerator&lt;br /&gt;
#Tolstoy Summer Camp&lt;br /&gt;
#TopSeedsLab&lt;br /&gt;
#Travel Startups Incubator&lt;br /&gt;
#Travelport Labs Accelerator&lt;br /&gt;
#Travelport Labs Incubator&lt;br /&gt;
#Triangle Startup Factory&lt;br /&gt;
#Tumml&lt;br /&gt;
#Tune Labs&lt;br /&gt;
#Twin Cities Accelerator 2016&lt;br /&gt;
#UW-Whitewater Launch Pad Accelerator&lt;br /&gt;
#Unbank.ventures FinTech Incubator&lt;br /&gt;
#University Technology Park&lt;br /&gt;
#Unreasonable Institute&lt;br /&gt;
#UpTech&lt;br /&gt;
#Upstart Accelerator&lt;br /&gt;
#Upstart Labs&lt;br /&gt;
#Upstart Memphis&lt;br /&gt;
#Uptima Business Bootcamp&lt;br /&gt;
#Upwest Labs&lt;br /&gt;
#VANTEC&lt;br /&gt;
#VC FinTech Accelerator&lt;br /&gt;
#Velocity Indiana Accelerator&lt;br /&gt;
#Venture Catalyst Partners&lt;br /&gt;
#Venture Hive&lt;br /&gt;
#Venture I&lt;br /&gt;
#VentureOut's  Enterprise Tech Expedition&lt;br /&gt;
#Venturegeeks&lt;br /&gt;
#Vet-Tech Accelerator&lt;br /&gt;
#VictorySpark&lt;br /&gt;
#Village88 Techlab&lt;br /&gt;
#Volkswagen ERL Technology Accelerator&lt;br /&gt;
#WHLabs&lt;br /&gt;
#Wasabi Ventures Academy&lt;br /&gt;
#Wayra&lt;br /&gt;
#Wellness Accelerator&lt;br /&gt;
#Wells Fargo Startup Accelerator&lt;br /&gt;
#Wireless IoT&lt;br /&gt;
#Women Innovate Mobile&lt;br /&gt;
#XLerateHealth&lt;br /&gt;
#XTRATOS&lt;br /&gt;
#Xlerate Health&lt;br /&gt;
#Y Combinator&lt;br /&gt;
#Y&amp;amp;R SparkPlug 2017&lt;br /&gt;
#YEurope&lt;br /&gt;
#YLE Media Startup Accelerator Program&lt;br /&gt;
#Yahoo Ad Tech Program&lt;br /&gt;
#Yangler (online accelerator)&lt;br /&gt;
#Year of the Startup&lt;br /&gt;
#Yetizen Accelerator&lt;br /&gt;
#You Is Now&lt;br /&gt;
#Z80 Labs&lt;br /&gt;
#ZIP Launchpad Admission&lt;br /&gt;
#ZeroTo510&lt;br /&gt;
#Zone Startups Calgary&lt;br /&gt;
#designX 2017&lt;br /&gt;
#eMerging Ventures&lt;br /&gt;
#ezone&lt;br /&gt;
#iStart Jax (DUPLICATE)&lt;br /&gt;
#iStart Valley&lt;br /&gt;
#iVentures10&lt;br /&gt;
#ignite100&lt;br /&gt;
#innovyz start&lt;br /&gt;
#tekMountain Accelerator&lt;br /&gt;
&lt;br /&gt;
=Project Summary=&lt;br /&gt;
This project will be used to determine which accelerators are the most effective at churning out successful startups, as well as what characteristics are exhibited by these accelerators. First, we need to gather as much data as we can about as many accelerators as we can in order to look at factors that differentiate successful vs. unsuccessful ventures. Next, we need to create a web crawling program which will gather information about accelerators across the world by accessing their websites and extracting information. I believe that our overall goal with this research project is to gain insight into the methods of successful accelerators, as well as to find out what exactly differentiates very successful accelerators from dead accelerators.&lt;br /&gt;
&lt;br /&gt;
Helpful Links: http://seedrankings.com/&lt;br /&gt;
&lt;br /&gt;
=Sources=&lt;br /&gt;
&lt;br /&gt;
Summary: These are sources obtained from [[List of Accelerators]] and other Google searches. We will evaluate these sources by looking at the number of accelerators they supply (as most of them are lists) and then also taking a look at the type of information they provide about each accelerator. Key data points are cohort-related data, startup-related data, and logistics of the accelerator. Better sources supply more information that the URL alone.&lt;br /&gt;
&lt;br /&gt;
(Obtained from [[List of Accelerators]] and various Google searches)&lt;br /&gt;
*http://seedrankings.com/&lt;br /&gt;
*http://www.acceleratorinfo.com/see-all.html&lt;br /&gt;
*http://www.seed-db.com/accelerators&lt;br /&gt;
*http://gust.com/usa-canada-accelerator-report-2015/?utm_content=35401577&amp;amp;utm_medium=social&amp;amp;utm_source=twitter&lt;br /&gt;
*https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/&lt;br /&gt;
*http://www.builtinnyc.com/2016/06/03/accelerators-incubators-nyc&lt;br /&gt;
*http://www.represent.la/&lt;br /&gt;
*http://www.launch.co/blog/complete-list-of-incubators-and-accelerators-like-y-combinat.html&lt;br /&gt;
*https://angel.co/accelerator-4&lt;br /&gt;
&lt;br /&gt;
(Obtained from Google search: &amp;quot;Accelerator Database&amp;quot;)&lt;br /&gt;
*seed-db is the first result that pops up&lt;br /&gt;
*https://www.corporate-accelerators.net/database/&lt;br /&gt;
*https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json&lt;br /&gt;
*By the 5th or 6th search result, the utility diminished greatly&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2015/03/17/the-best-startup-accelerators-of-2015-powering-a-tech-boom/#2f52fa7e34e4&lt;br /&gt;
*http://www.inc.com/will-yakowicz/the-15-best-startup-accelerators-in-the-us.html&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2016/03/11/the-best-startup-accelerators-of-2016/#74086a7724f2&lt;br /&gt;
*https://techcrunch.com/2015/03/17/these-are-the-top-20-us-accelerators/&lt;br /&gt;
*https://www.nexpcb.com/blogs/news/the-hardware-incubators-accelerators-list&lt;br /&gt;
&lt;br /&gt;
Other ways used to find Accelerators (listed below &amp;quot;List of Sources Obtained from Various Google Searches&amp;quot;):&lt;br /&gt;
*Type in generic location + &amp;quot;accelerators&amp;quot; (e.g. Houston Accelerators)&lt;br /&gt;
:*Looked at roughly the first 20 results&lt;br /&gt;
:*Used three locations as examples of accelerators that pop up&lt;br /&gt;
*Type in a specific state + &amp;quot;accelerator&amp;quot; + &amp;quot;list&amp;quot; (e.g. Texas accelerator list) to search for more relevant lists&lt;br /&gt;
:*Once again, looked at roughly the first 20 results&lt;br /&gt;
&lt;br /&gt;
=Source Evaluations=&lt;br /&gt;
&lt;br /&gt;
Summary: These evaluations couple with each of the sources above. The evaluations provide instructions for obtaining the information listed, as well as a general review of how useful the data seems. The review serves to determine whether a crawler would be suitable for obtaining information from the source autonomously.&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.acceleratorinfo.com/see-all.html==&lt;br /&gt;
#Opened source website&lt;br /&gt;
#Copied Information under &amp;quot;All Accelerator Programs&amp;quot; to TextPad, already sorted. Returned 190 results&lt;br /&gt;
#Each link on parent list leads to individual '''home page url''' of accelerator&lt;br /&gt;
:*Used sample size of 20 links, determined 16 to be accelerators, 2 to be incubators, 2 to be inactive or broken links&lt;br /&gt;
:*Many accelerators do not include founding date, most recent accelerators from around 2013-2014 (as determined from home page)&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for specific URLs to older accelerators, not very helpful for more specific information.&lt;br /&gt;
*Web crawling seems improbable because information is not readily available from source. Can potentially mine staff information or contact information from associated &amp;quot;about&amp;quot; page in the home url&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators/all==&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 235 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes:&lt;br /&gt;
::# &amp;quot;state&amp;quot;&lt;br /&gt;
::# &amp;quot;company name&amp;quot;&lt;br /&gt;
::# &amp;quot;website and CrunchBase links&amp;quot;&lt;br /&gt;
::# &amp;quot;cohort date&amp;quot;&lt;br /&gt;
::#&amp;quot;exit value&amp;quot;&lt;br /&gt;
::#&amp;quot;funding&amp;quot;. &lt;br /&gt;
:::Many entries for &amp;quot;exit value&amp;quot; are missing, some values for &amp;quot;funding&amp;quot; are missing&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators out of 235 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the following:&lt;br /&gt;
::#Status&lt;br /&gt;
::#Program (name)&lt;br /&gt;
::#Location&lt;br /&gt;
::#Country&lt;br /&gt;
::#Number of companies&lt;br /&gt;
::#Cumulative exit values&lt;br /&gt;
::#Cumulative funding &lt;br /&gt;
::#Average funding for startups&lt;br /&gt;
::#Median funding for startups&lt;br /&gt;
:::Many entries for &amp;quot;median funding&amp;quot; are left empty, as well as entries for all types of funding on the bottom half of the table&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, but after cross-referencing from other sources shows that seed-db is lacking many newer accelerators; list is not all-inclusive.&lt;br /&gt;
*Includes regional distributions for accelerator groups as well. For example, rather than just &amp;quot;Techstars&amp;quot;, the group is broken into Austin, Berlin, Boston, Boulder, etc.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators==&lt;br /&gt;
:Very similar to &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;, but contains large regional accelerators as groups, rather than individual accelerators. For example, Techstars appears only once.&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 239 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes same information as previous source, &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;. However, accelerators spanning across multiple regions have their startups located under one category on this webpage.&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators/groups out of 239 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the same information as the &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; source&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, includes large groups as well as individual accelerators. It seems that some accelerators missing from &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; are located here, since there are 239 returns rather than 235.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.f6s.com/programs?type==&lt;br /&gt;
#On the webpage, set &amp;quot;Type&amp;quot; to &amp;quot;Accelerator/Program&amp;quot;, set &amp;quot;Location&amp;quot; to &amp;quot;North America&amp;quot;, and set &amp;quot;Invest in Country&amp;quot; to &amp;quot;United States&amp;quot; to return results&lt;br /&gt;
#Highlighted results and scrolled down until all results found; copied results to TextPad&lt;br /&gt;
#In TextPad, sorted out lines with &amp;quot;by&amp;quot;, as well as miscellaneous categories such as dates and dollar signs through Regular Expressions&lt;br /&gt;
#Using the &amp;quot;More Info&amp;quot; line which held constant through the entire list, assigned a sequential number to the line (in order to determine the number of results)&lt;br /&gt;
::*Obtained a grand total of 1467 results from the list&lt;br /&gt;
::*Along with the name of the program/accelerator, the data included:&lt;br /&gt;
::#Dollar value per team&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Application Site&lt;br /&gt;
::#Accelerator URL&lt;br /&gt;
::*Many entries are not accelerators, from a quick glance through the results, there were various conferences, 3-5 days events, and written literature pertaining to accelerators as well&lt;br /&gt;
::*From a sample size of the first 30 entries, determined 10 to be valid accelerators, 3 incubators, 6 conferences/weekends, and the rest to be miscellaneous entries such as startup events or &amp;quot;studios&amp;quot; (perhaps useful but not relevant to search)&lt;br /&gt;
::*As we go down the list, the number of accelerators proportionately decreases. Can comfortably say that overall accelerator turnout from this website is much less than 33%, probably closer to 10-15%.&lt;br /&gt;
===Review===&lt;br /&gt;
*Potentially useful website if crawler could remove the clutter and target solely the accelerators; very useful for identifying new accelerators since data automatically sorted by date and location.&lt;br /&gt;
*Large list of sources includes many irrelevant results, such as conferences or weekends which are difficult to identify. The name of the sorting category itself, &amp;quot;Accelerator/Program&amp;quot; suggests that many of the results fall under the &amp;quot;Program&amp;quot; section rather than being valid accelerators.&lt;br /&gt;
*Potential site for identifying accelerators, but limited by in-site sorting; useful for URL and perhaps equity, but not very detailed information relating to the accelerator/program.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://gust.com/usa-canada-accelerator-report-2015/==&lt;br /&gt;
#Selected region of US and Canada&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Top 20 Active Accelerators&amp;quot; and selected &amp;quot;see the full list&amp;quot; near the bottom of the listed accelerators&lt;br /&gt;
#Copied resulting entries into TextPad and sorted out the numbers to leave only the name of the accelerator&lt;br /&gt;
::*Obtained 100 results for different accelerators&lt;br /&gt;
::*Accelerator lists included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Number of Start-ups funded (2015 only)&lt;br /&gt;
::*Accelerator list limited to 2015&lt;br /&gt;
===Review===&lt;br /&gt;
*Website provides its own evaluation of an accelerator's success based on various factors and provides data for larger trends.&lt;br /&gt;
*Usefulness is questionable because website does not provide much except the URL, and all of the entries are based on success in 2015.&lt;br /&gt;
*Other interesting data within website such as &amp;quot;Hot Markets&amp;quot;, investment breakdowns by state, etc. All of this data is also limited to 2015.&lt;br /&gt;
&lt;br /&gt;
==Source: https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/==&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Startup accelerators in Boston&amp;quot;&lt;br /&gt;
#Copied text beginning from &amp;quot;MassChallenge&amp;quot; (the first paragraph was just a general definition of startups) and continued to copy until &amp;quot;Startup Incubators in Boston&amp;quot;&lt;br /&gt;
#After pasting in TextPad, I sorted the data to delete any characters after the &amp;quot;-&amp;quot; and added a sequential number at the beginning of each line&lt;br /&gt;
::*Returned a total of 17 results for startups in Boston&lt;br /&gt;
::*Accelerator list included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Capital requirements&lt;br /&gt;
::#Application periods and requirements&lt;br /&gt;
::#Paragraph describing accelerator and its goals&lt;br /&gt;
===Review===&lt;br /&gt;
*Although the guide is dated, useful for identifying strong accelerator programs in Boston&lt;br /&gt;
*Limitation: only focuses on Boston, but the description is helpful in identifying the role of the accelerator&lt;br /&gt;
*Limited information on accelerator, not very useful by itself without information from the accelerator URL&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.corporate-accelerators.net/database/==&lt;br /&gt;
#Copied and pasted table into Microsoft Excel (Data was already sorted into categories so no need for TextPad)&lt;br /&gt;
#Table returned 72 references (but there was a link to the bottom to a larger database)&lt;br /&gt;
::*The table itself includes:&lt;br /&gt;
::#Major Company&lt;br /&gt;
::#Accelerator&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Website&lt;br /&gt;
::#Details&lt;br /&gt;
::*The &amp;quot;Details&amp;quot; link led to a variety of other information including:&lt;br /&gt;
::#Status (Active or Inactive)&lt;br /&gt;
::#Locations&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Term&lt;br /&gt;
::#Cohort Based? (Regular or Irregular)&lt;br /&gt;
::#Pitch Day&lt;br /&gt;
::#Office Space&lt;br /&gt;
::#Powered by&lt;br /&gt;
::#Support Offered?&lt;br /&gt;
::#Launch year&lt;br /&gt;
::#Focus Areas&lt;br /&gt;
::#General Description&lt;br /&gt;
::*Also Included a variety of data regarding the host company as well&lt;br /&gt;
===Review===&lt;br /&gt;
*Solid list for corporate accelerators and also includes a variety of information about the accelerator, the cohorts, etc. Some of the entries are international accelerators however so need to filter them out&lt;br /&gt;
*Only limited to 72 accelerators from major companies&lt;br /&gt;
&lt;br /&gt;
==Source: https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json==&lt;br /&gt;
#This source is a .json file from the previous database&lt;br /&gt;
#After placing into TextPad, replaced each space with a ###, replaced each new line with a tab, and replaced each ### with a new line. Ultimately returned 80 results&lt;br /&gt;
::*From the file, the .json includes:&lt;br /&gt;
::#NAICS and NAICS sector &lt;br /&gt;
::#Classification&lt;br /&gt;
::#Sector Description&lt;br /&gt;
::#Term&lt;br /&gt;
::#Goal&lt;br /&gt;
::#Partner&lt;br /&gt;
::*Also includes most of the information from the previous source, since they are undoubtedly linked&lt;br /&gt;
===Review===&lt;br /&gt;
*Another solid list for corporate accelerators with some more information, but ultimately very similar to the previous source.&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.quora.com/Where-can-I-find-a-comprehensive-list-of-startup-incubators-and-accelerators-in-the-US==&lt;br /&gt;
#Since we already looked at the first listed source (seed-db), I clicked on the second link &amp;quot;(by Robert Shedd) http://blog.shedd.us/321987608/&amp;quot; which took me to a page headed &amp;quot;Help for Startups! – A semi-complete list of startup accelerator programs&amp;quot; created by a blogger, Robert Shedd&lt;br /&gt;
#List included 102 entries by the blogger, each of which do look like an accelerator&lt;br /&gt;
::*Upon immediate overview, noticed many results from previous sources were missing. Immediately noticed lack of &amp;quot;OwlSpark&amp;quot;, the accelerator from Rice.&lt;br /&gt;
::*Shedd only offers us the accelerator name plus its URL&lt;br /&gt;
===Review===&lt;br /&gt;
*Nice list to cross-reference with other sources but does not offer much new insight compared to more powerful engines such as seed-db\&lt;br /&gt;
&lt;br /&gt;
=List of Sources Obtained from Various Google Searches=&lt;br /&gt;
&lt;br /&gt;
Summary: These accelerators are taken from a specific Google search rather than a list. The idea is to compile a list of Google searches that return relevant results of accelerators. This will aid in the creation of a future web crawler.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;Location + Accelerator&amp;quot;(Only individual results, not lists)==&lt;br /&gt;
===Houston Accelerators===&lt;br /&gt;
*Examples of single accelerators found&lt;br /&gt;
:#TMCx: http://www.tmc.edu/innovation/innovation-programs/tmcx/&lt;br /&gt;
:#RED labs: http://redlabs.uh.edu/8&lt;br /&gt;
:#SURGE accelerator: https://kirkcoburn.com/&lt;br /&gt;
:#OwlSpark: http://owlspark.com/&lt;br /&gt;
:#NextHIT: http://www.houstonhealthventures.com/nexthit-accelerator-program-application/&lt;br /&gt;
===Los Angeles Accelerators===&lt;br /&gt;
:#Amplify: http://amplify.la/&lt;br /&gt;
:#Y Combinator: https://www.ycombinator.com/&lt;br /&gt;
:#Chicklabs: https://www.chicklabsllc.com/&lt;br /&gt;
:#Disney Accelerator: https://disneyaccelerator.com/&lt;br /&gt;
:#Launchpad: https://launchpad.la/&lt;br /&gt;
===New York Accelerators===&lt;br /&gt;
:#DreamIT Ventures: http://www.dreamit.com/#meaningful-experience&lt;br /&gt;
:#Women Innovate Mobile: http://www.wim.co/&lt;br /&gt;
:#Techstars NYC: http://www.techstars.com/programs/nyc-program/&lt;br /&gt;
:#Entrepreneurs Roundtable: http://eranyc.com/&lt;br /&gt;
:#FirstGrowthVC: http://venturecrush.com/fg/&lt;br /&gt;
:#New York Digital Health Accelerator: http://digitalhealthaccelerator.com/&lt;br /&gt;
:#Grand Central Tech: http://www.grandcentraltech.com/&lt;br /&gt;
:#Accelerator Corp: http://www.acceleratorcorp.com/&lt;br /&gt;
:#New York Startup Lab: http://nystartuplab.com/&lt;br /&gt;
===Review===&lt;br /&gt;
*Some locations return more viable results for a similar sample size. For example, New York returned 9 valid accelerators, whereas Los Angeles and Houston both returned 5 actual accelerators out of the first 20 results: an 80% difference. Some optimization may come from identifying which locations return more accelerators upon searching.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;State+Accelerator+List&amp;quot;==&lt;br /&gt;
===New York Accelerator List===&lt;br /&gt;
*http://www.ongridventures.com/resources/new-york-silicon-alley-resources/newyorkaccelerators/ (Ranks 14 accelerators)&lt;br /&gt;
*http://under30ceo.com/11-new-york-tech-incubators-and-accelerators-for-entrepreneurs/ (Ranks 11 accelerators)&lt;br /&gt;
===California Accelerator List===&lt;br /&gt;
*http://www.socaltech.com/the_complete_guide_to_southern_california_accelerators_and_incubators_part_i/s-0040924.html (Lists accelerators in Southern Cali)&lt;br /&gt;
*http://barberacorporatelaw.com/blog/2014/4/8/28-business-incubators-in-the-los-angeles-area (List of 24 accelerators near the LA area)&lt;br /&gt;
===Texas Accelerator List===&lt;br /&gt;
*http://www.austinstartuplist.com/incubators (List of accelerators in Austin, &amp;lt;5 results)&lt;br /&gt;
*http://www.siliconhillsnews.com/2016/09/02/the-top-texas-healthcare-accelerators-and-incubators/ (Modest list of accelerators aiding in healthcare)&lt;br /&gt;
*http://realfoodmba.com/food-startup-accelerators/ (List of food-based accelerators, some of which are in Austin, others of which are international)&lt;br /&gt;
===Colorado Accelerator List===&lt;br /&gt;
*http://www.builtincolorado.com/2015/01/14/best-colorado-accelerators-your-startup (8 results)&lt;br /&gt;
*https://www.quora.com/What-accelerator-programs-are-located-in-Colorado (Quora inquiry yielding modest results)&lt;br /&gt;
===Washington Accelerator List===&lt;br /&gt;
*http://www.geekwire.com/2015/mapping-seattles-incubators-accelerators-and-co-working-spaces/ (Returns 14 results)&lt;br /&gt;
===Oregon Accelerator List===&lt;br /&gt;
*http://www.bizjournals.com/portland/subscriber-only/2016/01/15/incubators-and-accelerators.html (Returns list of 5 accelerators and details)&lt;br /&gt;
*http://www.oregon4biz.com/Innovate-&amp;amp;-Create/R&amp;amp;D-Business/Incubators/ (Returns list of 26 accelerators and incubators)&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Seed-DB appears for almost all of the search results&lt;br /&gt;
*Acceleratorinfo appears for most of the search results&lt;br /&gt;
*There are multiple cumulative reports of incubators per location, but not for accelerators&lt;br /&gt;
*Most regionalized accelerator lists deal with either an article or a ranking of a particular amount of accelerators in the area&lt;br /&gt;
*Many results returned nationally ranked lists of accelerators, such as the Forbes list of &amp;quot;Top Accelerators&amp;quot; or something along the lines of &amp;quot;Best Accelerators in the US&amp;quot;. The connection is that perhaps one accelerator mentioned on the list may be located within the searched state.&lt;br /&gt;
*There are also a few results for actual particle accelerators that must be sorted out (i.e. superconducting super collider)&lt;br /&gt;
&lt;br /&gt;
==Found through google searching accelerators found previously==&lt;br /&gt;
'''Found from googling YLE Media Startup Accelerator'''&lt;br /&gt;
*https://www.corporate-accelerators.net/database/index.html (DB of Corporate Accelerators 71-79 entries)&lt;br /&gt;
*http://startupaccelerator.vc/accelerator-corporate-innovation-sig/ (Database of Accelerators and Corporate Innovation 92 entries)&lt;br /&gt;
neither of these have had their entries added to list of accelerators&lt;br /&gt;
&lt;br /&gt;
=Individual Accelerator Evaluations=&lt;br /&gt;
Summary: The purpose of this section is to create instructions for each accelerator on how to find cohort information from their URLs. Along with specific instructions for obtaining the cohorts for each accelerator chosen, there should be a list of easy-to-obtain and relevant statistics regarding the accelerator, such as information about its team, location, etc. The variable statistics list is cumulative, whereas the cohort directions are unique per the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerators Chosen (Format = Name (source))==&lt;br /&gt;
#Blue Startups (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Launchpad LA (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Y Combinator (http://www.seed-db.com/accelerators)&lt;br /&gt;
#FlashPoint (http://www.seed-db.com/accelerators/all)&lt;br /&gt;
#Prosper Accelerator (https://www.f6s.com/programs?type)&lt;br /&gt;
#Axel Springer Plug and Play (http://www.axelspringerplugandplay.com/)&lt;br /&gt;
#Techstars (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Startmate (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Capital Factory (http://blog.shedd.us/321987608/)&lt;br /&gt;
#OwlSpark (Google search: &amp;quot;Houston + accelerators&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Blue Startups (http://bluestartups.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Track Record&amp;quot; page under the &amp;quot;Home&amp;quot; tab; found total number of graduated cohorts to be 7&lt;br /&gt;
#Navigated to &amp;quot;Portfolio&amp;quot; tab. Tab includes list of all seven graduated cohorts along with companies emerging from each one. Each cohort is listed under a separate page (ex. &amp;quot;Cohort 1&amp;quot;, &amp;quot;Cohort 2&amp;quot;, etc) and at the bottom of each cohort page, there is a link to the other 6. Each company has a short description along with its URL.&lt;br /&gt;
#An &amp;quot;Alumni News&amp;quot; page at the bottom of &amp;quot;Portfolio&amp;quot; includes articles pertinent to graduated startups.&lt;br /&gt;
#Unfortunately does not include the date and year of each cohort class, but perhaps could cross-reference with other sources.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Launchpad LA (http://launchpad.la/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Companies&amp;quot; in the top of the homepage&lt;br /&gt;
#&amp;quot;Companies&amp;quot; returns all companies backed by Launchpad LA based on their class year and number (cohort)&lt;br /&gt;
#:*Also sorted by active startups vs. inactive startups&lt;br /&gt;
#At the bottom of the &amp;quot;Companies&amp;quot; tab, there is a statistical layout returning values for the number of companies started by Launchpad during its time as an accelerator (2012-present), as well as the total funding funneled into the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Y Combinator (http://www.ycombinator.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Scrolled down on the home page and clicked on a link entitled &amp;quot;See all companies&amp;quot;.&lt;br /&gt;
#Navigated to a drop down menu named &amp;quot;All Batches&amp;quot;, and clicked on it to expand the list.&lt;br /&gt;
#List is made up of dates ranging from 2005-2016, and these dates return lists of launched companies including most but not all of their URL's, as well as their launch year.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Flashpoint (http://flashpoint.gatech.edu/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#On upper right corner after animation, there is a tab sign which lets you navigate to a page labeled &amp;quot;Teams&amp;quot;&lt;br /&gt;
#The &amp;quot;Team&amp;quot; page has each batch of companies emerging from Georgia Tech, although it does not include the dates or cohorts of these companies. For example, &amp;quot;Batch 1&amp;quot; at the top of the page just lists the companies in the batch without URLs or any additional information.&lt;br /&gt;
#On the &amp;quot;Application&amp;quot; page on the tab near the top, there is information regarding Batch 7, which begins early 2017. Suggests that batch 6 either ended spring 2016 or fall 2016.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Prosper Women Entrepreneurs (http://www.prosperstl.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Accelerator&amp;quot; tab and clicked &amp;quot;Companies&amp;quot; when prompted with the drop down menu.&lt;br /&gt;
#This tab returned all of the launched company logos which then redirected to the company's home page when clicked.&lt;br /&gt;
#No other relevant form of information such as date launched or cohort was included on this page.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Axel Springer Plug and Play(http://www.axelspringerplugandplay.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Clicked on the &amp;quot;Companies&amp;quot; tab on the home page and was directed to the middle of the page which included a short list of current companies.&lt;br /&gt;
#Clicked on the &amp;quot;All Companies&amp;quot; link which returned a page filled with startup logos and brief descriptions of those startups. When clicked, each logo serves to redirect to that startup's home page.&lt;br /&gt;
#Companies were not sorted by cohort or in any other relevant way.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Techstars (http://www.techstars.com)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the Accelerators tabs and clicked &amp;quot;Companies&amp;quot; on the drop down menu.&lt;br /&gt;
#Firstly, this returns a table comprised of a long list of different classes from different areas separated by years.&lt;br /&gt;
#Upon scrolling down further, each of these classes is broken down by the startups that graduated from them. It also includes information such as how much was invested in each startup, as well as whether or not the startup was acquired, is active, or failed.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Startmate (http://www.startmate.com.au)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startups&amp;quot; tab, which returned a page of all startups that have graduated from Startmate.&lt;br /&gt;
#Startups are separated by year of graduation, and each company is linked on this page.&lt;br /&gt;
#It appears as if each year, 1 cohort is taken through the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Capital Factory (https://capitalfactory.com/accelerate/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the startups tab, which returned a long list of companies that were accelerated by Capital Factory.&lt;br /&gt;
#Each logo for the startups served as a link to their respective websites.&lt;br /&gt;
#There was no evidence or mention of any cohorts.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: OwlSpark (http://entrepreneurship.rice.edu/accelerator/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startup Teams&amp;quot; tab, which returned a page that included links to 4 &amp;quot;Classes&amp;quot;.&lt;br /&gt;
#Each class link i.e. (Class 1, Class 2, Class 3, Class 4) returned links to each startup that graduated from the program.&lt;br /&gt;
#These classes signify cohorts.&lt;br /&gt;
&lt;br /&gt;
==List of Promising Variables==&lt;br /&gt;
*Key People (founders, lead entrepreneurs, strategists, etc.)&lt;br /&gt;
*Total number of launched companies&lt;br /&gt;
*A FAQ for application details, accelerator vision, and &lt;br /&gt;
*Funds raised per company (average)&lt;br /&gt;
*Features offered by accelerator (perks, space, tools, etc)&lt;br /&gt;
*General events hosted by the accelerator&lt;br /&gt;
*(Success) stories for graduated start-ups&lt;br /&gt;
&lt;br /&gt;
=E-R Diagram (in list form) for Identifying Attributes to Pull from Accelerators=&lt;br /&gt;
Summary: I will look at different entities within the accelerator page (e.g accelerators, cohorts, founders) and then find potential attributes that can be codified from those entities. Along with the attribute, we list a potential method for pulling that particular attribute. &lt;br /&gt;
&lt;br /&gt;
Format: &lt;br /&gt;
:&amp;lt;u&amp;gt;Entity&amp;lt;/u&amp;gt;&lt;br /&gt;
:*Attribute - Possible sources/ways to get&lt;br /&gt;
&lt;br /&gt;
Ed: &amp;quot;Be creative with finding new attributes to pull!&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==List==&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
*Accelerator Name - Website, external database&lt;br /&gt;
*Contact Form - General contact section in each website &lt;br /&gt;
*Industry focus - can be pulled from description&lt;br /&gt;
*Description - pulled from website itself&lt;br /&gt;
*Takes equity? - Database or from &amp;quot;about&amp;quot; page&lt;br /&gt;
*Non-profit? - Database&lt;br /&gt;
*URL - Already have way of obtaining&lt;br /&gt;
*DNS Registration Date - Already have way of obtaining&lt;br /&gt;
*Address - Google Maps, maybe the website&lt;br /&gt;
*Founding Date - Google Maps, website, server registration&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
*Mentorship? - Description in website&lt;br /&gt;
*Space Offered - Google Maps, Website description&lt;br /&gt;
*Partnerships - Angel list, Same section as mentorship or events&lt;br /&gt;
*Hosted Events - Calender&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
*Name - Founders or Team Page&lt;br /&gt;
*Title - Directly underneath or next to name&lt;br /&gt;
*PhD? - Biography, webpage under name&lt;br /&gt;
*Serial - Biography&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot; in &amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt; (n) has (n) &amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt; &lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt;&lt;br /&gt;
*Other Companies - Biography, webpage&lt;br /&gt;
*Previous Companies - Biography&lt;br /&gt;
*Net Worth - Forbes, Biography&lt;br /&gt;
*Link back to &amp;quot;Name&amp;quot; in &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
*Date + Accelerator = Cohort ID - Database or Website&lt;br /&gt;
*Number of Startups - Website, count from &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Cohort Number - Categorization on website, external database&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Names - Website, external database&lt;br /&gt;
*State of Inc - Angel List&lt;br /&gt;
*URL - Angel List, website&lt;br /&gt;
*Founding Date - Registration database, Angel List&lt;br /&gt;
*Industry - startup description&lt;br /&gt;
*Founding Location - Angel List&lt;br /&gt;
*Current Location - Angel List&lt;br /&gt;
*VC Raised to Date - SDC Platinum&lt;br /&gt;
*Angel Funds Raised to date - Angel List&lt;br /&gt;
&lt;br /&gt;
==Variables which Distinguish Accelerator Websites==&lt;br /&gt;
*The word &amp;quot;Accelerator&amp;quot;&lt;br /&gt;
**This word appears at least one time on the home page of the vast majority of accelerator websites. The word &amp;quot;Accelerator&amp;quot; appears either as a link to another page on the website or in a title on the homepage of the website. Not many other websites contain this word on their homepage, especially not if one Googles something generic such as &amp;quot;Accelerators in the US&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
*Fixed Term&lt;br /&gt;
**Accelerators normally work with their cohorts for 3 months. This is a major factor which differentiates between an accelerator and any other member of a startup ecosystem. If on their website they mention either &amp;quot;3 months&amp;quot; or &amp;quot;12 weeks&amp;quot;, it is extremely likely that the website belongs to an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Cohorts, Portfolio, Class, or Companies&lt;br /&gt;
**This is a potential variable that could link the websites of many different accelerators. The problem with the word &amp;quot;portfolio&amp;quot; is also used by numerous venture capital firms, which could potentially cause complications when attempting to pull only the sites of accelerators from a Google search. The word &amp;quot;cohort&amp;quot;, however, would have an extremely high probability of identifying the website as belonging to an accelerator. The words &amp;quot;class&amp;quot; and &amp;quot;companies&amp;quot; are promising but do not offer certainty.&lt;br /&gt;
&lt;br /&gt;
*Equity, Investment&lt;br /&gt;
**Although by itself, equity does not mean much, when paired with any of these other terms, it could potentially point to an accelerator. Most accelerators take equity in the form of common stock (6-8%), or they will ask for some alternate form of stake in the company.&lt;br /&gt;
&lt;br /&gt;
*Education and Mentorship&lt;br /&gt;
**Accelerators differ from incubators and angel investors in that they emphasize the education of the potential startup. They offer advice and intense mentorship from more experienced entrepreneurs within their staff, as well as many networking opportunities with the outside world. This variable is more difficult to find on the website of the accelerator, but I believe that if the website includes numerous keywords such as &amp;quot;education&amp;quot;, &amp;quot;mentorship&amp;quot;, or &amp;quot;networking opportunities&amp;quot;, it would be somewhat safe to assume that the website is owned by an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Demo Day&lt;br /&gt;
**This variable does not have tremendous potential in terms of crawling websites, but I feel that it is worth mentioning. Most accelerators &amp;quot;graduate&amp;quot; their cohorts with a demo day, which is a day when the startups present their company to potential investors. If the website contains the words &amp;quot;demo day&amp;quot;, which is fairly uncommon, it could be a good source of accelerator identification.&lt;br /&gt;
&lt;br /&gt;
A combination of any of these variables would certainly identify the current website as belonging to an accelerator.&lt;br /&gt;
&lt;br /&gt;
==Comprehensive List of Accelerators==&lt;br /&gt;
&lt;br /&gt;
All text files saved in &amp;quot;Accelerators&amp;quot; project on the McNair RPD. &lt;br /&gt;
&lt;br /&gt;
*Acc.Info: 190&lt;br /&gt;
*SeedDB: 240&lt;br /&gt;
*SARP: 59&lt;br /&gt;
*Corp: 79&lt;br /&gt;
*Total: 568 results&lt;br /&gt;
&lt;br /&gt;
After removing duplicates and locations: 363 results&lt;br /&gt;
&lt;br /&gt;
Doesn't count f6s, which returns 1170 results, roughly only 300 of which were accelerators. We created a crawler to sift through the webpages and parse HTML so we could identify the accelerators. Program and HTML saved on the Desktop.&lt;br /&gt;
&lt;br /&gt;
==Randomly Chosen Accelerators==&lt;br /&gt;
*TLabs&lt;br /&gt;
*BetaSpring&lt;br /&gt;
*The Unilever Foundry&lt;br /&gt;
*AIA Accelerator&lt;br /&gt;
*R/GA Accelerator&lt;br /&gt;
*Zeroto510&lt;br /&gt;
*Hub:raum&lt;br /&gt;
*Orange Fab&lt;br /&gt;
*Furnace&lt;br /&gt;
*Launch Chapel Hill&lt;br /&gt;
&lt;br /&gt;
===Determining whether or not these are accelerators===&lt;br /&gt;
Googled name of Accelerator and clicked on the first link&lt;br /&gt;
&lt;br /&gt;
Looked for Variables which Distinguish Accelerator Websites&lt;br /&gt;
*TLabs: Homepage states: &amp;quot;Leading Indian Tech Accelerator&amp;quot;; TLabs is an accelerator, but it is located in India.&lt;br /&gt;
*Betaspring: Under the &amp;quot;About Betaspring&amp;quot; tab,  it states that &amp;quot;Betaspring was among the first ten startup accelerators to launch worldwide&amp;quot;.&lt;br /&gt;
*The Unilever Foundry: Does not claim to be an accelerator, nor does it have information on the website about cohorts. This name was pulled from the source Corporate Accelerators.&lt;br /&gt;
*AIA Accelerator: The word &amp;quot;accelerator&amp;quot; is included in the name. Under the &amp;quot;Overview&amp;quot; tab, it states that startups have received mentorship.&lt;br /&gt;
*R/GA Accelerator: Under the &amp;quot;Overview&amp;quot; tab it states that the &amp;quot;R/GA Accelerator is designed for startups and... it is a three month, immersive, mentorship driven program&amp;quot;.&lt;br /&gt;
*Zeroto510: Website contains a &amp;quot;Portfolio Companies&amp;quot; tab which divides up the companies into cohorts. This identifies Zeroto510 as an accelerator.&lt;br /&gt;
*Hub:raum: Offers accelerator and incubator programs; however, none are located in North America.&lt;br /&gt;
*Orange Fab: States on the main page that &amp;quot;We're a 3-month accelerator program&amp;quot;.&lt;br /&gt;
*Furnace: &amp;quot;About&amp;quot; tab states that Furnace is &amp;quot;an innovative startup accelerator designed to form, incubate, and launch new companies&amp;quot;. Concludes with a Demo Day&lt;br /&gt;
*Launch Chapel Hill: Homepage states that they are &amp;quot;a startup accelerator&amp;quot;. Also included on the homepage is a line that states &amp;quot;Applications for Cohort 7 are now open&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
7/10 are accelerators located in the US.&lt;br /&gt;
&lt;br /&gt;
2/10 are accelerators not located in the US.&lt;br /&gt;
&lt;br /&gt;
1/10 is not an accelerator.&lt;br /&gt;
&lt;br /&gt;
===Steps for Extracting Cohort Information===&lt;br /&gt;
*TLabs: Clicked on the &amp;quot;Startup&amp;quot; tab and located a drop down menu entitled &amp;quot;Showing Startups from:&amp;quot;. This menu separates startups into Batches ranging from 1-9. These batches are cohorts.&lt;br /&gt;
*Betaspring: This website does not have a &amp;quot;Companies&amp;quot; or &amp;quot;Startups&amp;quot; tab. I clicked on their &amp;quot;Who&amp;quot; tab and noticed that within this section were two links called &amp;quot;Our portfolio&amp;quot; and &amp;quot;Our companies&amp;quot; which both linked to the same place. This place contained a list of the startups that Betaspring has funded, as well as links to each of the startup websites. The list was not separated into cohorts.&lt;br /&gt;
*The Unilever Foundry: Does not have a &amp;quot;Startups&amp;quot; or &amp;quot;Companies&amp;quot; link on the website.&lt;br /&gt;
*AIA Accelerator: Clicked on the &amp;quot;Startups&amp;quot; tab which returned a page with 5 companies and a bit of information on each of these companies. Also included the URL to each startup. However, the companies were not separated into cohorts, probably because there are so few of them.&lt;br /&gt;
*R/GA Accelerator: Clicked on the &amp;quot;Alumni&amp;quot; tab and navigated down the webpage. Startups are separated by class, which means cohort in this case. Startup info contains link to demo day presentation as well as the startup url.&lt;br /&gt;
*Zeroto510: Hovered over the &amp;quot;About Us&amp;quot; drop down menu and clicked on the &amp;quot;Portfolio Companies&amp;quot; link. Startups are separated by cohort, one for each year, starting from 2013. &lt;br /&gt;
*Hub:raum: Clicked on the &amp;quot;Portfolio&amp;quot; tab. Directed to a page with many names of startups, as well as a brief description of what their company is about. Also includes a link to each startup's website. Startups are not separated into cohorts, but rather by investment by location, current participants, and alumni.&lt;br /&gt;
*Orange Fab: Clicked on the &amp;quot;Startups&amp;quot; tab and was directed to a different page. Startups are not only separated into cohorts named &amp;quot;Seasons&amp;quot;, but they are also separated by industry.&lt;br /&gt;
*Furnace: Clicked on &amp;quot;Portfolio&amp;quot; tab, but unfortunately the website is broken and it returned an error in code.&lt;br /&gt;
*Launch Chapel Hill: Clicked on the &amp;quot;Ventures&amp;quot; tab and was directed to a page in which all startups were separated into cohorts, and a brief description of the startup was provided underneath their logo.&lt;br /&gt;
&lt;br /&gt;
=Code=&lt;br /&gt;
&lt;br /&gt;
The directory for all data related to this project is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
==F6S Web Crawler==&lt;br /&gt;
&lt;br /&gt;
This is a python script using the selenium library that retrieves the html content of each page on F6S's North American Accelerator search results. The script is located in:&lt;br /&gt;
&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs &lt;br /&gt;
&lt;br /&gt;
The script is titled f6s_crawler_gentle.py&lt;br /&gt;
&lt;br /&gt;
When run, the script visits the F6S search page for North American Accelerator's and begins retrieving the HTML of each page in that search list. &lt;br /&gt;
NOTE: Timing must be spaced out between all interactions with the browser. F6S has Captcha, and the program will fail if the site receives too many hit requests, or has any inkling that it is being probed by a bot.&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files are stored in: &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files stored as text files are stored in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files_text&lt;br /&gt;
&lt;br /&gt;
==F6S Parser==&lt;br /&gt;
The next step is to take the HTML files retrieved by the crawler and to parse them for necessary information. This parser should also determine whether or not the site is an accelerator site. &lt;br /&gt;
&lt;br /&gt;
The code for the parser is located in &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
It is titled f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
To run the code, open the file in Komodo and press play. &lt;br /&gt;
If running from the command line, change to the correct directory and run the following comand:&lt;br /&gt;
 python f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
The list of accelerators that passed through the parser is in the same directory:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
The tab delimited text file is named AcceleratorList.&lt;br /&gt;
The file contains the names of the accelerators that had the keywords listed in the file. Also, the file contains the run dates and location of the accelerator if it was listed on the f6s page.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==F6S API==&lt;br /&gt;
F6S has an API, but we have had no success getting a key to the API. The link to get a key to the API is on [https://www.f6s.com/developers/apis/deal-feed this page].&lt;br /&gt;
&lt;br /&gt;
I (Peter) have emailed F6S to ask for a key directly at support@f6s.com. As of the end of the Fall 2016 Semester, they have not responded.&lt;br /&gt;
&lt;br /&gt;
FUN FACT (MASS-RENAME FILES USING WINDOWS POWER SHELL):&lt;br /&gt;
&lt;br /&gt;
The following command allowed me to append &amp;quot;.txt&amp;quot; to all files in a folder once in the proper directory:&lt;br /&gt;
 Get-ChildItem * | Rename-Item -NewName { $_.name + '.txt'}&lt;br /&gt;
&lt;br /&gt;
To change file formats, Microsoft suggests:&lt;br /&gt;
 Get-ChildItem *.txt | Rename-Item -NewName { $_.name -Replace '\.txt', '.log'}&lt;br /&gt;
&lt;br /&gt;
==Final Data==&lt;br /&gt;
The Parser for parsing the text files of accelerator data is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
The Parser for parsing the cohort files of accelerator data is also located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
This folder contains the Python parsers. The Final_data folder contains the tab-delimited text files of parsed data. final_accelerator_data.txt contains the generalized data saved in .txt files and final_cohort_data.txt contains the cohort data saved in .cohort.txt files.&lt;br /&gt;
&lt;br /&gt;
All the files entitled accelerator_data are subsets of the final_accelerator_data.txt file, but each file contains only the accelerators that matched to the flag specified in the file title.&lt;br /&gt;
&lt;br /&gt;
find_headers .py finds a set of the headers for all the cohort files from the seed list project.&lt;br /&gt;
&lt;br /&gt;
==Google SiteSearch==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Google_SiteSearch&lt;br /&gt;
This folder contains code for a google search parser. The script sitesearch.py will search for a queried company and return a likely web address for that company.&lt;br /&gt;
&lt;br /&gt;
==Way Back Machine Parser==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\wayback_machine.py&lt;br /&gt;
This script takes URLs and returns a timestamp for the oldest documented webpage under that URL courtesy of the Way Back Machine Archive.&lt;br /&gt;
&lt;br /&gt;
==Process Locations==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\process_locations.py&lt;br /&gt;
This script takes a physical address and converts it into latitude and longitude coordinates. Should be used in conjunction with the Enclosing Circle program to find the concentration of accelerators.&lt;br /&gt;
 E:\McNair\Software\CodeBase\EnclosingCircle.py&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=16478</id>
		<title>Accelerator Seed List (Data)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Seed_List_(Data)&amp;diff=16478"/>
		<updated>2017-03-21T20:48:16Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Accelerator Seed List (Data)&lt;br /&gt;
|Has owner=Shrey Agarwal, Matthew Ringheanu&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has keywords=Accelerators&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Industry Classifier&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=End of Semester Report=&lt;br /&gt;
The end of semester report will focus on ranking accelerators and environments based on the variables we have gathered. Our primary form of categorization will be ranking individual accelerators based on their venture capital raise rate. We can probably generate information over time for accelerators and the amount of VC they raised to get a sense of what locations have developed in the past five years from the dates of transactions recorded by SDC. To obtain these rankings, we will identify which cohorts companies were trained in as well as complete details of the accelerator and the details of cohort companies. We will focus only on accelerators because there are many other entities in each ecosystem. We will also utilize information on IPO or acquisition by companies, obtained through Crunchbase, to gain some sense of how successful startups emerging from a particular accelerator are. To obtain the data over time, we will need to fill out the cohort date information column in our cohort data, which will require the help of either Crunchbase or the Wayback machine for older accelerators. In ranking the accelerators across regions, we can also track industry-specific hotspots for accelerators such as medicine in Memphis or technology in San Francisco.&lt;br /&gt;
&lt;br /&gt;
To complete the report, we need to fill information in:&lt;br /&gt;
*Industry and focus&lt;br /&gt;
*Location&lt;br /&gt;
*Name, description&lt;br /&gt;
*Matched VC data&lt;br /&gt;
*Founder information (maybe)&lt;br /&gt;
&lt;br /&gt;
=Overview=&lt;br /&gt;
This project is developing broad and near-population data on accelerators and their cohort companies. The objective is to identify which cohorts of which accelerators a cohort company was trained in, obtain details of the accelerators, and obtain details of the cohort companies, including information about any venture capital investment that the cohort company might have received and any IPO or acquisition the company may have experienced.&lt;br /&gt;
&lt;br /&gt;
The primary use of this data is for an academic paper detailed on the [[Matching Entrepreneurs to Accelerators and VCs (Academic Paper)]] page. &lt;br /&gt;
&lt;br /&gt;
However, this project can also provide useful data to other academic papers ([[Urban Start-up Agglomeration]], [[Hubs (Academic Paper)]], and [[Hubs Scorecard (Academic Paper)]]), projects ([[Houston Entrepreneurship]]) and blog posts (under the [[Emerging Ecosystems]] umbrella project).&lt;br /&gt;
&lt;br /&gt;
This project needs the results of the [[Industry Classifier]], [[Whois Parser]], and other tools.&lt;br /&gt;
&lt;br /&gt;
=Current Project Write-Up=&lt;br /&gt;
&lt;br /&gt;
==Things To Do==&lt;br /&gt;
*Obtain all URLs for accelerators in order to run through the Wayback Machine to find out when they started.&lt;br /&gt;
*Match Crunchbase Data with our Accelerator List to see if they have any accelerators that we do not.&lt;br /&gt;
*Obtain an example of accelerator that started early and has multiple companies but does not separate them into cohorts and figure out a way to determine which companies went through each cohort.&lt;br /&gt;
&lt;br /&gt;
==What Each File in the &amp;quot;Accelerator&amp;quot; Folder on the RDP Contains==&lt;br /&gt;
*&amp;quot;Accelerator List Sources&amp;quot; (Folder) - This folder contains most of the sources that we pulled accelerator names from at the very beginning of the project.&lt;br /&gt;
*&amp;quot;Code+Final_Data&amp;quot; (Folder) - This folder contains Peter's code for pulling the data from the text files in the &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Crunchbase Snapshot&amp;quot; (Folder) - This folder contains the data we obtained from Crunchbase. There is a massive amount of data which we will need to sort through to find useful information and hopefully match that data with our current cohort data.&lt;br /&gt;
*&amp;quot;Data&amp;quot; (Folder) - This folder contains all of our data on accelerators including cohort information and the html files of each cohort page. I would estimate that it is about 95% clean currently.&lt;br /&gt;
*&amp;quot;Data - Copy&amp;quot; (Folder) - This is just a copy of our current &amp;quot;Data&amp;quot; folder.&lt;br /&gt;
*&amp;quot;Data_Copy&amp;quot; (Folder) - This is a copy of our original &amp;quot;Data&amp;quot; folder before we did any manual cleaning.&lt;br /&gt;
*&amp;quot;Enclosing_Circle&amp;quot; (Folder) - This folder seems to contain some data on VC but I'm not sure how it pertains to the Accelerator project.&lt;br /&gt;
*&amp;quot;F6S Accelerator HTMLs&amp;quot; (Folder) - This folder contains the HTML pages of all the pages on the F6S website. We used it to add more potential accelerators to our list.&lt;br /&gt;
*&amp;quot;Google_SiteSearch&amp;quot; (Folder) - This folder contains Python code for Google searches.&lt;br /&gt;
*&amp;quot;Industry_Classifier&amp;quot; (Folder) - This folder seems to contain Python code but I'm not sure what for.&lt;br /&gt;
*&amp;quot;Matcher&amp;quot; (Folder) - This folder contains the Matcher.&lt;br /&gt;
*&amp;quot;Python WebCrawler&amp;quot; (Folder) - This folder contains code that is a work in progress for pulling descriptions from accelerator websites. It is Jeemin's project.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data Copy&amp;quot; (Excel File) - This file contains a copy of our cleaned cohort data.&lt;br /&gt;
*&amp;quot;Cleaned Cohort Data&amp;quot; (Excel File) - This file contains the most current, completely cleaned data on cohort company information.&lt;br /&gt;
*&amp;quot;NormalizeFixedWidth&amp;quot; (PL File) - This is the normalizer.&lt;br /&gt;
*&amp;quot;PortCoNames&amp;quot; (TXT File) - This file contains all of the names of the cohort companies as well as the accelerator they went through.&lt;br /&gt;
*&amp;quot;VC Data&amp;quot; (Excel File) - This file contains all of the names of the companies that have ever received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data&amp;quot; (TXT File) - This file contains that non-normalized data of all of the VC information.&lt;br /&gt;
*&amp;quot;VC_Data_Names&amp;quot; (TXT File) - This file contains all of the names of companies that have received VC funding.&lt;br /&gt;
*&amp;quot;VC_Data_Names_Matched_PortCoNames&amp;quot; (Excel File) - This file contains all of the cohort companies that have also received VC funding. Still needs to be sorted through.&lt;br /&gt;
&lt;br /&gt;
==Process==&lt;br /&gt;
After accumulating the massive amount of data on accelerators, their cohorts, and their html files, we began cleaning those text files, which are located in the &amp;quot;Data&amp;quot; folder within &amp;quot;Accelerators&amp;quot;. After going through the first round of cleaning, we ran a code through the cohort data which put all of that information into an Excel document called &amp;quot;Cleaned Cohort Data&amp;quot;. There were still some mistakes in the cohort information unfortunately, which we fixed within the Excel file itself. Therefore, there are some text files within the &amp;quot;Data&amp;quot; folder that do not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file. If we were to run the cohort code through the &amp;quot;Data&amp;quot; folder, we would get something that does not match with the &amp;quot;Cleaned Cohort Data&amp;quot; file, which is problematic. The solution to this (other than manually cleaning the text files again) would be to write a code from the &amp;quot;Cleaned Cohort Data&amp;quot; file which would allow us to clean the data in the &amp;quot;Data&amp;quot; folder through the format of the Excel file. We have also matched all of the cohort companies with our list of all companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
=Current To Do=&lt;br /&gt;
&lt;br /&gt;
#Work on the [[Crunchbase 2013 Snapshot]]&lt;br /&gt;
#Match cohort companies to VC backed portfolio companies&lt;br /&gt;
#Refine our data to work out which cohort each cohort company was a member of, cohort start dates and locations, etc.&lt;br /&gt;
#Make a list of top accelerator lists (e.g., http://tech.co/top-startup-accelerators-ranked-2012-08) and check that we have those accelerators&lt;br /&gt;
&lt;br /&gt;
=End of Semester Notes=&lt;br /&gt;
&lt;br /&gt;
*We have compiled a very long list of accelerators from many different databases. For the past couple of weeks, everyone in the center has been going through this list, 20 at a time, classifying each one as an accelerator or not an accelerator, and then proceeding to gather data on the accelerator using the process outlined below. This process went very smoothly. We have successfully gone through about 80% of the list. We are still missing information on the last hundred or so names. All of the collected data is located on the RDP, within the &amp;quot;Accelerators&amp;quot; folder under &amp;quot;Data&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
=Data Collection Notes=&lt;br /&gt;
&lt;br /&gt;
==3 files==&lt;br /&gt;
&lt;br /&gt;
For each accelerator in the list, put files in E:\Projects\Accelerators\Data&lt;br /&gt;
*AcceleratorName.txt - copy and paste the variables below into a (tab-delimited) txt file and complete&lt;br /&gt;
*AcceleratorName.cohort - your cohort text file (see below)&lt;br /&gt;
*AcceleratorName.html (possibly automatically with a folder too) - save a copy of the html of the cohort page&lt;br /&gt;
&lt;br /&gt;
==.txt Variables==&lt;br /&gt;
&lt;br /&gt;
 Name	&lt;br /&gt;
 Score	&lt;br /&gt;
 Flag	&lt;br /&gt;
 CohortURL	&lt;br /&gt;
 Address	&lt;br /&gt;
 Duration	&lt;br /&gt;
 Vintage		&lt;br /&gt;
 Industry	&lt;br /&gt;
 Description	&lt;br /&gt;
 Equity	&lt;br /&gt;
 NonProfit	 &lt;br /&gt;
 Notes	&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Try to get '''Name, Score, Flag, Cohort URL and Address''' for all. ONLY GRAB OTHER VARIABLES IF EASY. Just leave things blank if you can't find them quickly.&lt;br /&gt;
&lt;br /&gt;
'''If the score is 0, or the flag is S, I, A, or F just stop''' - don't bother downloading a cohort list, saving an HTML file, etc. If possible, do  stick a very brief description of the problem in the notes field.&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Score: is 0-1 where 0 is definitely not an accelerator, 1 is definitely an accelerator&lt;br /&gt;
*Flag: (leave blank if not needed), if multiple then separate by comma&lt;br /&gt;
**S for social entrep&lt;br /&gt;
**I for incubator&lt;br /&gt;
**A for an angel group&lt;br /&gt;
**F is for foreign&lt;br /&gt;
**C for in coworking space/hub/etc&lt;br /&gt;
**V for if part of venture fund&lt;br /&gt;
**D is for Dead&lt;br /&gt;
*Put just the root URL in Cohort URL if there isn't a Cohort page&lt;br /&gt;
*Duration: in wks (months x 4.33 and round)&lt;br /&gt;
*Vintage is year of first cohort if possible&lt;br /&gt;
*Industry is industry focus but only if clear focus&lt;br /&gt;
*Equity is a number (don't put %) or Y/N&lt;br /&gt;
*Notes is only there if need it. Particularly try to use this field to note discards.&lt;br /&gt;
&lt;br /&gt;
==.cohort files==&lt;br /&gt;
&lt;br /&gt;
Your .cohort files must:&lt;br /&gt;
*Be tab delimited txt&lt;br /&gt;
*Have a header&lt;br /&gt;
*The first column must be the portfolio company name&lt;br /&gt;
*Grab as many columns as you can easily (and name them)&lt;br /&gt;
&lt;br /&gt;
==Standardized format for text files==&lt;br /&gt;
&lt;br /&gt;
Information Text file&lt;br /&gt;
*1 tab only after each category&lt;br /&gt;
*No spaces after commas for flags or industry&lt;br /&gt;
*For duration put only a number in weeks but do not write &amp;quot;weeks&amp;quot;&lt;br /&gt;
*Equity is either only a number (no percent sign) or a Y/N&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Cohort Text file&lt;br /&gt;
*1 tab between each column&lt;br /&gt;
*Titles of each column on top&lt;br /&gt;
*Make a new category for &amp;quot;Cohort Number&amp;quot; and write either &amp;quot;1 2 3 4 etc.&amp;quot;&lt;br /&gt;
*Matthew: 1-225 (done) Shrey: 226-550 (done)&lt;br /&gt;
&lt;br /&gt;
==Link to Crunchbase API application==&lt;br /&gt;
&lt;br /&gt;
https://about.crunchbase.com/forms/research-access-apply/&lt;br /&gt;
&lt;br /&gt;
==Sign-Ups==&lt;br /&gt;
&lt;br /&gt;
 Ed - 1-10 (done)&lt;br /&gt;
 Carlin -  11-20 (done)&lt;br /&gt;
 Carlin - 21-40 (done)&lt;br /&gt;
 Christy - 41-60 (done)&lt;br /&gt;
 Avesh - 61-80 (done)&lt;br /&gt;
 Eliza - 81-100 (done)&lt;br /&gt;
 Meghana - 101-120 (done)&lt;br /&gt;
 Peter - 121-140 (done)&lt;br /&gt;
 Ramee - 141-160 (done)&lt;br /&gt;
 Will - 161-180 (done)&lt;br /&gt;
 Matthew - 181-200 (done)&lt;br /&gt;
 Julia - 201-220 (done)&lt;br /&gt;
 Peter - 221-240 (done)&lt;br /&gt;
 Shrey - 241-260 (done)&lt;br /&gt;
 Matthew - 261-280 (done)&lt;br /&gt;
 Eliza - 281-300 (done)&lt;br /&gt;
 Julia - 301-320 (done)&lt;br /&gt;
 Shrey - 321-340 (done)&lt;br /&gt;
 Carlin - 341-361 (done)&lt;br /&gt;
 Julia - 362-380 (done)&lt;br /&gt;
 Dylan - 381-393 (done)&lt;br /&gt;
 Jake - 394-404 (done)&lt;br /&gt;
 Dylan - 405-410 (done)&lt;br /&gt;
 Avesh - 411-415 (done)&lt;br /&gt;
 Dylan - 416-423 (done)&lt;br /&gt;
 Peter - 424-460(done)&lt;br /&gt;
 Carlin - 461-480 (done)&lt;br /&gt;
 Peter - 481-490(done)&lt;br /&gt;
 Julia - 491-510 (done)&lt;br /&gt;
 Peter - 511-515 (done)&lt;br /&gt;
 Julia - 516-529 (done)&lt;br /&gt;
 Ben - 530-540 (done)&lt;br /&gt;
 Shrey - 541-551 (done)&lt;br /&gt;
&lt;br /&gt;
=List of Accelerators=&lt;br /&gt;
#10Xelerator&lt;br /&gt;
#1440&lt;br /&gt;
#33entrepreneurs&lt;br /&gt;
#500 Startups&lt;br /&gt;
#9Mile Labs&lt;br /&gt;
#AIA Accelerator&lt;br /&gt;
#ARK Challenge&lt;br /&gt;
#AT&amp;amp;T Aspire Accelerator&lt;br /&gt;
#ATDC Community&lt;br /&gt;
#AZ TechCelerator&lt;br /&gt;
#AccelFoods&lt;br /&gt;
#Acceleprise&lt;br /&gt;
#Accelerate Baltimore&lt;br /&gt;
#Accelerate Genius&lt;br /&gt;
#Accelerate Tectoria Accelerator&lt;br /&gt;
#Accelerator Centre&lt;br /&gt;
#Advanced Technology Development Center (ATDC)&lt;br /&gt;
#Airbus BizLab&lt;br /&gt;
#Alchemist Accelerator&lt;br /&gt;
#AlphaLab&lt;br /&gt;
#Amplify.LA&lt;br /&gt;
#Angel Capital&lt;br /&gt;
#Angelcube&lt;br /&gt;
#Angelpad&lt;br /&gt;
#Annual Business BootCamp&lt;br /&gt;
#Arizona Center for Innovation&lt;br /&gt;
#Arizona Furnace&lt;br /&gt;
#Arrowhead Tech Incubator 2016&lt;br /&gt;
#Aspire 3 Accelerator 2017&lt;br /&gt;
#Atlanta Ventures Accelerator &lt;br /&gt;
#AutoXLR8R&lt;br /&gt;
#Awesome Inc.&lt;br /&gt;
#Axel Springer Plug and Play&lt;br /&gt;
#B 4 Change Impact Accelerator&lt;br /&gt;
#B2B Acceleration Program&lt;br /&gt;
#B4C Social Venture Accelerator&lt;br /&gt;
#BBC Worldwide Labs&lt;br /&gt;
#BMW Startup Garage&lt;br /&gt;
#Brandcelerate&lt;br /&gt;
#Bunker Labs&lt;br /&gt;
#Bank of Ireland Accelerator Programme&lt;br /&gt;
#Bantunium Labs Accelerator&lt;br /&gt;
#Barclays Accelerator&lt;br /&gt;
#Barclays New York Summer 2015&lt;br /&gt;
#Berkley Ventures&lt;br /&gt;
#Bessemer Business Incubation System&lt;br /&gt;
#Beta-i&lt;br /&gt;
#Beta.MN&lt;br /&gt;
#BetaFactory&lt;br /&gt;
#BetaSpring&lt;br /&gt;
#Betablox&lt;br /&gt;
#Betaspring RevUp  (DUPLICATE)&lt;br /&gt;
#Bethnal Green Ventures&lt;br /&gt;
#BioAccel&lt;br /&gt;
#BioInspire&lt;br /&gt;
#Bir 2015&lt;br /&gt;
#BitAngel Engagement Level&lt;br /&gt;
#BitAngels Startup Summer Program of 2013&lt;br /&gt;
#Bizdom&lt;br /&gt;
#Black Forest Accelerator&lt;br /&gt;
#Blue Startups&lt;br /&gt;
#Blueprint Health&lt;br /&gt;
#Bolt Boston&lt;br /&gt;
#Bonnier Accelerator&lt;br /&gt;
#BoomStartup&lt;br /&gt;
#BoomStartup Winter 2017 (DUPLICATE)&lt;br /&gt;
#Boomtown Accelerator&lt;br /&gt;
#Boomtown Health Tech (DUPLICATE)&lt;br /&gt;
#Boost VC&lt;br /&gt;
#BootupLabs&lt;br /&gt;
#Brandery&lt;br /&gt;
#Brooklyn Beta Summer Camp&lt;br /&gt;
#Budweiser Dream Brewery&lt;br /&gt;
#Buildit&lt;br /&gt;
#BuiltinPGH Companies&lt;br /&gt;
#Business Innovation Center&lt;br /&gt;
#Business Opportunity Academy 2017&lt;br /&gt;
#Business Technology Development Center (BizTech)&lt;br /&gt;
#CLT Joules Energy Accelerator 2014&lt;br /&gt;
#CWI Ventures&lt;br /&gt;
#CWI Ventures Application (DUPLICATE)&lt;br /&gt;
#CableLabs Technology Tours 2016&lt;br /&gt;
#Capital Factory&lt;br /&gt;
#Capital Innovators&lt;br /&gt;
#Capital Investment Network (Startups)&lt;br /&gt;
#Caroline Plouff&lt;br /&gt;
#Catalyst Partners&lt;br /&gt;
#Cause Collective : Social Innovation Lab&lt;br /&gt;
#Center for Entrepreneurial Innovation&lt;br /&gt;
#Chain Reaction Innovations 2017&lt;br /&gt;
#Chemical Angel Network&lt;br /&gt;
#Chinaccelerator&lt;br /&gt;
#Cisco Entrepreneurs in Residence&lt;br /&gt;
#Citi Accelerator&lt;br /&gt;
#Citrix Startup Accelerator&lt;br /&gt;
#Claremont/Upland Makerspace Fablab&lt;br /&gt;
#Climate Ventures 2.0 Accelerator&lt;br /&gt;
#Co.Lab accelerator&lt;br /&gt;
#Code for America Accelerator&lt;br /&gt;
#Cohab's Traxtion Point&lt;br /&gt;
#Collision Conference Investors&lt;br /&gt;
#Common Bond&lt;br /&gt;
#Communitech Hyperdrive&lt;br /&gt;
#Conquer Accelerator&lt;br /&gt;
#Coolhouse Labs&lt;br /&gt;
#CuriousMinds Incubator / Accelerator&lt;br /&gt;
#CyberTECH San Diego&lt;br /&gt;
#DBS Accelerator&lt;br /&gt;
#DPD Last Mile labs&lt;br /&gt;
#DV X Labs&lt;br /&gt;
#Dat Ventures&lt;br /&gt;
#Decatur-Morgan County Entrepreneurial Center&lt;br /&gt;
#Deep Space Ventures&lt;br /&gt;
#Demo Accelerator 2016- 2017&lt;br /&gt;
#DeveloperTown&lt;br /&gt;
#Difference Engine&lt;br /&gt;
#Digital Malaysia Corporate Accelerator Program&lt;br /&gt;
#Digital Media Zone Incubator/Accelerator&lt;br /&gt;
#Disney Accelerator&lt;br /&gt;
#DogFish Accelerator&lt;br /&gt;
#Domi Station&lt;br /&gt;
#Dotforge accelerator&lt;br /&gt;
#Dream Funded&lt;br /&gt;
#DreamIT Health&lt;br /&gt;
#DreamStart - Free Mentoring Program&lt;br /&gt;
#Dreamit Ventures (DUPLICATE)&lt;br /&gt;
#Ducky Diggy Lloyd &lt;br /&gt;
#E-Capital Summit&lt;br /&gt;
#EC Mentor Skills Inventory&lt;br /&gt;
#EIGERlab&lt;br /&gt;
#ETRAC&lt;br /&gt;
#EY Startup Challenge&lt;br /&gt;
#Eco Holding&lt;br /&gt;
#Eleven Startup Accelerator&lt;br /&gt;
#Emerge Xcelerate&lt;br /&gt;
#EnterpriseWorks Incubation Program&lt;br /&gt;
#Entrepreneur Development Center&lt;br /&gt;
#Entrepreneurs Roundtable Accelerator&lt;br /&gt;
#Environmental Business Cluster&lt;br /&gt;
#Equity Legal&lt;br /&gt;
#Excelerate Labs&lt;br /&gt;
#Execution Labs&lt;br /&gt;
#Exhilarator&lt;br /&gt;
#Extreme Startups&lt;br /&gt;
#Extreme University&lt;br /&gt;
#FOOD-X&lt;br /&gt;
#Factory45&lt;br /&gt;
#Fargo Startup House 2014-2015&lt;br /&gt;
#FastTrack Propero Healthcare&lt;br /&gt;
#FbFund&lt;br /&gt;
#Female Propeller for High Flyers&lt;br /&gt;
#FinTech Innovation Lab&lt;br /&gt;
#FinTech Studios 2015&lt;br /&gt;
#Fintech Founders Club #2&lt;br /&gt;
#First Growth Venture Network&lt;br /&gt;
#Fishbowl Labs AOL&lt;br /&gt;
#Flagship Enterprise Center&lt;br /&gt;
#FlashStarts&lt;br /&gt;
#Flashpoint&lt;br /&gt;
#Flat6 Labs&lt;br /&gt;
#Fledge9&lt;br /&gt;
#Flextronics Lab IX&lt;br /&gt;
#Food Future Scale-up Accelerator 2017&lt;br /&gt;
#Food System 6 (FS6) Accelerator&lt;br /&gt;
#FoodForwardX&lt;br /&gt;
#Fortify Ventures&lt;br /&gt;
#Founder Institute&lt;br /&gt;
#FounderFuel&lt;br /&gt;
#FoundersPad&lt;br /&gt;
#Fownders Accelerator&lt;br /&gt;
#French Accelerator 2016&lt;br /&gt;
#Fund the Food&lt;br /&gt;
#Fuse Corps Host&lt;br /&gt;
#GAKKEN Accelerator Program&lt;br /&gt;
#Gainesville Technology Enterprise Center&lt;br /&gt;
#Game CoLab Incubator Program 2014&lt;br /&gt;
#GameFounders&lt;br /&gt;
#GammaRebels&lt;br /&gt;
#Gazelle Lab&lt;br /&gt;
#Gener8tor&lt;br /&gt;
#German Accelerator Life Sciences&lt;br /&gt;
#German Accelerator Tech&lt;br /&gt;
#Global Accelerator Network 2015&lt;br /&gt;
#Good Works Houston Lab&lt;br /&gt;
#GoodCompany Ventures&lt;br /&gt;
#Google Launchpad Accelerator&lt;br /&gt;
#Grants4Apps Accelerator&lt;br /&gt;
#GreenStart&lt;br /&gt;
#Greenlite Labs&lt;br /&gt;
#GrowLab&lt;br /&gt;
#Growth Hacking Accelerator 2015&lt;br /&gt;
#Gulf Coast Center for Innovation and Entrepreneurship&lt;br /&gt;
#H-Farm Ventures&lt;br /&gt;
#HACKT Mission for International Founders&lt;br /&gt;
#HAXLR8R&lt;br /&gt;
#HCC Entrepreneurship Launchpad&lt;br /&gt;
#HIGHLINE Academy&lt;br /&gt;
#HUB&lt;br /&gt;
#HUBB Accelerator&lt;br /&gt;
#HUBB GTLA 2016&lt;br /&gt;
#HackFWD&lt;br /&gt;
#Hatch&lt;br /&gt;
#Health Wildcatters&lt;br /&gt;
#Health accelerator&lt;br /&gt;
#Healthbox&lt;br /&gt;
#Hero City Co-Working Space&lt;br /&gt;
#High Street Startups Accelerator&lt;br /&gt;
#Highway1&lt;br /&gt;
#Honda Xcelerator &lt;br /&gt;
#Houston Technology Center&lt;br /&gt;
#Hub Ventures&lt;br /&gt;
#HugeThing&lt;br /&gt;
#I/O ventures&lt;br /&gt;
#ICONYC labs&lt;br /&gt;
#IDC Elevator&lt;br /&gt;
#INcubes Funnel and Accelerator 2014/2015&lt;br /&gt;
#INcubes Online Form&lt;br /&gt;
#INcubes Startup Visa&lt;br /&gt;
#Illumina Accelerator&lt;br /&gt;
#Illuminator,  New York Accelerator 2015&lt;br /&gt;
#Imagine K12&lt;br /&gt;
#Immokalee Business Development Center&lt;br /&gt;
#Impact Engine&lt;br /&gt;
#Impact USA - 2017&lt;br /&gt;
#Incubate Miami&lt;br /&gt;
#Infuse Accelerator&lt;br /&gt;
#Ingenuity Partner Program&lt;br /&gt;
#InnoSpring&lt;br /&gt;
#Innov&amp;amp;Connect&lt;br /&gt;
#Innov8 for Health&lt;br /&gt;
#Innova Memphis&lt;br /&gt;
#InnovateOC&lt;br /&gt;
#Innovation Depot&lt;br /&gt;
#Innovation Pavilion&lt;br /&gt;
#Innovation Showcase Winter 2017&lt;br /&gt;
#Insight Accelerator Labs&lt;br /&gt;
#Intel Education Accelerator&lt;br /&gt;
#Investment Preparedness Lab&lt;br /&gt;
#Invoke Collective&lt;br /&gt;
#Iowa Startup Accelerator&lt;br /&gt;
#JFDI.Asia&lt;br /&gt;
#JFE Accelerator SF&lt;br /&gt;
#JLAB&lt;br /&gt;
#Jaguar Land Rover Tech Incubator&lt;br /&gt;
#Jolt&lt;br /&gt;
#JumpSchool &lt;br /&gt;
#JumpStart Foundry&lt;br /&gt;
#Jumpstart! Boulder&lt;br /&gt;
#JusticeXL&lt;br /&gt;
#Kairos Boston Spring Program&lt;br /&gt;
#Kaplan EdTech&lt;br /&gt;
#Kick&lt;br /&gt;
#Kick Boise&lt;br /&gt;
#Kick LA&lt;br /&gt;
#Kick Victoria&lt;br /&gt;
#Kicklabs&lt;br /&gt;
#Kinetiq Labs&lt;br /&gt;
#L-SPARK Accelerator&lt;br /&gt;
#LAUNCH incubator&lt;br /&gt;
#LAUNCHub&lt;br /&gt;
#LI TechCOMETS&lt;br /&gt;
#LabFunding Project Accelerator 2014&lt;br /&gt;
#Labs Venture Accelerator&lt;br /&gt;
#Launch Chapel Hill&lt;br /&gt;
#Launch Memphis&lt;br /&gt;
#LaunchBox Digital&lt;br /&gt;
#LaunchHouse&lt;br /&gt;
#LaunchPad PEI&lt;br /&gt;
#LaunchSpot&lt;br /&gt;
#Launch_Academy&lt;br /&gt;
#Launchpad Digital Health, LLC&lt;br /&gt;
#Launchpad LA&lt;br /&gt;
#Launchpad Long Island&lt;br /&gt;
#Le Camping&lt;br /&gt;
#Leading Entrepreneurial Accelerator Program&lt;br /&gt;
#Lean Launch Ventures&lt;br /&gt;
#LearnLaunchX&lt;br /&gt;
#Lemnos Labs&lt;br /&gt;
#Life Changing Labs&lt;br /&gt;
#LiftOff Health Incubator&lt;br /&gt;
#Lightbank Start&lt;br /&gt;
#LightningLab&lt;br /&gt;
#Lowe's Accelerator&lt;br /&gt;
#MACH37&lt;br /&gt;
#MACH37 Spring&lt;br /&gt;
#MIT SA+P venture accelerator&lt;br /&gt;
#MITA Institute Accelerator&lt;br /&gt;
#MTGx MediaFactory&lt;br /&gt;
#Mac6&lt;br /&gt;
#Madworks Governance Accelerator&lt;br /&gt;
#Maine Center for Entrepreneurial Development - Top Gun Program&lt;br /&gt;
#Matter&lt;br /&gt;
#Maven Ventures Fund &amp;amp; Incubator&lt;br /&gt;
#Media Camp&lt;br /&gt;
#Melbourne Accelerator Program&lt;br /&gt;
#Memphis BioWorks&lt;br /&gt;
#Merck Accelerator&lt;br /&gt;
#MergeLane 2017 Accelerator&lt;br /&gt;
#Mergelane&lt;br /&gt;
#Metavallon&lt;br /&gt;
#Microsoft Accelerator&lt;br /&gt;
#MindTheBridge&lt;br /&gt;
#Momentum&lt;br /&gt;
#MuckerLab&lt;br /&gt;
#Muru-D&lt;br /&gt;
#My5ive Accelerator 2016&lt;br /&gt;
#N-Motion (DUPLICATE)&lt;br /&gt;
#NDRC (LaunchPad / VentureLab)&lt;br /&gt;
#NEXT Dashboard&lt;br /&gt;
#NMotion&lt;br /&gt;
#NY Digital Health Accelerator&lt;br /&gt;
#NY Fashion Tech Lab 2017&lt;br /&gt;
#NYC ACRE&lt;br /&gt;
#NYC SeedStart&lt;br /&gt;
#Nashville Entrepreneur Center&lt;br /&gt;
#Nebula Shift&lt;br /&gt;
#Nephoscale IaaS&lt;br /&gt;
#Nest New York &lt;br /&gt;
#New Ventures Group&lt;br /&gt;
#New York Digital Health Accelerator (DUPLICATE)&lt;br /&gt;
#NewME Accelerator PopUps &lt;br /&gt;
#NewMe&lt;br /&gt;
#Next media accelerator&lt;br /&gt;
#NextHIT&lt;br /&gt;
#NextStart&lt;br /&gt;
#Nike+ Accelerator&lt;br /&gt;
#Northern Arizona Center for Entrepreneurship and Technology (NACET)&lt;br /&gt;
#Northern England&lt;br /&gt;
#Nxtp.labs&lt;br /&gt;
#OCTANe&lt;br /&gt;
#Oasis 500&lt;br /&gt;
#OpenFund&lt;br /&gt;
#Orange Fab&lt;br /&gt;
#Orange Works&lt;br /&gt;
#Orion Startups&lt;br /&gt;
#Oxygen Accelerator&lt;br /&gt;
#PIE&lt;br /&gt;
#Patriot Boot Camp&lt;br /&gt;
#Pearson Catalyst for Education&lt;br /&gt;
#Pipeline H2O&lt;br /&gt;
#Pitney Bowes Inc&lt;br /&gt;
#Plarium Labs&lt;br /&gt;
#Plug In South LA &lt;br /&gt;
#Plug and Play&lt;br /&gt;
#Plum Alley Investments 2016&lt;br /&gt;
#Points of Light Accelerator&lt;br /&gt;
#PowerHaus&lt;br /&gt;
#Preccelerator® Program 2016&lt;br /&gt;
#ProSiebenSat.1 Accelerator&lt;br /&gt;
#Project Entrepreneur 2016/17&lt;br /&gt;
#Project Healtchare&lt;br /&gt;
#Project Lift&lt;br /&gt;
#Project Music&lt;br /&gt;
#Project Skyway&lt;br /&gt;
#Propeller Venture Accelerator&lt;br /&gt;
#Prosper Capital Accelerator&lt;br /&gt;
#Proton Enterprises&lt;br /&gt;
#Pushstart Accelerator&lt;br /&gt;
#Qualcomm Robotics Accelerator&lt;br /&gt;
#Queen Creek Business Incubator&lt;br /&gt;
#R/GA Accelerator&lt;br /&gt;
#RAIN Incubator/Accelerator&lt;br /&gt;
#RJI Investment Group&lt;br /&gt;
#Reach&lt;br /&gt;
#RetailXelerator&lt;br /&gt;
#Rock Health&lt;br /&gt;
#Rocket Fuel Labs&lt;br /&gt;
#Rockstart Accelerator&lt;br /&gt;
#RunUp Labs&lt;br /&gt;
#Runway IoT Accelerator 2015&lt;br /&gt;
#SAP Startup Focus Program&lt;br /&gt;
#SKTA Innopartners Innovation Accelerator&lt;br /&gt;
#SPACELAB Tech Accelerator&lt;br /&gt;
#SPARK&lt;br /&gt;
#SPH Plug and Play&lt;br /&gt;
#SURF Incubator&lt;br /&gt;
#SaltMines Group Start-Up Studio&lt;br /&gt;
#ScaleTown&lt;br /&gt;
#Seamless IoT 2016&lt;br /&gt;
#Searchcamp&lt;br /&gt;
#Seed Hatchery&lt;br /&gt;
#SeedSpot&lt;br /&gt;
#SeedStartup&lt;br /&gt;
#SeedSumo&lt;br /&gt;
#Seedcamp&lt;br /&gt;
#Seedrocket&lt;br /&gt;
#Seeqnce&lt;br /&gt;
#Sequoia Apps&lt;br /&gt;
#Serval Ventures&lt;br /&gt;
#Shenzhen Valley Ventures Incubator&lt;br /&gt;
#Shoals Entrepreneurial Center&lt;br /&gt;
#Shopper Futures Accelerator&lt;br /&gt;
#Shotput Ventures&lt;br /&gt;
#Sid Martin Biotechnology Institute&lt;br /&gt;
#SigmaLabs Accelerator&lt;br /&gt;
#Silicon Valley Incubator &amp;amp; Accelerator&lt;br /&gt;
#SixThirty&lt;br /&gt;
#Sixers Innovation Lab&lt;br /&gt;
#Skywalker Accelerator&lt;br /&gt;
#SmartHealth Activator&lt;br /&gt;
#Smashd Labs&lt;br /&gt;
#SoCo Nexus Accelerator Spring 2017&lt;br /&gt;
#Social Enterprise Challenge&lt;br /&gt;
#Socratic Labs&lt;br /&gt;
#SparkLabs&lt;br /&gt;
#Sparkgap&lt;br /&gt;
#Sports Tank&lt;br /&gt;
#Springboard&lt;br /&gt;
#Sprint Accelerator&lt;br /&gt;
#Sprint Mobile Health Accelerator&lt;br /&gt;
#SproutBox&lt;br /&gt;
#SproutCamp&lt;br /&gt;
#Starburst Aerospace Accelerator&lt;br /&gt;
#Start Path Europe&lt;br /&gt;
#Start'inPost&lt;br /&gt;
#StartEngine&lt;br /&gt;
#StartFast Venture Accelerator&lt;br /&gt;
#Starta Accelerator Winter 2017&lt;br /&gt;
#Startl&lt;br /&gt;
#Startmate&lt;br /&gt;
#Startup Accelerator (DUPLICATE)&lt;br /&gt;
#Startup Front&lt;br /&gt;
#Startup Next &amp;amp; GAN&lt;br /&gt;
#Startup Orange County Accelerator&lt;br /&gt;
#Startup Runway&lt;br /&gt;
#Startup Wise Guys&lt;br /&gt;
#Startup Zone PEI&lt;br /&gt;
#Startup52X Accelerator&lt;br /&gt;
#StartupCity&lt;br /&gt;
#StartupHighway&lt;br /&gt;
#StartupHouse Foundry program&lt;br /&gt;
#StartupMinds Accelerator &lt;br /&gt;
#StartupYard&lt;br /&gt;
#Startupbootcamp&lt;br /&gt;
#Straight Shot&lt;br /&gt;
#Summer@Highland&lt;br /&gt;
#Surge&lt;br /&gt;
#SynBio axlr8r&lt;br /&gt;
#TEB Incubation &amp;amp; Acceleration Center&lt;br /&gt;
#THRIVE Accelerator III&lt;br /&gt;
#THRIVE Open Innovation (DUPLICATE)&lt;br /&gt;
#TIM#WCAP Accelerator&lt;br /&gt;
#TLabs&lt;br /&gt;
#TMCx Accelerator Digital Health 2017&lt;br /&gt;
#Tallwave&lt;br /&gt;
#Tampa Bay Innovation Center&lt;br /&gt;
#Tampa Bay Wave&lt;br /&gt;
#Tandem Mobile Accelerator&lt;br /&gt;
#Tech Nexus&lt;br /&gt;
#Tech Wildcatters&lt;br /&gt;
#Tech2020&lt;br /&gt;
#TechLaunch&lt;br /&gt;
#TechRanch&lt;br /&gt;
#TechSquareLabs&lt;br /&gt;
#Techstars&lt;br /&gt;
#Techstars Music&lt;br /&gt;
#Telenet Idealabs&lt;br /&gt;
#Telluride Venture Accelerator&lt;br /&gt;
#TenX&lt;br /&gt;
#The Alchemist Accelerator (DUPLICATE)&lt;br /&gt;
#The Ark&lt;br /&gt;
#The Bakery&lt;br /&gt;
#The Batchery&lt;br /&gt;
#The Brandery&lt;br /&gt;
#The Bridge&lt;br /&gt;
#The Center For Technology Enterprise &amp;amp; Development&lt;br /&gt;
#The Chaser&lt;br /&gt;
#The Company Lab (CO.LAB)&lt;br /&gt;
#The Draper FinTech Connection&lt;br /&gt;
#The Factory&lt;br /&gt;
#The Greatest Pitch&lt;br /&gt;
#The Harbor Accelerator&lt;br /&gt;
#The Incubator&lt;br /&gt;
#The Iron Yard&lt;br /&gt;
#The Mediapreneur Incubator&lt;br /&gt;
#The Morpheus&lt;br /&gt;
#The New York Venture Summit&lt;br /&gt;
#The Next Step: from idea to startup&lt;br /&gt;
#The Refinery&lt;br /&gt;
#The Unilever Foundry&lt;br /&gt;
#The Venture Center's Pre-Accelerator I&lt;br /&gt;
#The Vine OC&lt;br /&gt;
#The Vogt Awards&lt;br /&gt;
#The Yield Lab&lt;br /&gt;
#The eFactory Accelerator&lt;br /&gt;
#Think Big Partners Accelerator&lt;br /&gt;
#TiE Angels&lt;br /&gt;
#Tigerlabs Digital Health Accelerator&lt;br /&gt;
#Tolstoy Summer Camp&lt;br /&gt;
#TopSeedsLab&lt;br /&gt;
#Travel Startups Incubator&lt;br /&gt;
#Travelport Labs Accelerator&lt;br /&gt;
#Travelport Labs Incubator&lt;br /&gt;
#Triangle Startup Factory&lt;br /&gt;
#Tumml&lt;br /&gt;
#Tune Labs&lt;br /&gt;
#Twin Cities Accelerator 2016&lt;br /&gt;
#UW-Whitewater Launch Pad Accelerator&lt;br /&gt;
#Unbank.ventures FinTech Incubator&lt;br /&gt;
#University Technology Park&lt;br /&gt;
#Unreasonable Institute&lt;br /&gt;
#UpTech&lt;br /&gt;
#Upstart Accelerator&lt;br /&gt;
#Upstart Labs&lt;br /&gt;
#Upstart Memphis&lt;br /&gt;
#Uptima Business Bootcamp&lt;br /&gt;
#Upwest Labs&lt;br /&gt;
#VANTEC&lt;br /&gt;
#VC FinTech Accelerator&lt;br /&gt;
#Velocity Indiana Accelerator&lt;br /&gt;
#Venture Catalyst Partners&lt;br /&gt;
#Venture Hive&lt;br /&gt;
#Venture I&lt;br /&gt;
#VentureOut's  Enterprise Tech Expedition&lt;br /&gt;
#Venturegeeks&lt;br /&gt;
#Vet-Tech Accelerator&lt;br /&gt;
#VictorySpark&lt;br /&gt;
#Village88 Techlab&lt;br /&gt;
#Volkswagen ERL Technology Accelerator&lt;br /&gt;
#WHLabs&lt;br /&gt;
#Wasabi Ventures Academy&lt;br /&gt;
#Wayra&lt;br /&gt;
#Wellness Accelerator&lt;br /&gt;
#Wells Fargo Startup Accelerator&lt;br /&gt;
#Wireless IoT&lt;br /&gt;
#Women Innovate Mobile&lt;br /&gt;
#XLerateHealth&lt;br /&gt;
#XTRATOS&lt;br /&gt;
#Xlerate Health&lt;br /&gt;
#Y Combinator&lt;br /&gt;
#Y&amp;amp;R SparkPlug 2017&lt;br /&gt;
#YEurope&lt;br /&gt;
#YLE Media Startup Accelerator Program&lt;br /&gt;
#Yahoo Ad Tech Program&lt;br /&gt;
#Yangler (online accelerator)&lt;br /&gt;
#Year of the Startup&lt;br /&gt;
#Yetizen Accelerator&lt;br /&gt;
#You Is Now&lt;br /&gt;
#Z80 Labs&lt;br /&gt;
#ZIP Launchpad Admission&lt;br /&gt;
#ZeroTo510&lt;br /&gt;
#Zone Startups Calgary&lt;br /&gt;
#designX 2017&lt;br /&gt;
#eMerging Ventures&lt;br /&gt;
#ezone&lt;br /&gt;
#iStart Jax (DUPLICATE)&lt;br /&gt;
#iStart Valley&lt;br /&gt;
#iVentures10&lt;br /&gt;
#ignite100&lt;br /&gt;
#innovyz start&lt;br /&gt;
#tekMountain Accelerator&lt;br /&gt;
&lt;br /&gt;
=Project Summary=&lt;br /&gt;
This project will be used to determine which accelerators are the most effective at churning out successful startups, as well as what characteristics are exhibited by these accelerators. First, we need to gather as much data as we can about as many accelerators as we can in order to look at factors that differentiate successful vs. unsuccessful ventures. Next, we need to create a web crawling program which will gather information about accelerators across the world by accessing their websites and extracting information. I believe that our overall goal with this research project is to gain insight into the methods of successful accelerators, as well as to find out what exactly differentiates very successful accelerators from dead accelerators.&lt;br /&gt;
&lt;br /&gt;
Helpful Links: http://seedrankings.com/&lt;br /&gt;
&lt;br /&gt;
=Sources=&lt;br /&gt;
&lt;br /&gt;
Summary: These are sources obtained from [[List of Accelerators]] and other Google searches. We will evaluate these sources by looking at the number of accelerators they supply (as most of them are lists) and then also taking a look at the type of information they provide about each accelerator. Key data points are cohort-related data, startup-related data, and logistics of the accelerator. Better sources supply more information that the URL alone.&lt;br /&gt;
&lt;br /&gt;
(Obtained from [[List of Accelerators]] and various Google searches)&lt;br /&gt;
*http://seedrankings.com/&lt;br /&gt;
*http://www.acceleratorinfo.com/see-all.html&lt;br /&gt;
*http://www.seed-db.com/accelerators&lt;br /&gt;
*http://gust.com/usa-canada-accelerator-report-2015/?utm_content=35401577&amp;amp;utm_medium=social&amp;amp;utm_source=twitter&lt;br /&gt;
*https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/&lt;br /&gt;
*http://www.builtinnyc.com/2016/06/03/accelerators-incubators-nyc&lt;br /&gt;
*http://www.represent.la/&lt;br /&gt;
*http://www.launch.co/blog/complete-list-of-incubators-and-accelerators-like-y-combinat.html&lt;br /&gt;
*https://angel.co/accelerator-4&lt;br /&gt;
&lt;br /&gt;
(Obtained from Google search: &amp;quot;Accelerator Database&amp;quot;)&lt;br /&gt;
*seed-db is the first result that pops up&lt;br /&gt;
*https://www.corporate-accelerators.net/database/&lt;br /&gt;
*https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json&lt;br /&gt;
*By the 5th or 6th search result, the utility diminished greatly&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2015/03/17/the-best-startup-accelerators-of-2015-powering-a-tech-boom/#2f52fa7e34e4&lt;br /&gt;
*http://www.inc.com/will-yakowicz/the-15-best-startup-accelerators-in-the-us.html&lt;br /&gt;
*http://www.forbes.com/sites/briansolomon/2016/03/11/the-best-startup-accelerators-of-2016/#74086a7724f2&lt;br /&gt;
*https://techcrunch.com/2015/03/17/these-are-the-top-20-us-accelerators/&lt;br /&gt;
*https://www.nexpcb.com/blogs/news/the-hardware-incubators-accelerators-list&lt;br /&gt;
&lt;br /&gt;
Other ways used to find Accelerators (listed below &amp;quot;List of Sources Obtained from Various Google Searches&amp;quot;):&lt;br /&gt;
*Type in generic location + &amp;quot;accelerators&amp;quot; (e.g. Houston Accelerators)&lt;br /&gt;
:*Looked at roughly the first 20 results&lt;br /&gt;
:*Used three locations as examples of accelerators that pop up&lt;br /&gt;
*Type in a specific state + &amp;quot;accelerator&amp;quot; + &amp;quot;list&amp;quot; (e.g. Texas accelerator list) to search for more relevant lists&lt;br /&gt;
:*Once again, looked at roughly the first 20 results&lt;br /&gt;
&lt;br /&gt;
=Source Evaluations=&lt;br /&gt;
&lt;br /&gt;
Summary: These evaluations couple with each of the sources above. The evaluations provide instructions for obtaining the information listed, as well as a general review of how useful the data seems. The review serves to determine whether a crawler would be suitable for obtaining information from the source autonomously.&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.acceleratorinfo.com/see-all.html==&lt;br /&gt;
#Opened source website&lt;br /&gt;
#Copied Information under &amp;quot;All Accelerator Programs&amp;quot; to TextPad, already sorted. Returned 190 results&lt;br /&gt;
#Each link on parent list leads to individual '''home page url''' of accelerator&lt;br /&gt;
:*Used sample size of 20 links, determined 16 to be accelerators, 2 to be incubators, 2 to be inactive or broken links&lt;br /&gt;
:*Many accelerators do not include founding date, most recent accelerators from around 2013-2014 (as determined from home page)&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for specific URLs to older accelerators, not very helpful for more specific information.&lt;br /&gt;
*Web crawling seems improbable because information is not readily available from source. Can potentially mine staff information or contact information from associated &amp;quot;about&amp;quot; page in the home url&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators/all==&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 235 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes:&lt;br /&gt;
::# &amp;quot;state&amp;quot;&lt;br /&gt;
::# &amp;quot;company name&amp;quot;&lt;br /&gt;
::# &amp;quot;website and CrunchBase links&amp;quot;&lt;br /&gt;
::# &amp;quot;cohort date&amp;quot;&lt;br /&gt;
::#&amp;quot;exit value&amp;quot;&lt;br /&gt;
::#&amp;quot;funding&amp;quot;. &lt;br /&gt;
:::Many entries for &amp;quot;exit value&amp;quot; are missing, some values for &amp;quot;funding&amp;quot; are missing&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators out of 235 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the following:&lt;br /&gt;
::#Status&lt;br /&gt;
::#Program (name)&lt;br /&gt;
::#Location&lt;br /&gt;
::#Country&lt;br /&gt;
::#Number of companies&lt;br /&gt;
::#Cumulative exit values&lt;br /&gt;
::#Cumulative funding &lt;br /&gt;
::#Average funding for startups&lt;br /&gt;
::#Median funding for startups&lt;br /&gt;
:::Many entries for &amp;quot;median funding&amp;quot; are left empty, as well as entries for all types of funding on the bottom half of the table&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, but after cross-referencing from other sources shows that seed-db is lacking many newer accelerators; list is not all-inclusive.&lt;br /&gt;
*Includes regional distributions for accelerator groups as well. For example, rather than just &amp;quot;Techstars&amp;quot;, the group is broken into Austin, Berlin, Boston, Boulder, etc.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://www.seed-db.com/accelerators==&lt;br /&gt;
:Very similar to &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;, but contains large regional accelerators as groups, rather than individual accelerators. For example, Techstars appears only once.&lt;br /&gt;
#Copied &amp;quot;Seed Accelerators&amp;quot; table to TextPad, data sorted itself into lines. Returned 239 results.&lt;br /&gt;
#Clicking on the accelerator name itself links to a page with all of its associated startups, up until 6/2016 cohort&lt;br /&gt;
::*Startup table includes same information as previous source, &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot;. However, accelerators spanning across multiple regions have their startups located under one category on this webpage.&lt;br /&gt;
:On original seed-db webpage, each accelerator has a link to its associated home page url&lt;br /&gt;
::*From the table, each listed entry was an accelerator, although 24 accelerators/groups out of 239 were classified as &amp;quot;dead&amp;quot;&lt;br /&gt;
::*Along with the home url, each accelerator table includes the same information as the &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; source&lt;br /&gt;
===Review===&lt;br /&gt;
*Reliable source for accelerators, includes list of accelerators both dead and active, as well as their associated start-ups&lt;br /&gt;
*Web crawling potential is promising; startup table is located within the source for each webpage. Can also mine any category from the accelerator table&lt;br /&gt;
*Overall very extensive data for accelerators that are included on the list, includes large groups as well as individual accelerators. It seems that some accelerators missing from &amp;quot;http://www.seed-db.com/accelerators/all&amp;quot; are located here, since there are 239 returns rather than 235.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.f6s.com/programs?type==&lt;br /&gt;
#On the webpage, set &amp;quot;Type&amp;quot; to &amp;quot;Accelerator/Program&amp;quot;, set &amp;quot;Location&amp;quot; to &amp;quot;North America&amp;quot;, and set &amp;quot;Invest in Country&amp;quot; to &amp;quot;United States&amp;quot; to return results&lt;br /&gt;
#Highlighted results and scrolled down until all results found; copied results to TextPad&lt;br /&gt;
#In TextPad, sorted out lines with &amp;quot;by&amp;quot;, as well as miscellaneous categories such as dates and dollar signs through Regular Expressions&lt;br /&gt;
#Using the &amp;quot;More Info&amp;quot; line which held constant through the entire list, assigned a sequential number to the line (in order to determine the number of results)&lt;br /&gt;
::*Obtained a grand total of 1467 results from the list&lt;br /&gt;
::*Along with the name of the program/accelerator, the data included:&lt;br /&gt;
::#Dollar value per team&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Application Site&lt;br /&gt;
::#Accelerator URL&lt;br /&gt;
::*Many entries are not accelerators, from a quick glance through the results, there were various conferences, 3-5 days events, and written literature pertaining to accelerators as well&lt;br /&gt;
::*From a sample size of the first 30 entries, determined 10 to be valid accelerators, 3 incubators, 6 conferences/weekends, and the rest to be miscellaneous entries such as startup events or &amp;quot;studios&amp;quot; (perhaps useful but not relevant to search)&lt;br /&gt;
::*As we go down the list, the number of accelerators proportionately decreases. Can comfortably say that overall accelerator turnout from this website is much less than 33%, probably closer to 10-15%.&lt;br /&gt;
===Review===&lt;br /&gt;
*Potentially useful website if crawler could remove the clutter and target solely the accelerators; very useful for identifying new accelerators since data automatically sorted by date and location.&lt;br /&gt;
*Large list of sources includes many irrelevant results, such as conferences or weekends which are difficult to identify. The name of the sorting category itself, &amp;quot;Accelerator/Program&amp;quot; suggests that many of the results fall under the &amp;quot;Program&amp;quot; section rather than being valid accelerators.&lt;br /&gt;
*Potential site for identifying accelerators, but limited by in-site sorting; useful for URL and perhaps equity, but not very detailed information relating to the accelerator/program.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Source: http://gust.com/usa-canada-accelerator-report-2015/==&lt;br /&gt;
#Selected region of US and Canada&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Top 20 Active Accelerators&amp;quot; and selected &amp;quot;see the full list&amp;quot; near the bottom of the listed accelerators&lt;br /&gt;
#Copied resulting entries into TextPad and sorted out the numbers to leave only the name of the accelerator&lt;br /&gt;
::*Obtained 100 results for different accelerators&lt;br /&gt;
::*Accelerator lists included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Number of Start-ups funded (2015 only)&lt;br /&gt;
::*Accelerator list limited to 2015&lt;br /&gt;
===Review===&lt;br /&gt;
*Website provides its own evaluation of an accelerator's success based on various factors and provides data for larger trends.&lt;br /&gt;
*Usefulness is questionable because website does not provide much except the URL, and all of the entries are based on success in 2015.&lt;br /&gt;
*Other interesting data within website such as &amp;quot;Hot Markets&amp;quot;, investment breakdowns by state, etc. All of this data is also limited to 2015.&lt;br /&gt;
&lt;br /&gt;
==Source: https://bostonstartupsguide.com/guide/every-boston-startup-accelerator-incubator/==&lt;br /&gt;
#Scrolled down to the section labeled &amp;quot;Startup accelerators in Boston&amp;quot;&lt;br /&gt;
#Copied text beginning from &amp;quot;MassChallenge&amp;quot; (the first paragraph was just a general definition of startups) and continued to copy until &amp;quot;Startup Incubators in Boston&amp;quot;&lt;br /&gt;
#After pasting in TextPad, I sorted the data to delete any characters after the &amp;quot;-&amp;quot; and added a sequential number at the beginning of each line&lt;br /&gt;
::*Returned a total of 17 results for startups in Boston&lt;br /&gt;
::*Accelerator list included:&lt;br /&gt;
::#Name and URL&lt;br /&gt;
::#Capital requirements&lt;br /&gt;
::#Application periods and requirements&lt;br /&gt;
::#Paragraph describing accelerator and its goals&lt;br /&gt;
===Review===&lt;br /&gt;
*Although the guide is dated, useful for identifying strong accelerator programs in Boston&lt;br /&gt;
*Limitation: only focuses on Boston, but the description is helpful in identifying the role of the accelerator&lt;br /&gt;
*Limited information on accelerator, not very useful by itself without information from the accelerator URL&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.corporate-accelerators.net/database/==&lt;br /&gt;
#Copied and pasted table into Microsoft Excel (Data was already sorted into categories so no need for TextPad)&lt;br /&gt;
#Table returned 72 references (but there was a link to the bottom to a larger database)&lt;br /&gt;
::*The table itself includes:&lt;br /&gt;
::#Major Company&lt;br /&gt;
::#Accelerator&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Website&lt;br /&gt;
::#Details&lt;br /&gt;
::*The &amp;quot;Details&amp;quot; link led to a variety of other information including:&lt;br /&gt;
::#Status (Active or Inactive)&lt;br /&gt;
::#Locations&lt;br /&gt;
::#Funding&lt;br /&gt;
::#Equity&lt;br /&gt;
::#Term&lt;br /&gt;
::#Cohort Based? (Regular or Irregular)&lt;br /&gt;
::#Pitch Day&lt;br /&gt;
::#Office Space&lt;br /&gt;
::#Powered by&lt;br /&gt;
::#Support Offered?&lt;br /&gt;
::#Launch year&lt;br /&gt;
::#Focus Areas&lt;br /&gt;
::#General Description&lt;br /&gt;
::*Also Included a variety of data regarding the host company as well&lt;br /&gt;
===Review===&lt;br /&gt;
*Solid list for corporate accelerators and also includes a variety of information about the accelerator, the cohorts, etc. Some of the entries are international accelerators however so need to filter them out&lt;br /&gt;
*Only limited to 72 accelerators from major companies&lt;br /&gt;
&lt;br /&gt;
==Source: https://github.com/florianheinemann/www-corporate-accelerators-net/blob/master/_data/Accelerators.json==&lt;br /&gt;
#This source is a .json file from the previous database&lt;br /&gt;
#After placing into TextPad, replaced each space with a ###, replaced each new line with a tab, and replaced each ### with a new line. Ultimately returned 80 results&lt;br /&gt;
::*From the file, the .json includes:&lt;br /&gt;
::#NAICS and NAICS sector &lt;br /&gt;
::#Classification&lt;br /&gt;
::#Sector Description&lt;br /&gt;
::#Term&lt;br /&gt;
::#Goal&lt;br /&gt;
::#Partner&lt;br /&gt;
::*Also includes most of the information from the previous source, since they are undoubtedly linked&lt;br /&gt;
===Review===&lt;br /&gt;
*Another solid list for corporate accelerators with some more information, but ultimately very similar to the previous source.&lt;br /&gt;
&lt;br /&gt;
==Source: https://www.quora.com/Where-can-I-find-a-comprehensive-list-of-startup-incubators-and-accelerators-in-the-US==&lt;br /&gt;
#Since we already looked at the first listed source (seed-db), I clicked on the second link &amp;quot;(by Robert Shedd) http://blog.shedd.us/321987608/&amp;quot; which took me to a page headed &amp;quot;Help for Startups! – A semi-complete list of startup accelerator programs&amp;quot; created by a blogger, Robert Shedd&lt;br /&gt;
#List included 102 entries by the blogger, each of which do look like an accelerator&lt;br /&gt;
::*Upon immediate overview, noticed many results from previous sources were missing. Immediately noticed lack of &amp;quot;OwlSpark&amp;quot;, the accelerator from Rice.&lt;br /&gt;
::*Shedd only offers us the accelerator name plus its URL&lt;br /&gt;
===Review===&lt;br /&gt;
*Nice list to cross-reference with other sources but does not offer much new insight compared to more powerful engines such as seed-db\&lt;br /&gt;
&lt;br /&gt;
=List of Sources Obtained from Various Google Searches=&lt;br /&gt;
&lt;br /&gt;
Summary: These accelerators are taken from a specific Google search rather than a list. The idea is to compile a list of Google searches that return relevant results of accelerators. This will aid in the creation of a future web crawler.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;Location + Accelerator&amp;quot;(Only individual results, not lists)==&lt;br /&gt;
===Houston Accelerators===&lt;br /&gt;
*Examples of single accelerators found&lt;br /&gt;
:#TMCx: http://www.tmc.edu/innovation/innovation-programs/tmcx/&lt;br /&gt;
:#RED labs: http://redlabs.uh.edu/8&lt;br /&gt;
:#SURGE accelerator: https://kirkcoburn.com/&lt;br /&gt;
:#OwlSpark: http://owlspark.com/&lt;br /&gt;
:#NextHIT: http://www.houstonhealthventures.com/nexthit-accelerator-program-application/&lt;br /&gt;
===Los Angeles Accelerators===&lt;br /&gt;
:#Amplify: http://amplify.la/&lt;br /&gt;
:#Y Combinator: https://www.ycombinator.com/&lt;br /&gt;
:#Chicklabs: https://www.chicklabsllc.com/&lt;br /&gt;
:#Disney Accelerator: https://disneyaccelerator.com/&lt;br /&gt;
:#Launchpad: https://launchpad.la/&lt;br /&gt;
===New York Accelerators===&lt;br /&gt;
:#DreamIT Ventures: http://www.dreamit.com/#meaningful-experience&lt;br /&gt;
:#Women Innovate Mobile: http://www.wim.co/&lt;br /&gt;
:#Techstars NYC: http://www.techstars.com/programs/nyc-program/&lt;br /&gt;
:#Entrepreneurs Roundtable: http://eranyc.com/&lt;br /&gt;
:#FirstGrowthVC: http://venturecrush.com/fg/&lt;br /&gt;
:#New York Digital Health Accelerator: http://digitalhealthaccelerator.com/&lt;br /&gt;
:#Grand Central Tech: http://www.grandcentraltech.com/&lt;br /&gt;
:#Accelerator Corp: http://www.acceleratorcorp.com/&lt;br /&gt;
:#New York Startup Lab: http://nystartuplab.com/&lt;br /&gt;
===Review===&lt;br /&gt;
*Some locations return more viable results for a similar sample size. For example, New York returned 9 valid accelerators, whereas Los Angeles and Houston both returned 5 actual accelerators out of the first 20 results: an 80% difference. Some optimization may come from identifying which locations return more accelerators upon searching.&lt;br /&gt;
&lt;br /&gt;
==From &amp;quot;State+Accelerator+List&amp;quot;==&lt;br /&gt;
===New York Accelerator List===&lt;br /&gt;
*http://www.ongridventures.com/resources/new-york-silicon-alley-resources/newyorkaccelerators/ (Ranks 14 accelerators)&lt;br /&gt;
*http://under30ceo.com/11-new-york-tech-incubators-and-accelerators-for-entrepreneurs/ (Ranks 11 accelerators)&lt;br /&gt;
===California Accelerator List===&lt;br /&gt;
*http://www.socaltech.com/the_complete_guide_to_southern_california_accelerators_and_incubators_part_i/s-0040924.html (Lists accelerators in Southern Cali)&lt;br /&gt;
*http://barberacorporatelaw.com/blog/2014/4/8/28-business-incubators-in-the-los-angeles-area (List of 24 accelerators near the LA area)&lt;br /&gt;
===Texas Accelerator List===&lt;br /&gt;
*http://www.austinstartuplist.com/incubators (List of accelerators in Austin, &amp;lt;5 results)&lt;br /&gt;
*http://www.siliconhillsnews.com/2016/09/02/the-top-texas-healthcare-accelerators-and-incubators/ (Modest list of accelerators aiding in healthcare)&lt;br /&gt;
*http://realfoodmba.com/food-startup-accelerators/ (List of food-based accelerators, some of which are in Austin, others of which are international)&lt;br /&gt;
===Colorado Accelerator List===&lt;br /&gt;
*http://www.builtincolorado.com/2015/01/14/best-colorado-accelerators-your-startup (8 results)&lt;br /&gt;
*https://www.quora.com/What-accelerator-programs-are-located-in-Colorado (Quora inquiry yielding modest results)&lt;br /&gt;
===Washington Accelerator List===&lt;br /&gt;
*http://www.geekwire.com/2015/mapping-seattles-incubators-accelerators-and-co-working-spaces/ (Returns 14 results)&lt;br /&gt;
===Oregon Accelerator List===&lt;br /&gt;
*http://www.bizjournals.com/portland/subscriber-only/2016/01/15/incubators-and-accelerators.html (Returns list of 5 accelerators and details)&lt;br /&gt;
*http://www.oregon4biz.com/Innovate-&amp;amp;-Create/R&amp;amp;D-Business/Incubators/ (Returns list of 26 accelerators and incubators)&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
*Seed-DB appears for almost all of the search results&lt;br /&gt;
*Acceleratorinfo appears for most of the search results&lt;br /&gt;
*There are multiple cumulative reports of incubators per location, but not for accelerators&lt;br /&gt;
*Most regionalized accelerator lists deal with either an article or a ranking of a particular amount of accelerators in the area&lt;br /&gt;
*Many results returned nationally ranked lists of accelerators, such as the Forbes list of &amp;quot;Top Accelerators&amp;quot; or something along the lines of &amp;quot;Best Accelerators in the US&amp;quot;. The connection is that perhaps one accelerator mentioned on the list may be located within the searched state.&lt;br /&gt;
*There are also a few results for actual particle accelerators that must be sorted out (i.e. superconducting super collider)&lt;br /&gt;
&lt;br /&gt;
==Found through google searching accelerators found previously==&lt;br /&gt;
'''Found from googling YLE Media Startup Accelerator'''&lt;br /&gt;
*https://www.corporate-accelerators.net/database/index.html (DB of Corporate Accelerators 71-79 entries)&lt;br /&gt;
*http://startupaccelerator.vc/accelerator-corporate-innovation-sig/ (Database of Accelerators and Corporate Innovation 92 entries)&lt;br /&gt;
neither of these have had their entries added to list of accelerators&lt;br /&gt;
&lt;br /&gt;
=Individual Accelerator Evaluations=&lt;br /&gt;
Summary: The purpose of this section is to create instructions for each accelerator on how to find cohort information from their URLs. Along with specific instructions for obtaining the cohorts for each accelerator chosen, there should be a list of easy-to-obtain and relevant statistics regarding the accelerator, such as information about its team, location, etc. The variable statistics list is cumulative, whereas the cohort directions are unique per the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerators Chosen (Format = Name (source))==&lt;br /&gt;
#Blue Startups (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Launchpad LA (http://www.acceleratorinfo.com/see-all.html)&lt;br /&gt;
#Y Combinator (http://www.seed-db.com/accelerators)&lt;br /&gt;
#FlashPoint (http://www.seed-db.com/accelerators/all)&lt;br /&gt;
#Prosper Accelerator (https://www.f6s.com/programs?type)&lt;br /&gt;
#Axel Springer Plug and Play (http://www.axelspringerplugandplay.com/)&lt;br /&gt;
#Techstars (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Startmate (http://www.seed-db.com/accelerators)&lt;br /&gt;
#Capital Factory (http://blog.shedd.us/321987608/)&lt;br /&gt;
#OwlSpark (Google search: &amp;quot;Houston + accelerators&amp;quot;)&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Blue Startups (http://bluestartups.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Track Record&amp;quot; page under the &amp;quot;Home&amp;quot; tab; found total number of graduated cohorts to be 7&lt;br /&gt;
#Navigated to &amp;quot;Portfolio&amp;quot; tab. Tab includes list of all seven graduated cohorts along with companies emerging from each one. Each cohort is listed under a separate page (ex. &amp;quot;Cohort 1&amp;quot;, &amp;quot;Cohort 2&amp;quot;, etc) and at the bottom of each cohort page, there is a link to the other 6. Each company has a short description along with its URL.&lt;br /&gt;
#An &amp;quot;Alumni News&amp;quot; page at the bottom of &amp;quot;Portfolio&amp;quot; includes articles pertinent to graduated startups.&lt;br /&gt;
#Unfortunately does not include the date and year of each cohort class, but perhaps could cross-reference with other sources.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Launchpad LA (http://launchpad.la/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Companies&amp;quot; in the top of the homepage&lt;br /&gt;
#&amp;quot;Companies&amp;quot; returns all companies backed by Launchpad LA based on their class year and number (cohort)&lt;br /&gt;
#:*Also sorted by active startups vs. inactive startups&lt;br /&gt;
#At the bottom of the &amp;quot;Companies&amp;quot; tab, there is a statistical layout returning values for the number of companies started by Launchpad during its time as an accelerator (2012-present), as well as the total funding funneled into the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Y Combinator (http://www.ycombinator.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Scrolled down on the home page and clicked on a link entitled &amp;quot;See all companies&amp;quot;.&lt;br /&gt;
#Navigated to a drop down menu named &amp;quot;All Batches&amp;quot;, and clicked on it to expand the list.&lt;br /&gt;
#List is made up of dates ranging from 2005-2016, and these dates return lists of launched companies including most but not all of their URL's, as well as their launch year.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Flashpoint (http://flashpoint.gatech.edu/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#On upper right corner after animation, there is a tab sign which lets you navigate to a page labeled &amp;quot;Teams&amp;quot;&lt;br /&gt;
#The &amp;quot;Team&amp;quot; page has each batch of companies emerging from Georgia Tech, although it does not include the dates or cohorts of these companies. For example, &amp;quot;Batch 1&amp;quot; at the top of the page just lists the companies in the batch without URLs or any additional information.&lt;br /&gt;
#On the &amp;quot;Application&amp;quot; page on the tab near the top, there is information regarding Batch 7, which begins early 2017. Suggests that batch 6 either ended spring 2016 or fall 2016.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Prosper Women Entrepreneurs (http://www.prosperstl.com)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Navigated to &amp;quot;Accelerator&amp;quot; tab and clicked &amp;quot;Companies&amp;quot; when prompted with the drop down menu.&lt;br /&gt;
#This tab returned all of the launched company logos which then redirected to the company's home page when clicked.&lt;br /&gt;
#No other relevant form of information such as date launched or cohort was included on this page.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Axel Springer Plug and Play(http://www.axelspringerplugandplay.com/)==&lt;br /&gt;
Finding the cohort:&lt;br /&gt;
#Clicked on the &amp;quot;Companies&amp;quot; tab on the home page and was directed to the middle of the page which included a short list of current companies.&lt;br /&gt;
#Clicked on the &amp;quot;All Companies&amp;quot; link which returned a page filled with startup logos and brief descriptions of those startups. When clicked, each logo serves to redirect to that startup's home page.&lt;br /&gt;
#Companies were not sorted by cohort or in any other relevant way.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Techstars (http://www.techstars.com)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the Accelerators tabs and clicked &amp;quot;Companies&amp;quot; on the drop down menu.&lt;br /&gt;
#Firstly, this returns a table comprised of a long list of different classes from different areas separated by years.&lt;br /&gt;
#Upon scrolling down further, each of these classes is broken down by the startups that graduated from them. It also includes information such as how much was invested in each startup, as well as whether or not the startup was acquired, is active, or failed.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Startmate (http://www.startmate.com.au)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startups&amp;quot; tab, which returned a page of all startups that have graduated from Startmate.&lt;br /&gt;
#Startups are separated by year of graduation, and each company is linked on this page.&lt;br /&gt;
#It appears as if each year, 1 cohort is taken through the accelerator.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: Capital Factory (https://capitalfactory.com/accelerate/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the startups tab, which returned a long list of companies that were accelerated by Capital Factory.&lt;br /&gt;
#Each logo for the startups served as a link to their respective websites.&lt;br /&gt;
#There was no evidence or mention of any cohorts.&lt;br /&gt;
&lt;br /&gt;
==Accelerator: OwlSpark (http://entrepreneurship.rice.edu/accelerator/)==&lt;br /&gt;
Finding the cohorts:&lt;br /&gt;
#Navigated to the &amp;quot;Startup Teams&amp;quot; tab, which returned a page that included links to 4 &amp;quot;Classes&amp;quot;.&lt;br /&gt;
#Each class link i.e. (Class 1, Class 2, Class 3, Class 4) returned links to each startup that graduated from the program.&lt;br /&gt;
#These classes signify cohorts.&lt;br /&gt;
&lt;br /&gt;
==List of Promising Variables==&lt;br /&gt;
*Key People (founders, lead entrepreneurs, strategists, etc.)&lt;br /&gt;
*Total number of launched companies&lt;br /&gt;
*A FAQ for application details, accelerator vision, and &lt;br /&gt;
*Funds raised per company (average)&lt;br /&gt;
*Features offered by accelerator (perks, space, tools, etc)&lt;br /&gt;
*General events hosted by the accelerator&lt;br /&gt;
*(Success) stories for graduated start-ups&lt;br /&gt;
&lt;br /&gt;
=E-R Diagram (in list form) for Identifying Attributes to Pull from Accelerators=&lt;br /&gt;
Summary: I will look at different entities within the accelerator page (e.g accelerators, cohorts, founders) and then find potential attributes that can be codified from those entities. Along with the attribute, we list a potential method for pulling that particular attribute. &lt;br /&gt;
&lt;br /&gt;
Format: &lt;br /&gt;
:&amp;lt;u&amp;gt;Entity&amp;lt;/u&amp;gt;&lt;br /&gt;
:*Attribute - Possible sources/ways to get&lt;br /&gt;
&lt;br /&gt;
Ed: &amp;quot;Be creative with finding new attributes to pull!&amp;quot;&lt;br /&gt;
&lt;br /&gt;
==List==&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
*Accelerator Name - Website, external database&lt;br /&gt;
*Contact Form - General contact section in each website &lt;br /&gt;
*Industry focus - can be pulled from description&lt;br /&gt;
*Description - pulled from website itself&lt;br /&gt;
*Takes equity? - Database or from &amp;quot;about&amp;quot; page&lt;br /&gt;
*Non-profit? - Database&lt;br /&gt;
*URL - Already have way of obtaining&lt;br /&gt;
*DNS Registration Date - Already have way of obtaining&lt;br /&gt;
*Address - Google Maps, maybe the website&lt;br /&gt;
*Founding Date - Google Maps, website, server registration&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Features&amp;lt;/u&amp;gt;&lt;br /&gt;
*Mentorship? - Description in website&lt;br /&gt;
*Space Offered - Google Maps, Website description&lt;br /&gt;
*Partnerships - Angel list, Same section as mentorship or events&lt;br /&gt;
*Hosted Events - Calender&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
*Name - Founders or Team Page&lt;br /&gt;
*Title - Directly underneath or next to name&lt;br /&gt;
*PhD? - Biography, webpage under name&lt;br /&gt;
*Serial - Biography&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot; in &amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt; (n) has (n) &amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt; &lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Ventures&amp;lt;/u&amp;gt;&lt;br /&gt;
*Other Companies - Biography, webpage&lt;br /&gt;
*Previous Companies - Biography&lt;br /&gt;
*Net Worth - Forbes, Biography&lt;br /&gt;
*Link back to &amp;quot;Name&amp;quot; in &amp;lt;u&amp;gt;Founders&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Accelerators&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt;&lt;br /&gt;
*Date + Accelerator = Cohort ID - Database or Website&lt;br /&gt;
*Number of Startups - Website, count from &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Cohort Number - Categorization on website, external database&lt;br /&gt;
*Link back to &amp;quot;Accelerator Name&amp;quot;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Cohorts&amp;lt;/u&amp;gt; (1) has (n) &amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;u&amp;gt;Startups&amp;lt;/u&amp;gt;&lt;br /&gt;
*Names - Website, external database&lt;br /&gt;
*State of Inc - Angel List&lt;br /&gt;
*URL - Angel List, website&lt;br /&gt;
*Founding Date - Registration database, Angel List&lt;br /&gt;
*Industry - startup description&lt;br /&gt;
*Founding Location - Angel List&lt;br /&gt;
*Current Location - Angel List&lt;br /&gt;
*VC Raised to Date - SDC Platinum&lt;br /&gt;
*Angel Funds Raised to date - Angel List&lt;br /&gt;
&lt;br /&gt;
==Variables which Distinguish Accelerator Websites==&lt;br /&gt;
*The word &amp;quot;Accelerator&amp;quot;&lt;br /&gt;
**This word appears at least one time on the home page of the vast majority of accelerator websites. The word &amp;quot;Accelerator&amp;quot; appears either as a link to another page on the website or in a title on the homepage of the website. Not many other websites contain this word on their homepage, especially not if one Googles something generic such as &amp;quot;Accelerators in the US&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
*Fixed Term&lt;br /&gt;
**Accelerators normally work with their cohorts for 3 months. This is a major factor which differentiates between an accelerator and any other member of a startup ecosystem. If on their website they mention either &amp;quot;3 months&amp;quot; or &amp;quot;12 weeks&amp;quot;, it is extremely likely that the website belongs to an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Cohorts, Portfolio, Class, or Companies&lt;br /&gt;
**This is a potential variable that could link the websites of many different accelerators. The problem with the word &amp;quot;portfolio&amp;quot; is also used by numerous venture capital firms, which could potentially cause complications when attempting to pull only the sites of accelerators from a Google search. The word &amp;quot;cohort&amp;quot;, however, would have an extremely high probability of identifying the website as belonging to an accelerator. The words &amp;quot;class&amp;quot; and &amp;quot;companies&amp;quot; are promising but do not offer certainty.&lt;br /&gt;
&lt;br /&gt;
*Equity, Investment&lt;br /&gt;
**Although by itself, equity does not mean much, when paired with any of these other terms, it could potentially point to an accelerator. Most accelerators take equity in the form of common stock (6-8%), or they will ask for some alternate form of stake in the company.&lt;br /&gt;
&lt;br /&gt;
*Education and Mentorship&lt;br /&gt;
**Accelerators differ from incubators and angel investors in that they emphasize the education of the potential startup. They offer advice and intense mentorship from more experienced entrepreneurs within their staff, as well as many networking opportunities with the outside world. This variable is more difficult to find on the website of the accelerator, but I believe that if the website includes numerous keywords such as &amp;quot;education&amp;quot;, &amp;quot;mentorship&amp;quot;, or &amp;quot;networking opportunities&amp;quot;, it would be somewhat safe to assume that the website is owned by an accelerator.&lt;br /&gt;
&lt;br /&gt;
*Demo Day&lt;br /&gt;
**This variable does not have tremendous potential in terms of crawling websites, but I feel that it is worth mentioning. Most accelerators &amp;quot;graduate&amp;quot; their cohorts with a demo day, which is a day when the startups present their company to potential investors. If the website contains the words &amp;quot;demo day&amp;quot;, which is fairly uncommon, it could be a good source of accelerator identification.&lt;br /&gt;
&lt;br /&gt;
A combination of any of these variables would certainly identify the current website as belonging to an accelerator.&lt;br /&gt;
&lt;br /&gt;
==Comprehensive List of Accelerators==&lt;br /&gt;
&lt;br /&gt;
All text files saved in &amp;quot;Accelerators&amp;quot; project on the McNair RPD. &lt;br /&gt;
&lt;br /&gt;
*Acc.Info: 190&lt;br /&gt;
*SeedDB: 240&lt;br /&gt;
*SARP: 59&lt;br /&gt;
*Corp: 79&lt;br /&gt;
*Total: 568 results&lt;br /&gt;
&lt;br /&gt;
After removing duplicates and locations: 363 results&lt;br /&gt;
&lt;br /&gt;
Doesn't count f6s, which returns 1170 results, roughly only 300 of which were accelerators. We created a crawler to sift through the webpages and parse HTML so we could identify the accelerators. Program and HTML saved on the Desktop.&lt;br /&gt;
&lt;br /&gt;
==Randomly Chosen Accelerators==&lt;br /&gt;
*TLabs&lt;br /&gt;
*BetaSpring&lt;br /&gt;
*The Unilever Foundry&lt;br /&gt;
*AIA Accelerator&lt;br /&gt;
*R/GA Accelerator&lt;br /&gt;
*Zeroto510&lt;br /&gt;
*Hub:raum&lt;br /&gt;
*Orange Fab&lt;br /&gt;
*Furnace&lt;br /&gt;
*Launch Chapel Hill&lt;br /&gt;
&lt;br /&gt;
===Determining whether or not these are accelerators===&lt;br /&gt;
Googled name of Accelerator and clicked on the first link&lt;br /&gt;
&lt;br /&gt;
Looked for Variables which Distinguish Accelerator Websites&lt;br /&gt;
*TLabs: Homepage states: &amp;quot;Leading Indian Tech Accelerator&amp;quot;; TLabs is an accelerator, but it is located in India.&lt;br /&gt;
*Betaspring: Under the &amp;quot;About Betaspring&amp;quot; tab,  it states that &amp;quot;Betaspring was among the first ten startup accelerators to launch worldwide&amp;quot;.&lt;br /&gt;
*The Unilever Foundry: Does not claim to be an accelerator, nor does it have information on the website about cohorts. This name was pulled from the source Corporate Accelerators.&lt;br /&gt;
*AIA Accelerator: The word &amp;quot;accelerator&amp;quot; is included in the name. Under the &amp;quot;Overview&amp;quot; tab, it states that startups have received mentorship.&lt;br /&gt;
*R/GA Accelerator: Under the &amp;quot;Overview&amp;quot; tab it states that the &amp;quot;R/GA Accelerator is designed for startups and... it is a three month, immersive, mentorship driven program&amp;quot;.&lt;br /&gt;
*Zeroto510: Website contains a &amp;quot;Portfolio Companies&amp;quot; tab which divides up the companies into cohorts. This identifies Zeroto510 as an accelerator.&lt;br /&gt;
*Hub:raum: Offers accelerator and incubator programs; however, none are located in North America.&lt;br /&gt;
*Orange Fab: States on the main page that &amp;quot;We're a 3-month accelerator program&amp;quot;.&lt;br /&gt;
*Furnace: &amp;quot;About&amp;quot; tab states that Furnace is &amp;quot;an innovative startup accelerator designed to form, incubate, and launch new companies&amp;quot;. Concludes with a Demo Day&lt;br /&gt;
*Launch Chapel Hill: Homepage states that they are &amp;quot;a startup accelerator&amp;quot;. Also included on the homepage is a line that states &amp;quot;Applications for Cohort 7 are now open&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
7/10 are accelerators located in the US.&lt;br /&gt;
&lt;br /&gt;
2/10 are accelerators not located in the US.&lt;br /&gt;
&lt;br /&gt;
1/10 is not an accelerator.&lt;br /&gt;
&lt;br /&gt;
===Steps for Extracting Cohort Information===&lt;br /&gt;
*TLabs: Clicked on the &amp;quot;Startup&amp;quot; tab and located a drop down menu entitled &amp;quot;Showing Startups from:&amp;quot;. This menu separates startups into Batches ranging from 1-9. These batches are cohorts.&lt;br /&gt;
*Betaspring: This website does not have a &amp;quot;Companies&amp;quot; or &amp;quot;Startups&amp;quot; tab. I clicked on their &amp;quot;Who&amp;quot; tab and noticed that within this section were two links called &amp;quot;Our portfolio&amp;quot; and &amp;quot;Our companies&amp;quot; which both linked to the same place. This place contained a list of the startups that Betaspring has funded, as well as links to each of the startup websites. The list was not separated into cohorts.&lt;br /&gt;
*The Unilever Foundry: Does not have a &amp;quot;Startups&amp;quot; or &amp;quot;Companies&amp;quot; link on the website.&lt;br /&gt;
*AIA Accelerator: Clicked on the &amp;quot;Startups&amp;quot; tab which returned a page with 5 companies and a bit of information on each of these companies. Also included the URL to each startup. However, the companies were not separated into cohorts, probably because there are so few of them.&lt;br /&gt;
*R/GA Accelerator: Clicked on the &amp;quot;Alumni&amp;quot; tab and navigated down the webpage. Startups are separated by class, which means cohort in this case. Startup info contains link to demo day presentation as well as the startup url.&lt;br /&gt;
*Zeroto510: Hovered over the &amp;quot;About Us&amp;quot; drop down menu and clicked on the &amp;quot;Portfolio Companies&amp;quot; link. Startups are separated by cohort, one for each year, starting from 2013. &lt;br /&gt;
*Hub:raum: Clicked on the &amp;quot;Portfolio&amp;quot; tab. Directed to a page with many names of startups, as well as a brief description of what their company is about. Also includes a link to each startup's website. Startups are not separated into cohorts, but rather by investment by location, current participants, and alumni.&lt;br /&gt;
*Orange Fab: Clicked on the &amp;quot;Startups&amp;quot; tab and was directed to a different page. Startups are not only separated into cohorts named &amp;quot;Seasons&amp;quot;, but they are also separated by industry.&lt;br /&gt;
*Furnace: Clicked on &amp;quot;Portfolio&amp;quot; tab, but unfortunately the website is broken and it returned an error in code.&lt;br /&gt;
*Launch Chapel Hill: Clicked on the &amp;quot;Ventures&amp;quot; tab and was directed to a page in which all startups were separated into cohorts, and a brief description of the startup was provided underneath their logo.&lt;br /&gt;
&lt;br /&gt;
=Code=&lt;br /&gt;
&lt;br /&gt;
The directory for all data related to this project is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
==F6S Web Crawler==&lt;br /&gt;
&lt;br /&gt;
This is a python script using the selenium library that retrieves the html content of each page on F6S's North American Accelerator search results. The script is located in:&lt;br /&gt;
&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs &lt;br /&gt;
&lt;br /&gt;
The script is titled f6s_crawler_gentle.py&lt;br /&gt;
&lt;br /&gt;
When run, the script visits the F6S search page for North American Accelerator's and begins retrieving the HTML of each page in that search list. &lt;br /&gt;
NOTE: Timing must be spaced out between all interactions with the browser. F6S has Captcha, and the program will fail if the site receives too many hit requests, or has any inkling that it is being probed by a bot.&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files are stored in: &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files&lt;br /&gt;
&lt;br /&gt;
The Accelerator HTML files stored as text files are stored in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs\Accelerator_HTML_files_text&lt;br /&gt;
&lt;br /&gt;
==F6S Parser==&lt;br /&gt;
The next step is to take the HTML files retrieved by the crawler and to parse them for necessary information. This parser should also determine whether or not the site is an accelerator site. &lt;br /&gt;
&lt;br /&gt;
The code for the parser is located in &lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
It is titled f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
To run the code, open the file in Komodo and press play. &lt;br /&gt;
If running from the command line, change to the correct directory and run the following comand:&lt;br /&gt;
 python f6s_parser.py&lt;br /&gt;
&lt;br /&gt;
The list of accelerators that passed through the parser is in the same directory:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\F6S Accelerator HTMLs&lt;br /&gt;
&lt;br /&gt;
The tab delimited text file is named AcceleratorList.&lt;br /&gt;
The file contains the names of the accelerators that had the keywords listed in the file. Also, the file contains the run dates and location of the accelerator if it was listed on the f6s page.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==F6S API==&lt;br /&gt;
F6S has an API, but we have had no success getting a key to the API. The link to get a key to the API is on [https://www.f6s.com/developers/apis/deal-feed this page].&lt;br /&gt;
&lt;br /&gt;
I (Peter) have emailed F6S to ask for a key directly at support@f6s.com. As of the end of the Fall 2016 Semester, they have not responded.&lt;br /&gt;
&lt;br /&gt;
FUN FACT (MASS-RENAME FILES USING WINDOWS POWER SHELL):&lt;br /&gt;
&lt;br /&gt;
The following command allowed me to append &amp;quot;.txt&amp;quot; to all files in a folder once in the proper directory:&lt;br /&gt;
 Get-ChildItem * | Rename-Item -NewName { $_.name + '.txt'}&lt;br /&gt;
&lt;br /&gt;
To change file formats, Microsoft suggests:&lt;br /&gt;
 Get-ChildItem *.txt | Rename-Item -NewName { $_.name -Replace '\.txt', '.log'}&lt;br /&gt;
&lt;br /&gt;
==Final Data==&lt;br /&gt;
The Parser for parsing the text files of accelerator data is located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
The Parser for parsing the cohort files of accelerator data is also located in:&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data&lt;br /&gt;
&lt;br /&gt;
This folder contains the Python parsers. The Final_data folder contains the tab-delimited text files of parsed data. final_accelerator_data.txt contains the generalized data saved in .txt files and final_cohort_data.txt contains the cohort data saved in .cohort.txt files.&lt;br /&gt;
&lt;br /&gt;
All the files entitled accelerator_data are subsets of the final_accelerator_data.txt file, but each file contains only the accelerators that matched to the flag specified in the file title.&lt;br /&gt;
&lt;br /&gt;
find_headers .py finds a set of the headers for all the cohort files from the seed list project.&lt;br /&gt;
&lt;br /&gt;
==Google SiteSearch==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Google_SiteSearch&lt;br /&gt;
This folder contains code for a google search parser. The script sitesearch.py will search for a queried company and return a likely web address for that company.&lt;br /&gt;
&lt;br /&gt;
==Way Back Machine Parser==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\wayback_machine.py&lt;br /&gt;
This script takes URLs and returns a timestamp for the oldest documented webpage under that URL courtesy of the Way Back Machine Archive.&lt;br /&gt;
&lt;br /&gt;
==Process Locations==&lt;br /&gt;
 E:\McNair\Projects\Accelerators\Code+Final_Data\process_locations.py&lt;br /&gt;
This script takes a physical address and converts it into latitude and longitude coordinates. Should be used in conjunction with the Enclosing Circle program to find the concentration of accelerators.&lt;br /&gt;
 E:\McNair\Software\CodeBase\EnclosingCircle.py&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=16427</id>
		<title>Shrey Agarwal (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=16427"/>
		<updated>2017-03-21T19:58:24Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;09/27/2016 14:00 - 17:00: &lt;br /&gt;
*Set up personal and work log pages, accessed Remote Desktop. &lt;br /&gt;
*Compiled list of accelerators from Wiki&lt;br /&gt;
09/29/2016 14:00 - 16:15; 16:45 - 17:30:&lt;br /&gt;
*Created new project: [[Accelerator Seed List (Data)]] and worked with Dr. Egan to create schematic for data entry.&lt;br /&gt;
*Evaluated 3 sources and logged data. Sources were taken from [[List of Accelerators]]. Logged each step onto project page and identified categories that would be suitable for web crawling sometime in the future.&lt;br /&gt;
10/11/2016 14:00 - 17:30;&lt;br /&gt;
*Explored how to use regular expressions in TextPad to aid with data sorting (need to review expressions with Dr. Egan in future)&lt;br /&gt;
*Continued evaluating sources from [[List of Accelerators]] and recorded steps onto project page, as before. Finished evaluating the six sources from initial list. (All work done in [[Accelerator Seed List (Data)]])&lt;br /&gt;
10/13/2016 14:00 - 17:00;&lt;br /&gt;
*All work done in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Talked to Dr. Egan about project going forward. Need to pick out 10-15 accelerators from the sources listed on my project page and identify a reliable method for obtaining cohort information, as well as other variables&lt;br /&gt;
*Used google searches to identify more sources, and evaluated three databases with the help of TextPad&lt;br /&gt;
*Began working on more generic google searches. Was able to go through &amp;quot;Location+accelerator&amp;quot;-type searches today. Will continue next time.&lt;br /&gt;
[[Category:Internal]]&lt;br /&gt;
10/18/2016 14:00 - 17:30;&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Took a sample size of 10 accelerators and detailed how to extract cohort information, as well as what other information is readily available from accelerator URLs.&lt;br /&gt;
*Brought Matthew up to speed on accelerator project, added summaries to each section so they became easier to follow, and worked with him to finish up extracting cohort information&lt;br /&gt;
10/20/16 14:30 - 17:30:&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Finished up the list of instructions for finding the cohort. Continued compiling the list of variables for each of the accelerators within the sample size.&lt;br /&gt;
*Consulted Peter on prospects of creating a web crawler with the information we currently have compiled. Determined it was possible, although beyond the scope of Peter's knowledge.&lt;br /&gt;
10/25/16 14:00 - 17:00&lt;br /&gt;
*Consulted Ed with next step for project.&lt;br /&gt;
*Began listing the E-R diagram onto the accelerator database page where entities were potential categories and each entity had its associated attributes&lt;br /&gt;
10/27/16 14:00 - 17:00&lt;br /&gt;
*Continued working with Matthew to identify elements in the E-R diagram for pulling information on accelerators. &lt;br /&gt;
*Found sources to obtain/cross-reference information (ie. Angel List)&lt;br /&gt;
11/08/16 14:00 - 18:00&lt;br /&gt;
*Identified possible keywords to filter results through for accelerators&lt;br /&gt;
*Began compiling a comprehensive list of accelerators based on the data we have already sifted through.&lt;br /&gt;
*Learned how to use regular expressions from Ben to sort names individually and alphabetically.&lt;br /&gt;
11/10/16 14:00 - 18:00&lt;br /&gt;
*Began sorting through accelerator list and removing duplicates, as well as identifying more places to pull names from.&lt;br /&gt;
*Worked with Peter to create a crawl for f6s because the website does not return only accelerators.&lt;br /&gt;
11/15/16 14:00 - 18:00&lt;br /&gt;
*Took a break from f6s to locate more lists based on individual google searches such as &amp;quot;city+accelerator+list&amp;quot;&lt;br /&gt;
*Put Seed DB information into an excel file on the remote desktop&lt;br /&gt;
11/17/16 14:00 - 16:00&lt;br /&gt;
*Continued filling out information for the random Google Searches&lt;br /&gt;
*Organized TextPad files on the RDP into coherent excel spreadsheets with proper headers on the table&lt;br /&gt;
*Noticed problem with f6s: it seems although all of the html coding was protected by a captcha so the crawler did not actually extract any information; it was all blocked.&lt;br /&gt;
11/22/16 14:00 - 17:00&lt;br /&gt;
*Worked to fix f6s crawler with Peter&lt;br /&gt;
*Finished and compiled master list of accelerators&lt;br /&gt;
12/01/16 14:00 - 18:00&lt;br /&gt;
*Caught up on project with Ed and Carlin&lt;br /&gt;
*Took 20 accelerators (241-260) from the list and filled out text.html files for them; finished the 20&lt;br /&gt;
12/05/16 13:00 - 16:00&lt;br /&gt;
*After finishing first 20 accelerators, continued working down the list, beginning at 321&lt;br /&gt;
*Work noted in [[Accelerator Seed List (Data)]], but mostly stored on McNair RDP&lt;br /&gt;
12/06/16 14:00 - 18:00&lt;br /&gt;
*Continued &amp;quot;Accelerating&amp;quot; down the list in [[Accelerator Seed List (Data)]], finished up until 340&lt;br /&gt;
12/08/16 14:00 - 17:00&lt;br /&gt;
*Continued working on accelerator list on the same page.&lt;br /&gt;
01/17/17 14:00 - 16:00&lt;br /&gt;
*Finished up &amp;quot;accelerating&amp;quot; from [[Accelerator Seed List (Data)]], numbers 341-351&lt;br /&gt;
1/18/17 14:00 - 16:00&lt;br /&gt;
*Finished accelerating for sure, went back and began an overview of the work done for quality control.&lt;br /&gt;
01/20/17 14:00 - 16:00&lt;br /&gt;
*Mandatory meeting, then worked through 2 of Ed's unfinished accelerators&lt;br /&gt;
1/23/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to go over about 70 items in the accelerator list and ensure that they follow a uniform structure and show correct information&lt;br /&gt;
1/24/17 14:00 - 16:00&lt;br /&gt;
*Worked with Peter to fix the problem with results not coming through on the new spreadsheet by renaming the file and including more symbols in the searches. Spreadsheet should be up to date now.&lt;br /&gt;
*Got to number 144 on the list while going through files.&lt;br /&gt;
1/25/17 14:00 - 16;00&lt;br /&gt;
*Continued looking through the list and fixing wrong entries or reporting them&lt;br /&gt;
1/26/17 14:00 - 16:00&lt;br /&gt;
*Talked with Ed about project going forward and tried to access the Crunchbase API with Peter to crawl for start-up companies.&lt;br /&gt;
*Continued working through the accelerator list, stopped at number 186.&lt;br /&gt;
1/27/17 14:00 - 16:00&lt;br /&gt;
*Continued looking through accelerator list and fixing any entries with error. Got to number 261.&lt;br /&gt;
1/30/17 14:30 - 16:30&lt;br /&gt;
*Got through about 425&lt;br /&gt;
1/31/17 14:00 - 16:00&lt;br /&gt;
*Got to number 502&lt;br /&gt;
2/01/17 14:00 - 16:00&lt;br /&gt;
*Finished looking through the initial list of accelerators and writing down which ones needed to be modified or completed (through 551)&lt;br /&gt;
2/03/17 14:00 - 17:00&lt;br /&gt;
*Finished about 30 entries for the accelerator entries that still needed to be completed. Worked out of the &amp;quot;NOT DONE&amp;quot; file in the server (which is now blank because everything is finished)&lt;br /&gt;
2/06/17 14:00 - 16:00&lt;br /&gt;
*Developed a standardized format for the text files with Matthew. Instructions are under &amp;quot;standardized format&amp;quot; in the accelerator seed list portion. I started at number 226 and standardized formats up until 370.&lt;br /&gt;
2/07/17 14:00-16:00&lt;br /&gt;
*Continued work from yesterday, completed up to number 488 from the list. Will likely need one more day to finish.&lt;br /&gt;
2/08/17 14:00 - 16:00&lt;br /&gt;
*Finished standardizing the txt files for use on the excel spreadsheet, compiled the data and examined the resultant tables. Realized we needed to fix some categories in the cohort files.&lt;br /&gt;
2/09/17 14:00 - 17:00&lt;br /&gt;
*Worked with Ed on a side project trying to gather information on climate change thanks to Baker's article on the Wall Street Journal&lt;br /&gt;
*Gathered information on climate change in relation to high-growth, high-risk innovation and organizations that deal with things such as carbon credits&lt;br /&gt;
2/10/17 14:00 - 17:00&lt;br /&gt;
*Realized that blog post was ambitious because we could not really find a clear purpose from the information we gathered, nor could we find a unique angle. Held off on the idea&lt;br /&gt;
*Went back to organizing the new columns and headers on the text file by identifying areas of error in the excel spreadsheet&lt;br /&gt;
2/15/17 14:00 - 16:00&lt;br /&gt;
*Spoke with Ed about free enterprise while he lectured all of us. It took about an hour.&lt;br /&gt;
*Looked at plans for project going forward including using linkedin to search the founders&lt;br /&gt;
2/20/17 14:00 - 16:00&lt;br /&gt;
*Found our first source for expanding the project into incubators, from angel.co. Seems similar to f6s in that we can crawl it and obtain a list of incubators and their various counterparts. &lt;br /&gt;
2/21/17 14:00 - 16:00&lt;br /&gt;
*Found more sources for incubators by reading through quora discussions and masters theses. Bookmarked these pages so that I could put them into text files after.&lt;br /&gt;
2/23/17 14:00 - 18:00&lt;br /&gt;
*Converted incubator files to text-pad and saved them (4 total), then cleaned them up through regex&lt;br /&gt;
*Took the cohort text file, put it into excel, and proceeded to clean up all of the mistakes in the excel document, particularly bad data or mistakes with organizations. Got through Y-Combinator.&lt;br /&gt;
2/24/17 14:00 - 16:00&lt;br /&gt;
*Finished up cleaning the cohort data for the names and the descriptions, but there still needs to be work done on the other stuff like dates and programs&lt;br /&gt;
2/28/17 14:00 - 16:00&lt;br /&gt;
*Created page [[Hub-Based Venture Firms]] and proceeded to research VC in Hubs listed on under E:\McNair\Projects\Hubs\summer 2016\Hubs Variables - Ariel.xls&lt;br /&gt;
*Looked at details such as whether they have in-house funds, whether they co-invest, focuses, and amounts invested.&lt;br /&gt;
3/01/17 14:00 - 16:00&lt;br /&gt;
*Worked with Ben and Matthew to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
3/02/17 14:00 - 16:00&lt;br /&gt;
*Tried to repeat the VC data pull without it crashing from pulling too many entries. Unfortunately, we were unable to finish it&lt;br /&gt;
3/06/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to put final touches on the cohort data to prep it for matching with our VC data&lt;br /&gt;
3/07/17 14:00 - 16:00&lt;br /&gt;
*Finally finished working on the cohort files, will match on the 8th&lt;br /&gt;
3/08/17 14:00 - 16:00&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
3/20/17 14:00 - 16:00&lt;br /&gt;
*Participated in a SQL training session with Ed, learned how to create a database and to pull tab delimited information from text files onto a table&lt;br /&gt;
3/21/17 14:00 - 16:00&lt;br /&gt;
*Met with Ed and arrived at the conclusion of finishing the draft for a report by the end of the semester. Put the initial report information on the accelerator page using the variables that we currently have&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=14409</id>
		<title>Shrey Agarwal (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Shrey_Agarwal_(Work_Log)&amp;diff=14409"/>
		<updated>2017-02-28T21:50:06Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;09/27/2016 14:00 - 17:00: &lt;br /&gt;
*Set up personal and work log pages, accessed Remote Desktop. &lt;br /&gt;
*Compiled list of accelerators from Wiki&lt;br /&gt;
09/29/2016 14:00 - 16:15; 16:45 - 17:30:&lt;br /&gt;
*Created new project: [[Accelerator Seed List (Data)]] and worked with Dr. Egan to create schematic for data entry.&lt;br /&gt;
*Evaluated 3 sources and logged data. Sources were taken from [[List of Accelerators]]. Logged each step onto project page and identified categories that would be suitable for web crawling sometime in the future.&lt;br /&gt;
10/11/2016 14:00 - 17:30;&lt;br /&gt;
*Explored how to use regular expressions in TextPad to aid with data sorting (need to review expressions with Dr. Egan in future)&lt;br /&gt;
*Continued evaluating sources from [[List of Accelerators]] and recorded steps onto project page, as before. Finished evaluating the six sources from initial list. (All work done in [[Accelerator Seed List (Data)]])&lt;br /&gt;
10/13/2016 14:00 - 17:00;&lt;br /&gt;
*All work done in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Talked to Dr. Egan about project going forward. Need to pick out 10-15 accelerators from the sources listed on my project page and identify a reliable method for obtaining cohort information, as well as other variables&lt;br /&gt;
*Used google searches to identify more sources, and evaluated three databases with the help of TextPad&lt;br /&gt;
*Began working on more generic google searches. Was able to go through &amp;quot;Location+accelerator&amp;quot;-type searches today. Will continue next time.&lt;br /&gt;
[[Category:Internal]]&lt;br /&gt;
10/18/2016 14:00 - 17:30;&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Took a sample size of 10 accelerators and detailed how to extract cohort information, as well as what other information is readily available from accelerator URLs.&lt;br /&gt;
*Brought Matthew up to speed on accelerator project, added summaries to each section so they became easier to follow, and worked with him to finish up extracting cohort information&lt;br /&gt;
10/20/16 14:30 - 17:30:&lt;br /&gt;
*Work continued in [[Accelerator Seed List (Data)]]&lt;br /&gt;
*Finished up the list of instructions for finding the cohort. Continued compiling the list of variables for each of the accelerators within the sample size.&lt;br /&gt;
*Consulted Peter on prospects of creating a web crawler with the information we currently have compiled. Determined it was possible, although beyond the scope of Peter's knowledge.&lt;br /&gt;
10/25/16 14:00 - 17:00&lt;br /&gt;
*Consulted Ed with next step for project.&lt;br /&gt;
*Began listing the E-R diagram onto the accelerator database page where entities were potential categories and each entity had its associated attributes&lt;br /&gt;
10/27/16 14:00 - 17:00&lt;br /&gt;
*Continued working with Matthew to identify elements in the E-R diagram for pulling information on accelerators. &lt;br /&gt;
*Found sources to obtain/cross-reference information (ie. Angel List)&lt;br /&gt;
11/08/16 14:00 - 18:00&lt;br /&gt;
*Identified possible keywords to filter results through for accelerators&lt;br /&gt;
*Began compiling a comprehensive list of accelerators based on the data we have already sifted through.&lt;br /&gt;
*Learned how to use regular expressions from Ben to sort names individually and alphabetically.&lt;br /&gt;
11/10/16 14:00 - 18:00&lt;br /&gt;
*Began sorting through accelerator list and removing duplicates, as well as identifying more places to pull names from.&lt;br /&gt;
*Worked with Peter to create a crawl for f6s because the website does not return only accelerators.&lt;br /&gt;
11/15/16 14:00 - 18:00&lt;br /&gt;
*Took a break from f6s to locate more lists based on individual google searches such as &amp;quot;city+accelerator+list&amp;quot;&lt;br /&gt;
*Put Seed DB information into an excel file on the remote desktop&lt;br /&gt;
11/17/16 14:00 - 16:00&lt;br /&gt;
*Continued filling out information for the random Google Searches&lt;br /&gt;
*Organized TextPad files on the RDP into coherent excel spreadsheets with proper headers on the table&lt;br /&gt;
*Noticed problem with f6s: it seems although all of the html coding was protected by a captcha so the crawler did not actually extract any information; it was all blocked.&lt;br /&gt;
11/22/16 14:00 - 17:00&lt;br /&gt;
*Worked to fix f6s crawler with Peter&lt;br /&gt;
*Finished and compiled master list of accelerators&lt;br /&gt;
12/01/16 14:00 - 18:00&lt;br /&gt;
*Caught up on project with Ed and Carlin&lt;br /&gt;
*Took 20 accelerators (241-260) from the list and filled out text.html files for them; finished the 20&lt;br /&gt;
12/05/16 13:00 - 16:00&lt;br /&gt;
*After finishing first 20 accelerators, continued working down the list, beginning at 321&lt;br /&gt;
*Work noted in [[Accelerator Seed List (Data)]], but mostly stored on McNair RDP&lt;br /&gt;
12/06/16 14:00 - 18:00&lt;br /&gt;
*Continued &amp;quot;Accelerating&amp;quot; down the list in [[Accelerator Seed List (Data)]], finished up until 340&lt;br /&gt;
12/08/16 14:00 - 17:00&lt;br /&gt;
*Continued working on accelerator list on the same page.&lt;br /&gt;
01/17/17 14:00 - 16:00&lt;br /&gt;
*Finished up &amp;quot;accelerating&amp;quot; from [[Accelerator Seed List (Data)]], numbers 341-351&lt;br /&gt;
1/18/17 14:00 - 16:00&lt;br /&gt;
*Finished accelerating for sure, went back and began an overview of the work done for quality control.&lt;br /&gt;
01/20/17 14:00 - 16:00&lt;br /&gt;
*Mandatory meeting, then worked through 2 of Ed's unfinished accelerators&lt;br /&gt;
1/23/17 14:00 - 16:00&lt;br /&gt;
*Worked with Matthew to go over about 70 items in the accelerator list and ensure that they follow a uniform structure and show correct information&lt;br /&gt;
1/24/17 14:00 - 16:00&lt;br /&gt;
*Worked with Peter to fix the problem with results not coming through on the new spreadsheet by renaming the file and including more symbols in the searches. Spreadsheet should be up to date now.&lt;br /&gt;
*Got to number 144 on the list while going through files.&lt;br /&gt;
1/25/17 14:00 - 16;00&lt;br /&gt;
*Continued looking through the list and fixing wrong entries or reporting them&lt;br /&gt;
1/26/17 14:00 - 16:00&lt;br /&gt;
*Talked with Ed about project going forward and tried to access the Crunchbase API with Peter to crawl for start-up companies.&lt;br /&gt;
*Continued working through the accelerator list, stopped at number 186.&lt;br /&gt;
1/27/17 14:00 - 16:00&lt;br /&gt;
*Continued looking through accelerator list and fixing any entries with error. Got to number 261.&lt;br /&gt;
1/30/17 14:30 - 16:30&lt;br /&gt;
*Got through about 425&lt;br /&gt;
1/31/17 14:00 - 16:00&lt;br /&gt;
*Got to number 502&lt;br /&gt;
2/01/17 14:00 - 16:00&lt;br /&gt;
*Finished looking through the initial list of accelerators and writing down which ones needed to be modified or completed (through 551)&lt;br /&gt;
2/03/17 14:00 - 17:00&lt;br /&gt;
*Finished about 30 entries for the accelerator entries that still needed to be completed. Worked out of the &amp;quot;NOT DONE&amp;quot; file in the server (which is now blank because everything is finished)&lt;br /&gt;
2/06/17 14:00 - 16:00&lt;br /&gt;
*Developed a standardized format for the text files with Matthew. Instructions are under &amp;quot;standardized format&amp;quot; in the accelerator seed list portion. I started at number 226 and standardized formats up until 370.&lt;br /&gt;
2/07/17 14:00-16:00&lt;br /&gt;
*Continued work from yesterday, completed up to number 488 from the list. Will likely need one more day to finish.&lt;br /&gt;
2/08/17 14:00 - 16:00&lt;br /&gt;
*Finished standardizing the txt files for use on the excel spreadsheet, compiled the data and examined the resultant tables. Realized we needed to fix some categories in the cohort files.&lt;br /&gt;
2/09/17 14:00 - 17:00&lt;br /&gt;
*Worked with Ed on a side project trying to gather information on climate change thanks to Baker's article on the Wall Street Journal&lt;br /&gt;
*Gathered information on climate change in relation to high-growth, high-risk innovation and organizations that deal with things such as carbon credits&lt;br /&gt;
2/10/17 14:00 - 17:00&lt;br /&gt;
*Realized that blog post was ambitious because we could not really find a clear purpose from the information we gathered, nor could we find a unique angle. Held off on the idea&lt;br /&gt;
*Went back to organizing the new columns and headers on the text file by identifying areas of error in the excel spreadsheet&lt;br /&gt;
2/15/17 14:00 - 16:00&lt;br /&gt;
*Spoke with Ed about free enterprise while he lectured all of us. It took about an hour.&lt;br /&gt;
*Looked at plans for project going forward including using linkedin to search the founders&lt;br /&gt;
2/20/17 14:00 - 16:00&lt;br /&gt;
*Found our first source for expanding the project into incubators, from angel.co. Seems similar to f6s in that we can crawl it and obtain a list of incubators and their various counterparts. &lt;br /&gt;
2/21/17 14:00 - 16:00&lt;br /&gt;
*Found more sources for incubators by reading through quora discussions and masters theses. Bookmarked these pages so that I could put them into text files after.&lt;br /&gt;
2/23/17 14:00 - 18:00&lt;br /&gt;
*Converted incubator files to text-pad and saved them (4 total), then cleaned them up through regex&lt;br /&gt;
*Took the cohort text file, put it into excel, and proceeded to clean up all of the mistakes in the excel document, particularly bad data or mistakes with organizations. Got through Y-Combinator.&lt;br /&gt;
2/24/17 14:00 - 16:00&lt;br /&gt;
*Finished up cleaning the cohort data for the names and the descriptions, but there still needs to be work done on the other stuff like dates and programs&lt;br /&gt;
2/28/17 14:00 - 16:00&lt;br /&gt;
*Created page [[Hub-Based Venture Firms]] and proceeded to research VC in Hubs listed on under E:\McNair\Projects\Hubs\summer 2016\Hubs Variables - Ariel.xls&lt;br /&gt;
*Looked at details such as whether they have in-house funds, whether they co-invest, focuses, and amounts invested.&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14407</id>
		<title>Hub-Based Venture Firms</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14407"/>
		<updated>2017-02-28T21:39:09Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{AcademicPaper&lt;br /&gt;
|Title=Hub-Based Venture Firms&lt;br /&gt;
|RAs=Shrey Agarwal,&lt;br /&gt;
|Status=In development&lt;br /&gt;
}}&lt;br /&gt;
[[Hubs]]&lt;br /&gt;
&lt;br /&gt;
=Capital Factory=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*https://angel.co/capital-factory-fund-4&lt;br /&gt;
*&amp;quot;The Capital Factory Fund comes from Austin's most prominent entrepreneurs and investors and only invests in Austin-based tech startups. Investors from around the world are participating who want to dip their toe in the booming Austin market and identify future individual investments. If you are bullish on Austin, then you want to be in this fund.&amp;quot;&lt;br /&gt;
*Takes in new investors each round&lt;br /&gt;
*co-invests based on partners&lt;br /&gt;
*Founding date unclear, but Capital Factory founded 2009&lt;br /&gt;
=1871=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Space where companies, ideas, startups come together to share experience&lt;br /&gt;
*Has partnerships with VC, but does not seem to have own fund&lt;br /&gt;
=1776=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*&amp;quot;We connect our startups to the latest wisdom on how to build highly scalable businesses through our curriculum. To expert mentors who can help startups quickly solve problems. To markets through our institutional and corporate partners. To capital through our investor network and the 1776 Seed Fund.&amp;quot;&lt;br /&gt;
*Incubator with own seed fund&lt;br /&gt;
*http://www.bizjournals.com/washington/blog/techflash/2015/09/calling-all-startups-1776-closes-first-seed-fund.html&lt;br /&gt;
*Closed first seed fund at $12.5m late 2015&lt;br /&gt;
*Focuses on government in energy, health, education, sustainability, transportation, and smart_tech&lt;br /&gt;
=American Underground=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*Invested in Groundfloor for $1m in 2014, classifies as a &amp;quot;Micro VC&amp;quot; so reasonably, has a small personal fund&lt;br /&gt;
*Looks like it co-invests&lt;br /&gt;
*Mainly a campus space for startups and entrepreneurs to congregate&lt;br /&gt;
*&amp;quot;Google for Entrepreneurs enables tech hubs by providing them with technical content, business tools, and infrastructure upgrades so that they can support increasing demand from developers and startups.&amp;quot;&lt;br /&gt;
*Focus: tech&lt;br /&gt;
=Galvanize=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Lessons for programming, data science, etc.&lt;br /&gt;
*University of New Haven&lt;br /&gt;
*Locations across US, online classes&lt;br /&gt;
=Rocket Space=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*&amp;quot;Corporate membership&amp;quot; for various perks, connections to network of startups&lt;br /&gt;
*pseudo-accelerator program of some sort, has various program, so might make some small investments in the program&lt;br /&gt;
*Focus: tech &lt;br /&gt;
=Betamore=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*has bootcamps for rising VC analysts, but no fund of its own to invest&lt;br /&gt;
*programming classes for SQL, etc.&lt;br /&gt;
*Became nonprofit 2015, partners with VC in Baltimore and other locations&lt;br /&gt;
=Packard Place=&lt;br /&gt;
In-house operating fund? UNLIKELY&lt;br /&gt;
*Uses old space of Packard Place Motors, converted into an innovation and acceleration space&lt;br /&gt;
*Claims to have multiple accelerator programs in Charlotte&lt;br /&gt;
*Mentions nothing about on-site VC&lt;br /&gt;
=The Venture Center=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*VC fund based in Arkansas&lt;br /&gt;
*Founded in 2013, raised $19m in revenue the past year&lt;br /&gt;
*focuses on acceleration and mentorship as a VC&lt;br /&gt;
*focus: technology commercialization&lt;br /&gt;
=The Idea Village=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*No information on VC on-site&lt;br /&gt;
*Just an ecosystem with mentorship and opportunities&lt;br /&gt;
=Benjamin's Desk=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Coworking space for mobile technology&lt;br /&gt;
*Showcases work to potential investors, does not invest on its own&lt;br /&gt;
*Perks with membership&lt;br /&gt;
=GSV Labs=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*invests with listed partners&lt;br /&gt;
*over $250m raised in VC 2015&lt;br /&gt;
*Founded 2012&lt;br /&gt;
*focuses: big-data, edtech, entertainment, sustainability, and mobile&lt;br /&gt;
*does not seem to co-invest&lt;br /&gt;
*partners with silicon valley investors&lt;br /&gt;
=Founders Floor=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*&amp;quot;We are a seed stage investor in technology companies through our private syndicate fund.&amp;quot;&lt;br /&gt;
*Co-invests? yes &amp;quot;For larger seed rounds we regularly introduce our portfolio companies to in-network seed stage VC’s for co-investment opportunities&amp;quot;&lt;br /&gt;
*Also has a pseudo-accelerator and coworking space&lt;br /&gt;
*Founded fund in 2014, classified as micro VC(&amp;lt;&amp;lt;$100m)&lt;br /&gt;
=CyberTech=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*global space for cybersecurity and IoT&lt;br /&gt;
*paid membership, plus an incubator program&lt;br /&gt;
*hosts events around the US&lt;br /&gt;
=Innovation Pavilion=&lt;br /&gt;
In-house operating fund?  NO&lt;br /&gt;
*Never mentions in-house VC&lt;br /&gt;
*place for connecting VC and startups&lt;br /&gt;
*&amp;quot;At the heart of Innovation Pavilion is the entrepreneur.  We have a network  consisting of investors, supply chain manufacturers,  service providers and professional services  like  financial, legal and marketing. We also encourage collaboration through events and workshops at our facilities.&amp;quot; No onsite VC&lt;br /&gt;
=BestHQ=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*connects startups and VC&lt;br /&gt;
*collaboration/coworking space&lt;br /&gt;
*paid membership&lt;br /&gt;
=Work Hard Pittsburgh=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Current model launched in 2016, serves as a business incubator and a co-working space&lt;br /&gt;
*No mention of VC&lt;br /&gt;
=Tampa Bay Wave=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*space connecting entrepreneurs and startups&lt;br /&gt;
*began a startup accelerator that launched in 2013&lt;br /&gt;
=Think Big Partners=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*&amp;quot;Think Big Partners helps companies grow faster, smarter and more efficiently through our network of in-house services, community members and national partners. We push the boundaries of the status quo and help entrepreneurs create game changing companies and technologies.&amp;quot;&lt;br /&gt;
*Not much more information on website than that, definitely no on-site VC&lt;br /&gt;
=Hacker Lab=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*coworking space with classes, a &amp;quot;hacker space&amp;quot;, and a &amp;quot;maker space&amp;quot;&lt;br /&gt;
*Does not invest&lt;br /&gt;
=Geekdom=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*Has its own &amp;quot;Geekdom Fund&amp;quot; which invests in early stage IT startups&lt;br /&gt;
*focus: IT, tech&lt;br /&gt;
*Raised $6.68m in 2014, classifies as micro VC&lt;br /&gt;
*coinvests with partner network&lt;br /&gt;
=Epicenter=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*part of the Google for Entrepreneurs initiative, strictly a working space&lt;br /&gt;
*connects entrepreneurs to coworking space with hands-on mentoring&lt;br /&gt;
=Awesome Inc=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*operates funds as a Micro VC to help startups partner with other companies&lt;br /&gt;
*Raised about $4.5m from 2015-2016, founded in 2016&lt;br /&gt;
*Focuses on education, many programs to teach high schoolers and middle schoolers basic STEM&lt;br /&gt;
*Reasonable to expect co-investing with these circumstances&lt;br /&gt;
=Learn Launch=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*operates a small fund for seed investments into its Ed-tech accelerator&lt;br /&gt;
*Micro VC providing up to $120k to each startup and working with VC partners to provide more&lt;br /&gt;
*focus: education&lt;br /&gt;
*Fund began as late as 2016, 2017 marks the second Accelerator fund&lt;br /&gt;
=Catapult Chicago=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*community of solely tech startups&lt;br /&gt;
*Bunch of amenities for membership, including forums, programming, advisors, and &amp;quot;Everest&amp;quot; program&lt;br /&gt;
=Velocity=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*Very small fund in the ecosystem&lt;br /&gt;
*invests from $10-25k to early stage and seed companies&lt;br /&gt;
*Primarily connects companies together in Southern Indiana&lt;br /&gt;
*Certainly co-invests&lt;br /&gt;
*Funding seems to have started 2015&lt;br /&gt;
*focus is tech&lt;br /&gt;
=Tech Ranch=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Education for entrepreneurs, thinking space for ventures since 2008&lt;br /&gt;
*Focus on networking&lt;br /&gt;
*no mention of VC or funding&lt;br /&gt;
=ReSET=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Paid membership&lt;br /&gt;
*nonprofit with no personal fund&lt;br /&gt;
*&amp;quot;Its strategic goals are threefold: to be the “go-to” place for impact entrepreneurs, to make Hartford the Impact City, and Connecticut the social enterprise state.&amp;quot;&lt;br /&gt;
*focus: social entrepreneurship, enterprise&lt;br /&gt;
=The Atlanta Tech Village=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Just seems to be a collaboration space&lt;br /&gt;
*Has private offices and communal places for networking and interactions&lt;br /&gt;
*Has &amp;quot;scholarship&amp;quot; to waive membership fees, but that does not count as a fund.&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14406</id>
		<title>Hub-Based Venture Firms</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14406"/>
		<updated>2017-02-28T21:38:47Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{AcademicPaper&lt;br /&gt;
|Title=Hub-Based Venture Firms&lt;br /&gt;
|Author=Shrey Agarwal,&lt;br /&gt;
|Status=In development&lt;br /&gt;
}}&lt;br /&gt;
[[Hubs]]&lt;br /&gt;
&lt;br /&gt;
=Capital Factory=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*https://angel.co/capital-factory-fund-4&lt;br /&gt;
*&amp;quot;The Capital Factory Fund comes from Austin's most prominent entrepreneurs and investors and only invests in Austin-based tech startups. Investors from around the world are participating who want to dip their toe in the booming Austin market and identify future individual investments. If you are bullish on Austin, then you want to be in this fund.&amp;quot;&lt;br /&gt;
*Takes in new investors each round&lt;br /&gt;
*co-invests based on partners&lt;br /&gt;
*Founding date unclear, but Capital Factory founded 2009&lt;br /&gt;
=1871=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Space where companies, ideas, startups come together to share experience&lt;br /&gt;
*Has partnerships with VC, but does not seem to have own fund&lt;br /&gt;
=1776=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*&amp;quot;We connect our startups to the latest wisdom on how to build highly scalable businesses through our curriculum. To expert mentors who can help startups quickly solve problems. To markets through our institutional and corporate partners. To capital through our investor network and the 1776 Seed Fund.&amp;quot;&lt;br /&gt;
*Incubator with own seed fund&lt;br /&gt;
*http://www.bizjournals.com/washington/blog/techflash/2015/09/calling-all-startups-1776-closes-first-seed-fund.html&lt;br /&gt;
*Closed first seed fund at $12.5m late 2015&lt;br /&gt;
*Focuses on government in energy, health, education, sustainability, transportation, and smart_tech&lt;br /&gt;
=American Underground=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*Invested in Groundfloor for $1m in 2014, classifies as a &amp;quot;Micro VC&amp;quot; so reasonably, has a small personal fund&lt;br /&gt;
*Looks like it co-invests&lt;br /&gt;
*Mainly a campus space for startups and entrepreneurs to congregate&lt;br /&gt;
*&amp;quot;Google for Entrepreneurs enables tech hubs by providing them with technical content, business tools, and infrastructure upgrades so that they can support increasing demand from developers and startups.&amp;quot;&lt;br /&gt;
*Focus: tech&lt;br /&gt;
=Galvanize=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Lessons for programming, data science, etc.&lt;br /&gt;
*University of New Haven&lt;br /&gt;
*Locations across US, online classes&lt;br /&gt;
=Rocket Space=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*&amp;quot;Corporate membership&amp;quot; for various perks, connections to network of startups&lt;br /&gt;
*pseudo-accelerator program of some sort, has various program, so might make some small investments in the program&lt;br /&gt;
*Focus: tech &lt;br /&gt;
=Betamore=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*has bootcamps for rising VC analysts, but no fund of its own to invest&lt;br /&gt;
*programming classes for SQL, etc.&lt;br /&gt;
*Became nonprofit 2015, partners with VC in Baltimore and other locations&lt;br /&gt;
=Packard Place=&lt;br /&gt;
In-house operating fund? UNLIKELY&lt;br /&gt;
*Uses old space of Packard Place Motors, converted into an innovation and acceleration space&lt;br /&gt;
*Claims to have multiple accelerator programs in Charlotte&lt;br /&gt;
*Mentions nothing about on-site VC&lt;br /&gt;
=The Venture Center=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*VC fund based in Arkansas&lt;br /&gt;
*Founded in 2013, raised $19m in revenue the past year&lt;br /&gt;
*focuses on acceleration and mentorship as a VC&lt;br /&gt;
*focus: technology commercialization&lt;br /&gt;
=The Idea Village=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*No information on VC on-site&lt;br /&gt;
*Just an ecosystem with mentorship and opportunities&lt;br /&gt;
=Benjamin's Desk=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Coworking space for mobile technology&lt;br /&gt;
*Showcases work to potential investors, does not invest on its own&lt;br /&gt;
*Perks with membership&lt;br /&gt;
=GSV Labs=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*invests with listed partners&lt;br /&gt;
*over $250m raised in VC 2015&lt;br /&gt;
*Founded 2012&lt;br /&gt;
*focuses: big-data, edtech, entertainment, sustainability, and mobile&lt;br /&gt;
*does not seem to co-invest&lt;br /&gt;
*partners with silicon valley investors&lt;br /&gt;
=Founders Floor=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*&amp;quot;We are a seed stage investor in technology companies through our private syndicate fund.&amp;quot;&lt;br /&gt;
*Co-invests? yes &amp;quot;For larger seed rounds we regularly introduce our portfolio companies to in-network seed stage VC’s for co-investment opportunities&amp;quot;&lt;br /&gt;
*Also has a pseudo-accelerator and coworking space&lt;br /&gt;
*Founded fund in 2014, classified as micro VC(&amp;lt;&amp;lt;$100m)&lt;br /&gt;
=CyberTech=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*global space for cybersecurity and IoT&lt;br /&gt;
*paid membership, plus an incubator program&lt;br /&gt;
*hosts events around the US&lt;br /&gt;
=Innovation Pavilion=&lt;br /&gt;
In-house operating fund?  NO&lt;br /&gt;
*Never mentions in-house VC&lt;br /&gt;
*place for connecting VC and startups&lt;br /&gt;
*&amp;quot;At the heart of Innovation Pavilion is the entrepreneur.  We have a network  consisting of investors, supply chain manufacturers,  service providers and professional services  like  financial, legal and marketing. We also encourage collaboration through events and workshops at our facilities.&amp;quot; No onsite VC&lt;br /&gt;
=BestHQ=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*connects startups and VC&lt;br /&gt;
*collaboration/coworking space&lt;br /&gt;
*paid membership&lt;br /&gt;
=Work Hard Pittsburgh=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Current model launched in 2016, serves as a business incubator and a co-working space&lt;br /&gt;
*No mention of VC&lt;br /&gt;
=Tampa Bay Wave=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*space connecting entrepreneurs and startups&lt;br /&gt;
*began a startup accelerator that launched in 2013&lt;br /&gt;
=Think Big Partners=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*&amp;quot;Think Big Partners helps companies grow faster, smarter and more efficiently through our network of in-house services, community members and national partners. We push the boundaries of the status quo and help entrepreneurs create game changing companies and technologies.&amp;quot;&lt;br /&gt;
*Not much more information on website than that, definitely no on-site VC&lt;br /&gt;
=Hacker Lab=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*coworking space with classes, a &amp;quot;hacker space&amp;quot;, and a &amp;quot;maker space&amp;quot;&lt;br /&gt;
*Does not invest&lt;br /&gt;
=Geekdom=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*Has its own &amp;quot;Geekdom Fund&amp;quot; which invests in early stage IT startups&lt;br /&gt;
*focus: IT, tech&lt;br /&gt;
*Raised $6.68m in 2014, classifies as micro VC&lt;br /&gt;
*coinvests with partner network&lt;br /&gt;
=Epicenter=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*part of the Google for Entrepreneurs initiative, strictly a working space&lt;br /&gt;
*connects entrepreneurs to coworking space with hands-on mentoring&lt;br /&gt;
=Awesome Inc=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*operates funds as a Micro VC to help startups partner with other companies&lt;br /&gt;
*Raised about $4.5m from 2015-2016, founded in 2016&lt;br /&gt;
*Focuses on education, many programs to teach high schoolers and middle schoolers basic STEM&lt;br /&gt;
*Reasonable to expect co-investing with these circumstances&lt;br /&gt;
=Learn Launch=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*operates a small fund for seed investments into its Ed-tech accelerator&lt;br /&gt;
*Micro VC providing up to $120k to each startup and working with VC partners to provide more&lt;br /&gt;
*focus: education&lt;br /&gt;
*Fund began as late as 2016, 2017 marks the second Accelerator fund&lt;br /&gt;
=Catapult Chicago=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*community of solely tech startups&lt;br /&gt;
*Bunch of amenities for membership, including forums, programming, advisors, and &amp;quot;Everest&amp;quot; program&lt;br /&gt;
=Velocity=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*Very small fund in the ecosystem&lt;br /&gt;
*invests from $10-25k to early stage and seed companies&lt;br /&gt;
*Primarily connects companies together in Southern Indiana&lt;br /&gt;
*Certainly co-invests&lt;br /&gt;
*Funding seems to have started 2015&lt;br /&gt;
*focus is tech&lt;br /&gt;
=Tech Ranch=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Education for entrepreneurs, thinking space for ventures since 2008&lt;br /&gt;
*Focus on networking&lt;br /&gt;
*no mention of VC or funding&lt;br /&gt;
=ReSET=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Paid membership&lt;br /&gt;
*nonprofit with no personal fund&lt;br /&gt;
*&amp;quot;Its strategic goals are threefold: to be the “go-to” place for impact entrepreneurs, to make Hartford the Impact City, and Connecticut the social enterprise state.&amp;quot;&lt;br /&gt;
*focus: social entrepreneurship, enterprise&lt;br /&gt;
=The Atlanta Tech Village=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Just seems to be a collaboration space&lt;br /&gt;
*Has private offices and communal places for networking and interactions&lt;br /&gt;
*Has &amp;quot;scholarship&amp;quot; to waive membership fees, but that does not count as a fund.&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14405</id>
		<title>Hub-Based Venture Firms</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14405"/>
		<updated>2017-02-28T21:38:33Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{AcademicPaper&lt;br /&gt;
|Title=Hub-Based Venture Firms&lt;br /&gt;
|Status=In development&lt;br /&gt;
}}&lt;br /&gt;
[[Hubs]]&lt;br /&gt;
&lt;br /&gt;
=Capital Factory=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*https://angel.co/capital-factory-fund-4&lt;br /&gt;
*&amp;quot;The Capital Factory Fund comes from Austin's most prominent entrepreneurs and investors and only invests in Austin-based tech startups. Investors from around the world are participating who want to dip their toe in the booming Austin market and identify future individual investments. If you are bullish on Austin, then you want to be in this fund.&amp;quot;&lt;br /&gt;
*Takes in new investors each round&lt;br /&gt;
*co-invests based on partners&lt;br /&gt;
*Founding date unclear, but Capital Factory founded 2009&lt;br /&gt;
=1871=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Space where companies, ideas, startups come together to share experience&lt;br /&gt;
*Has partnerships with VC, but does not seem to have own fund&lt;br /&gt;
=1776=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*&amp;quot;We connect our startups to the latest wisdom on how to build highly scalable businesses through our curriculum. To expert mentors who can help startups quickly solve problems. To markets through our institutional and corporate partners. To capital through our investor network and the 1776 Seed Fund.&amp;quot;&lt;br /&gt;
*Incubator with own seed fund&lt;br /&gt;
*http://www.bizjournals.com/washington/blog/techflash/2015/09/calling-all-startups-1776-closes-first-seed-fund.html&lt;br /&gt;
*Closed first seed fund at $12.5m late 2015&lt;br /&gt;
*Focuses on government in energy, health, education, sustainability, transportation, and smart_tech&lt;br /&gt;
=American Underground=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*Invested in Groundfloor for $1m in 2014, classifies as a &amp;quot;Micro VC&amp;quot; so reasonably, has a small personal fund&lt;br /&gt;
*Looks like it co-invests&lt;br /&gt;
*Mainly a campus space for startups and entrepreneurs to congregate&lt;br /&gt;
*&amp;quot;Google for Entrepreneurs enables tech hubs by providing them with technical content, business tools, and infrastructure upgrades so that they can support increasing demand from developers and startups.&amp;quot;&lt;br /&gt;
*Focus: tech&lt;br /&gt;
=Galvanize=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Lessons for programming, data science, etc.&lt;br /&gt;
*University of New Haven&lt;br /&gt;
*Locations across US, online classes&lt;br /&gt;
=Rocket Space=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*&amp;quot;Corporate membership&amp;quot; for various perks, connections to network of startups&lt;br /&gt;
*pseudo-accelerator program of some sort, has various program, so might make some small investments in the program&lt;br /&gt;
*Focus: tech &lt;br /&gt;
=Betamore=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*has bootcamps for rising VC analysts, but no fund of its own to invest&lt;br /&gt;
*programming classes for SQL, etc.&lt;br /&gt;
*Became nonprofit 2015, partners with VC in Baltimore and other locations&lt;br /&gt;
=Packard Place=&lt;br /&gt;
In-house operating fund? UNLIKELY&lt;br /&gt;
*Uses old space of Packard Place Motors, converted into an innovation and acceleration space&lt;br /&gt;
*Claims to have multiple accelerator programs in Charlotte&lt;br /&gt;
*Mentions nothing about on-site VC&lt;br /&gt;
=The Venture Center=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*VC fund based in Arkansas&lt;br /&gt;
*Founded in 2013, raised $19m in revenue the past year&lt;br /&gt;
*focuses on acceleration and mentorship as a VC&lt;br /&gt;
*focus: technology commercialization&lt;br /&gt;
=The Idea Village=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*No information on VC on-site&lt;br /&gt;
*Just an ecosystem with mentorship and opportunities&lt;br /&gt;
=Benjamin's Desk=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Coworking space for mobile technology&lt;br /&gt;
*Showcases work to potential investors, does not invest on its own&lt;br /&gt;
*Perks with membership&lt;br /&gt;
=GSV Labs=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*invests with listed partners&lt;br /&gt;
*over $250m raised in VC 2015&lt;br /&gt;
*Founded 2012&lt;br /&gt;
*focuses: big-data, edtech, entertainment, sustainability, and mobile&lt;br /&gt;
*does not seem to co-invest&lt;br /&gt;
*partners with silicon valley investors&lt;br /&gt;
=Founders Floor=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*&amp;quot;We are a seed stage investor in technology companies through our private syndicate fund.&amp;quot;&lt;br /&gt;
*Co-invests? yes &amp;quot;For larger seed rounds we regularly introduce our portfolio companies to in-network seed stage VC’s for co-investment opportunities&amp;quot;&lt;br /&gt;
*Also has a pseudo-accelerator and coworking space&lt;br /&gt;
*Founded fund in 2014, classified as micro VC(&amp;lt;&amp;lt;$100m)&lt;br /&gt;
=CyberTech=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*global space for cybersecurity and IoT&lt;br /&gt;
*paid membership, plus an incubator program&lt;br /&gt;
*hosts events around the US&lt;br /&gt;
=Innovation Pavilion=&lt;br /&gt;
In-house operating fund?  NO&lt;br /&gt;
*Never mentions in-house VC&lt;br /&gt;
*place for connecting VC and startups&lt;br /&gt;
*&amp;quot;At the heart of Innovation Pavilion is the entrepreneur.  We have a network  consisting of investors, supply chain manufacturers,  service providers and professional services  like  financial, legal and marketing. We also encourage collaboration through events and workshops at our facilities.&amp;quot; No onsite VC&lt;br /&gt;
=BestHQ=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*connects startups and VC&lt;br /&gt;
*collaboration/coworking space&lt;br /&gt;
*paid membership&lt;br /&gt;
=Work Hard Pittsburgh=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Current model launched in 2016, serves as a business incubator and a co-working space&lt;br /&gt;
*No mention of VC&lt;br /&gt;
=Tampa Bay Wave=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*space connecting entrepreneurs and startups&lt;br /&gt;
*began a startup accelerator that launched in 2013&lt;br /&gt;
=Think Big Partners=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*&amp;quot;Think Big Partners helps companies grow faster, smarter and more efficiently through our network of in-house services, community members and national partners. We push the boundaries of the status quo and help entrepreneurs create game changing companies and technologies.&amp;quot;&lt;br /&gt;
*Not much more information on website than that, definitely no on-site VC&lt;br /&gt;
=Hacker Lab=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*coworking space with classes, a &amp;quot;hacker space&amp;quot;, and a &amp;quot;maker space&amp;quot;&lt;br /&gt;
*Does not invest&lt;br /&gt;
=Geekdom=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*Has its own &amp;quot;Geekdom Fund&amp;quot; which invests in early stage IT startups&lt;br /&gt;
*focus: IT, tech&lt;br /&gt;
*Raised $6.68m in 2014, classifies as micro VC&lt;br /&gt;
*coinvests with partner network&lt;br /&gt;
=Epicenter=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*part of the Google for Entrepreneurs initiative, strictly a working space&lt;br /&gt;
*connects entrepreneurs to coworking space with hands-on mentoring&lt;br /&gt;
=Awesome Inc=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*operates funds as a Micro VC to help startups partner with other companies&lt;br /&gt;
*Raised about $4.5m from 2015-2016, founded in 2016&lt;br /&gt;
*Focuses on education, many programs to teach high schoolers and middle schoolers basic STEM&lt;br /&gt;
*Reasonable to expect co-investing with these circumstances&lt;br /&gt;
=Learn Launch=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*operates a small fund for seed investments into its Ed-tech accelerator&lt;br /&gt;
*Micro VC providing up to $120k to each startup and working with VC partners to provide more&lt;br /&gt;
*focus: education&lt;br /&gt;
*Fund began as late as 2016, 2017 marks the second Accelerator fund&lt;br /&gt;
=Catapult Chicago=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*community of solely tech startups&lt;br /&gt;
*Bunch of amenities for membership, including forums, programming, advisors, and &amp;quot;Everest&amp;quot; program&lt;br /&gt;
=Velocity=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*Very small fund in the ecosystem&lt;br /&gt;
*invests from $10-25k to early stage and seed companies&lt;br /&gt;
*Primarily connects companies together in Southern Indiana&lt;br /&gt;
*Certainly co-invests&lt;br /&gt;
*Funding seems to have started 2015&lt;br /&gt;
*focus is tech&lt;br /&gt;
=Tech Ranch=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Education for entrepreneurs, thinking space for ventures since 2008&lt;br /&gt;
*Focus on networking&lt;br /&gt;
*no mention of VC or funding&lt;br /&gt;
=ReSET=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Paid membership&lt;br /&gt;
*nonprofit with no personal fund&lt;br /&gt;
*&amp;quot;Its strategic goals are threefold: to be the “go-to” place for impact entrepreneurs, to make Hartford the Impact City, and Connecticut the social enterprise state.&amp;quot;&lt;br /&gt;
*focus: social entrepreneurship, enterprise&lt;br /&gt;
=The Atlanta Tech Village=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Just seems to be a collaboration space&lt;br /&gt;
*Has private offices and communal places for networking and interactions&lt;br /&gt;
*Has &amp;quot;scholarship&amp;quot; to waive membership fees, but that does not count as a fund.&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14404</id>
		<title>Hub-Based Venture Firms</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14404"/>
		<updated>2017-02-28T21:27:59Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{AcademicPaper&lt;br /&gt;
|Title=Hub-Based Venture Firms&lt;br /&gt;
|Status=In development&lt;br /&gt;
}}&lt;br /&gt;
[[Hubs]]&lt;br /&gt;
&lt;br /&gt;
=Capital Factory=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*https://angel.co/capital-factory-fund-4&lt;br /&gt;
*&amp;quot;The Capital Factory Fund comes from Austin's most prominent entrepreneurs and investors and only invests in Austin-based tech startups. Investors from around the world are participating who want to dip their toe in the booming Austin market and identify future individual investments. If you are bullish on Austin, then you want to be in this fund.&amp;quot;&lt;br /&gt;
*Takes in new investors each round&lt;br /&gt;
*co-invests based on partners&lt;br /&gt;
*Founding date unclear, but Capital Factory founded 2009&lt;br /&gt;
=1871=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Space where companies, ideas, startups come together to share experience&lt;br /&gt;
*Has partnerships with VC, but does not seem to have own fund&lt;br /&gt;
=1776=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*&amp;quot;We connect our startups to the latest wisdom on how to build highly scalable businesses through our curriculum. To expert mentors who can help startups quickly solve problems. To markets through our institutional and corporate partners. To capital through our investor network and the 1776 Seed Fund.&amp;quot;&lt;br /&gt;
*Incubator with own seed fund&lt;br /&gt;
*http://www.bizjournals.com/washington/blog/techflash/2015/09/calling-all-startups-1776-closes-first-seed-fund.html&lt;br /&gt;
*Closed first seed fund at $12.5m late 2015&lt;br /&gt;
*Focuses on government in energy, health, education, sustainability, transportation, and smart_tech&lt;br /&gt;
=American Underground=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*Invested in Groundfloor for $1m in 2014, classifies as a &amp;quot;Micro VC&amp;quot; so reasonably, has a small personal fund&lt;br /&gt;
*Looks like it co-invests&lt;br /&gt;
*Mainly a campus space for startups and entrepreneurs to congregate&lt;br /&gt;
*&amp;quot;Google for Entrepreneurs enables tech hubs by providing them with technical content, business tools, and infrastructure upgrades so that they can support increasing demand from developers and startups.&amp;quot;&lt;br /&gt;
*Focus: tech&lt;br /&gt;
=Galvanize=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Lessons for programming, data science, etc.&lt;br /&gt;
*University of New Haven&lt;br /&gt;
*Locations across US, online classes&lt;br /&gt;
=Rocket Space=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*&amp;quot;Corporate membership&amp;quot; for various perks, connections to network of startups&lt;br /&gt;
*pseudo-accelerator program of some sort, has various program, so might make some small investments in the program&lt;br /&gt;
*Focus: tech &lt;br /&gt;
=Betamore=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*has bootcamps for rising VC analysts, but no fund of its own to invest&lt;br /&gt;
*programming classes for SQL, etc.&lt;br /&gt;
*Became nonprofit 2015, partners with VC in Baltimore and other locations&lt;br /&gt;
=Packard Place=&lt;br /&gt;
In-house operating fund? UNLIKELY&lt;br /&gt;
*Uses old space of Packard Place Motors, converted into an innovation and acceleration space&lt;br /&gt;
*Claims to have multiple accelerator programs in Charlotte&lt;br /&gt;
*Mentions nothing about on-site VC&lt;br /&gt;
=The Venture Center=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*VC fund based in Arkansas&lt;br /&gt;
*Founded in 2013, raised $19m in revenue the past year&lt;br /&gt;
*focuses on acceleration and mentorship as a VC&lt;br /&gt;
*focus: technology commercialization&lt;br /&gt;
=The Idea Village=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*No information on VC on-site&lt;br /&gt;
*Just an ecosystem with mentorship and opportunities&lt;br /&gt;
=Benjamin's Desk=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Coworking space for mobile technology&lt;br /&gt;
*Showcases work to potential investors, does not invest on its own&lt;br /&gt;
*Perks with membership&lt;br /&gt;
=GSV Labs=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*invests with listed partners&lt;br /&gt;
*over $250m raised in VC 2015&lt;br /&gt;
*Founded 2012&lt;br /&gt;
*focuses: big-data, edtech, entertainment, sustainability, and mobile&lt;br /&gt;
*does not seem to co-invest&lt;br /&gt;
*partners with silicon valley investors&lt;br /&gt;
=Founders Floor=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*&amp;quot;We are a seed stage investor in technology companies through our private syndicate fund.&amp;quot;&lt;br /&gt;
*Co-invests? yes &amp;quot;For larger seed rounds we regularly introduce our portfolio companies to in-network seed stage VC’s for co-investment opportunities&amp;quot;&lt;br /&gt;
*Also has a pseudo-accelerator and coworking space&lt;br /&gt;
*Founded fund in 2014, classified as micro VC(&amp;lt;&amp;lt;$100m)&lt;br /&gt;
=CyberTech=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*global space for cybersecurity and IoT&lt;br /&gt;
*paid membership, plus an incubator program&lt;br /&gt;
*hosts events around the US&lt;br /&gt;
=Innovation Pavilion=&lt;br /&gt;
In-house operating fund?  NO&lt;br /&gt;
*Never mentions in-house VC&lt;br /&gt;
*place for connecting VC and startups&lt;br /&gt;
*&amp;quot;At the heart of Innovation Pavilion is the entrepreneur.  We have a network  consisting of investors, supply chain manufacturers,  service providers and professional services  like  financial, legal and marketing. We also encourage collaboration through events and workshops at our facilities.&amp;quot; No onsite VC&lt;br /&gt;
=BestHQ=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*connects startups and VC&lt;br /&gt;
*collaboration/coworking space&lt;br /&gt;
*paid membership&lt;br /&gt;
=Work Hard Pittsburgh=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Current model launched in 2016, serves as a business incubator and a co-working space&lt;br /&gt;
*No mention of VC&lt;br /&gt;
=Tampa Bay Wave=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*space connecting entrepreneurs and startups&lt;br /&gt;
*began a startup accelerator that launched in 2013&lt;br /&gt;
=Think Big Partners=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*&amp;quot;Think Big Partners helps companies grow faster, smarter and more efficiently through our network of in-house services, community members and national partners. We push the boundaries of the status quo and help entrepreneurs create game changing companies and technologies.&amp;quot;&lt;br /&gt;
*Not much more information on website than that, definitely no on-site VC&lt;br /&gt;
=Hacker Lab=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*coworking space with classes, a &amp;quot;hacker space&amp;quot;, and a &amp;quot;maker space&amp;quot;&lt;br /&gt;
*Does not invest&lt;br /&gt;
=Geekdom=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*Has its own &amp;quot;Geekdom Fund&amp;quot; which invests in early stage IT startups&lt;br /&gt;
*focus: IT, tech&lt;br /&gt;
*Raised $6.68m in 2014, classifies as micro VC&lt;br /&gt;
*coinvests with partner network&lt;br /&gt;
=Epicenter=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*part of the Google for Entrepreneurs initiative, strictly a working space&lt;br /&gt;
*connects entrepreneurs to coworking space with hands-on mentoring&lt;br /&gt;
=Awesome Inc=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*operates funds as a Micro VC to help startups partner with other companies&lt;br /&gt;
*Raised about $4.5m from 2015-2016, founded in 2016&lt;br /&gt;
*Focuses on education, many programs to teach high schoolers and middle schoolers basic STEM&lt;br /&gt;
*Reasonable to expect co-investing with these circumstances&lt;br /&gt;
=Learn Launch=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*operates a small fund for seed investments into its Ed-tech accelerator&lt;br /&gt;
*Micro VC providing up to $120k to each startup and working with VC partners to provide more&lt;br /&gt;
*focus: education&lt;br /&gt;
*Fund began as late as 2016, 2017 marks the second Accelerator fund&lt;br /&gt;
=Catapult Chicago=&lt;br /&gt;
In-house operating fund?&lt;br /&gt;
=Velocity=&lt;br /&gt;
=Tech Ranch=&lt;br /&gt;
=ReSET=&lt;br /&gt;
=The Atlanta Tech Village=&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14403</id>
		<title>Hub-Based Venture Firms</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14403"/>
		<updated>2017-02-28T21:24:49Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{AcademicPaper&lt;br /&gt;
|Title=Hub-Based Venture Firms&lt;br /&gt;
|Status=In development&lt;br /&gt;
}}&lt;br /&gt;
[[Hubs]]&lt;br /&gt;
&lt;br /&gt;
=Capital Factory=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*https://angel.co/capital-factory-fund-4&lt;br /&gt;
*&amp;quot;The Capital Factory Fund comes from Austin's most prominent entrepreneurs and investors and only invests in Austin-based tech startups. Investors from around the world are participating who want to dip their toe in the booming Austin market and identify future individual investments. If you are bullish on Austin, then you want to be in this fund.&amp;quot;&lt;br /&gt;
*Takes in new investors each round&lt;br /&gt;
*co-invests based on partners&lt;br /&gt;
*Founding date unclear, but Capital Factory founded 2009&lt;br /&gt;
=1871=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Space where companies, ideas, startups come together to share experience&lt;br /&gt;
*Has partnerships with VC, but does not seem to have own fund&lt;br /&gt;
=1776=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*&amp;quot;We connect our startups to the latest wisdom on how to build highly scalable businesses through our curriculum. To expert mentors who can help startups quickly solve problems. To markets through our institutional and corporate partners. To capital through our investor network and the 1776 Seed Fund.&amp;quot;&lt;br /&gt;
*Incubator with own seed fund&lt;br /&gt;
*http://www.bizjournals.com/washington/blog/techflash/2015/09/calling-all-startups-1776-closes-first-seed-fund.html&lt;br /&gt;
*Closed first seed fund at $12.5m late 2015&lt;br /&gt;
*Focuses on government in energy, health, education, sustainability, transportation, and smart_tech&lt;br /&gt;
=American Underground=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*Invested in Groundfloor for $1m in 2014, classifies as a &amp;quot;Micro VC&amp;quot; so reasonably, has a small personal fund&lt;br /&gt;
*Looks like it co-invests&lt;br /&gt;
*Mainly a campus space for startups and entrepreneurs to congregate&lt;br /&gt;
*&amp;quot;Google for Entrepreneurs enables tech hubs by providing them with technical content, business tools, and infrastructure upgrades so that they can support increasing demand from developers and startups.&amp;quot;&lt;br /&gt;
*Focus: tech&lt;br /&gt;
=Galvanize=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Lessons for programming, data science, etc.&lt;br /&gt;
*University of New Haven&lt;br /&gt;
*Locations across US, online classes&lt;br /&gt;
=Rocket Space=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*&amp;quot;Corporate membership&amp;quot; for various perks, connections to network of startups&lt;br /&gt;
*pseudo-accelerator program of some sort, has various program, so might make some small investments in the program&lt;br /&gt;
*Focus: tech &lt;br /&gt;
=Betamore=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*has bootcamps for rising VC analysts, but no fund of its own to invest&lt;br /&gt;
*programming classes for SQL, etc.&lt;br /&gt;
*Became nonprofit 2015, partners with VC in Baltimore and other locations&lt;br /&gt;
=Packard Place=&lt;br /&gt;
In-house operating fund? UNLIKELY&lt;br /&gt;
*Uses old space of Packard Place Motors, converted into an innovation and acceleration space&lt;br /&gt;
*Claims to have multiple accelerator programs in Charlotte&lt;br /&gt;
*Mentions nothing about on-site VC&lt;br /&gt;
=The Venture Center=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*VC fund based in Arkansas&lt;br /&gt;
*Founded in 2013, raised $19m in revenue the past year&lt;br /&gt;
*focuses on acceleration and mentorship as a VC&lt;br /&gt;
*focus: technology commercialization&lt;br /&gt;
=The Idea Village=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*No information on VC on-site&lt;br /&gt;
*Just an ecosystem with mentorship and opportunities&lt;br /&gt;
=Benjamin's Desk=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Coworking space for mobile technology&lt;br /&gt;
*Showcases work to potential investors, does not invest on its own&lt;br /&gt;
*Perks with membership&lt;br /&gt;
=GSV Labs=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*invests with listed partners&lt;br /&gt;
*over $250m raised in VC 2015&lt;br /&gt;
*Founded 2012&lt;br /&gt;
*focuses: big-data, edtech, entertainment, sustainability, and mobile&lt;br /&gt;
*does not seem to co-invest&lt;br /&gt;
*partners with silicon valley investors&lt;br /&gt;
=Founders Floor=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*&amp;quot;We are a seed stage investor in technology companies through our private syndicate fund.&amp;quot;&lt;br /&gt;
*Co-invests? yes &amp;quot;For larger seed rounds we regularly introduce our portfolio companies to in-network seed stage VC’s for co-investment opportunities&amp;quot;&lt;br /&gt;
*Also has a pseudo-accelerator and coworking space&lt;br /&gt;
*Founded fund in 2014, classified as micro VC(&amp;lt;&amp;lt;$100m)&lt;br /&gt;
=CyberTech=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*global space for cybersecurity and IoT&lt;br /&gt;
*paid membership, plus an incubator program&lt;br /&gt;
*hosts events around the US&lt;br /&gt;
=Innovation Pavilion=&lt;br /&gt;
In-house operating fund?  NO&lt;br /&gt;
*Never mentions in-house VC&lt;br /&gt;
*place for connecting VC and startups&lt;br /&gt;
*&amp;quot;At the heart of Innovation Pavilion is the entrepreneur.  We have a network  consisting of investors, supply chain manufacturers,  service providers and professional services  like  financial, legal and marketing. We also encourage collaboration through events and workshops at our facilities.&amp;quot; No onsite VC&lt;br /&gt;
=BestHQ=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*connects startups and VC&lt;br /&gt;
*collaboration/coworking space&lt;br /&gt;
*paid membership&lt;br /&gt;
=Work Hard Pittsburgh=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Current model launched in 2016, serves as a business incubator and a co-working space&lt;br /&gt;
*No mention of VC&lt;br /&gt;
=Tampa Bay Wave=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*space connecting entrepreneurs and startups&lt;br /&gt;
*began a startup accelerator that launched in 2013&lt;br /&gt;
=Think Big Partners=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*&amp;quot;Think Big Partners helps companies grow faster, smarter and more efficiently through our network of in-house services, community members and national partners. We push the boundaries of the status quo and help entrepreneurs create game changing companies and technologies.&amp;quot;&lt;br /&gt;
*Not much more information on website than that, definitely no on-site VC&lt;br /&gt;
=Hacker Lab=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*coworking space with classes, a &amp;quot;hacker space&amp;quot;, and a &amp;quot;maker space&amp;quot;&lt;br /&gt;
*Does not invest&lt;br /&gt;
=Geekdom=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*Has its own &amp;quot;Geekdom Fund&amp;quot; which invests in early stage IT startups&lt;br /&gt;
*focus: IT, tech&lt;br /&gt;
*Raised $6.68m in 2014, classifies as micro VC&lt;br /&gt;
*coinvests with partner network&lt;br /&gt;
=Epicenter=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*part of the Google for Entrepreneurs initiative, strictly a working space&lt;br /&gt;
*connects entrepreneurs to coworking space with hands-on mentoring&lt;br /&gt;
=Awesome Inc=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*operates funds as a Micro VC to help startups partner with other companies&lt;br /&gt;
*Raised about $4.5m from 2015-2016, founded in 2016&lt;br /&gt;
*Focuses on education, many programs to teach high schoolers and middle schoolers basic STEM&lt;br /&gt;
*Reasonable to expect co-investing with these circumstances&lt;br /&gt;
=Learn Launch=&lt;br /&gt;
=Catapult Chicago=&lt;br /&gt;
=Velocity=&lt;br /&gt;
=Tech Ranch=&lt;br /&gt;
=ReSET=&lt;br /&gt;
=The Atlanta Tech Village=&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14402</id>
		<title>Hub-Based Venture Firms</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14402"/>
		<updated>2017-02-28T21:07:08Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{AcademicPaper&lt;br /&gt;
|Title=Hub-Based Venture Firms&lt;br /&gt;
|Status=In development&lt;br /&gt;
}}&lt;br /&gt;
[[Hubs]]&lt;br /&gt;
&lt;br /&gt;
=Capital Factory=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*https://angel.co/capital-factory-fund-4&lt;br /&gt;
*&amp;quot;The Capital Factory Fund comes from Austin's most prominent entrepreneurs and investors and only invests in Austin-based tech startups. Investors from around the world are participating who want to dip their toe in the booming Austin market and identify future individual investments. If you are bullish on Austin, then you want to be in this fund.&amp;quot;&lt;br /&gt;
*Takes in new investors each round&lt;br /&gt;
*co-invests based on partners&lt;br /&gt;
*Founding date unclear, but Capital Factory founded 2009&lt;br /&gt;
=1871=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Space where companies, ideas, startups come together to share experience&lt;br /&gt;
*Has partnerships with VC, but does not seem to have own fund&lt;br /&gt;
=1776=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*&amp;quot;We connect our startups to the latest wisdom on how to build highly scalable businesses through our curriculum. To expert mentors who can help startups quickly solve problems. To markets through our institutional and corporate partners. To capital through our investor network and the 1776 Seed Fund.&amp;quot;&lt;br /&gt;
*Incubator with own seed fund&lt;br /&gt;
*http://www.bizjournals.com/washington/blog/techflash/2015/09/calling-all-startups-1776-closes-first-seed-fund.html&lt;br /&gt;
*Closed first seed fund at $12.5m late 2015&lt;br /&gt;
*Focuses on government in energy, health, education, sustainability, transportation, and smart_tech&lt;br /&gt;
=American Underground=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*Invested in Groundfloor for $1m in 2014, classifies as a &amp;quot;Micro VC&amp;quot; so reasonably, has a small personal fund&lt;br /&gt;
*Looks like it co-invests&lt;br /&gt;
*Mainly a campus space for startups and entrepreneurs to congregate&lt;br /&gt;
*&amp;quot;Google for Entrepreneurs enables tech hubs by providing them with technical content, business tools, and infrastructure upgrades so that they can support increasing demand from developers and startups.&amp;quot;&lt;br /&gt;
*Focus: tech&lt;br /&gt;
=Galvanize=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Lessons for programming, data science, etc.&lt;br /&gt;
*University of New Haven&lt;br /&gt;
*Locations across US, online classes&lt;br /&gt;
=Rocket Space=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*&amp;quot;Corporate membership&amp;quot; for various perks, connections to network of startups&lt;br /&gt;
*pseudo-accelerator program of some sort, has various program, so might make some small investments in the program&lt;br /&gt;
*Focus: tech &lt;br /&gt;
=Betamore=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*has bootcamps for rising VC analysts, but no fund of its own to invest&lt;br /&gt;
*programming classes for SQL, etc.&lt;br /&gt;
*Became nonprofit 2015, partners with VC in Baltimore and other locations&lt;br /&gt;
=Packard Place=&lt;br /&gt;
In-house operating fund? UNLIKELY&lt;br /&gt;
*Uses old space of Packard Place Motors, converted into an innovation and acceleration space&lt;br /&gt;
*Claims to have multiple accelerator programs in Charlotte&lt;br /&gt;
*Mentions nothing about on-site VC&lt;br /&gt;
=The Venture Center=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*VC fund based in Arkansas&lt;br /&gt;
*Founded in 2013, raised $19m in revenue the past year&lt;br /&gt;
*focuses on acceleration and mentorship as a VC&lt;br /&gt;
*focus: technology commercialization&lt;br /&gt;
=The Idea Village=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*No information on VC on-site&lt;br /&gt;
*Just an ecosystem with mentorship and opportunities&lt;br /&gt;
=Benjamin's Desk=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Coworking space for mobile technology&lt;br /&gt;
*Showcases work to potential investors, does not invest on its own&lt;br /&gt;
*Perks with membership&lt;br /&gt;
=GSV Labs=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*invests with listed partners&lt;br /&gt;
*over $250m raised in VC 2015&lt;br /&gt;
*Founded 2012&lt;br /&gt;
*focuses: big-data, edtech, entertainment, sustainability, and mobile&lt;br /&gt;
*does not seem to co-invest&lt;br /&gt;
*partners with silicon valley investors&lt;br /&gt;
=Founders Floor=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*&amp;quot;We are a seed stage investor in technology companies through our private syndicate fund.&amp;quot;&lt;br /&gt;
*Co-invests? yes &amp;quot;For larger seed rounds we regularly introduce our portfolio companies to in-network seed stage VC’s for co-investment opportunities&amp;quot;&lt;br /&gt;
*Also has a pseudo-accelerator and coworking space&lt;br /&gt;
*Founded fund in 2014, classified as micro VC(&amp;lt;&amp;lt;$100m)&lt;br /&gt;
=CyberTech=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*global space for cybersecurity and IoT&lt;br /&gt;
*paid membership, plus an incubator program&lt;br /&gt;
*hosts events around the US&lt;br /&gt;
=Innovation Pavilion=&lt;br /&gt;
In-house operating fund?  NO&lt;br /&gt;
*Never mentions in-house VC&lt;br /&gt;
*place for connecting VC and startups&lt;br /&gt;
*&amp;quot;At the heart of Innovation Pavilion is the entrepreneur.  We have a network  consisting of investors, supply chain manufacturers,  service providers and professional services  like  financial, legal and marketing. We also encourage collaboration through events and workshops at our facilities.&amp;quot; No onsite VC&lt;br /&gt;
=BestHQ=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*connects startups and VC&lt;br /&gt;
*collaboration/coworking space&lt;br /&gt;
*paid membership&lt;br /&gt;
=Work Hard Pittsburgh=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Current model launched in 2016, serves as a business incubator and a co-working space&lt;br /&gt;
*No mention of VC&lt;br /&gt;
=Tampa Bay Wave=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*space connecting entrepreneurs and startups&lt;br /&gt;
*began a startup accelerator that launched in 2013&lt;br /&gt;
=Think Big Partners=&lt;br /&gt;
In-house operating fund?&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14401</id>
		<title>Hub-Based Venture Firms</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14401"/>
		<updated>2017-02-28T20:57:12Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{AcademicPaper&lt;br /&gt;
|Title=Hub-Based Venture Firms&lt;br /&gt;
|Status=In development&lt;br /&gt;
}}&lt;br /&gt;
[[Hubs]]&lt;br /&gt;
&lt;br /&gt;
=Capital Factory=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*https://angel.co/capital-factory-fund-4&lt;br /&gt;
*&amp;quot;The Capital Factory Fund comes from Austin's most prominent entrepreneurs and investors and only invests in Austin-based tech startups. Investors from around the world are participating who want to dip their toe in the booming Austin market and identify future individual investments. If you are bullish on Austin, then you want to be in this fund.&amp;quot;&lt;br /&gt;
*Takes in new investors each round&lt;br /&gt;
*co-invests based on partners&lt;br /&gt;
*Founding date unclear, but Capital Factory founded 2009&lt;br /&gt;
=1871=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Space where companies, ideas, startups come together to share experience&lt;br /&gt;
*Has partnerships with VC, but does not seem to have own fund&lt;br /&gt;
=1776=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*&amp;quot;We connect our startups to the latest wisdom on how to build highly scalable businesses through our curriculum. To expert mentors who can help startups quickly solve problems. To markets through our institutional and corporate partners. To capital through our investor network and the 1776 Seed Fund.&amp;quot;&lt;br /&gt;
*Incubator with own seed fund&lt;br /&gt;
*http://www.bizjournals.com/washington/blog/techflash/2015/09/calling-all-startups-1776-closes-first-seed-fund.html&lt;br /&gt;
*Closed first seed fund at $12.5m late 2015&lt;br /&gt;
*Focuses on government in energy, health, education, sustainability, transportation, and smart_tech&lt;br /&gt;
=American Underground=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*Invested in Groundfloor for $1m in 2014, classifies as a &amp;quot;Micro VC&amp;quot; so reasonably, has a small personal fund&lt;br /&gt;
*Looks like it co-invests&lt;br /&gt;
*Mainly a campus space for startups and entrepreneurs to congregate&lt;br /&gt;
*&amp;quot;Google for Entrepreneurs enables tech hubs by providing them with technical content, business tools, and infrastructure upgrades so that they can support increasing demand from developers and startups.&amp;quot;&lt;br /&gt;
*Focus: tech&lt;br /&gt;
=Galvanize=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Lessons for programming, data science, etc.&lt;br /&gt;
*University of New Haven&lt;br /&gt;
*Locations across US, online classes&lt;br /&gt;
=Rocket Space=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*&amp;quot;Corporate membership&amp;quot; for various perks, connections to network of startups&lt;br /&gt;
*pseudo-accelerator program of some sort, has various program, so might make some small investments in the program&lt;br /&gt;
*Focus: tech &lt;br /&gt;
=Betamore=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*has bootcamps for rising VC analysts, but no fund of its own to invest&lt;br /&gt;
*programming classes for SQL, etc.&lt;br /&gt;
*Became nonprofit 2015, partners with VC in Baltimore and other locations&lt;br /&gt;
=Packard Place=&lt;br /&gt;
In-house operating fund? UNLIKELY&lt;br /&gt;
*Uses old space of Packard Place Motors, converted into an innovation and acceleration space&lt;br /&gt;
*Claims to have multiple accelerator programs in Charlotte&lt;br /&gt;
*Mentions nothing about on-site VC&lt;br /&gt;
=The Venture Center=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*VC fund based in Arkansas&lt;br /&gt;
*Founded in 2013, raised $19m in revenue the past year&lt;br /&gt;
*focuses on acceleration and mentorship as a VC&lt;br /&gt;
*focus: technology commercialization&lt;br /&gt;
=The Idea Village=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*No information on VC on-site&lt;br /&gt;
*Just an ecosystem with mentorship and opportunities&lt;br /&gt;
=Benjamin's Desk=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Coworking space for mobile technology&lt;br /&gt;
*Showcases work to potential investors, does not invest on its own&lt;br /&gt;
*Perks with membership&lt;br /&gt;
=GSV Labs=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*invests with listed partners&lt;br /&gt;
*over $250m raised in VC 2015&lt;br /&gt;
*Founded 2012&lt;br /&gt;
*focuses: big-data, edtech, entertainment, sustainability, and mobile&lt;br /&gt;
*does not seem to co-invest&lt;br /&gt;
*partners with silicon valley investors&lt;br /&gt;
=Founders Floor=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*&amp;quot;We are a seed stage investor in technology companies through our private syndicate fund.&amp;quot;&lt;br /&gt;
*Co-invests? yes &amp;quot;For larger seed rounds we regularly introduce our portfolio companies to in-network seed stage VC’s for co-investment opportunities&amp;quot;&lt;br /&gt;
*Also has a pseudo-accelerator and coworking space&lt;br /&gt;
*Founded fund in 2014, classified as micro VC(&amp;lt;&amp;lt;$100m)&lt;br /&gt;
=CyberTech=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*global space for cybersecurity and IoT&lt;br /&gt;
*paid membership, plus an incubator program&lt;br /&gt;
*hosts events around the US&lt;br /&gt;
=Innovation Pavilion=&lt;br /&gt;
In-house operating fund?  NO&lt;br /&gt;
*Never mentions in-house VC&lt;br /&gt;
*place for connecting VC and startups&lt;br /&gt;
*&amp;quot;At the heart of Innovation Pavilion is the entrepreneur.  We have a network  consisting of investors, supply chain manufacturers,  service providers and professional services  like  financial, legal and marketing. We also encourage collaboration through events and workshops at our facilities.&amp;quot; No onsite VC&lt;br /&gt;
=BestHQ=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*connects startups and VC&lt;br /&gt;
*collaboration/coworking space&lt;br /&gt;
*paid membership&lt;br /&gt;
=Work Hard Pittsburgh=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Current model launched in 2016, serves as a business incubator and a co-working space&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14400</id>
		<title>Hub-Based Venture Firms</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14400"/>
		<updated>2017-02-28T20:50:38Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{AcademicPaper&lt;br /&gt;
|Title=Hub-Based Venture Firms&lt;br /&gt;
|Status=In development&lt;br /&gt;
}}&lt;br /&gt;
[[Hubs]]&lt;br /&gt;
&lt;br /&gt;
=Capital Factory=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*https://angel.co/capital-factory-fund-4&lt;br /&gt;
*&amp;quot;The Capital Factory Fund comes from Austin's most prominent entrepreneurs and investors and only invests in Austin-based tech startups. Investors from around the world are participating who want to dip their toe in the booming Austin market and identify future individual investments. If you are bullish on Austin, then you want to be in this fund.&amp;quot;&lt;br /&gt;
*Takes in new investors each round&lt;br /&gt;
*co-invests based on partners&lt;br /&gt;
*Founding date unclear, but Capital Factory founded 2009&lt;br /&gt;
=1871=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Space where companies, ideas, startups come together to share experience&lt;br /&gt;
*Has partnerships with VC, but does not seem to have own fund&lt;br /&gt;
=1776=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*&amp;quot;We connect our startups to the latest wisdom on how to build highly scalable businesses through our curriculum. To expert mentors who can help startups quickly solve problems. To markets through our institutional and corporate partners. To capital through our investor network and the 1776 Seed Fund.&amp;quot;&lt;br /&gt;
*Incubator with own seed fund&lt;br /&gt;
*http://www.bizjournals.com/washington/blog/techflash/2015/09/calling-all-startups-1776-closes-first-seed-fund.html&lt;br /&gt;
*Closed first seed fund at $12.5m late 2015&lt;br /&gt;
*Focuses on government in energy, health, education, sustainability, transportation, and smart_tech&lt;br /&gt;
=American Underground=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*Invested in Groundfloor for $1m in 2014, classifies as a &amp;quot;Micro VC&amp;quot; so reasonably, has a small personal fund&lt;br /&gt;
*Looks like it co-invests&lt;br /&gt;
*Mainly a campus space for startups and entrepreneurs to congregate&lt;br /&gt;
*&amp;quot;Google for Entrepreneurs enables tech hubs by providing them with technical content, business tools, and infrastructure upgrades so that they can support increasing demand from developers and startups.&amp;quot;&lt;br /&gt;
*Focus: tech&lt;br /&gt;
=Galvanize=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Lessons for programming, data science, etc.&lt;br /&gt;
*University of New Haven&lt;br /&gt;
*Locations across US, online classes&lt;br /&gt;
=Rocket Space=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*&amp;quot;Corporate membership&amp;quot; for various perks, connections to network of startups&lt;br /&gt;
*pseudo-accelerator program of some sort, has various program, so might make some small investments in the program&lt;br /&gt;
*Focus: tech &lt;br /&gt;
=Betamore=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*has bootcamps for rising VC analysts, but no fund of its own to invest&lt;br /&gt;
*programming classes for SQL, etc.&lt;br /&gt;
*Became nonprofit 2015, partners with VC in Baltimore and other locations&lt;br /&gt;
=Packard Place=&lt;br /&gt;
In-house operating fund? UNLIKELY&lt;br /&gt;
*Uses old space of Packard Place Motors, converted into an innovation and acceleration space&lt;br /&gt;
*Claims to have multiple accelerator programs in Charlotte&lt;br /&gt;
*Mentions nothing about on-site VC&lt;br /&gt;
=The Venture Center=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*VC fund based in Arkansas&lt;br /&gt;
*Founded in 2013, raised $19m in revenue the past year&lt;br /&gt;
*focuses on acceleration and mentorship as a VC&lt;br /&gt;
*focus: technology commercialization&lt;br /&gt;
=The Idea Village=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*No information on VC on-site&lt;br /&gt;
*Just an ecosystem with mentorship and opportunities&lt;br /&gt;
=Benjamin's Desk=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Coworking space for mobile technology&lt;br /&gt;
*Showcases work to potential investors, does not invest on its own&lt;br /&gt;
*Perks with membership&lt;br /&gt;
=GSV Labs=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*invests with listed partners&lt;br /&gt;
*over $250m raised in VC 2015&lt;br /&gt;
*Founded 2012&lt;br /&gt;
*focuses: big-data, edtech, entertainment, sustainability, and mobile&lt;br /&gt;
*does not seem to co-invest&lt;br /&gt;
*partners with silicon valley investors&lt;br /&gt;
=Founders Floor=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*&amp;quot;We are a seed stage investor in technology companies through our private syndicate fund.&amp;quot;&lt;br /&gt;
*Co-invests? yes &amp;quot;For larger seed rounds we regularly introduce our portfolio companies to in-network seed stage VC’s for co-investment opportunities&amp;quot;&lt;br /&gt;
*Also has a pseudo-accelerator and coworking space&lt;br /&gt;
*Founded fund in 2014, classified as micro VC&lt;br /&gt;
=CyberTech=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*global space for cybersecurity and IoT&lt;br /&gt;
*paid membership, plus an incubator program&lt;br /&gt;
*hosts events around the US&lt;br /&gt;
=Innovation Pavillion=&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14399</id>
		<title>Hub-Based Venture Firms</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hub-Based_Venture_Firms&amp;diff=14399"/>
		<updated>2017-02-28T20:38:56Z</updated>

		<summary type="html">&lt;p&gt;Shrey: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{AcademicPaper&lt;br /&gt;
|Title=Hub-Based Venture Firms&lt;br /&gt;
|Status=In development&lt;br /&gt;
}}&lt;br /&gt;
[[Hubs]]&lt;br /&gt;
&lt;br /&gt;
=Capital Factory=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*https://angel.co/capital-factory-fund-4&lt;br /&gt;
*&amp;quot;The Capital Factory Fund comes from Austin's most prominent entrepreneurs and investors and only invests in Austin-based tech startups. Investors from around the world are participating who want to dip their toe in the booming Austin market and identify future individual investments. If you are bullish on Austin, then you want to be in this fund.&amp;quot;&lt;br /&gt;
*Takes in new investors each round&lt;br /&gt;
*co-invests based on partners&lt;br /&gt;
*Founding date unclear, but Capital Factory founded 2009&lt;br /&gt;
=1871=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Space where companies, ideas, startups come together to share experience&lt;br /&gt;
*Has partnerships with VC, but does not seem to have own fund&lt;br /&gt;
=1776=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*&amp;quot;We connect our startups to the latest wisdom on how to build highly scalable businesses through our curriculum. To expert mentors who can help startups quickly solve problems. To markets through our institutional and corporate partners. To capital through our investor network and the 1776 Seed Fund.&amp;quot;&lt;br /&gt;
*Incubator with own seed fund&lt;br /&gt;
*http://www.bizjournals.com/washington/blog/techflash/2015/09/calling-all-startups-1776-closes-first-seed-fund.html&lt;br /&gt;
*Closed first seed fund at $12.5m late 2015&lt;br /&gt;
*Focuses on government in energy, health, education, sustainability, transportation, and smart_tech&lt;br /&gt;
=American Underground=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*Invested in Groundfloor for $1m in 2014, classifies as a &amp;quot;Micro VC&amp;quot; so reasonably, has a small personal fund&lt;br /&gt;
*Looks like it co-invests&lt;br /&gt;
*Mainly a campus space for startups and entrepreneurs to congregate&lt;br /&gt;
*&amp;quot;Google for Entrepreneurs enables tech hubs by providing them with technical content, business tools, and infrastructure upgrades so that they can support increasing demand from developers and startups.&amp;quot;&lt;br /&gt;
*Focus: tech&lt;br /&gt;
=Galvanize=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*Lessons for programming, data science, etc.&lt;br /&gt;
*University of New Haven&lt;br /&gt;
*Locations across US, online classes&lt;br /&gt;
=Rocket Space=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*&amp;quot;Corporate membership&amp;quot; for various perks, connections to network of startups&lt;br /&gt;
*pseudo-accelerator program of some sort, has various program, so might make some small investments in the program&lt;br /&gt;
*Focus: tech &lt;br /&gt;
=Betamore=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*has bootcamps for rising VC analysts, but no fund of its own to invest&lt;br /&gt;
*programming classes for SQL, etc.&lt;br /&gt;
*Became nonprofit 2015, partners with VC in Baltimore and other locations&lt;br /&gt;
=Packard Place=&lt;br /&gt;
In-house operating fund? UNLIKELY&lt;br /&gt;
*Uses old space of Packard Place Motors, converted into an innovation and acceleration space&lt;br /&gt;
*Claims to have multiple accelerator programs in Charlotte&lt;br /&gt;
*Mentions nothing about on-site VC&lt;br /&gt;
=The Venture Center=&lt;br /&gt;
In-house operating fund? YES&lt;br /&gt;
*VC fund based in Arkansas&lt;br /&gt;
*Founded in 2013, raised $19m in revenue the past year&lt;br /&gt;
*focuses on acceleration and mentorship as a VC&lt;br /&gt;
*focus: technology commercialization&lt;br /&gt;
=The Idea Village=&lt;br /&gt;
In-house operating fund? NO&lt;br /&gt;
*No information on VC on-site&lt;br /&gt;
*Just an ecosystem with mentorship and opportunities&lt;br /&gt;
=&lt;/div&gt;</summary>
		<author><name>Shrey</name></author>
		
	</entry>
</feed>