Seed DB Crawler
Revision as of 16:04, 31 July 2018 by Leminh.ams (talk | contribs)
Seed DB Crawler | |
---|---|
Project Information | |
Project Title | Seed DB Parser |
Owner | Minh Le, Maxine Tao, Connor Rothschild |
Start Date | |
Deadline | |
Primary Billing | |
Notes | |
Has project status | |
Copyright © 2016 edegan.com. All Rights Reserved. |
Location
E:\McNair\Projects\Seed DB\parser.py
ListOfAccs.txt - input file containing a list of accelerators that we want to pull information on
Functionality
Uses Selenium Webdriver to pull cohort companies, timing info from SeedDB website
SeedDB is structured so that there is a page containing a list of accelerators. If you click on an accelerator name, you are then taken to another page of all their cohorts. This second page of all cohorts for each accelerator is stored in a folder called seedDBhtml.