Changes

Jump to navigation Jump to search
We fixed up and ran E:\projects\accelerators\Google\DemoDayCrawler.py
This script was based on E:\mcnair\Software\Accelerators\DemoDayCrawler.py, rather than the more recent E:\mcnair\Projects\Accelerator Demo Day\Test Run\STEP1_crawl.py
 
The output is:
*E:\projects\accelerators\Google\Results.txt 2515
*E:\projects\accelerators\Google\Results folder containing html
Previously run Google search results are in:
*5 results per accelerator -- E:\mcnair\Software\Accelerators\demoday_crawl_full.txt2777*10 results per accelerator -- E:\mcnair\Projects\Accelerator Demo Day\Test Run\demoday_crawl_full_from_testrun.txt4351*10 results per select accelerator year -- E:\mcnair\Projects\Accelerator Demo Day\Test Run\demoday_crawl_full.txt1230 These were all copied to Z:\accelerators and cleaned up, and loaded along with the new Results.txt into '''accelerators'''. The SQL is in E:\projects\accelerators\LoadAcceleratorTables.sql However, it seems that we have found EVERY page before. There is therefore nothing to Turk. ====Other info====
Found the following list of accelerators by accident: https://www.s-b-z.com/FORMING%20THE%20BUSINESS/db/accelerators.aspx

Navigation menu