The goal of this project is to leverage data mining with Selenium and Machine Learning to get good candidate web pages for Demo Days for accelerators. Relevant information on the project can be found on the [http://mcnair.bakerinstitute.org/wiki/Accelerator_Data Accelerator Data] page.
ListOfAccs.txt
The full list of potential keywords (used for throwing out irrelevant results)search terms to match with the text versions of news articles: KeywordsCohortAndAcceleratorsFullList.txt
A list of accelerators, queries, and urls:
A file with the name of the results that passed keyword matching:
DemoDayHitsFull.txt
A file with an analysis of the most frequent matched words in each text file: