Changes

Jump to navigation Jump to search
162 bytes added ,  13:47, 21 September 2020
no edit summary
{{Project|Has project output=Tool|Has sponsor=McNair ProjectsCenter
|Has title=Demo Day Page Parser
|Has owner=Peter Jalbert,
|Has project status=ActiveSubsume
}}
 
==Project Specs==
The goal of this project is to leverage data mining with Selenium and Machine Learning to get good candidate web pages for Demo Days for accelerators. Relevant information on the project can be found on the [http://mcnair.bakerinstitute.org/wiki/Accelerator_Data Accelerator Data] page.
ListOfAccs.txt
The full list of potential keywords (used for throwing out irrelevant results)search terms to match with the text versions of news articles: KeywordsCohortAndAcceleratorsFullList.txt
A list of accelerators, queries, and urls:
A file with the name of the results that passed keyword matching:
DemoDayHitsFull.txt
 
A file with an analysis of the most frequent matched words in each text file:
topWordsFull.txt
==Faulty Results==

Navigation menu