Changes

Jump to navigation Jump to search
2,400 bytes added ,  12:16, 2 November 2017
no edit summary
<onlyinclude>[[Veeral Shah]] [[Work Logs]] [[Veeral Shah (Work Log)|(log page)]]</onlyinclude>
 [[Veeral Shah]] [[Work Logs]] [[Veeral Shah (Work Log)|(log page)]] =Summer 2016=
06/01/2016 - Set Up Work Log Page
06/17/2016 - Completing the collection of Accelerator Data
=Summer 2017=
Working on [[Accelerator Seed List (Data)]]- Under "Discussion"
06/05/2017 - First day of work of the summer. Began reading and deciphering the Seed Accelerator Project notes and files. Started writing my plan for the project this summer under Discussion on the Seed Accelerator Project page.
06/14/2017 - Discovered Global Accelerator Network site (Goldmine!) which has several Accelerators we do not have. Going through html to find a method for crawling through the site.
06/15/2017 - Gave the GAN parser spec to Abhi. He is working on creating the a parser for me to obtain all the data from the site. Meanwhile, I am continuing to manually trek through the potential accelerators from the 2013 crunchbase match.  06/16/2017 - Obtained the GAN information! It is in a textpad file on the Accelerator Seed List data discussion page. Yet to do anything with it. 06/20/2017 - 06/23/2017 - Continued and finished going manually through all the rest of the potential accelerators from Crunchbase. Location of all matches is on the Accelerator Match Excel page on the RDP Accelerator page. 06/26/2017 - Identified all the different sources from which the Current Accelerator list was extracted. Preparing to cross check across sources make final check for Accelerators. Also, analyzed the process by which cohorts could be extracted and organized for new accelerators. 06/27/2017 - 06/29/2017 - Manually went through the websites of new accelerators with Joe and extracted cohorts. 07/10/2017 - 7/11/2017 - Completed the acquisition of all the cohorts for each new accelerator company obtained from the Crunchbase 2013 Snapshot and organized and standardized the structure of each cohort txt file so it was compatible with Peter's cohort parser code. 7/12/2017 - Ran Peter's cohort parser code (E:\McNair\Projects\Accelerators\Code+Final_Data) on the New Crunchbase Accelerator Cohorts Folder in E:\McNair\Projects\Accelerators\Data. Combined the new cohort list with the already existing list in Cleaned Cohort Data.xls and formed a new and updated list called Updated Cleaned Cohort Data 7-12-2017 in the Accelerator folder. 7/13/2017 - 7/14/2017 - Read up documentation on Matcher and Whois Parser and worked with Adrian to run the Matcher and put together data sets.
067/1617/2017 - Used the Normalizer perl tool to normalize the VC Data obtained from SDC Platinum and then ran the Matcher on the VC Data and the Cleaned Cohort Data to match cohort companies with investment information.
7/18/2017 - 7/19/2017 - Tried to clean up the VC Data because the SQL code is not acquiring all of the results. Read up on the documentation of the Whois Parser to prepare data to use it.
7/24/2017 - Found duplicates/inconsistencies in the cohort data and fixed them. Trying to acquire site domains from Organizations data to use for Whois Parser.
7/25/2017 -
[[Category:Work Log]]

Navigation menu