<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>http://www.edegan.com/mediawiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Mringheanu</id>
	<title>edegan.com - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="http://www.edegan.com/mediawiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Mringheanu"/>
	<link rel="alternate" type="text/html" href="http://www.edegan.com/wiki/Special:Contributions/Mringheanu"/>
	<updated>2026-05-17T21:13:34Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.34.2</generator>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=22330</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=22330"/>
		<updated>2017-12-07T22:24:22Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm&lt;br /&gt;
*Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm&lt;br /&gt;
*Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm&lt;br /&gt;
*Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm&lt;br /&gt;
*Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm&lt;br /&gt;
*Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm&lt;br /&gt;
*Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm&lt;br /&gt;
*Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm&lt;br /&gt;
*Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm&lt;br /&gt;
*Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm&lt;br /&gt;
*Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm&lt;br /&gt;
*Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm:&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm&lt;br /&gt;
*Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm&lt;br /&gt;
*Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm&lt;br /&gt;
*Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm&lt;br /&gt;
*Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm&lt;br /&gt;
*Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm&lt;br /&gt;
*Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm&lt;br /&gt;
*Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm&lt;br /&gt;
*Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm&lt;br /&gt;
*Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm&lt;br /&gt;
*Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm&lt;br /&gt;
*Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
11/2/2017 4:00-5:30 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file.&lt;br /&gt;
&lt;br /&gt;
11/6/2017 2:00-4:00 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file. Began compiling a list of keywords for demo day press releases.&lt;br /&gt;
&lt;br /&gt;
11/7/2017 3:00-5:00 pm&lt;br /&gt;
*Finished coming up with keywords for demo day crawler. Sent the final list to Peter.&lt;br /&gt;
&lt;br /&gt;
11/8/2017 2:00-3:30 pm&lt;br /&gt;
*Spoke to Ed and organized all of our current data.&lt;br /&gt;
&lt;br /&gt;
11/9/2017 3:00-5:00 pm&lt;br /&gt;
*Created a new project page called Accelerator Data and listed all relevant files as well as descriptions.&lt;br /&gt;
&lt;br /&gt;
11/14/2017 3:00-5:00 pm&lt;br /&gt;
*Looked up URLs and decided whether or not the webiste was relevant.&lt;br /&gt;
&lt;br /&gt;
11/15/2017 2:00-5:00 pm&lt;br /&gt;
*Created SQL database entitled &amp;quot;acceleratordata&amp;quot; and began creating tables from folder of All Relevant Files.&lt;br /&gt;
&lt;br /&gt;
11/16/2017 3:00-5:00 pm&lt;br /&gt;
*Continued to input tables into SQL database.&lt;br /&gt;
&lt;br /&gt;
11/20/2017 2:00-5:00 pm&lt;br /&gt;
*Cleaned text files in order to import tables into SQL database.&lt;br /&gt;
&lt;br /&gt;
11/27/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Peter to find and exclude irrelevant keywords on HTML pages. Began categorizing relevant demo day pages.&lt;br /&gt;
&lt;br /&gt;
11/28/2017 3:00-5:00 pm&lt;br /&gt;
*Finished inputting tables of relevant files into SQL database.&lt;br /&gt;
&lt;br /&gt;
11/29/2017 2:00-5:00 pm&lt;br /&gt;
*Went through accelerator HTML URLs. Spoke with Ed about going through HTMLs and classifying based on overall and specific relevance.&lt;br /&gt;
&lt;br /&gt;
12/1/2017 3:00-5:00 pm&lt;br /&gt;
*Worked through accelerator links and classified pages based on whether or not they provided relevant information about startup timing.&lt;br /&gt;
&lt;br /&gt;
12/4/2017 10:00-12:00 pm&lt;br /&gt;
*Continued running through demo day crawl URLs and scoring them based on relevance.&lt;br /&gt;
&lt;br /&gt;
12/7/2017 1:00-4:30 pm&lt;br /&gt;
*Finalized scoring of demo day URLs for the original crawl. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm&lt;br /&gt;
*Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm&lt;br /&gt;
*Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm&lt;br /&gt;
*Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm&lt;br /&gt;
*Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm&lt;br /&gt;
*Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm&lt;br /&gt;
*Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm&lt;br /&gt;
*Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm&lt;br /&gt;
*Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm&lt;br /&gt;
*Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm&lt;br /&gt;
*Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm&lt;br /&gt;
*Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm&lt;br /&gt;
*Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm&lt;br /&gt;
*Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm&lt;br /&gt;
*Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm&lt;br /&gt;
*Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm&lt;br /&gt;
*Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm&lt;br /&gt;
*Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm&lt;br /&gt;
*Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm&lt;br /&gt;
*Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm&lt;br /&gt;
*Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm&lt;br /&gt;
*Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm&lt;br /&gt;
*Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm&lt;br /&gt;
*Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm&lt;br /&gt;
*Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm&lt;br /&gt;
*Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm&lt;br /&gt;
*Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm&lt;br /&gt;
*Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm&lt;br /&gt;
*Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm&lt;br /&gt;
*Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm&lt;br /&gt;
*Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm&lt;br /&gt;
*Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
10/17/2016 2:00-5:00 pm&lt;br /&gt;
*Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm&lt;br /&gt;
*Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm&lt;br /&gt;
*Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm&lt;br /&gt;
*Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm&lt;br /&gt;
*Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm&lt;br /&gt;
*Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm&lt;br /&gt;
*Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm&lt;br /&gt;
*Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm&lt;br /&gt;
*Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm&lt;br /&gt;
*Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm&lt;br /&gt;
*Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm&lt;br /&gt;
*Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm&lt;br /&gt;
*Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm&lt;br /&gt;
*Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm&lt;br /&gt;
*Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm&lt;br /&gt;
*Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm&lt;br /&gt;
*Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm&lt;br /&gt;
*Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm&lt;br /&gt;
*Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm&lt;br /&gt;
*Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm&lt;br /&gt;
*Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm&lt;br /&gt;
*Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=22273</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=22273"/>
		<updated>2017-12-04T17:53:54Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm&lt;br /&gt;
*Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm&lt;br /&gt;
*Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm&lt;br /&gt;
*Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm&lt;br /&gt;
*Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm&lt;br /&gt;
*Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm&lt;br /&gt;
*Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm&lt;br /&gt;
*Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm&lt;br /&gt;
*Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm&lt;br /&gt;
*Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm&lt;br /&gt;
*Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm&lt;br /&gt;
*Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm:&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm&lt;br /&gt;
*Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm&lt;br /&gt;
*Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm&lt;br /&gt;
*Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm&lt;br /&gt;
*Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm&lt;br /&gt;
*Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm&lt;br /&gt;
*Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm&lt;br /&gt;
*Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm&lt;br /&gt;
*Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm&lt;br /&gt;
*Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm&lt;br /&gt;
*Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm&lt;br /&gt;
*Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
11/2/2017 4:00-5:30 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file.&lt;br /&gt;
&lt;br /&gt;
11/6/2017 2:00-4:00 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file. Began compiling a list of keywords for demo day press releases.&lt;br /&gt;
&lt;br /&gt;
11/7/2017 3:00-5:00 pm&lt;br /&gt;
*Finished coming up with keywords for demo day crawler. Sent the final list to Peter.&lt;br /&gt;
&lt;br /&gt;
11/8/2017 2:00-3:30 pm&lt;br /&gt;
*Spoke to Ed and organized all of our current data.&lt;br /&gt;
&lt;br /&gt;
11/9/2017 3:00-5:00 pm&lt;br /&gt;
*Created a new project page called Accelerator Data and listed all relevant files as well as descriptions.&lt;br /&gt;
&lt;br /&gt;
11/14/2017 3:00-5:00 pm&lt;br /&gt;
*Looked up URLs and decided whether or not the webiste was relevant.&lt;br /&gt;
&lt;br /&gt;
11/15/2017 2:00-5:00 pm&lt;br /&gt;
*Created SQL database entitled &amp;quot;acceleratordata&amp;quot; and began creating tables from folder of All Relevant Files.&lt;br /&gt;
&lt;br /&gt;
11/16/2017 3:00-5:00 pm&lt;br /&gt;
*Continued to input tables into SQL database.&lt;br /&gt;
&lt;br /&gt;
11/20/2017 2:00-5:00 pm&lt;br /&gt;
*Cleaned text files in order to import tables into SQL database.&lt;br /&gt;
&lt;br /&gt;
11/27/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Peter to find and exclude irrelevant keywords on HTML pages. Began categorizing relevant demo day pages.&lt;br /&gt;
&lt;br /&gt;
11/28/2017 3:00-5:00 pm&lt;br /&gt;
*Finished inputting tables of relevant files into SQL database.&lt;br /&gt;
&lt;br /&gt;
11/29/2017 2:00-5:00 pm&lt;br /&gt;
*Went through accelerator HTML URLs. Spoke with Ed about going through HTMLs and classifying based on overall and specific relevance.&lt;br /&gt;
&lt;br /&gt;
12/1/2017 3:00-5:00 pm&lt;br /&gt;
*Worked through accelerator links and classified pages based on whether or not they provided relevant information about startup timing.&lt;br /&gt;
&lt;br /&gt;
12/4/2017 10:00-12:00 pm&lt;br /&gt;
*Continued running through demo day crawl URLs and scoring them based on relevance.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm&lt;br /&gt;
*Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm&lt;br /&gt;
*Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm&lt;br /&gt;
*Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm&lt;br /&gt;
*Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm&lt;br /&gt;
*Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm&lt;br /&gt;
*Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm&lt;br /&gt;
*Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm&lt;br /&gt;
*Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm&lt;br /&gt;
*Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm&lt;br /&gt;
*Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm&lt;br /&gt;
*Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm&lt;br /&gt;
*Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm&lt;br /&gt;
*Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm&lt;br /&gt;
*Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm&lt;br /&gt;
*Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm&lt;br /&gt;
*Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm&lt;br /&gt;
*Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm&lt;br /&gt;
*Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm&lt;br /&gt;
*Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm&lt;br /&gt;
*Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm&lt;br /&gt;
*Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm&lt;br /&gt;
*Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm&lt;br /&gt;
*Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm&lt;br /&gt;
*Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm&lt;br /&gt;
*Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm&lt;br /&gt;
*Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm&lt;br /&gt;
*Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm&lt;br /&gt;
*Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm&lt;br /&gt;
*Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm&lt;br /&gt;
*Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm&lt;br /&gt;
*Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
10/17/2016 2:00-5:00 pm&lt;br /&gt;
*Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm&lt;br /&gt;
*Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm&lt;br /&gt;
*Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm&lt;br /&gt;
*Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm&lt;br /&gt;
*Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm&lt;br /&gt;
*Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm&lt;br /&gt;
*Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm&lt;br /&gt;
*Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm&lt;br /&gt;
*Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm&lt;br /&gt;
*Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm&lt;br /&gt;
*Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm&lt;br /&gt;
*Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm&lt;br /&gt;
*Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm&lt;br /&gt;
*Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm&lt;br /&gt;
*Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm&lt;br /&gt;
*Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm&lt;br /&gt;
*Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm&lt;br /&gt;
*Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm&lt;br /&gt;
*Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm&lt;br /&gt;
*Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm&lt;br /&gt;
*Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm&lt;br /&gt;
*Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=22270</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=22270"/>
		<updated>2017-12-04T17:13:24Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
&lt;br /&gt;
'''Original Search'''&lt;br /&gt;
&lt;br /&gt;
*'''List of Preliminary Accelerators'''&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
**Variables: Names of potential accelerators&lt;br /&gt;
&lt;br /&gt;
*'''accelerator_data_noflag'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
&lt;br /&gt;
'''Cohort Directory &amp;quot;Big Push&amp;quot;'''&lt;br /&gt;
&lt;br /&gt;
*'''Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\&lt;br /&gt;
**Description: This folder contains files for each of the accelerators that we searched through from the &amp;quot;List of Preliminary Accelerators&amp;quot;. There are three files per accelerator: 1) The &amp;quot;accelerator name.txt&amp;quot; file which contains each of the variables recorded by all of the McNair Center workers during our big push on the project winter 2016, 2) The .html file for the cohort page if the entry was indeed an accelerator and if the worker could find the cohort page on that accelerator, and 3) a &amp;quot;accelerator name.cohort.txt&amp;quot; file which contains a list of the cohort companies as well as all variables which were easily found alongside the cohort.&lt;br /&gt;
&lt;br /&gt;
*'''List of Python files'''&lt;br /&gt;
**'''parse_accelerator_data'''&lt;br /&gt;
**'''parse_cohort_data'''&lt;br /&gt;
**'''process_locations'''&lt;br /&gt;
**'''wayback_machine'''&lt;br /&gt;
**Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: These files contain the code which Peter used to categorize the data from the &amp;quot;Data Copy&amp;quot; folder in Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\, which is just a copy of our cleaned data file. From this code, Peter returned for us a list of accelerators categorized by their flag and a compiled list of all the cohort companies as well as the variables recorded by McNair workers.&lt;br /&gt;
**'''Note''': We manually altered the cohort data which came out of Peter's code so that we could homogenize the formatting. This resulted in a unique cohort file which will not be replicated when running the code again. On the other hand, we manually altered the individual txt files for the accelerators to fix format so running Peter's code again should result in a similar file.&lt;br /&gt;
&lt;br /&gt;
*'''Cleaned Cohort Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.&lt;br /&gt;
**Variables: Accelerator Name, Company Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, Program, Cohort, Year&lt;br /&gt;
&lt;br /&gt;
*'''First_Incomplete_PercentVC_Table'''&lt;br /&gt;
**Original Location: Bulk(Z:)\Accelerators&lt;br /&gt;
**Description: The VC percentage raise rate for 198 accelerators. At this point we realized we were missing almost 100 accelerators, so we decided to expand our list and gather more data.&lt;br /&gt;
**Variables: Accelerator Name, Number of Cohort Companies, Number of VC Backed Cohort Companies, Raise rate percentage&lt;br /&gt;
&lt;br /&gt;
'''Refining the List'''&lt;br /&gt;
&lt;br /&gt;
*'''New Crunchbase Accelerators'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017&lt;br /&gt;
**Description: After conducting some SDC matches with cohort data from the cohort companies of the accelerators in the '''accelerator_data_noflag''' text file, we realized many potential accelerators were missing. We then got an Excel file from Crunchbase containing all of its organizations, which we then sorted to identify potential missing accelerators. The accelerators we were actually missing are in this Excel file.&lt;br /&gt;
**Variables: Names of Missing Accelerators&lt;br /&gt;
**Potential Crunchbase Variables&lt;br /&gt;
&lt;br /&gt;
*'''Accelerator_Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Summer 2017\Veeral&lt;br /&gt;
**Description: This text file contains cleaned data on all of our current accelerators. This file was compiled by Veeral over Summer 2017. Some of these accelerators are not based in the United States.&lt;br /&gt;
**Variables: Accelerator, homepage_url, city, region, country_code, Creation date&lt;br /&gt;
&lt;br /&gt;
*'''ListofAccs'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all current accelerators we have been working with.&lt;br /&gt;
**Variables: Accelerator name, Whois parser code&lt;br /&gt;
&lt;br /&gt;
'''Additional Variables'''&lt;br /&gt;
&lt;br /&gt;
*'''Accelerator_Cohort_Companies'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all cohort companies of all accelerators.&lt;br /&gt;
**Variables: Cohort Companies, Accelerator name&lt;br /&gt;
&lt;br /&gt;
*'''Current Matched Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: Sheet 1 contains our matched data from matching our SDC pull with our cohort companies list found in '''Accelerator_Cohort_Companies'''. Sheet 2 removes the duplicates from the previous match. Sheet 3 contains the list of VCCompanies, which accelerator they went through, the date of their first investment. Sheet 4 contains our cohort list matched with the crunchbase organizations, but it contains too many duplicates to use.&lt;br /&gt;
**Variables: VCCompanies, Accelerator, Earliest Round Date&lt;br /&gt;
&lt;br /&gt;
*'''founders_linkedin'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains founder data for each accelerator found by Peter when crawling LinkedIn.&lt;br /&gt;
**Variables: Accelerator name, Founder name, LinkedIn URL&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=22266</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=22266"/>
		<updated>2017-12-01T22:45:39Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm&lt;br /&gt;
*Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm&lt;br /&gt;
*Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm&lt;br /&gt;
*Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm&lt;br /&gt;
*Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm&lt;br /&gt;
*Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm&lt;br /&gt;
*Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm&lt;br /&gt;
*Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm&lt;br /&gt;
*Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm&lt;br /&gt;
*Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm&lt;br /&gt;
*Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm&lt;br /&gt;
*Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm:&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm&lt;br /&gt;
*Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm&lt;br /&gt;
*Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm&lt;br /&gt;
*Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm&lt;br /&gt;
*Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm&lt;br /&gt;
*Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm&lt;br /&gt;
*Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm&lt;br /&gt;
*Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm&lt;br /&gt;
*Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm&lt;br /&gt;
*Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm&lt;br /&gt;
*Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm&lt;br /&gt;
*Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
11/2/2017 4:00-5:30 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file.&lt;br /&gt;
&lt;br /&gt;
11/6/2017 2:00-4:00 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file. Began compiling a list of keywords for demo day press releases.&lt;br /&gt;
&lt;br /&gt;
11/7/2017 3:00-5:00 pm&lt;br /&gt;
*Finished coming up with keywords for demo day crawler. Sent the final list to Peter.&lt;br /&gt;
&lt;br /&gt;
11/8/2017 2:00-3:30 pm&lt;br /&gt;
*Spoke to Ed and organized all of our current data.&lt;br /&gt;
&lt;br /&gt;
11/9/2017 3:00-5:00 pm&lt;br /&gt;
*Created a new project page called Accelerator Data and listed all relevant files as well as descriptions.&lt;br /&gt;
&lt;br /&gt;
11/14/2017 3:00-5:00 pm&lt;br /&gt;
*Looked up URLs and decided whether or not the webiste was relevant.&lt;br /&gt;
&lt;br /&gt;
11/15/2017 2:00-5:00 pm&lt;br /&gt;
*Created SQL database entitled &amp;quot;acceleratordata&amp;quot; and began creating tables from folder of All Relevant Files.&lt;br /&gt;
&lt;br /&gt;
11/16/2017 3:00-5:00 pm&lt;br /&gt;
*Continued to input tables into SQL database.&lt;br /&gt;
&lt;br /&gt;
11/20/2017 2:00-5:00 pm&lt;br /&gt;
*Cleaned text files in order to import tables into SQL database.&lt;br /&gt;
&lt;br /&gt;
11/27/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Peter to find and exclude irrelevant keywords on HTML pages. Began categorizing relevant demo day pages.&lt;br /&gt;
&lt;br /&gt;
11/28/2017 3:00-5:00 pm&lt;br /&gt;
*Finished inputting tables of relevant files into SQL database.&lt;br /&gt;
&lt;br /&gt;
11/29/2017 2:00-5:00 pm&lt;br /&gt;
*Went through accelerator HTML URLs. Spoke with Ed about going through HTMLs and classifying based on overall and specific relevance.&lt;br /&gt;
&lt;br /&gt;
12/1/2017 3:00-5:00 pm&lt;br /&gt;
*Worked through accelerator links and classified pages based on whether or not they provided relevant information about startup timing.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm&lt;br /&gt;
*Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm&lt;br /&gt;
*Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm&lt;br /&gt;
*Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm&lt;br /&gt;
*Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm&lt;br /&gt;
*Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm&lt;br /&gt;
*Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm&lt;br /&gt;
*Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm&lt;br /&gt;
*Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm&lt;br /&gt;
*Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm&lt;br /&gt;
*Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm&lt;br /&gt;
*Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm&lt;br /&gt;
*Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm&lt;br /&gt;
*Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm&lt;br /&gt;
*Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm&lt;br /&gt;
*Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm&lt;br /&gt;
*Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm&lt;br /&gt;
*Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm&lt;br /&gt;
*Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm&lt;br /&gt;
*Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm&lt;br /&gt;
*Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm&lt;br /&gt;
*Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm&lt;br /&gt;
*Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm&lt;br /&gt;
*Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm&lt;br /&gt;
*Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm&lt;br /&gt;
*Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm&lt;br /&gt;
*Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm&lt;br /&gt;
*Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm&lt;br /&gt;
*Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm&lt;br /&gt;
*Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm&lt;br /&gt;
*Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm&lt;br /&gt;
*Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
10/17/2016 2:00-5:00 pm&lt;br /&gt;
*Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm&lt;br /&gt;
*Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm&lt;br /&gt;
*Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm&lt;br /&gt;
*Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm&lt;br /&gt;
*Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm&lt;br /&gt;
*Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm&lt;br /&gt;
*Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm&lt;br /&gt;
*Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm&lt;br /&gt;
*Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm&lt;br /&gt;
*Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm&lt;br /&gt;
*Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm&lt;br /&gt;
*Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm&lt;br /&gt;
*Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm&lt;br /&gt;
*Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm&lt;br /&gt;
*Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm&lt;br /&gt;
*Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm&lt;br /&gt;
*Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm&lt;br /&gt;
*Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm&lt;br /&gt;
*Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm&lt;br /&gt;
*Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm&lt;br /&gt;
*Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm&lt;br /&gt;
*Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=22213</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=22213"/>
		<updated>2017-11-29T22:59:34Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm&lt;br /&gt;
*Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm&lt;br /&gt;
*Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm&lt;br /&gt;
*Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm&lt;br /&gt;
*Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm&lt;br /&gt;
*Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm&lt;br /&gt;
*Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm&lt;br /&gt;
*Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm&lt;br /&gt;
*Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm&lt;br /&gt;
*Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm&lt;br /&gt;
*Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm&lt;br /&gt;
*Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm:&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm&lt;br /&gt;
*Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm&lt;br /&gt;
*Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm&lt;br /&gt;
*Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm&lt;br /&gt;
*Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm&lt;br /&gt;
*Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm&lt;br /&gt;
*Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm&lt;br /&gt;
*Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm&lt;br /&gt;
*Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm&lt;br /&gt;
*Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm&lt;br /&gt;
*Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm&lt;br /&gt;
*Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
11/2/2017 4:00-5:30 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file.&lt;br /&gt;
&lt;br /&gt;
11/6/2017 2:00-4:00 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file. Began compiling a list of keywords for demo day press releases.&lt;br /&gt;
&lt;br /&gt;
11/7/2017 3:00-5:00 pm&lt;br /&gt;
*Finished coming up with keywords for demo day crawler. Sent the final list to Peter.&lt;br /&gt;
&lt;br /&gt;
11/8/2017 2:00-3:30 pm&lt;br /&gt;
*Spoke to Ed and organized all of our current data.&lt;br /&gt;
&lt;br /&gt;
11/9/2017 3:00-5:00 pm&lt;br /&gt;
*Created a new project page called Accelerator Data and listed all relevant files as well as descriptions.&lt;br /&gt;
&lt;br /&gt;
11/14/2017 3:00-5:00 pm&lt;br /&gt;
*Looked up URLs and decided whether or not the webiste was relevant.&lt;br /&gt;
&lt;br /&gt;
11/15/2017 2:00-5:00 pm&lt;br /&gt;
*Created SQL database entitled &amp;quot;acceleratordata&amp;quot; and began creating tables from folder of All Relevant Files.&lt;br /&gt;
&lt;br /&gt;
11/16/2017 3:00-5:00 pm&lt;br /&gt;
*Continued to input tables into SQL database.&lt;br /&gt;
&lt;br /&gt;
11/20/2017 2:00-5:00 pm&lt;br /&gt;
*Cleaned text files in order to import tables into SQL database.&lt;br /&gt;
&lt;br /&gt;
11/27/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Peter to find and exclude irrelevant keywords on HTML pages. Began categorizing relevant demo day pages.&lt;br /&gt;
&lt;br /&gt;
11/28/2017 3:00-5:00 pm&lt;br /&gt;
*Finished inputting tables of relevant files into SQL database.&lt;br /&gt;
&lt;br /&gt;
11/29/2017 2:00-5:00 pm&lt;br /&gt;
*Went through accelerator HTML URLs. Spoke with Ed about going through HTMLs and classifying based on overall and specific relevance.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm&lt;br /&gt;
*Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm&lt;br /&gt;
*Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm&lt;br /&gt;
*Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm&lt;br /&gt;
*Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm&lt;br /&gt;
*Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm&lt;br /&gt;
*Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm&lt;br /&gt;
*Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm&lt;br /&gt;
*Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm&lt;br /&gt;
*Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm&lt;br /&gt;
*Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm&lt;br /&gt;
*Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm&lt;br /&gt;
*Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm&lt;br /&gt;
*Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm&lt;br /&gt;
*Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm&lt;br /&gt;
*Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm&lt;br /&gt;
*Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm&lt;br /&gt;
*Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm&lt;br /&gt;
*Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm&lt;br /&gt;
*Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm&lt;br /&gt;
*Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm&lt;br /&gt;
*Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm&lt;br /&gt;
*Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm&lt;br /&gt;
*Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm&lt;br /&gt;
*Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm&lt;br /&gt;
*Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm&lt;br /&gt;
*Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm&lt;br /&gt;
*Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm&lt;br /&gt;
*Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm&lt;br /&gt;
*Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm&lt;br /&gt;
*Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm&lt;br /&gt;
*Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
10/17/2016 2:00-5:00 pm&lt;br /&gt;
*Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm&lt;br /&gt;
*Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm&lt;br /&gt;
*Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm&lt;br /&gt;
*Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm&lt;br /&gt;
*Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm&lt;br /&gt;
*Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm&lt;br /&gt;
*Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm&lt;br /&gt;
*Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm&lt;br /&gt;
*Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm&lt;br /&gt;
*Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm&lt;br /&gt;
*Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm&lt;br /&gt;
*Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm&lt;br /&gt;
*Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm&lt;br /&gt;
*Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm&lt;br /&gt;
*Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm&lt;br /&gt;
*Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm&lt;br /&gt;
*Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm&lt;br /&gt;
*Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm&lt;br /&gt;
*Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm&lt;br /&gt;
*Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm&lt;br /&gt;
*Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm&lt;br /&gt;
*Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=22152</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=22152"/>
		<updated>2017-11-28T22:49:19Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm&lt;br /&gt;
*Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm&lt;br /&gt;
*Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm&lt;br /&gt;
*Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm&lt;br /&gt;
*Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm&lt;br /&gt;
*Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm&lt;br /&gt;
*Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm&lt;br /&gt;
*Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm&lt;br /&gt;
*Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm&lt;br /&gt;
*Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm&lt;br /&gt;
*Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm&lt;br /&gt;
*Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm:&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm&lt;br /&gt;
*Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm&lt;br /&gt;
*Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm&lt;br /&gt;
*Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm&lt;br /&gt;
*Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm&lt;br /&gt;
*Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm&lt;br /&gt;
*Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm&lt;br /&gt;
*Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm&lt;br /&gt;
*Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm&lt;br /&gt;
*Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm&lt;br /&gt;
*Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm&lt;br /&gt;
*Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
11/2/2017 4:00-5:30 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file.&lt;br /&gt;
&lt;br /&gt;
11/6/2017 2:00-4:00 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file. Began compiling a list of keywords for demo day press releases.&lt;br /&gt;
&lt;br /&gt;
11/7/2017 3:00-5:00 pm&lt;br /&gt;
*Finished coming up with keywords for demo day crawler. Sent the final list to Peter.&lt;br /&gt;
&lt;br /&gt;
11/8/2017 2:00-3:30 pm&lt;br /&gt;
*Spoke to Ed and organized all of our current data.&lt;br /&gt;
&lt;br /&gt;
11/9/2017 3:00-5:00 pm&lt;br /&gt;
*Created a new project page called Accelerator Data and listed all relevant files as well as descriptions.&lt;br /&gt;
&lt;br /&gt;
11/14/2017 3:00-5:00 pm&lt;br /&gt;
*Looked up URLs and decided whether or not the webiste was relevant.&lt;br /&gt;
&lt;br /&gt;
11/15/2017 2:00-5:00 pm&lt;br /&gt;
*Created SQL database entitled &amp;quot;acceleratordata&amp;quot; and began creating tables from folder of All Relevant Files.&lt;br /&gt;
&lt;br /&gt;
11/16/2017 3:00-5:00 pm&lt;br /&gt;
*Continued to input tables into SQL database.&lt;br /&gt;
&lt;br /&gt;
11/20/2017 2:00-5:00 pm&lt;br /&gt;
*Cleaned text files in order to import tables into SQL database.&lt;br /&gt;
&lt;br /&gt;
11/27/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Peter to find and exclude irrelevant keywords on HTML pages. Began categorizing relevant demo day pages.&lt;br /&gt;
&lt;br /&gt;
11/28/2017 3:00-5:00 pm&lt;br /&gt;
*Finished inputting tables of relevant files into SQL database.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm&lt;br /&gt;
*Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm&lt;br /&gt;
*Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm&lt;br /&gt;
*Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm&lt;br /&gt;
*Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm&lt;br /&gt;
*Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm&lt;br /&gt;
*Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm&lt;br /&gt;
*Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm&lt;br /&gt;
*Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm&lt;br /&gt;
*Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm&lt;br /&gt;
*Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm&lt;br /&gt;
*Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm&lt;br /&gt;
*Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm&lt;br /&gt;
*Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm&lt;br /&gt;
*Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm&lt;br /&gt;
*Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm&lt;br /&gt;
*Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm&lt;br /&gt;
*Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm&lt;br /&gt;
*Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm&lt;br /&gt;
*Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm&lt;br /&gt;
*Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm&lt;br /&gt;
*Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm&lt;br /&gt;
*Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm&lt;br /&gt;
*Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm&lt;br /&gt;
*Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm&lt;br /&gt;
*Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm&lt;br /&gt;
*Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm&lt;br /&gt;
*Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm&lt;br /&gt;
*Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm&lt;br /&gt;
*Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm&lt;br /&gt;
*Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm&lt;br /&gt;
*Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
10/17/2016 2:00-5:00 pm&lt;br /&gt;
*Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm&lt;br /&gt;
*Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm&lt;br /&gt;
*Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm&lt;br /&gt;
*Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm&lt;br /&gt;
*Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm&lt;br /&gt;
*Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm&lt;br /&gt;
*Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm&lt;br /&gt;
*Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm&lt;br /&gt;
*Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm&lt;br /&gt;
*Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm&lt;br /&gt;
*Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm&lt;br /&gt;
*Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm&lt;br /&gt;
*Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm&lt;br /&gt;
*Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm&lt;br /&gt;
*Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm&lt;br /&gt;
*Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm&lt;br /&gt;
*Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm&lt;br /&gt;
*Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm&lt;br /&gt;
*Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm&lt;br /&gt;
*Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm&lt;br /&gt;
*Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm&lt;br /&gt;
*Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=22082</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=22082"/>
		<updated>2017-11-27T22:51:21Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm&lt;br /&gt;
*Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm&lt;br /&gt;
*Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm&lt;br /&gt;
*Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm&lt;br /&gt;
*Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm&lt;br /&gt;
*Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm&lt;br /&gt;
*Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm&lt;br /&gt;
*Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm&lt;br /&gt;
*Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm&lt;br /&gt;
*Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm&lt;br /&gt;
*Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm&lt;br /&gt;
*Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm:&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm&lt;br /&gt;
*Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm&lt;br /&gt;
*Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm&lt;br /&gt;
*Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm&lt;br /&gt;
*Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm&lt;br /&gt;
*Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm&lt;br /&gt;
*Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm&lt;br /&gt;
*Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm&lt;br /&gt;
*Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm&lt;br /&gt;
*Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm&lt;br /&gt;
*Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm&lt;br /&gt;
*Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
11/2/2017 4:00-5:30 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file.&lt;br /&gt;
&lt;br /&gt;
11/6/2017 2:00-4:00 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file. Began compiling a list of keywords for demo day press releases.&lt;br /&gt;
&lt;br /&gt;
11/7/2017 3:00-5:00 pm&lt;br /&gt;
*Finished coming up with keywords for demo day crawler. Sent the final list to Peter.&lt;br /&gt;
&lt;br /&gt;
11/8/2017 2:00-3:30 pm&lt;br /&gt;
*Spoke to Ed and organized all of our current data.&lt;br /&gt;
&lt;br /&gt;
11/9/2017 3:00-5:00 pm&lt;br /&gt;
*Created a new project page called Accelerator Data and listed all relevant files as well as descriptions.&lt;br /&gt;
&lt;br /&gt;
11/14/2017 3:00-5:00 pm&lt;br /&gt;
*Looked up URLs and decided whether or not the webiste was relevant.&lt;br /&gt;
&lt;br /&gt;
11/15/2017 2:00-5:00 pm&lt;br /&gt;
*Created SQL database entitled &amp;quot;acceleratordata&amp;quot; and began creating tables from folder of All Relevant Files.&lt;br /&gt;
&lt;br /&gt;
11/16/2017 3:00-5:00 pm&lt;br /&gt;
*Continued to input tables into SQL database&lt;br /&gt;
&lt;br /&gt;
11/20/2017 2:00-5:00 pm&lt;br /&gt;
*Cleaned text files in order to import tables into SQL database&lt;br /&gt;
&lt;br /&gt;
11/27/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Peter to find and exclude irrelevant keywords on HTML pages. Began categorizing relevant demo day pages.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm&lt;br /&gt;
*Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm&lt;br /&gt;
*Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm&lt;br /&gt;
*Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm&lt;br /&gt;
*Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm&lt;br /&gt;
*Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm&lt;br /&gt;
*Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm&lt;br /&gt;
*Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm&lt;br /&gt;
*Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm&lt;br /&gt;
*Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm&lt;br /&gt;
*Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm&lt;br /&gt;
*Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm&lt;br /&gt;
*Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm&lt;br /&gt;
*Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm&lt;br /&gt;
*Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm&lt;br /&gt;
*Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm&lt;br /&gt;
*Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm&lt;br /&gt;
*Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm&lt;br /&gt;
*Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm&lt;br /&gt;
*Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm&lt;br /&gt;
*Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm&lt;br /&gt;
*Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm&lt;br /&gt;
*Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm&lt;br /&gt;
*Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm&lt;br /&gt;
*Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm&lt;br /&gt;
*Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm&lt;br /&gt;
*Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm&lt;br /&gt;
*Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm&lt;br /&gt;
*Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm&lt;br /&gt;
*Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm&lt;br /&gt;
*Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm&lt;br /&gt;
*Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
10/17/2016 2:00-5:00 pm&lt;br /&gt;
*Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm&lt;br /&gt;
*Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm&lt;br /&gt;
*Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm&lt;br /&gt;
*Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm&lt;br /&gt;
*Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm&lt;br /&gt;
*Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm&lt;br /&gt;
*Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm&lt;br /&gt;
*Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm&lt;br /&gt;
*Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm&lt;br /&gt;
*Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm&lt;br /&gt;
*Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm&lt;br /&gt;
*Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm&lt;br /&gt;
*Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm&lt;br /&gt;
*Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm&lt;br /&gt;
*Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm&lt;br /&gt;
*Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm&lt;br /&gt;
*Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm&lt;br /&gt;
*Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm&lt;br /&gt;
*Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm&lt;br /&gt;
*Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm&lt;br /&gt;
*Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm&lt;br /&gt;
*Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21995</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21995"/>
		<updated>2017-11-20T22:54:26Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm&lt;br /&gt;
*Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm&lt;br /&gt;
*Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm&lt;br /&gt;
*Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm&lt;br /&gt;
*Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm&lt;br /&gt;
*Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm&lt;br /&gt;
*Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm&lt;br /&gt;
*Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm&lt;br /&gt;
*Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm&lt;br /&gt;
*Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm&lt;br /&gt;
*Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm&lt;br /&gt;
*Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm:&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm&lt;br /&gt;
*Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm&lt;br /&gt;
*Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm&lt;br /&gt;
*Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm&lt;br /&gt;
*Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm&lt;br /&gt;
*Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm&lt;br /&gt;
*Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm&lt;br /&gt;
*Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm&lt;br /&gt;
*Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm&lt;br /&gt;
*Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm&lt;br /&gt;
*Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm&lt;br /&gt;
*Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
11/2/2017 4:00-5:30 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file.&lt;br /&gt;
&lt;br /&gt;
11/6/2017 2:00-4:00 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file. Began compiling a list of keywords for demo day press releases.&lt;br /&gt;
&lt;br /&gt;
11/7/2017 3:00-5:00 pm&lt;br /&gt;
*Finished coming up with keywords for demo day crawler. Sent the final list to Peter.&lt;br /&gt;
&lt;br /&gt;
11/8/2017 2:00-3:30 pm&lt;br /&gt;
*Spoke to Ed and organized all of our current data.&lt;br /&gt;
&lt;br /&gt;
11/9/2017 3:00-5:00 pm&lt;br /&gt;
*Created a new project page called Accelerator Data and listed all relevant files as well as descriptions.&lt;br /&gt;
&lt;br /&gt;
11/14/2017 3:00-5:00 pm&lt;br /&gt;
*Looked up URLs and decided whether or not the webiste was relevant.&lt;br /&gt;
&lt;br /&gt;
11/15/2017 2:00-5:00 pm&lt;br /&gt;
*Created SQL database entitled &amp;quot;acceleratordata&amp;quot; and began creating tables from folder of All Relevant Files.&lt;br /&gt;
&lt;br /&gt;
11/16/2017 3:00-5:00 pm&lt;br /&gt;
*Continued to input tables into SQL database&lt;br /&gt;
&lt;br /&gt;
11/20/2017 2:00-5:00 pm&lt;br /&gt;
*Cleaned text files in order to import tables into SQL database&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm&lt;br /&gt;
*Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm&lt;br /&gt;
*Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm&lt;br /&gt;
*Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm&lt;br /&gt;
*Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm&lt;br /&gt;
*Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm&lt;br /&gt;
*Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm&lt;br /&gt;
*Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm&lt;br /&gt;
*Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm&lt;br /&gt;
*Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm&lt;br /&gt;
*Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm&lt;br /&gt;
*Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm&lt;br /&gt;
*Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm&lt;br /&gt;
*Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm&lt;br /&gt;
*Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm&lt;br /&gt;
*Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm&lt;br /&gt;
*Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm&lt;br /&gt;
*Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm&lt;br /&gt;
*Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm&lt;br /&gt;
*Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm&lt;br /&gt;
*Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm&lt;br /&gt;
*Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm&lt;br /&gt;
*Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm&lt;br /&gt;
*Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm&lt;br /&gt;
*Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm&lt;br /&gt;
*Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm&lt;br /&gt;
*Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm&lt;br /&gt;
*Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm&lt;br /&gt;
*Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm&lt;br /&gt;
*Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm&lt;br /&gt;
*Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm&lt;br /&gt;
*Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
10/17/2016 2:00-5:00 pm&lt;br /&gt;
*Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm&lt;br /&gt;
*Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm&lt;br /&gt;
*Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm&lt;br /&gt;
*Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm&lt;br /&gt;
*Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm&lt;br /&gt;
*Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm&lt;br /&gt;
*Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm&lt;br /&gt;
*Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm&lt;br /&gt;
*Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm&lt;br /&gt;
*Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm&lt;br /&gt;
*Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm&lt;br /&gt;
*Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm&lt;br /&gt;
*Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm&lt;br /&gt;
*Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm&lt;br /&gt;
*Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm&lt;br /&gt;
*Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm&lt;br /&gt;
*Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm&lt;br /&gt;
*Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm&lt;br /&gt;
*Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm&lt;br /&gt;
*Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm&lt;br /&gt;
*Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm&lt;br /&gt;
*Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21958</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21958"/>
		<updated>2017-11-16T22:52:52Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm&lt;br /&gt;
*Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm&lt;br /&gt;
*Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm&lt;br /&gt;
*Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm&lt;br /&gt;
*Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm&lt;br /&gt;
*Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm&lt;br /&gt;
*Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm&lt;br /&gt;
*Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm&lt;br /&gt;
*Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm&lt;br /&gt;
*Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm&lt;br /&gt;
*Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm&lt;br /&gt;
*Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm:&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm&lt;br /&gt;
*Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm&lt;br /&gt;
*Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm&lt;br /&gt;
*Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm&lt;br /&gt;
*Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm&lt;br /&gt;
*Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm&lt;br /&gt;
*Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm&lt;br /&gt;
*Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm&lt;br /&gt;
*Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm&lt;br /&gt;
*Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm&lt;br /&gt;
*Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm&lt;br /&gt;
*Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
11/2/2017 4:00-5:30 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file.&lt;br /&gt;
&lt;br /&gt;
11/6/2017 2:00-4:00 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file. Began compiling a list of keywords for demo day press releases.&lt;br /&gt;
&lt;br /&gt;
11/7/2017 3:00-5:00 pm&lt;br /&gt;
*Finished coming up with keywords for demo day crawler. Sent the final list to Peter.&lt;br /&gt;
&lt;br /&gt;
11/8/2017 2:00-3:30 pm&lt;br /&gt;
*Spoke to Ed and organized all of our current data.&lt;br /&gt;
&lt;br /&gt;
11/9/2017 3:00-5:00 pm&lt;br /&gt;
*Created a new project page called Accelerator Data and listed all relevant files as well as descriptions.&lt;br /&gt;
&lt;br /&gt;
11/14/2017 3:00-5:00 pm&lt;br /&gt;
*Looked up URLs and decided whether or not the webiste was relevant.&lt;br /&gt;
&lt;br /&gt;
11/15/2017 2:00-5:00 pm&lt;br /&gt;
*Created SQL database entitled &amp;quot;acceleratordata&amp;quot; and began creating tables from folder of All Relevant Files.&lt;br /&gt;
&lt;br /&gt;
11/16/2017 3:00-5:00 pm&lt;br /&gt;
*Continued to input tables into SQL database&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm&lt;br /&gt;
*Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm&lt;br /&gt;
*Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm&lt;br /&gt;
*Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm&lt;br /&gt;
*Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm&lt;br /&gt;
*Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm&lt;br /&gt;
*Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm&lt;br /&gt;
*Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm&lt;br /&gt;
*Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm&lt;br /&gt;
*Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm&lt;br /&gt;
*Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm&lt;br /&gt;
*Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm&lt;br /&gt;
*Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm&lt;br /&gt;
*Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm&lt;br /&gt;
*Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm&lt;br /&gt;
*Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm&lt;br /&gt;
*Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm&lt;br /&gt;
*Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm&lt;br /&gt;
*Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm&lt;br /&gt;
*Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm&lt;br /&gt;
*Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm&lt;br /&gt;
*Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm&lt;br /&gt;
*Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm&lt;br /&gt;
*Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm&lt;br /&gt;
*Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm&lt;br /&gt;
*Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm&lt;br /&gt;
*Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm&lt;br /&gt;
*Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm&lt;br /&gt;
*Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm&lt;br /&gt;
*Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm&lt;br /&gt;
*Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm&lt;br /&gt;
*Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
10/17/2016 2:00-5:00 pm&lt;br /&gt;
*Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm&lt;br /&gt;
*Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm&lt;br /&gt;
*Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm&lt;br /&gt;
*Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm&lt;br /&gt;
*Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm&lt;br /&gt;
*Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm&lt;br /&gt;
*Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm&lt;br /&gt;
*Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm&lt;br /&gt;
*Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm&lt;br /&gt;
*Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm&lt;br /&gt;
*Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm&lt;br /&gt;
*Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm&lt;br /&gt;
*Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm&lt;br /&gt;
*Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm&lt;br /&gt;
*Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm&lt;br /&gt;
*Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm&lt;br /&gt;
*Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm&lt;br /&gt;
*Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm&lt;br /&gt;
*Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm&lt;br /&gt;
*Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm&lt;br /&gt;
*Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm&lt;br /&gt;
*Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21921</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21921"/>
		<updated>2017-11-15T23:00:17Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm&lt;br /&gt;
*Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm&lt;br /&gt;
*Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm&lt;br /&gt;
*Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm&lt;br /&gt;
*Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm&lt;br /&gt;
*Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm&lt;br /&gt;
*Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm&lt;br /&gt;
*Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm&lt;br /&gt;
*Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm&lt;br /&gt;
*Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm&lt;br /&gt;
*Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm&lt;br /&gt;
*Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm:&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm&lt;br /&gt;
*Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm&lt;br /&gt;
*Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm&lt;br /&gt;
*Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm&lt;br /&gt;
*Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm&lt;br /&gt;
*Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm&lt;br /&gt;
*Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm&lt;br /&gt;
*Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm&lt;br /&gt;
*Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm&lt;br /&gt;
*Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm&lt;br /&gt;
*Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm&lt;br /&gt;
*Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
11/2/2017 4:00-5:30 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file.&lt;br /&gt;
&lt;br /&gt;
11/6/2017 2:00-4:00 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file. Began compiling a list of keywords for demo day press releases.&lt;br /&gt;
&lt;br /&gt;
11/7/2017 3:00-5:00 pm&lt;br /&gt;
*Finished coming up with keywords for demo day crawler. Sent the final list to Peter.&lt;br /&gt;
&lt;br /&gt;
11/8/2017 2:00-3:30 pm&lt;br /&gt;
*Spoke to Ed and organized all of our current data.&lt;br /&gt;
&lt;br /&gt;
11/9/2017 3:00-5:00 pm&lt;br /&gt;
*Created a new project page called Accelerator Data and listed all relevant files as well as descriptions&lt;br /&gt;
&lt;br /&gt;
11/14/2017 3:00-5:00 pm&lt;br /&gt;
*Looked up URLs and decided whether or not the webiste was relevant.&lt;br /&gt;
&lt;br /&gt;
11/15/2017 2:00-5:00 pm&lt;br /&gt;
*Created SQL database entitled &amp;quot;acceleratordata&amp;quot; and began creating tables from folder of All Relevant Files&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm&lt;br /&gt;
*Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm&lt;br /&gt;
*Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm&lt;br /&gt;
*Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm&lt;br /&gt;
*Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm&lt;br /&gt;
*Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm&lt;br /&gt;
*Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm&lt;br /&gt;
*Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm&lt;br /&gt;
*Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm&lt;br /&gt;
*Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm&lt;br /&gt;
*Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm&lt;br /&gt;
*Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm&lt;br /&gt;
*Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm&lt;br /&gt;
*Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm&lt;br /&gt;
*Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm&lt;br /&gt;
*Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm&lt;br /&gt;
*Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm&lt;br /&gt;
*Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm&lt;br /&gt;
*Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm&lt;br /&gt;
*Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm&lt;br /&gt;
*Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm&lt;br /&gt;
*Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm&lt;br /&gt;
*Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm&lt;br /&gt;
*Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm&lt;br /&gt;
*Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm&lt;br /&gt;
*Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm&lt;br /&gt;
*Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm&lt;br /&gt;
*Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm&lt;br /&gt;
*Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm&lt;br /&gt;
*Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm&lt;br /&gt;
*Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm&lt;br /&gt;
*Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
10/17/2016 2:00-5:00 pm&lt;br /&gt;
*Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm&lt;br /&gt;
*Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm&lt;br /&gt;
*Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm&lt;br /&gt;
*Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm&lt;br /&gt;
*Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm&lt;br /&gt;
*Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm&lt;br /&gt;
*Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm&lt;br /&gt;
*Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm&lt;br /&gt;
*Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm&lt;br /&gt;
*Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm&lt;br /&gt;
*Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm&lt;br /&gt;
*Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm&lt;br /&gt;
*Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm&lt;br /&gt;
*Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm&lt;br /&gt;
*Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm&lt;br /&gt;
*Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm&lt;br /&gt;
*Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm&lt;br /&gt;
*Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm&lt;br /&gt;
*Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm&lt;br /&gt;
*Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm&lt;br /&gt;
*Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm&lt;br /&gt;
*Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21881</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21881"/>
		<updated>2017-11-14T22:58:48Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm&lt;br /&gt;
*Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm&lt;br /&gt;
*Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm&lt;br /&gt;
*Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm&lt;br /&gt;
*Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm&lt;br /&gt;
*Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm&lt;br /&gt;
*Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm&lt;br /&gt;
*Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm&lt;br /&gt;
*Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm&lt;br /&gt;
*Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm&lt;br /&gt;
*Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm&lt;br /&gt;
*Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm:&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm&lt;br /&gt;
*Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm&lt;br /&gt;
*Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm&lt;br /&gt;
*Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm&lt;br /&gt;
*Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm&lt;br /&gt;
*Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm&lt;br /&gt;
*Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm&lt;br /&gt;
*Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm&lt;br /&gt;
*Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm&lt;br /&gt;
*Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm&lt;br /&gt;
*Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm&lt;br /&gt;
*Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
11/2/2017 4:00-5:30 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file.&lt;br /&gt;
&lt;br /&gt;
11/6/2017 2:00-4:00 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file. Began compiling a list of keywords for demo day press releases.&lt;br /&gt;
&lt;br /&gt;
11/7/2017 3:00-5:00 pm&lt;br /&gt;
*Finished coming up with keywords for demo day crawler. Sent the final list to Peter.&lt;br /&gt;
&lt;br /&gt;
11/8/2017 2:00-3:30 pm&lt;br /&gt;
*Spoke to Ed and organized all of our current data.&lt;br /&gt;
&lt;br /&gt;
11/9/2017 3:00-5:00 pm&lt;br /&gt;
*Created a new project page called Accelerator Data and listed all relevant files as well as descriptions&lt;br /&gt;
&lt;br /&gt;
11/14/2017 3:00-5:00 pm&lt;br /&gt;
*Looked up URLs and decided whether or not the webiste was relevant.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm&lt;br /&gt;
*Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm&lt;br /&gt;
*Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm&lt;br /&gt;
*Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm&lt;br /&gt;
*Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm&lt;br /&gt;
*Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm&lt;br /&gt;
*Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm&lt;br /&gt;
*Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm&lt;br /&gt;
*Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm&lt;br /&gt;
*Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm&lt;br /&gt;
*Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm&lt;br /&gt;
*Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm&lt;br /&gt;
*Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm&lt;br /&gt;
*Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm&lt;br /&gt;
*Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm&lt;br /&gt;
*Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm&lt;br /&gt;
*Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm&lt;br /&gt;
*Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm&lt;br /&gt;
*Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm&lt;br /&gt;
*Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm&lt;br /&gt;
*Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm&lt;br /&gt;
*Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm&lt;br /&gt;
*Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm&lt;br /&gt;
*Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm&lt;br /&gt;
*Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm&lt;br /&gt;
*Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm&lt;br /&gt;
*Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm&lt;br /&gt;
*Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm&lt;br /&gt;
*Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm&lt;br /&gt;
*Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm&lt;br /&gt;
*Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm&lt;br /&gt;
*Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
10/17/2016 2:00-5:00 pm&lt;br /&gt;
*Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm&lt;br /&gt;
*Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm&lt;br /&gt;
*Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm&lt;br /&gt;
*Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm&lt;br /&gt;
*Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm&lt;br /&gt;
*Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm&lt;br /&gt;
*Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm&lt;br /&gt;
*Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm&lt;br /&gt;
*Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm&lt;br /&gt;
*Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm&lt;br /&gt;
*Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm&lt;br /&gt;
*Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm&lt;br /&gt;
*Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm&lt;br /&gt;
*Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm&lt;br /&gt;
*Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm&lt;br /&gt;
*Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm&lt;br /&gt;
*Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm&lt;br /&gt;
*Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm&lt;br /&gt;
*Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm&lt;br /&gt;
*Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm&lt;br /&gt;
*Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm&lt;br /&gt;
*Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21747</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21747"/>
		<updated>2017-11-09T22:59:07Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm&lt;br /&gt;
*Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm&lt;br /&gt;
*Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm&lt;br /&gt;
*Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm&lt;br /&gt;
*Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm&lt;br /&gt;
*Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm&lt;br /&gt;
*Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm&lt;br /&gt;
*Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm&lt;br /&gt;
*Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm&lt;br /&gt;
*Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm&lt;br /&gt;
*Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm&lt;br /&gt;
*Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm:&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm&lt;br /&gt;
*Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm&lt;br /&gt;
*Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm&lt;br /&gt;
*Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm&lt;br /&gt;
*Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm&lt;br /&gt;
*Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm&lt;br /&gt;
*Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm&lt;br /&gt;
*Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm&lt;br /&gt;
*Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm&lt;br /&gt;
*Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm&lt;br /&gt;
*Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm&lt;br /&gt;
*Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
11/2/2017 4:00-5:30 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file.&lt;br /&gt;
&lt;br /&gt;
11/6/2017 2:00-4:00 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file. Began compiling a list of keywords for demo day press releases.&lt;br /&gt;
&lt;br /&gt;
11/7/2017 3:00-5:00 pm&lt;br /&gt;
*Finished coming up with keywords for demo day crawler. Sent the final list to Peter.&lt;br /&gt;
&lt;br /&gt;
11/8/2017 2:00-3:30 pm&lt;br /&gt;
*Spoke to Ed and organized all of our current data.&lt;br /&gt;
&lt;br /&gt;
11/9/2017 3:00-5:00 pm&lt;br /&gt;
*Created a new project page called Accelerator Data and listed all relevant files as well as descriptions&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm&lt;br /&gt;
*Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm&lt;br /&gt;
*Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm&lt;br /&gt;
*Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm&lt;br /&gt;
*Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm&lt;br /&gt;
*Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm&lt;br /&gt;
*Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm&lt;br /&gt;
*Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm&lt;br /&gt;
*Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm&lt;br /&gt;
*Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm&lt;br /&gt;
*Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm&lt;br /&gt;
*Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm&lt;br /&gt;
*Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm&lt;br /&gt;
*Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm&lt;br /&gt;
*Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm&lt;br /&gt;
*Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm&lt;br /&gt;
*Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm&lt;br /&gt;
*Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm&lt;br /&gt;
*Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm&lt;br /&gt;
*Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm&lt;br /&gt;
*Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm&lt;br /&gt;
*Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm&lt;br /&gt;
*Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm&lt;br /&gt;
*Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm&lt;br /&gt;
*Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm&lt;br /&gt;
*Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm&lt;br /&gt;
*Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm&lt;br /&gt;
*Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm&lt;br /&gt;
*Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm&lt;br /&gt;
*Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm&lt;br /&gt;
*Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm&lt;br /&gt;
*Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
10/17/2016 2:00-5:00 pm&lt;br /&gt;
*Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm&lt;br /&gt;
*Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm&lt;br /&gt;
*Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm&lt;br /&gt;
*Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm&lt;br /&gt;
*Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm&lt;br /&gt;
*Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm&lt;br /&gt;
*Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm&lt;br /&gt;
*Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm&lt;br /&gt;
*Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm&lt;br /&gt;
*Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm&lt;br /&gt;
*Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm&lt;br /&gt;
*Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm&lt;br /&gt;
*Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm&lt;br /&gt;
*Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm&lt;br /&gt;
*Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm&lt;br /&gt;
*Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm&lt;br /&gt;
*Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm&lt;br /&gt;
*Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm&lt;br /&gt;
*Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm&lt;br /&gt;
*Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm&lt;br /&gt;
*Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm&lt;br /&gt;
*Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21746</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21746"/>
		<updated>2017-11-09T22:36:47Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
*'''List of Preliminary Accelerators'''&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
**Variables: Names of potential accelerators&lt;br /&gt;
*'''accelerator_data_noflag'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
*'''New Crunchbase Accelerators'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017&lt;br /&gt;
**Description: After conducting some SDC matches with cohort data from the cohort companies of the accelerators in the '''accelerator_data_noflag''' text file, we realized many potential accelerators were missing. We then got an Excel file from Crunchbase containing all of its organizations, which we then sorted to identify potential missing accelerators. The accelerators we were actually missing are in this Excel file.&lt;br /&gt;
**Variables: Names of Missing Accelerators&lt;br /&gt;
*'''Accelerator_Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Summer 2017\Veeral&lt;br /&gt;
**Description: This text file contains cleaned data on all of our current accelerators. This file was compiled by Veeral over Summer 2017. Some of these accelerators are not based in the United States.&lt;br /&gt;
**Variables: Accelerator, homepage_url, city, region, country_code, Creation date&lt;br /&gt;
*'''Cleaned Cohort Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.&lt;br /&gt;
**Variables: Accelerator Name, Company Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, Program, Cohort, Year&lt;br /&gt;
*'''ListofAccs'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all current accelerators we have been working with.&lt;br /&gt;
**Variables: Accelerator name&lt;br /&gt;
*'''Accelerator_Cohort_Companies'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all cohort companies of all accelerators.&lt;br /&gt;
**Variables: Cohort Companies, Accelerator name&lt;br /&gt;
*'''Current Matched Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: Sheet 1 contains our matched data from matching our SDC pull with our cohort companies list found in '''Accelerator_Cohort_Companies'''. Sheet 2 removes the duplicates from the previous match. Sheet 3 contains the list of VCCompanies, which accelerator they went through, the date of their first investment. Sheet 4 contains our cohort list matched with the crunchbase organizations, but it contains too many duplicates to use.&lt;br /&gt;
**Variables: VCCompanies, Accelerator, Earliest Round Date&lt;br /&gt;
*'''founders_linkedin'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains founder data for each accelerator found by Peter when crawling LinkedIn.&lt;br /&gt;
**Variables: Accelerator name, Founder name, LinkedIn URL&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21745</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21745"/>
		<updated>2017-11-09T22:34:05Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
*'''List of Preliminary Accelerators'''&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
**Variables: Names of potential accelerators&lt;br /&gt;
*'''accelerator_data_noflag'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
*'''New Crunchbase Accelerators'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017&lt;br /&gt;
**Description: After conducting some SDC matches with cohort data from the cohort companies of the accelerators in the '''accelerator_data_noflag''' text file, we realized many potential accelerators were missing. We then got an Excel file from Crunchbase containing all of its organizations, which we then sorted to identify potential missing accelerators. The accelerators we were actually missing are in this Excel file.&lt;br /&gt;
**Variables: Names of Missing Accelerators&lt;br /&gt;
*'''Accelerator_Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Summer 2017\Veeral&lt;br /&gt;
**Description: This text file contains cleaned data on all of our current accelerators. This file was compiled by Veeral over Summer 2017. Some of these accelerators are not based in the United States.&lt;br /&gt;
**Variables: Accelerator, homepage_url, city, region, country_code, Creation date&lt;br /&gt;
*'''Cleaned Cohort Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.&lt;br /&gt;
**Variables: Accelerator Name, Company Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, Program, Cohort, Year&lt;br /&gt;
*'''ListofAccs'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all current accelerators we have been working with.&lt;br /&gt;
**Variables: Accelerator name&lt;br /&gt;
*'''Accelerator_Cohort_Companies'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all cohort companies of all accelerators.&lt;br /&gt;
**Variables: Cohort Companies, Accelerator name&lt;br /&gt;
*'''Current Matched Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: Sheet 1 contains our matched data from matching our SDC pull with our cohort companies list found in '''Accelerator_Cohort_Companies'''. Sheet 2 removes the duplicates from the previous match. Sheet 3 contains the list of VCCompanies, which accelerator they went through, the date of their first investment. Sheet 4 contains our cohort list matched with the crunchbase organizations, but it contains too many duplicates to use.&lt;br /&gt;
**Variables: VCCompanies, Accelerator, Earliest Round Date&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21744</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21744"/>
		<updated>2017-11-09T22:33:13Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
*'''List of Preliminary Accelerators'''&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
**Variables: Names of potential accelerators&lt;br /&gt;
*'''accelerator_data_noflag'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
*'''New Crunchbase Accelerators'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017&lt;br /&gt;
**Description: After conducting some SDC matches with cohort data from the cohort companies of the accelerators in the '''accelerator_data_noflag''' text file, we realized many potential accelerators were missing. We then got an Excel file from Crunchbase containing all of its organizations, which we then sorted to identify potential missing accelerators. The accelerators we were actually missing are in this Excel file.&lt;br /&gt;
**Variables: Names of Missing Accelerators&lt;br /&gt;
*'''Accelerator_Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Summer 2017\Veeral&lt;br /&gt;
**Description: This text file contains cleaned data on all of our current accelerators. This file was compiled by Veeral over Summer 2017. Some of these accelerators are not based in the United States.&lt;br /&gt;
**Variables: Accelerator, homepage_url, city, region, country_code, Creation date&lt;br /&gt;
*'''Cleaned Cohort Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.&lt;br /&gt;
**Variables: Accelerator Name, Company Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, Program, Cohort, Year&lt;br /&gt;
*'''ListofAccs'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all current accelerators we have been working with.&lt;br /&gt;
**Variables: Accelerator name&lt;br /&gt;
*'''Accelerator_Cohort_Companies'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all cohort companies of all accelerators.&lt;br /&gt;
**Variables: Cohort Companies, Accelerator name&lt;br /&gt;
*'''Current Matched Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: Sheet 1 contains our matched data from matching our SDC pull with our cohort companies list found in '''Accelerator_Cohort_Companies'''. Sheet 2 removes the duplicates from the previous match. Sheet 3 contains the list of VCCompanies, which accelerator they went through, the date of their first investment. Sheet 4 contains our cohort list matched with the crunchbase organizations, but it contains too many duplicates to use.&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21743</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21743"/>
		<updated>2017-11-09T22:25:59Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
*'''List of Preliminary Accelerators'''&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
**Variables: Names of potential accelerators&lt;br /&gt;
*'''accelerator_data_noflag'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
*'''New Crunchbase Accelerators'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017&lt;br /&gt;
**Description: After conducting some SDC matches with cohort data from the cohort companies of the accelerators in the '''accelerator_data_noflag''' text file, we realized many potential accelerators were missing. We then got an Excel file from Crunchbase containing all of its organizations, which we then sorted to identify potential missing accelerators. The accelerators we were actually missing are in this Excel file.&lt;br /&gt;
**Variables: Names of Missing Accelerators&lt;br /&gt;
*'''Accelerator_Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Summer 2017\Veeral&lt;br /&gt;
**Description: This text file contains cleaned data on all of our current accelerators. This file was compiled by Veeral over Summer 2017. Some of these accelerators are not based in the United States.&lt;br /&gt;
**Variables: Accelerator, homepage_url, city, region, country_code, Creation date&lt;br /&gt;
*'''Cleaned Cohort Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.&lt;br /&gt;
**Variables: Accelerator Name, Company Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, Program, Cohort, Year&lt;br /&gt;
*'''ListofAccs'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all current accelerators we have been working with.&lt;br /&gt;
**Variables: Accelerator name&lt;br /&gt;
*'''Accelerator_Cohort_Companies'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all cohort companies of all accelerators.&lt;br /&gt;
**Variables: Cohort Companies, Accelerator name&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21742</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21742"/>
		<updated>2017-11-09T22:21:20Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
*'''List of Preliminary Accelerators'''&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
**Variables: Names of potential accelerators&lt;br /&gt;
*'''accelerator_data_noflag'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
*'''New Crunchbase Accelerators'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017&lt;br /&gt;
**Description: After conducting some SDC matches with cohort data from the cohort companies of the accelerators in the '''accelerator_data_noflag''' text file, we realized many potential accelerators were missing. We then got an Excel file from Crunchbase containing all of its organizations, which we then sorted to identify potential missing accelerators. The accelerators we were actually missing are in this Excel file.&lt;br /&gt;
**Variables: Names of Missing Accelerators&lt;br /&gt;
*'''Accelerator_Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Summer 2017\Veeral&lt;br /&gt;
**Description: This text file contains cleaned data on all of our current accelerators. This file was compiled by Veeral over Summer 2017. Some of these accelerators are not based in the United States.&lt;br /&gt;
**Variables: Accelerator, homepage_url, city, region, country_code, Creation date&lt;br /&gt;
*'''Cleaned Cohort Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.&lt;br /&gt;
**Variables: Accelerator Name, Company Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, Program, Cohort, Year&lt;br /&gt;
*'''ListofAccs'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all current accelerators we have been working with.&lt;br /&gt;
**Variables: Accelerator name&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21741</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21741"/>
		<updated>2017-11-09T22:20:49Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
*'''List of Preliminary Accelerators'''&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
**Variables: Names of potential accelerators&lt;br /&gt;
*'''accelerator_data_noflag'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
*'''New Crunchbase Accelerators'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017&lt;br /&gt;
**Description: After conducting some SDC matches with cohort data from the cohort companies of the accelerators in the '''accelerator_data_noflag''' text file, we realized many potential accelerators were missing. We then got an Excel file from Crunchbase containing all of its organizations, which we then sorted to identify potential missing accelerators. The accelerators we were actually missing are in this Excel file.&lt;br /&gt;
**Variables: Names of Missing Accelerators&lt;br /&gt;
*'''Accelerator_Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Summer 2017\Veeral&lt;br /&gt;
**Description: This text file contains cleaned data on all of our current accelerators. This file was compiled by Veeral over Summer 2017. Some of these accelerators are not based in the United States.&lt;br /&gt;
**Variables: Accelerator, homepage_url, city, region, country_code, Creation date&lt;br /&gt;
*'''Cleaned Cohort Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.&lt;br /&gt;
**Variables: Accelerator, Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, Program, Cohort, Year&lt;br /&gt;
*'''ListofAccs'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This text file contains a master list of all current accelerators we have been working with.&lt;br /&gt;
**Variables: Accelerator name&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21740</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21740"/>
		<updated>2017-11-09T22:19:05Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
*'''List of Preliminary Accelerators'''&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
**Variables: Names of potential accelerators&lt;br /&gt;
*'''accelerator_data_noflag'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
*'''New Crunchbase Accelerators'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017&lt;br /&gt;
**Description: After conducting some SDC matches with cohort data from the cohort companies of the accelerators in the '''accelerator_data_noflag''' text file, we realized many potential accelerators were missing. We then got an Excel file from Crunchbase containing all of its organizations, which we then sorted to identify potential missing accelerators. The accelerators we were actually missing are in this Excel file.&lt;br /&gt;
**Variables: Names of Missing Accelerators&lt;br /&gt;
*'''Cleaned Cohort Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Fall 2017&lt;br /&gt;
**Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.&lt;br /&gt;
**Variables: Accelerator, Name, Description, Website, Industry, Location, Acquisition, Notes, Inverstors, Perks, Status, Funding Stage, Founder, Executive, Program, Cohort, Year&lt;br /&gt;
*'''Accelerator_Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Summer 2017\Veeral&lt;br /&gt;
**Description: This text file contains cleaned data on all of our current accelerators. This file was compiled by Veeral over Summer 2017. Some of these accelerators are not based in the United States.&lt;br /&gt;
**Variables: Accelerator, homepage_url, city, region, country_code, Creation date&lt;br /&gt;
*'''ListofAccs'''&lt;br /&gt;
**Original Location&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21739</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21739"/>
		<updated>2017-11-09T22:13:15Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
*'''List of Preliminary Accelerators'''&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
**Variables: Names of potential accelerators&lt;br /&gt;
*'''accelerator_data_noflag'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
*'''New Crunchbase Accelerators'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017&lt;br /&gt;
**Description: After conducting some SDC matches with cohort data from the cohort companies of the accelerators in the '''accelerator_data_noflag''' text file, we realized many potential accelerators were missing. We then got an Excel file from Crunchbase containing all of its organizations, which we then sorted to identify potential missing accelerators. The accelerators we were actually missing are in this Excel file.&lt;br /&gt;
**Variables: Names of Missing Accelerators&lt;br /&gt;
*'''Cleaned Cohort Data'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Summer 2017\Veeral&lt;br /&gt;
**Description: This Excel file contains all data on all cohort companies for our entire list of current accelerators. All missing accelerators were updated by Veeral and we have used this as our final list of cohort companies for all accelerators.&lt;br /&gt;
**Variables:&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21738</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21738"/>
		<updated>2017-11-09T22:02:21Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
*'''List of Preliminary Accelerators'''&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
**Variables: Names of potential accelerators&lt;br /&gt;
*'''accelerator_data_noflag'''&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;br /&gt;
*&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21737</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21737"/>
		<updated>2017-11-09T22:01:43Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
*List of Preliminary Accelerators&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
**Variables: Names of potential accelerators&lt;br /&gt;
*accelator_data_noflag&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21736</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21736"/>
		<updated>2017-11-09T22:01:01Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
*List of Preliminary Accelerators&lt;br /&gt;
**Original Location: [[Accelerator Seed List (Data)]]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [[Accelerator Seed List (Data)]] for process.&lt;br /&gt;
*accelator_data_noflag&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21734</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21734"/>
		<updated>2017-11-09T22:00:35Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
*List of Preliminary Accelerators&lt;br /&gt;
**Original Location: [Accelerator Seed List (Data)]&lt;br /&gt;
**Description: This is the very first master list we compiled of potential accelerators. Look to [Accelerator Seed List (Data)] for process.&lt;br /&gt;
*accelator_data_noflag&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;br /&gt;
**Variables: Name, Score, Flag, CohortURL, Address, Duration, Vintage, Industry, Description, Equity, Nonprofit, Notes&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21733</id>
		<title>Accelerator Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Accelerator_Data&amp;diff=21733"/>
		<updated>2017-11-09T21:55:40Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Has title=Composite Accelerator Data&lt;br /&gt;
|Has owner=Matthew Ringheanu, Shrey Agarwal,&lt;br /&gt;
|Has start date=Fall 2016&lt;br /&gt;
|Has deadline=Fall 2017&lt;br /&gt;
|Has keywords=Accelerator, Data&lt;br /&gt;
|Has notes=Continuation of [Accelerator Seed List (Data)]&lt;br /&gt;
|Has project status=Active&lt;br /&gt;
|Is dependent on=Accelerator Seed List (Data),&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Relevant Files=&lt;br /&gt;
==Location for All Relevant Files==&lt;br /&gt;
*All relevant files are located in Bulk(E:)\McNair\Projects\Accelerators\All Relevant Files&lt;br /&gt;
==List of All Relevant Files==&lt;br /&gt;
*accelator_data_noflag&lt;br /&gt;
**Original Location: Bulk(E:)\McNair\Projects\Accelerators\Spring 2017\Code+Final_Data&lt;br /&gt;
**Description: This text file contains the data on all accelerators that we found from our first round of research that were not flagged. It consolidates the data collected by all McNair Center interns, filtering out the organizations which are not accelerators.&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21678</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21678"/>
		<updated>2017-11-08T21:41:23Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm&lt;br /&gt;
*Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm&lt;br /&gt;
*Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm&lt;br /&gt;
*Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm&lt;br /&gt;
*Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm&lt;br /&gt;
*Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm&lt;br /&gt;
*Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm&lt;br /&gt;
*Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm&lt;br /&gt;
*Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm&lt;br /&gt;
*Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm&lt;br /&gt;
*Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm&lt;br /&gt;
*Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm:&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm&lt;br /&gt;
*Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm&lt;br /&gt;
*Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm&lt;br /&gt;
*Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm&lt;br /&gt;
*Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm&lt;br /&gt;
*Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm&lt;br /&gt;
*Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm&lt;br /&gt;
*Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm&lt;br /&gt;
*Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm&lt;br /&gt;
*Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm&lt;br /&gt;
*Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm&lt;br /&gt;
*Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
11/2/2017 4:00-5:30 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file.&lt;br /&gt;
&lt;br /&gt;
11/6/2017 2:00-4:00 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file. Began compiling a list of keywords for demo day press releases.&lt;br /&gt;
&lt;br /&gt;
11/7/2017 3:00-5:00 pm&lt;br /&gt;
*Finished coming up with keywords for demo day crawler. Sent the final list to Peter.&lt;br /&gt;
&lt;br /&gt;
11/8/2017 2:00-3:30 pm&lt;br /&gt;
*Spoke to Ed and organized all of our current data.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm&lt;br /&gt;
*Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm&lt;br /&gt;
*Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm&lt;br /&gt;
*Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm&lt;br /&gt;
*Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm&lt;br /&gt;
*Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm&lt;br /&gt;
*Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm&lt;br /&gt;
*Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm&lt;br /&gt;
*Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm&lt;br /&gt;
*Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm&lt;br /&gt;
*Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm&lt;br /&gt;
*Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm&lt;br /&gt;
*Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm&lt;br /&gt;
*Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm&lt;br /&gt;
*Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm&lt;br /&gt;
*Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm&lt;br /&gt;
*Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm&lt;br /&gt;
*Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm&lt;br /&gt;
*Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm&lt;br /&gt;
*Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm&lt;br /&gt;
*Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm&lt;br /&gt;
*Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm&lt;br /&gt;
*Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm&lt;br /&gt;
*Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm&lt;br /&gt;
*Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm&lt;br /&gt;
*Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm&lt;br /&gt;
*Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm&lt;br /&gt;
*Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm&lt;br /&gt;
*Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm&lt;br /&gt;
*Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm&lt;br /&gt;
*Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm&lt;br /&gt;
*Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
10/17/2016 2:00-5:00 pm&lt;br /&gt;
*Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm&lt;br /&gt;
*Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm&lt;br /&gt;
*Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm&lt;br /&gt;
*Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm&lt;br /&gt;
*Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm&lt;br /&gt;
*Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm&lt;br /&gt;
*Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm&lt;br /&gt;
*Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm&lt;br /&gt;
*Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm&lt;br /&gt;
*Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm&lt;br /&gt;
*Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm&lt;br /&gt;
*Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm&lt;br /&gt;
*Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm&lt;br /&gt;
*Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm&lt;br /&gt;
*Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm&lt;br /&gt;
*Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm&lt;br /&gt;
*Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm&lt;br /&gt;
*Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm&lt;br /&gt;
*Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm&lt;br /&gt;
*Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm&lt;br /&gt;
*Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm&lt;br /&gt;
*Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21667</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21667"/>
		<updated>2017-11-07T22:51:56Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm&lt;br /&gt;
*Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm&lt;br /&gt;
*Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm&lt;br /&gt;
*Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm&lt;br /&gt;
*Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm&lt;br /&gt;
*Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm&lt;br /&gt;
*Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm&lt;br /&gt;
*Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm&lt;br /&gt;
*Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm&lt;br /&gt;
*Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm&lt;br /&gt;
*Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm&lt;br /&gt;
*Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm:&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm&lt;br /&gt;
*Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm&lt;br /&gt;
*Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm&lt;br /&gt;
*Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm&lt;br /&gt;
*Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm&lt;br /&gt;
*Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm&lt;br /&gt;
*Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm&lt;br /&gt;
*Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm&lt;br /&gt;
*Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm&lt;br /&gt;
*Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm&lt;br /&gt;
*Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm&lt;br /&gt;
*Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
11/2/2017 4:00-5:30 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file.&lt;br /&gt;
&lt;br /&gt;
11/6/2017 2:00-4:00 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file. Began compiling a list of keywords for demo day press releases.&lt;br /&gt;
&lt;br /&gt;
11/7/2017 3:00-5:00 pm&lt;br /&gt;
*Finished coming up with keywords for demo day crawler. Sent the final list to Peter.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm&lt;br /&gt;
*Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm&lt;br /&gt;
*Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm&lt;br /&gt;
*Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm&lt;br /&gt;
*Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm&lt;br /&gt;
*Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm&lt;br /&gt;
*Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm&lt;br /&gt;
*Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm&lt;br /&gt;
*Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm&lt;br /&gt;
*Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm&lt;br /&gt;
*Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm&lt;br /&gt;
*Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm&lt;br /&gt;
*Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm&lt;br /&gt;
*Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm&lt;br /&gt;
*Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm&lt;br /&gt;
*Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm&lt;br /&gt;
*Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm&lt;br /&gt;
*Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm&lt;br /&gt;
*Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm&lt;br /&gt;
*Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm&lt;br /&gt;
*Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm&lt;br /&gt;
*Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm&lt;br /&gt;
*Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm&lt;br /&gt;
*Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm&lt;br /&gt;
*Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm&lt;br /&gt;
*Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm&lt;br /&gt;
*Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm&lt;br /&gt;
*Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm&lt;br /&gt;
*Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm&lt;br /&gt;
*Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm&lt;br /&gt;
*Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm&lt;br /&gt;
*Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
10/17/2016 2:00-5:00 pm&lt;br /&gt;
*Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm&lt;br /&gt;
*Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm&lt;br /&gt;
*Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm&lt;br /&gt;
*Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm&lt;br /&gt;
*Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm&lt;br /&gt;
*Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm&lt;br /&gt;
*Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm&lt;br /&gt;
*Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm&lt;br /&gt;
*Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm&lt;br /&gt;
*Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm&lt;br /&gt;
*Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm&lt;br /&gt;
*Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm&lt;br /&gt;
*Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm&lt;br /&gt;
*Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm&lt;br /&gt;
*Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm&lt;br /&gt;
*Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm&lt;br /&gt;
*Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm&lt;br /&gt;
*Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm&lt;br /&gt;
*Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm&lt;br /&gt;
*Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm&lt;br /&gt;
*Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm&lt;br /&gt;
*Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21664</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21664"/>
		<updated>2017-11-07T21:55:54Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm&lt;br /&gt;
*Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm&lt;br /&gt;
*Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm&lt;br /&gt;
*Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm&lt;br /&gt;
*Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm&lt;br /&gt;
*Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm&lt;br /&gt;
*Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm&lt;br /&gt;
*Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm&lt;br /&gt;
*Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm&lt;br /&gt;
*Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm&lt;br /&gt;
*Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm&lt;br /&gt;
*Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm:&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm&lt;br /&gt;
*Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm&lt;br /&gt;
*Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm&lt;br /&gt;
*Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm&lt;br /&gt;
*Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm&lt;br /&gt;
*Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm&lt;br /&gt;
*Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm&lt;br /&gt;
*Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm&lt;br /&gt;
*Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm&lt;br /&gt;
*Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm&lt;br /&gt;
*Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm&lt;br /&gt;
*Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
11/2/2017 4:00-5:30 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file.&lt;br /&gt;
&lt;br /&gt;
11/6/2017 2:00-4:00 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file. Began compiling a list of keywords for demo day press releases.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm&lt;br /&gt;
*Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm&lt;br /&gt;
*Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm&lt;br /&gt;
*Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm&lt;br /&gt;
*Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm&lt;br /&gt;
*Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm&lt;br /&gt;
*Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm&lt;br /&gt;
*Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm&lt;br /&gt;
*Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm&lt;br /&gt;
*Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm&lt;br /&gt;
*Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm&lt;br /&gt;
*Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm&lt;br /&gt;
*Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm&lt;br /&gt;
*Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm&lt;br /&gt;
*Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm&lt;br /&gt;
*Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm&lt;br /&gt;
*Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm&lt;br /&gt;
*Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm&lt;br /&gt;
*Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm&lt;br /&gt;
*Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm&lt;br /&gt;
*Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm&lt;br /&gt;
*Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm&lt;br /&gt;
*Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm&lt;br /&gt;
*Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm&lt;br /&gt;
*Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm&lt;br /&gt;
*Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm&lt;br /&gt;
*Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm&lt;br /&gt;
*Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm&lt;br /&gt;
*Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm&lt;br /&gt;
*Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm&lt;br /&gt;
*Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm&lt;br /&gt;
*Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm&lt;br /&gt;
*Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm&lt;br /&gt;
*Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm&lt;br /&gt;
*Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm&lt;br /&gt;
*Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
10/17/2016 2:00-5:00 pm&lt;br /&gt;
*Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm&lt;br /&gt;
*Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm&lt;br /&gt;
*Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm&lt;br /&gt;
*Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm&lt;br /&gt;
*Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm&lt;br /&gt;
*Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm&lt;br /&gt;
*Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm&lt;br /&gt;
*Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm&lt;br /&gt;
*Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm&lt;br /&gt;
*Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm&lt;br /&gt;
*Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm&lt;br /&gt;
*Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm&lt;br /&gt;
*Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm&lt;br /&gt;
*Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm&lt;br /&gt;
*Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm&lt;br /&gt;
*Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm&lt;br /&gt;
*Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm&lt;br /&gt;
*Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm&lt;br /&gt;
*Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm&lt;br /&gt;
*Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm&lt;br /&gt;
*Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm&lt;br /&gt;
*Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm&lt;br /&gt;
*Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm&lt;br /&gt;
*Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21663</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21663"/>
		<updated>2017-11-07T21:51:44Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm&lt;br /&gt;
*Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm&lt;br /&gt;
*Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm&lt;br /&gt;
*Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm&lt;br /&gt;
*Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm&lt;br /&gt;
*Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm&lt;br /&gt;
*Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm&lt;br /&gt;
*Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm&lt;br /&gt;
*Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm&lt;br /&gt;
*Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm&lt;br /&gt;
*Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm&lt;br /&gt;
*Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm&lt;br /&gt;
*Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm&lt;br /&gt;
*Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm&lt;br /&gt;
*Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm&lt;br /&gt;
*Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm:&lt;br /&gt;
*Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm&lt;br /&gt;
*Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm&lt;br /&gt;
*Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm&lt;br /&gt;
*Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm&lt;br /&gt;
*Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm&lt;br /&gt;
*Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm&lt;br /&gt;
*Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm&lt;br /&gt;
*Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm&lt;br /&gt;
*Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm&lt;br /&gt;
*Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm&lt;br /&gt;
*Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm&lt;br /&gt;
*Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm&lt;br /&gt;
*Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
11/2/2017 4:00-5:30 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file.&lt;br /&gt;
&lt;br /&gt;
11/6/2017 2:00-4:00 pm&lt;br /&gt;
*Continued entering cohort company dates into Excel file. Began compiling a list of keywords for demo day press releases.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21631</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21631"/>
		<updated>2017-11-06T21:58:04Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm: Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm: Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm: Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm: Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm: Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm: Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm: Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm: Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm: Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm: Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm: Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm: Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm: Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm: Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm: Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
11/2/2017 4:00-5:30 pm: Continued entering cohort company dates into Excel file.&lt;br /&gt;
&lt;br /&gt;
11/6/2017 2:00-4:00 pm: Continued entering cohort company dates into Excel file. Began compiling a list of keywords for demo day press releases.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21539</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21539"/>
		<updated>2017-11-02T22:27:25Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Fall 2017===&lt;br /&gt;
&amp;lt;onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm: Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm: Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm: Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm: Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm: Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm: Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm: Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm: Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm: Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm: Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm: Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm: Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm: Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm: Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm: Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
11/2/2017 4:00-5:30 pm: Continued entering cohort company dates into Excel file.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/onlyinclude&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Spring 2017===&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
===Fall 2016===&lt;br /&gt;
&lt;br /&gt;
10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21457</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21457"/>
		<updated>2017-11-01T21:01:45Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm: Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm: Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm: Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm: Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm: Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm: Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm: Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm: Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm: Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm: Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm: Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm: Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm: Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm: Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
11/1/2017 2:00-4:00 pm: Finalized entering dates for Y Combinator cohort companies.&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21416</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21416"/>
		<updated>2017-10-31T21:56:52Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm: Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm: Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm: Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm: Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm: Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm: Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm: Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm: Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm: Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm: Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm: Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm: Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm: Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
10/31/2017 3:00-5:00 pm: Began compiling data in the column for Date Company went through Accelerator.&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21322</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21322"/>
		<updated>2017-10-30T20:24:15Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm: Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm: Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm: Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm: Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm: Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm: Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm: Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm: Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm: Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm: Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm: Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm: Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
10/30/2017 2:00-3:30 pm: Finished adding all of the accelerators to the list of VCCompanies. Added a column indicating whether or not the company went through two or more accelerators.&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21189</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21189"/>
		<updated>2017-10-26T22:26:07Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm: Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm: Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm: Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm: Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm: Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm: Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm: Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm: Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm: Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm: Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm: Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
10/26/2017 3:30-5:30 pm: Continued going through list of VCCompanies and adding accelerators. Will have this completed on Monday.&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21121</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21121"/>
		<updated>2017-10-25T21:52:12Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm: Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm: Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm: Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm: Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm: Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm: Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm: Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm: Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm: Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm: Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
10/25/2017 2:00-5:00 pm: Continued going through list of VCCompanies and adding accelerators.&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21056</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21056"/>
		<updated>2017-10-24T21:56:16Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm: Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm: Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm: Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm: Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm: Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm: Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm: Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm: Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm: Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
10/24/2017 3:00-5:00 pm: Went through list of VCCompanies and began adding respective accelerators in order to proceed with VCPercentage table.&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Catherine_Kirby&amp;diff=21053</id>
		<title>Catherine Kirby</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Catherine_Kirby&amp;diff=21053"/>
		<updated>2017-10-24T21:34:36Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Staff&lt;br /&gt;
|position=Research Team&lt;br /&gt;
|name=Catherine Kirby&lt;br /&gt;
|degree=BA&lt;br /&gt;
|major=Political Science&lt;br /&gt;
|class=2019&lt;br /&gt;
|join_date=09/08/2016&lt;br /&gt;
|skills=Korean, Writing, Spanish, Research&lt;br /&gt;
|interests=Public Policy, Cooking, Languages, Desserts&lt;br /&gt;
|fun_fact=I am a blackbelt in Taekwondo (supposedly)&lt;br /&gt;
|email=cak6@rice.edu&lt;br /&gt;
|status=Active&lt;br /&gt;
}}&lt;br /&gt;
Catherine Kirby is a 19 year old Rice student working as a Research Assistant for the James A. Baker III Institute for Public Policy's McNair Center for Entrepreneurship and Innovation. Catherine currently lives in the Houston area and attends Rice University. &lt;br /&gt;
&lt;br /&gt;
==Early Life==&lt;br /&gt;
&lt;br /&gt;
Catherine was born and raised in Dallas by parents Judy and Rangy Kirby. She has three younger siblings, Parker (18), Christine (15), and Andrew (12). Her father, Rangall, is a general and vascular surgeon and her mother, Judy, is an ophthalmologist. She attended the Hockaday School for eight years, graduating in 2015. Catherine began teaching herself Korean in ninth grade and later attended the National Security Language Initiative for Youth Korean Summer and studied at Sogang University in 2014.&lt;br /&gt;
&lt;br /&gt;
==Education==&lt;br /&gt;
&lt;br /&gt;
Catherine is a rising Sophomore majoring in Political Science and minoring in Business at Rice University. She resides at Baker College, making her the third Kirby to make Baker her home. &lt;br /&gt;
&lt;br /&gt;
==Organizational Involvement==&lt;br /&gt;
&lt;br /&gt;
Catherine serves as the Head of Formal Events for the Baker Institute Student Forum. In addition, she has also volunteered for the Leaders Program for the Partnership for the Advancement and Immersion of Refugees and for the Rice International Language Exchange. &lt;br /&gt;
&lt;br /&gt;
==Work Experience==&lt;br /&gt;
&lt;br /&gt;
Catherine worked as a policy intern at Glasshouse Policy, an Austin non-profit focusing on Texas state policy and city issues in 2015 with a stipend from the Rice University Center for Civic Leadership. In high school, Catherine volunteered teaching ESL, Korean to young Korean children, and Korean to foreign students. She has also subtitled a movie for the Busan International Film Festival and worked at Nothing Bundt Cakes. &lt;br /&gt;
&lt;br /&gt;
==Time at McNair==&lt;br /&gt;
&lt;br /&gt;
[[Catherine Kirby (Work Log)]]&lt;br /&gt;
[[Catherine Kirby (Research Plan)]]&lt;br /&gt;
[[Big Problems for Small Practices (Blog Post)]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21011</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=21011"/>
		<updated>2017-10-23T20:27:38Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm: Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm: Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm: Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm: Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm: Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm: Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm: Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm: Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
10/23/2017 2:00-3:30 pm: Worked on sorting out the discrepancies in our matched data.&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20974</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20974"/>
		<updated>2017-10-20T20:27:46Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm: Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm: Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm: Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm: Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm: Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm: Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm: Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
10/20/2017 2:00-3:30 pm: Generated the new list of VCCompanies as well as their earliest round dates.&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20953</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20953"/>
		<updated>2017-10-19T21:50:46Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm: Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm: Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm: Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm: Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm: Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm: Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
10/19/2017 3:00-5:00 pm: Organized all of our matched data and updated it in order to reflect the most recent SDC pull with Ed. Matched Crunchbase data with our cohort companies.&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20871</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20871"/>
		<updated>2017-10-18T21:52:04Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm: Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm: Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm: Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm: Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm: Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
10/18/2017 2:00-5:00 pm: Updated our VC data with Ed's help in order to increase the accuracy and completion of our data.&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20842</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20842"/>
		<updated>2017-10-17T21:46:03Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm: Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm: Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm: Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm: Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
10/17/2017 3:00-5:00 pm: Worked with Ben to find a solution to our problem of data acquisition. Finalized earliest round date for VCCompanies.&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20826</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20826"/>
		<updated>2017-10-16T21:47:43Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm: Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm: Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm: Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
10/16/2017 2:00-3:30 pm: Continued working on sorting VCCompanies by their earliest round date.&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Egg_Egan&amp;diff=20809</id>
		<title>Egg Egan</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Egg_Egan&amp;diff=20809"/>
		<updated>2017-10-16T18:51:04Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Staff&lt;br /&gt;
|position=Omelet&lt;br /&gt;
|name=Egg Eggan&lt;br /&gt;
|user_image=Egg.jpg&lt;br /&gt;
|degree=Medium Well&lt;br /&gt;
|class=Grade A&lt;br /&gt;
|skills=Skillet Maneuvering, cracking under pressure&lt;br /&gt;
|status=Cooked&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Ingredients=&lt;br /&gt;
# 3 eggs&lt;br /&gt;
# 1/2 cup deadweight loss&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20789</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20789"/>
		<updated>2017-10-12T21:42:54Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm: Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm: Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
10/12/2017 3:00-5:00 pm: Discovered that the Wayback Machine will not be a good option for finding when companies went through their accelerators. Created a list of VCCompanies and their earliest round date. Included a column for the date they went through their accelerators and will fill it in when we find a good method of finding this date.&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20737</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20737"/>
		<updated>2017-10-11T20:16:19Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm: Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
10/11/2017 2:00-3:30 pm: Looked into using the WhoIs Parser in order to find when the companies went through their accelerators.&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20677</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20677"/>
		<updated>2017-10-06T21:42:28Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
10/6/2017 3:00-5:00 pm: Went through the matched data. Brainstormed ways to get the dates for cohort companies going through accelerators.&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Joe_Reilly&amp;diff=20675</id>
		<title>Joe Reilly</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Joe_Reilly&amp;diff=20675"/>
		<updated>2017-10-06T20:44:59Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Staff&lt;br /&gt;
|position=Research Team&lt;br /&gt;
|name=Joe Reilly&lt;br /&gt;
|degree=BA&lt;br /&gt;
|major=Economics; Managerial Studies&lt;br /&gt;
|class=2019&lt;br /&gt;
|join_date=6/20/17&lt;br /&gt;
|email=jpr4@rice.edu&lt;br /&gt;
|status=Active&lt;br /&gt;
|user_image=FullSizeRenderjoe.jpg&lt;br /&gt;
|degree=BA&lt;br /&gt;
|major=Economics; Managerial Studies&lt;br /&gt;
|class=2019&lt;br /&gt;
|join_date=20 June 2017&lt;br /&gt;
|skills=Excel, Writing, Research&lt;br /&gt;
|fun_fact=triplet&lt;br /&gt;
|email=jpr4@rice.edu&lt;br /&gt;
}}&lt;br /&gt;
Joe is a Research Assistant for the James A. Baker III Institute for Public Policy's McNair Center for Entrepreneurship and Innovation.  &lt;br /&gt;
&lt;br /&gt;
==Early Life==&lt;br /&gt;
Joe was born to Robert and Judy Reilly in Los Angeles, CA, on January 1, 1997.  He is a triplet, and has two brothers, Mike and Jack, who both attend Saint John's University in Minnesota.&lt;br /&gt;
&lt;br /&gt;
==Education==&lt;br /&gt;
Joe is a Junior at Duncan College, pursuing a degree in Economics, and in Managerial studies.&lt;br /&gt;
&lt;br /&gt;
==Work Experience==&lt;br /&gt;
During spring 2017 semester, Joe worked as a research intern at The Dietrich Law Firm in Houston, Texas.&lt;br /&gt;
==Time at McNair==&lt;br /&gt;
&lt;br /&gt;
[[Joe Reilly]] [[Work Logs]] [[Joe Reilly (Work Log)|(log page)]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20606</id>
		<title>Matthew Ringheanu (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matthew_Ringheanu_(Work_Log)&amp;diff=20606"/>
		<updated>2017-10-04T21:41:45Z</updated>

		<summary type="html">&lt;p&gt;Mringheanu: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;10/17/2016 2:00-5:00 pm: Created personal wiki page as well as work log; Read about the research project to which I have been assigned; Wrote a short summary of what I believe it is and included some helpful links&lt;br /&gt;
&lt;br /&gt;
10/18/2016 4:00-6:00 pm: Met with research partner Shrey who filled me in on where we are with the project; Began looking on websites of certain accelerators for how to determine their cohorts and listed these steps on the wiki&lt;br /&gt;
&lt;br /&gt;
10/19/2016 2:00-5:00 pm: Finished looking on the remaining accelerator websites and wrote the steps on determining how to manually locate the cohorts.&lt;br /&gt;
&lt;br /&gt;
10/20/2016 4:00-6:00 pm: Met with Peter and Christy to discuss the possibility of creating a web crawler that will pull data from individual accelerator sites.&lt;br /&gt;
&lt;br /&gt;
10/24/2016 2:00-5:00 pm: Brainstormed with Albert and Julia about changes to the category name for SBDE. Spoke to Ed about full scope of accelerator project.&lt;br /&gt;
&lt;br /&gt;
10/25/2016 4:00-6:00 pm: Brainstormed with Shrey about different potential industry focuses within accelerators, as well as different variables to search for in terms of accelerators, startups, cohorts, etc.&lt;br /&gt;
&lt;br /&gt;
10/26/2016 2:00-5:00 pm: Began searching for more databases including lists of accelerators as well as some characteristics of those accelerators; Began searching for characteristics that identify accelerators on their websites&lt;br /&gt;
&lt;br /&gt;
10/27/2016 4:00-6:00 pm: Continued searching for relevant lists of accelerators to include on our page. Added some links that have high potential under the tab (Obtained from List of Accelerators or various Google searches).&lt;br /&gt;
&lt;br /&gt;
10/31/2016 2:00-5:00 pm: Began constructing a list of variables that clearly distinguish an accelerator on its website. This is in an effort to allow a crawler to crawl through many Google searches and identify accelerators.&lt;br /&gt;
&lt;br /&gt;
11/1/2016 4:00-6:00 pm: Continued looking for variables that could identify accelerators from their websites. Searched through numerous different websites of accelerators obtained from our current databases.&lt;br /&gt;
&lt;br /&gt;
11/2/2016 2:00-4:00 pm: Continued combing through websites of numerous accelerators, well-known and other, in the hopes of finding identifying variables.&lt;br /&gt;
&lt;br /&gt;
11/3/2016 4:00-6:00 pm: Finalized my list of variables that could be used to distinguish the websites of accelerators. Slightly re-arranged our list of accelerator databases in order of relevance.&lt;br /&gt;
&lt;br /&gt;
11/7/2016 2:00-5:00 pm: Began compiling the list of all accelerators. Created a new TextPad document with information from a new database.&lt;br /&gt;
&lt;br /&gt;
11/8/2016 4:00-6:00 pm: Worked with Shrey and Ben in order to compile all of our accelerator databases into one long list on Textpad.&lt;br /&gt;
&lt;br /&gt;
11/9/2016 2:00-5:00 pm: Continued formulating a database for all accelerators and all of the available info given.&lt;br /&gt;
&lt;br /&gt;
11/10/2016 4:00-6:00 pm: Worked with Shrey and Peter in order to develop a crawler for f6s.&lt;br /&gt;
&lt;br /&gt;
11/14/2016 2:00-5:00 pm: Began sorting the Seed-DB database in an Excel document.&lt;br /&gt;
&lt;br /&gt;
11/15/2016 4:00-6:00 pm: Conducted some Google searches in an attempt to find more accelerator databases. Began looking through Executive Orders searching for keywords.&lt;br /&gt;
&lt;br /&gt;
11/16/2016 2:00-5:00 pm: Completed searching through Executive Orders.&lt;br /&gt;
&lt;br /&gt;
11/17/2016 4:00-6:00 pm: Continued working on Google searches for state accelerator list. Looked through f6s for common words that can be used to distinguish accelerators once we have finalized the crawler.&lt;br /&gt;
&lt;br /&gt;
11/21/2016 2:00-5:00 pm: Randomly chose 10 accelerators from Excel list of accelerators on the RDP. Went through each website and listed the steps that I took in order to determine whether or not the website belonged to an accelerator. Will continue extracting cohort information tomorrow.&lt;br /&gt;
&lt;br /&gt;
11/22/2016 4:00-6:00 pm: Listed out all steps for extracting cohort information from the ten randomly chosen accelerators. Worked with Peter in order to build a tool that will search all of the HTMLs and attempt to identify each one as an accelerator as well as extract some basic information.&lt;br /&gt;
&lt;br /&gt;
11/28/2016 2:00-5:00 pm: Merged the F6S accelerator list with our other list, then posted it on the project page. Learned process for accelerator data extraction from Ed.&lt;br /&gt;
&lt;br /&gt;
11/29/2016 4:00-6:00 pm: Began process of collecting data from the 20 accelerators that I am responsible for.&lt;br /&gt;
&lt;br /&gt;
11/30/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished 15/20.&lt;br /&gt;
&lt;br /&gt;
12/1/2016 4:00-6:00 pm: Continued collecting data from accelerators. Finished original 20, picked up a new set of 20.&lt;br /&gt;
&lt;br /&gt;
12/2/2016 2:00-5:00 pm: Continued collecting data from accelerators. Finished next 20.&lt;br /&gt;
&lt;br /&gt;
12/8/2016 1:00-3:00 pm: Completed collecting data from accelerators for the semester.&lt;br /&gt;
&lt;br /&gt;
1/18/2017 1:00-5:00 pm: Continued collecting data for accelerator project. Helped Catherine draft tweets for the McNair Center twitter account.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-3:00 pm: Continued collecting data on accelerators. Attended McNair Center team meeting.&lt;br /&gt;
&lt;br /&gt;
1/23/2017 1:00-5:00 pm: Began combing through accelerator list, determining which accelerators are still missing data and documenting these in a TextPad file. Finished through #115.&lt;br /&gt;
&lt;br /&gt;
1/25/2017 1:00-5:00 pm: Continued looking through accelerator list.&lt;br /&gt;
&lt;br /&gt;
1/27/2017 1:00-3:00 pm: Continued going through accelerator list. Left off on #226 with Shrey.&lt;br /&gt;
&lt;br /&gt;
1/20/2017 1:00-5:00 pm: Continued going through accelerator list. Finished through #440.&lt;br /&gt;
&lt;br /&gt;
2/1/2017 1:00-5:00 pm: Finished going through the list of accelerators looking for incomplete files. Began completing the files that were not done.&lt;br /&gt;
&lt;br /&gt;
2/3/2017 1:00-3:00 pm: Continued working on completing accelerator files.&lt;br /&gt;
&lt;br /&gt;
2/6/2017 1:00-4:30 pm: Finished data set of accelerators. Began going through and making sure that all text files and cohort files are of the same format so Peter can easily pull the information. Left for 30 minutes for an interview from 2:30-3:00 pm.&lt;br /&gt;
&lt;br /&gt;
2/8/2017 1:00-5:00 pm: Finished formatting through #137. Spoke with Ed about project.&lt;br /&gt;
&lt;br /&gt;
2/13/2017 1:00-5:00 pm: Completed formatting for all accelerator text files.&lt;br /&gt;
&lt;br /&gt;
2/15/2017 3:00-5:00 pm: Made copy of the completed data set. Spoke to Ed about future steps to take for project including gathering founder data and obtaining the crunchbase api.&lt;br /&gt;
&lt;br /&gt;
2/17/2017 1:00-3:00 pm: Went through final Excel spreadsheet for cohort information. Still need to run the crawler one more time after the completion of the editing process. Found the application for the crunchbase api which will hopefully allow us to gain access.&lt;br /&gt;
&lt;br /&gt;
2/20/2017 1:00-5:00 pm: Filled out another application for Crunchbase research access; Found the first source for the incubator project on angel.co, will hopefully work with Peter to make a crawler similar to f6s&lt;br /&gt;
&lt;br /&gt;
2/22/2017 1:00-5:00 pm: Pulled data from SDC for Ed and normalized it. Learned how to use SDC and the normalizer.&lt;br /&gt;
&lt;br /&gt;
2/24/2017 1:00-3:00 pm: Finished cleaning up the cohort data for Y-combinator on the Final Cohort Excel Spreadsheet.&lt;br /&gt;
&lt;br /&gt;
2/27/2017 1:00-5:00 pm: Continued cleaning up the cohort data in the Excel file. Finished Cohort Number and Year.&lt;br /&gt;
&lt;br /&gt;
3/1/2017 2:00-5:00 pm: Worked with Ben and Shrey to pull data from SDC for all VC funded companies and normalized it to put it in an Excel document.&lt;br /&gt;
&lt;br /&gt;
3/3/2017 1:00-2:30 pm: Worked with Ben to try and repeat down the VC data without it going too far.&lt;br /&gt;
&lt;br /&gt;
3/6/2017 1:00-4:00 pm: Worked with Shrey to finish cleaning the cohort data. It is ready to be run through the matcher with Ben.&lt;br /&gt;
&lt;br /&gt;
3/8/2017 1:00-5:00 pm: Matched the VC Data with the list of Cohort Companies and got one list of all cohort companies that have received VC funding.&lt;br /&gt;
&lt;br /&gt;
3/10/2017 12:00-2:00 pm: Put a write-up on the top of the Accelerator wiki page detailing where we are in the project currently as well as what data we have accumulated on the RDP.&lt;br /&gt;
&lt;br /&gt;
3/20/2017 1:00-5:00 pm: Began gathering the URLs of all accelerators in a TextPad file called Accelerator URLs. Participated in the SQL training session.&lt;br /&gt;
&lt;br /&gt;
3/22/2017 1:00-5:00 pm: Made tables in Terminal for Accelerator companies matched with VC companies and for Cohort Data.&lt;br /&gt;
&lt;br /&gt;
3/27/2017 1:00-4:00 pm: Compiled all URLs of accelerator into a TextPad file.&lt;br /&gt;
&lt;br /&gt;
3/29/2017 1:00-5:00 pm: Worked on the matched data with Ben. Next time I will run the RegEx code that will filter the URLs, and I will look through the duplicates where two different VC backed company names matched to one cohort company name.&lt;br /&gt;
&lt;br /&gt;
3/31/2017 1:00-2:00 pm: Ran the code for accelerator urls which are ready to be run through the wayback machine in order to get the start dates. Also began looking through vc backed company names.&lt;br /&gt;
&lt;br /&gt;
4/3/2017 1:00-5:00 pm: Continued looking through double matched VC companies. Learned more SQL from Ed.&lt;br /&gt;
&lt;br /&gt;
4/5/2017 1:00-5:00 pm: Made the final vc percentage table on terminal and for next time I will collect missing accelerator data.&lt;br /&gt;
&lt;br /&gt;
4/7/2017 1:00-3:00 pm: Began collecting cohort data for big accelerators that were missing from our list in order to add it to our final list of cohort companies.&lt;br /&gt;
&lt;br /&gt;
4/10/2017 1:00-5:00 pm: Finished gathering cohort company names for big accelerators that we were missing and put them into the Cleaned Cohort Companies Excel file. Ben is looking through Crunchbase data in order to possibly find more missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/14/2017 1:00-4:00 pm: Began working through &amp;quot;Crunchbase Potential Accelerators&amp;quot; textpad that may contain missing accelerators and wrote notes on the ones that I was able to go through. Need to finish this textpad before moving forward.&lt;br /&gt;
&lt;br /&gt;
4/17/2017 1:00-4:00 pm: Continued going through potential Crunchbase accelerators that we may have missed. Talked to Ed about getting a more comprehensive list from Excel file and by the end of the semester have the tables and data collected and done.&lt;br /&gt;
&lt;br /&gt;
4/19/2017 1:00-4:00 pm: Worked with Jeemin to generate an entire list of potential US accelerators from crunchbase. Worked to find a way to classify accelerators just based on their descriptions.&lt;br /&gt;
&lt;br /&gt;
4/21/2017: 1:00-4:00 pm: Continued working through the list identifying accelerators that we do not have. Ramee and Juliette are now helping us gather cohort data for those missing accelerators.&lt;br /&gt;
&lt;br /&gt;
4/24/2017 9:00-1:00 pm: Updated Veeral on current state of project. Typed up a to-do list on the discussion wiki for Veeral. Got new cohort data on an accelerator and added it to Excel file.&lt;br /&gt;
&lt;br /&gt;
5/3/2017 11:00-1:00 pm: Talked to Ed and Anne about future report. Continued working through list of crunchbase potential accelerators. Last day of work for this semester.&lt;br /&gt;
&lt;br /&gt;
9/11/2017 2:00-5:00 pm: Spoke to Ed about the project going forward. Organized the current updated data for our project.&lt;br /&gt;
&lt;br /&gt;
9/12/2017 3:00-5:00 pm: Began going through the Cleaned Cohort Data Excel file and found a few problems with it. Will continue the cleaning process for the rest of the week.&lt;br /&gt;
&lt;br /&gt;
9/13/2017 2:00-5:00 pm: Sorted through Cleaned Cohort Data and finalized our List of Accelerators. We can begin the process of creating our PercentVC table.&lt;br /&gt;
&lt;br /&gt;
9/14/2017 3:00-5:00 pm: Completely finalized our dataset of accelerators and startups. Met with Michelle Passo to discuss objectives of the research for credit course.&lt;br /&gt;
&lt;br /&gt;
9/18/2017 2:00-4:00 pm: Talked with Peter about the LinkedIn crawler data. Went through VC page that Meghana sent me.&lt;br /&gt;
&lt;br /&gt;
9/19/2017 3:00-5:00 pm: Completed SDC pull of updated VC Data.&lt;br /&gt;
&lt;br /&gt;
9/20/2017 2:00-5:00 pm: Attempted several times to run the Matcher. Cleaned our pulled data.&lt;br /&gt;
&lt;br /&gt;
9/21/2017 3:00-5:00 pm: Came extremely close to running the Matcher the correctly. Reviewed the final LinkedIn data from Peter.&lt;br /&gt;
&lt;br /&gt;
9/25/2017 2:00-5:00 pm: Finalized the matched file of accelerator companies with VC portfolio companies. Gave Ben the data on Georgia accelerators.&lt;br /&gt;
&lt;br /&gt;
9/26/2017 3:00-5:00 pm: Worked on finding the duplicates in our Matched file in order to have the most accurate data.&lt;br /&gt;
&lt;br /&gt;
9/27/2017 2:00-5:00 pm: Attempted to find a way to organize the duplicate matches.&lt;br /&gt;
&lt;br /&gt;
9/28/2017 4:00-5:00 pm: Continued running through matched data in order to organize it effectively.&lt;br /&gt;
&lt;br /&gt;
10/2/2017 2:00-5:00 pm: Talked to Ed about next steps for the project. Practiced accessing the crunchbase database on SQL. Brushed up on SQL code.&lt;br /&gt;
&lt;br /&gt;
10/3/2017 3:00-5:00 pm: Searched the database for crunchbase investment information.&lt;br /&gt;
&lt;br /&gt;
10/4/2017 2:00-5:00 pm: Pulled the funding rounds table from SQL and matched it with the companies that have received VC funding in order to gather round dates.&lt;br /&gt;
&lt;br /&gt;
[[Matthew Ringheanu]] [[Work Logs]] [[Matthew Ringheanu (Work Log)|(log page)]]&lt;br /&gt;
[[Category:Work Log]]&lt;/div&gt;</summary>
		<author><name>Mringheanu</name></author>
		
	</entry>
</feed>