Changes

Jump to navigation Jump to search
8,489 bytes added ,  12:51, 26 March 2018
===Spring 2017===
 
<onlyinclude>
 
[[Joe Reilly]] [[Work Logs]] [[Joe Reilly (Work Log)|(log page)]]
2017-063-2026: Joined the center! Wrote my pageAdded Score column to Accelerator Master Variable List google Doc; began filling in necessary info. Started  2017-3-19: Filled in part of 'duration' column on [[Collecting SBIR Data]]Accelerator Master List Doc. 2017-3-9: Created a google doc of the Accelerator master list 2017-3-5: Created "Potential Other "Variables full list" in E:\McNair\Projects\Accelerators\Spring 2018\Grouping project of ListOfAccs. Finished collecting SBIR DataBegan comprehensive list of all possible variables that could be included using current info in Master Accelerator Variable Master List Project Excel File. 2017-3-2: Fixed errors on Master Accelerator Variable Master List Project in E:\McNair\Projects\Accelerators\Spring 2018; saved created "Potential Other Variables" in bulk(E:)\McNair\Projects\Accelerators\Spring 2018\Grouping project of ListOfAccs.  2017-2-28: Organized and delegated tasks for completion of Accelerator Variable Master List Project among Michelle, Cindy, Yunnie, and me. [fill in days] 2017-2-23: Accelerator Type Project: researched whether foreign-based accelerators had a significant US presence. 2017-> McNair-2->Projects16: Accelerator Type Project 2017-2->SBIR. Began researching VC funds in the file Venture Funds 15: Accelerator type project: wrote instructions, saved as "Instructions for Accelerator type project" in E:\McNair\Projects\HoustonAccelerators\VCDataSpring 2018\Grouping project of ListOfAccs. 2017-2-14: Accelerator type project 2017-2-12: Accelerator type project 2017-2-7: Accelerator type project 2017-2-6: Accelerator Data meeting. 2017-2-1: Accelerator type project 2017-1-31: Accelerator type project
2017-061-2129: Continued researching VC funds in the file Venture Funds in E:\McNair\Projects\Houston\VCData.Accelerator type project
2017-061-2224: Finished researching VC funds in the file Venture Funds in E:\McNair\Projects\Houston\VCDataContinued on Accelerator Type Project. Began groupingprocess of collecting files through Zotero for future "Gender and MGMT style" lit review page.
2017-061-2322: Finished grouping VC funds in the file Venture Funds in Worked on Accelerator Type Project. See http://mcnair.bakerinstitute.org/wiki/Accelerator_Seed_List_(Data)#Accelerator_Type_project. Also see E:\McNair\Projects\HoustonAccelerators\VCData. Researched whether based in Houston, and whether they should be considered aliveSpring 2017\Grouping project of ListOfAccs.
2017-06-27: Sorted VC funds in E:\McNair\Projects\Houston\VCData; deleted non-operating ones; finalized groups. Began researching the relative size of different sectors in Houston's economy. Work saved in E:\McNair\Projects\Houston\Industries.
2017-06-28: Began adding cohorts to each new accelerator in E:\McNair\Projects\Accelerators, saving each accelerator's cohort in E:\McNair\Projects\Accelerators\Data as (acceleratorname).cohort, as a text file.</onlyinclude>
===Fall 2017-06-29: Continued adding cohorts to each new accelerator in E:\McNair\Projects\Accelerators, saving each accelerator's cohort in E:\McNair\Projects\Accelerators\Data as (acceleratorname).cohort, as a text file. Searched through documents in E:\McNair\Projects\SimplerPatentData\data\extracts\applications, in the modern and vintage folders, for examples of patents of the following type: utility, plant, reissue, and design, in versions 1.5, 1.6, 4.0, 4.1, 4.2, 4.3, 4.4, and 4.5. Placed examples in the folder E:\McNair\Projects\SimplerPatentData\data\examples. As mentioned in the wiki page (and all but confirmed with regex searches of hundreds of the patent documents), we appear only to have data on utility patents, except for a few plant patents.===
2017-0611-3028: Continued adding cohorts to each new accelerator in E:\McNair\Projects\Accelerators, saving each accelerator's cohort in E:\McNair\Projects\Accelerators\Data as (acceleratorname).cohort, either as an excel file or a text file. Added addresses to companies in E:\McNair\Projects\Houston\VCDataAccelerator seed list grouping.
2017-11-21: Accelerator seed list grouping.
2017-0711-1116: Continued adding addresses to companies in Venture Funds in E:\McNair\Projects\Houston\VCData. Note: The file "VC Data" is now called "Venture Funds"on Accelerator seed list grouping.
2017-0711-1214: Updated http: Added addresses/PO box locations to Venture Funds in E:\McNair\Projects\Houston\VCData to remaining VC firms/mcnair. Organized listbakerinstitute. Began organizing cohort data collected on 6org/30 wwiki/ regex to streamline searching and gathering of cohort names themselves. Compiled addresses, along with other categorized info, in an excel file called "VC firms with address, basic sector info", saved in E:\McNair\Projects\Houston\VCData. For each cohort in E:\McNair\Projects\Accelerators\Accelerator Match, added headers to the tabbed name, founder, description, etc. Continued searching for addresses of the firms with no addresses listed in "VC firms with address, basic sector info". Double-checked addressesPatent_Design_Main_Page.
Note2017--11-10: The Houston VC data write up is Continued on [[Houston_Entrepreneurship_Ecosystem_Project#VC_Funds_in_Houston]]Accelerator seed list grouping.
2017-11-07-13: Helped Diana with researching proportion of Houston's city budget allocated towards startup fundingStarted on Accelerator seed list grouping. See http://entrepreneurship (apparently none.mcnair.bakerinstitute.org/wiki/Accelerator_Seed_List_(Data). Gathered info on IT #Current_Project_Write-Up and Procurement sections of Budget. Researched proportion of Houston budget allocated towards IT and Procurement. Excel and text files saved in Ehttp:\McNair\Projects\Houston\Budget. Continued researching relative sizes of industries in Houston, gathered relevant info and links in "Industry breakdown//mcnair.bakerinstitute.." in E:\McNair\Projects\Houston\Industriesorg/wiki/Accelerator_Seed_List_(Data)#Current_Work.
2017-0711-1402: used Moved "Cleaned lease data from links in "Industry breakdown by GDP...with prices" in into E:\McNair\Projects\Innovation Districts\HoustonStartup Ecosystem\Industries to create Excel charts of Houston employment & Gross Area Product broken down by industryFiles for Geocoding. Saved charts in Did the same folder. Energy, health, and (for employment "cleaned sales data) IT sectors were emphasized, in line with the goal of communicating the idea that, as substantial parts of the Houston economy, those industries will benefit from supporting local startupsprices". Looked up example charts in Added a section on the file 2017ReportV1 in E:\McNair\Projects\google docs for Houston\Houston Ecosystem Recommendations, both Innovation District called Office Space for ideas for future charts, and to get a idea Lease. To measure cost of quantitative VC data living (median income in Houston. Fixed typoslower than US median), improved incomplete keyscollected CPI data from Bureau of Labor Statistics. Cleaned the file Venture Funds in ESee "Median Income" section on https:\McNair\Projects\Houston\VCData: added research on funds declared dead to make sure they actually are//docs.google.com/document/d/17iorMk9KeQV5UJmzGFvBiwKk20sybvAJDeFvbkNzpfw/edit. Double checked that all firms in the master list were accounted for in the grouped list of VC firmsUpdated work log to new requirements, fixed typos.
2017-0710-1831: For the file "hubs list" Checked applicants entries for XML project. Corrected google docs for Houston Innovation district. Cleaned lease/sales info in ZE:\HubsMcNair\Projects\2017Innovation Districts\hubs_dataHouston Startup Ecosystem\Location data. Created cleaned lease data with prices, researched whether organizations not listed as hubs (aka shaded red) in cleaned sales data with prices. Copied data into "Hubs Data v2_16Adresses with more info" (located in the same folder) should be considered hubs, under the definition that a hub has 1) has a coworking space, 2) provides mentorship, 3) offers coding classes/tech events for cohort companiesE:\McNair\Projects\Innovation Districts\Houston Startup Ecosystem. Whether the hub had an accelerator or was tech focused was also notedCopied "addresses of all Houston crimes 2016" into E:\McNair\Projects\Innovation Districts\Houston Startup Ecosystem\Files for Geocoding.
2017-0710-1927: Continued editing cleaned "hubs listaddresses of all Houston crimes 2016" in Z:\Hubs\2017\hubs_data, researching organizations marked as questionably hubscompletely. Used websites like Alexa and similarsites.com to find hubs with websites similar to hubs in Made changes on the "hubs list" file. Only found 1 new hubgoogle docs summarizing our work. Began searching possible hubs in the file "Raw Program list" in E:\McNair\Projects\Hubs\summer 2016.
2017-0710-2026: Searched through the firms in "Raw Program list" in EContinued working on [[Patent Schema Reconciliation]]. Added XMLs under citations header. Added 7 XMLs; checked for different types and versions. See http:\McNair\Projects\Hubs\summer 2016 to determine if they could be considered hubs based on the definition listed above//mcnair.bakerinstitute.org/wiki/Equivalent_XPath_and_APS_Queries#Citations. If they were, they were added to the list of new hubs in "hubs list" in ZSee [[Talk:\Hubs\2017\hubs_dataHouston Innovation District]].
2017-0710-2120: Confirmed whether each hub in last year's hub list (in "Hubs Data v2_16" Zsee http:\Hubs\2017\hubs_data) is still operating//mcnair.bakerinstitute.org/wiki/Talk:Houston_Innovation_District.
2017-0710-2519: Noted hubs that met the new definition but were not considered hubs in hubs_list in the same file. Copied all hubs data from "hubs list" in Zsee http:\Hubs\2017\hubs_data to "Joe hub list 2017" in the same folder//mcnair. Searched hubsbakerinstitute.txt and "Potential Hubs" in Zorg/wiki/Talk:\Hubs\2017\hubs_data for new hubs; added new ones to "Joe hubs list 2017"Houston_Innovation_District.
2017-0710-2615: Began [[Patent Schema Reconciliation]], creating a text document of xpaths for the following nodessee http: patent number, filing number, grant date, kind, type, application number, and filing date//mcnair. Saved file in Ebakerinstitute.org/wiki/Talk:\McNair\Projects\SimplerPatentData\data\examples\Patent Schema Reconciliation. Augusta_Startup_Ecosystem
Notes2017-10-03: I am assuming "application number" in the patent code means "filing number", because the word "filing" appears nowhere in the code, and there is already a different number, under the "publication reference" header, that seems to be referring to the patent numberOn http://mcnair. It's likely that the number under which the patent is internally filed is called "application number", and appears under the header "application reference", and that the (publicized) patent number appears under the header "publication reference"bakerinstitute.org/wiki/Augusta_Startup_Ecosystem:
An example xpath for *Added sections: **Augusta Incubators section: added Clubhou.se, with a certain block list of code from grantedcorporate sponsors. If info if I could find any on how they supported clubhou.se.**Augusta Innovation Zone**UNISYS presecne**Booz Allen Hamilton presence*On powerpoint, v4added slide on Clubhouse.5, plantse & Augusta Innovation Zone. Saved as Augusta_Innovation_District_10.03 in E: \McNair\Projects\Innovation Districts\Augusta Startup Ecosystem.
<us2017-bibliographic09-data-grant><publication-reference><document-id><country>US</country><doc-number>PP027502</doc-number><kind>P3</kind><date>20161227</date></document-id>28: Created slides listing info on Augusta from the wiki page. Saved in E:\McNair\Projects\Innovation Districts\Augusta Startup Ecosystem, called "Joe Augusta". Merged with Dylan's slides. Still need to add more info, make cleaner. Edited Augusta Ecosystem page. Added chart from augmaster excel file in E:\McNair\Projects\Innovation Districts\Augusta Startup Ecosystem\NIH NSF Clinical Trials for Georgia by Zip to Augusta Powerpoint.
For the above code, I identified (what I think are accurate) xpaths for the nodes of patent number (//us2017-bibliographic-data-grant/publication-reference/document-id/doc-number), kind (//us-bibliographic-data-grant/publication-reference/document-id/kind), and grant date (//us-bibliographic09-26: Created a text file with XML of every unique piece of information in the assignment file in E:\McNair\Projects\SimplerPatentData\data-grant/publication-reference/document-id/date)\examples. I am adding the xpaths for these nodes, as well as the others mentioned above, for Saved file in the 4 types of patents, for each version, for both granted and applicationssame folder. Still have to do xpaths for granted version On [[Augusta Startup Ecosystem]], completed numbers 2and 3 under "Tasks.5 for all types, and all applications. Waiting on Oliver about whether we need xpaths for more nodes other than the 6 example nodes."
2017To-07-27do notes: Doubled checked that For the xpaths assignment file in httpE://mcnair.bakerinstitute.org/wiki/Equivalent_XPath_and_APS_Queries#Query_Equivalences were accurate for v4.0,v4.1, v4.2\McNair\Projects\SimplerPatentData\data\examples, list the XML and added to the page xpaths an example for the nodes listed on that wiki page for v<4.31 instance of every node of information. Began adding xpaths for other nodes Oliver noted would be helpfulDon't need to list repeats, like Invention Titlejust every unique xpath with its corresponding example. Went over new hubs definition with Hira; ensured no hubs on "Joe hub list 2017"(see above) were actually just incubators, and that they all had codingpgs 27-28 of https:/tech events/programs with substancewww.uspto. Took 17 hubs off totalgov/sites/default/files/documents/USPTO_Patents_Assignment_Dataset_WP. Saved new list in Z:\Hubs\2017\hubs_data, called Joe hub list 2017 w commentspdf have good definitions.
2017-0709-2822: Continued working on Checked xpaths. For Top50_Table in E:\McNair\Projects\Ecosystem\Rankingfor granted, found and entered necessary data. Source made corrections for city population and area is mistakes in the same folder, titled "City area chart"text file and McNair.
2017-0809-0121: For the xpath project: Found which patent versions for text files in E:\McNair\Projects\SimplerPatentData\data\examples\granted had PRIORITY_CLAIMS_DATE, PRIORITY_CLAIMS_COUNTRY, and PRIORITY_CLAIMS_PATENT_NUMBER; noted which ones did, and added their Added xpaths to the file of xpaths, Patent Schema Reconciliationtext.txt in E:\McNair\Projects\SimplerPatentData\data\examples\Patent Schema Reconciliation. Checked which types from [[Equivalent XPath and versions had pct document numbersAPS Queries]], updated xpaths in http://mcnair.bakerinstitute.completed all up to assignment org/wiki/Equivalent_XPath_and_APS_Queries#Query_Equivalences and Patent Schema Reconciliation.txtname. Began the same process with IPCR_Subclass and the following xpaths on http://mcnair.bakerinstitute.org/wiki/Equivalent_XPath_and_APS_Queries#Query_Equivalences. Began listing examples for each xpath. Used data from roundplus.txt in Z:\VentureCapitalData\SDCVCData\vcdb to create charts, saved as New2017Report(Aug) in E:\McNair\Projects\Houston\2017Report.
2017-0809-0220: Edited excel charts from 08-01. Continued on xpath projectFinished listing the xpaths under assignments for the text file, completed IPCR sections. For Copy of Rankingv3_Diana's_workingfile in E:\McNair\Projects\Ecosystem\Rankingand with it, added data on population, political activity all the granted xpaths are now listed in the 2016 presidential election, whether it had a university (using a filter of organizations that gave out doctorate degrees in Carnegie Classifications 2015_cleaned in E:\McNair\Projects\University Patents)text file.
2017-0809-0319: For the cities ranked 1-50 Added xpaths on [[Equivalent XPath and APS Queries]] to Patent Schema Reconciliation text.txt, which hasn't been updated in Top50_Table in E:\McNair\Projects\Ecosystem\Ranking, found 1)City 2) State 3) Dollars invested 4) firsta while. The xpaths for [[Equivalent XPath and APS Queries]] are complete for granted v4.0-round deals v4.5) Active Startups 6) Density (Active Startups per Capita). Found percent of 3, 4, Added city and 5 among state under the totals for all of citations header from the US, and for each of 8 metro areas as a proportion of wiki page to the US' totals. Saved text file as "Cleaned ranking table Aug 3", saved in E:\McNair\Projects\Ecosystem\Ranking. Continued on xpath project.
2017-0809-0415: Continued on Edited mistaken xpaths. Discovered that inventors and citations are indeed two different sections, with inventors only existing for v4.3-v4.5. They are not different headers referencing the same information. The ''inventors'' section appears only to exist under v4.3-v4.5, and the xpath projectfor ''citations'' is parties/applicants/applicant (for v4.0-v4.2), or (for v4.3-v4.5) us-parties/applicants/applicant. See "Task Notes" Checked xpaths and listed under all sections. Xpaths still need to be added to the text file Patent Schema Reconciliation text.txt in [[E:\McNair\Projects\SimplerPatentData\data\examples\Patent Schema Reconciliation]].
2017-0809-2414: For Finished listing xpaths for "lawyer" section (appears as "agent" in text files in E:\McNair\Projects\SimplerPatentData\data\examples\granted) in [[Equivalent XPath and APS Queries]]. With it, every xpath projectwe're interested in in E:\McNair\Projects\SimplerPatentData\data\examples\granted is now listed in [[Equivalent XPath and APS Queries]], added cpc_subclassfor each type and version, cpc_main-group, and cpc_subgroup with relevant examples. Still need to add some of these xpaths to the file Patent Schema Reconciliation text.txt in E:\McNair\Projects\SimplerPatentData\data\examples\Patent Schema Reconciliation. Began listing an exhaustive set of example types for different types and versions of classification_national_class in [[Equivalent XPath and APS Queries]].
2017-0809-25012: For Continued on xpath project, added classification_national_country and classification_national_class xpaths to Patent Schema Reconciliation text.txt in E:\McNair\Projects\SimplerPatentData\data\examples\Patent Schema Reconciliation. Continued listing an exhaustive list of example types for the different Finished citations section; still need to check which types and versions of classification_national_class in [[Equivalent XPath had entries for city, country, and APS Queries]]state, under assignments. Still need to complete lawyers section as well.
2017-09-0508: For xpath project, added primary_examiner_first_name xpaths to Patent Schema Reconciliation text.txt in E:\McNair\Projects\SimplerPatentData\data\examples\Patent Schema Reconciliation. Listed an exhaustive set of example types for primary_examiner_first_name in notes on [[Equivalent XPath and APS Queries]]titled "Notes for Joe to remember". Did the same Listed orgname xpaths for primary_examiner_last_name, primary_examiner_departmentcitations, discovered that the citations and number_of_claimsinventors section were effectively identical. Orgname still needs to be added to Patent Schema Reconciliation text.txt.
2017-09-07: For xpath project, corrected example types that mistakenly contained quotation marks in [[Equivalent XPath and APS Queries]]. Only kept quotation marks when they appeared in the patent text itself. Added, in the citations category, sequence, last_name, and first_name to Patent Schema Reconciliation text.txt in E:\McNair\Projects\SimplerPatentData\data\examples\Patent Schema Reconciliation. I also listed an exhaustive set of examples for sequence, last_name, and first_name in [[Equivalent XPath and APS Queries]]. Also Listed orgname fov v4.5 on [[Equivalent XPath and APS Queries]].
*Note: [[Equivalent XPath and APS Queries]] indicates that all entries under the citations header are strings. Assuming strings are indicated by quotation marks, only sequence appears to be a string.
 
2017-09-05: For xpath project, added primary_examiner_first_name xpaths to Patent Schema Reconciliation text.txt in E:\McNair\Projects\SimplerPatentData\data\examples\Patent Schema Reconciliation. Listed an exhaustive set of example types for primary_examiner_first_name in [Equivalent XPath and APS Queries]]. Did the same for primary_examiner_last_name, primary_examiner_department, and number_of_claims.
 
2017-08-25: For xpath project, added classification_national_country and classification_national_class xpaths to Patent Schema Reconciliation text.txt in E:\McNair\Projects\SimplerPatentData\data\examples\Patent Schema Reconciliation. Continued listing an exhaustive list of example types for the different types and versions of classification_national_class in [[Equivalent XPath and APS Queries]].
 
2017-08-24: For xpath project, added cpc_subclass, cpc_main-group, and cpc_subgroup to Patent Schema Reconciliation text.txt in E:\McNair\Projects\SimplerPatentData\data\examples\Patent Schema Reconciliation. Began listing an exhaustive set of example types for different types and versions of classification_national_class in [[Equivalent XPath and APS Queries]].
 
 
===Summer 2017===
 
 
 
2017-08-04: Continued on xpath project. See "Task Notes" in [[Patent Schema Reconciliation]].
 
2017-08-03: For the cities ranked 1-50 in Top50_Table in E:\McNair\Projects\Ecosystem\Ranking, found 1)City 2) State 3) Dollars invested 4) first-round deals 5) Active Startups 6) Density (Active Startups per Capita). Found percent of 3, 4, and 5 among the totals for all of the US, and for each of 8 metro areas as a proportion of the US' totals. Saved file as "Cleaned ranking table Aug 3", saved in E:\McNair\Projects\Ecosystem\Ranking. Continued on xpath project.
 
2017-08-02: Edited excel charts from 08-01. Continued on xpath project, completed IPCR sections. For Copy of Rankingv3_Diana's_workingfile in E:\McNair\Projects\Ecosystem\Ranking, added data on population, political activity in the 2016 presidential election, whether it had a university (using a filter of organizations that gave out doctorate degrees in Carnegie Classifications 2015_cleaned in E:\McNair\Projects\University Patents).
 
2017-08-01: For the xpath project: Found which patent versions for text files in E:\McNair\Projects\SimplerPatentData\data\examples\granted had PRIORITY_CLAIMS_DATE, PRIORITY_CLAIMS_COUNTRY, and PRIORITY_CLAIMS_PATENT_NUMBER; noted which ones did, and added their xpaths to the file of xpaths, Patent Schema Reconciliation.txt in E:\McNair\Projects\SimplerPatentData\data\examples\Patent Schema Reconciliation. Checked which types and versions had pct document numbers, updated xpaths in http://mcnair.bakerinstitute.org/wiki/Equivalent_XPath_and_APS_Queries#Query_Equivalences and Patent Schema Reconciliation.txt. Began the same process with IPCR_Subclass and the following xpaths on http://mcnair.bakerinstitute.org/wiki/Equivalent_XPath_and_APS_Queries#Query_Equivalences. Began listing examples for each xpath. Used data from roundplus.txt in Z:\VentureCapitalData\SDCVCData\vcdb to create charts, saved as New2017Report(Aug) in E:\McNair\Projects\Houston\2017Report.
 
2017-07-28: Continued working on xpaths. For Top50_Table in E:\McNair\Projects\Ecosystem\Ranking, found and entered necessary data. Source for city population and area is in the same folder, titled "City area chart".
 
2017-07-27: Doubled checked that the xpaths in http://mcnair.bakerinstitute.org/wiki/Equivalent_XPath_and_APS_Queries#Query_Equivalences were accurate for v4.0,v4.1, v4.2, and added to the page xpaths for the nodes listed on that wiki page for v<4.3. Began adding xpaths for other nodes Oliver noted would be helpful, like Invention Title. Went over new hubs definition with Hira; ensured no hubs on "Joe hub list 2017"(see above) were actually just incubators, and that they all had coding/tech events/programs with substance. Took 17 hubs off total. Saved new list in Z:\Hubs\2017\hubs_data, called Joe hub list 2017 w comments.
 
For the above code, I identified (what I think are accurate) xpaths for the nodes of patent number (//us-bibliographic-data-grant/publication-reference/document-id/doc-number), kind (//us-bibliographic-data-grant/publication-reference/document-id/kind), and grant date (//us-bibliographic-data-grant/publication-reference/document-id/date). I am adding the xpaths for these nodes, as well as the others mentioned above, for the 4 types of patents, for each version, for both granted and applications. Still have to do xpaths for granted version 2.5 for all types, and all applications. Waiting on Oliver about whether we need xpaths for more nodes other than the 6 example nodes.
 
</document-id>
 
<date>20161227</date>
 
<kind>P3</kind>
 
<doc-number>PP027502</doc-number>
 
<country>US</country>
 
<document-id>
 
<publication-reference>
 
<us-bibliographic-data-grant>
 
An example xpath for a certain block of code from granted, v4.5, plant:
 
Notes: I am assuming "application number" in the patent code means "filing number", because the word "filing" appears nowhere in the code, and there is already a different number, under the "publication reference" header, that seems to be referring to the patent number. It's likely that the number under which the patent is internally filed is called "application number", and appears under the header "application reference", and that the (publicized) patent number appears under the header "publication reference".
 
2017-07-26: Began [[Patent Schema Reconciliation]], creating a text document of xpaths for the following nodes: patent number, filing number, grant date, kind, type, application number, and filing date. Saved file in E:\McNair\Projects\SimplerPatentData\data\examples\Patent Schema Reconciliation.
 
2017-07-25: Noted hubs that met the new definition but were not considered hubs in hubs_list in the same file. Copied all hubs data from "hubs list" in Z:\Hubs\2017\hubs_data to "Joe hub list 2017" in the same folder. Searched hubs.txt and "Potential Hubs" in Z:\Hubs\2017\hubs_data for new hubs; added new ones to "Joe hubs list 2017".
 
2017-07-21: Confirmed whether each hub in last year's hub list (in "Hubs Data v2_16" Z:\Hubs\2017\hubs_data) is still operating.
 
2017-07-20: Searched through the firms in "Raw Program list" in E:\McNair\Projects\Hubs\summer 2016 to determine if they could be considered hubs based on the definition listed above. If they were, they were added to the list of new hubs in "hubs list" in Z:\Hubs\2017\hubs_data.
 
2017-07-19: Continued editing "hubs list" in Z:\Hubs\2017\hubs_data, researching organizations marked as questionably hubs. Used websites like Alexa and similarsites.com to find hubs with websites similar to hubs in the "hubs list" file. Only found 1 new hub. Began searching possible hubs in the file "Raw Program list" in E:\McNair\Projects\Hubs\summer 2016.
 
2017-07-18: For the file "hubs list" in Z:\Hubs\2017\hubs_data, researched whether organizations not listed as hubs (aka shaded red) in "Hubs Data v2_16" (located in the same folder) should be considered hubs, under the definition that a hub has 1) has a coworking space, 2) provides mentorship, 3) offers coding classes/tech events for cohort companies. Whether the hub had an accelerator or was tech focused was also noted.
 
2017-07-14: used data from links in "Industry breakdown by GDP..." in E:\McNair\Projects\Houston\Industries to create Excel charts of Houston employment & Gross Area Product broken down by industry. Saved charts in the same folder. Energy, health, and (for employment data) IT sectors were emphasized, in line with the goal of communicating the idea that, as substantial parts of the Houston economy, those industries will benefit from supporting local startups. Looked up example charts in the file 2017ReportV1 in E:\McNair\Projects\Houston\Houston Ecosystem Recommendations, both for ideas for future charts, and to get an idea of quantitative VC data in Houston. Fixed typos, improved incomplete keys. Cleaned the file Venture Funds in E:\McNair\Projects\Houston\VCData: added research on funds declared dead to make sure they actually are. Double checked that all firms in the master list were accounted for in the grouped list of VC firms.
 
2017-07-13: Helped Diana with researching proportion of Houston's city budget allocated towards startup funding/entrepreneurship (apparently none...). Gathered info on IT and Procurement sections of Budget. Researched proportion of Houston budget allocated towards IT and Procurement. Excel and text files saved in E:\McNair\Projects\Houston\Budget. Continued researching relative sizes of industries in Houston, gathered relevant info and links in "Industry breakdown..." in E:\McNair\Projects\Houston\Industries.
 
Note: The Houston VC data write up is on [[Houston_Entrepreneurship_Ecosystem_Project#VC_Funds_in_Houston]]
 
2017-07-12: Added addresses/PO box locations to Venture Funds in E:\McNair\Projects\Houston\VCData to remaining VC firms. Organized list. Began organizing cohort data collected on 6/30 w/ regex to streamline searching and gathering of cohort names themselves. Compiled addresses, along with other categorized info, in an excel file called "VC firms with address, basic sector info", saved in E:\McNair\Projects\Houston\VCData. For each cohort in E:\McNair\Projects\Accelerators\Accelerator Match, added headers to the tabbed name, founder, description, etc. Continued searching for addresses of the firms with no addresses listed in "VC firms with address, basic sector info". Double-checked addresses.
 
2017-07-11: Continued adding addresses to companies in Venture Funds in E:\McNair\Projects\Houston\VCData. Note: The file "VC Data" is now called "Venture Funds".
 
2017-06-30: Continued adding cohorts to each new accelerator in E:\McNair\Projects\Accelerators, saving each accelerator's cohort in E:\McNair\Projects\Accelerators\Data as (acceleratorname).cohort, either as an excel file or a text file. Added addresses to companies in E:\McNair\Projects\Houston\VCData.
 
2017-06-29: Continued adding cohorts to each new accelerator in E:\McNair\Projects\Accelerators, saving each accelerator's cohort in E:\McNair\Projects\Accelerators\Data as (acceleratorname).cohort, as a text file. Searched through documents in E:\McNair\Projects\SimplerPatentData\data\extracts\applications, in the modern and vintage folders, for examples of patents of the following type: utility, plant, reissue, and design, in versions 1.5, 1.6, 4.0, 4.1, 4.2, 4.3, 4.4, and 4.5. Placed examples in the folder E:\McNair\Projects\SimplerPatentData\data\examples. As mentioned in the wiki page (and all but confirmed with regex searches of hundreds of the patent documents), we appear only to have data on utility patents, except for a few plant patents.
 
2017-06-28: Began adding cohorts to each new accelerator in E:\McNair\Projects\Accelerators, saving each accelerator's cohort in E:\McNair\Projects\Accelerators\Data as (acceleratorname).cohort, as a text file.
 
2017-06-27: Sorted VC funds in E:\McNair\Projects\Houston\VCData; deleted non-operating ones; finalized groups. Began researching the relative size of different sectors in Houston's economy. Work saved in E:\McNair\Projects\Houston\Industries.
 
2017-06-23: Finished grouping VC funds in the file Venture Funds in E:\McNair\Projects\Houston\VCData. Researched whether based in Houston, and whether they should be considered alive.
 
2017-06-22: Finished researching VC funds in the file Venture Funds in E:\McNair\Projects\Houston\VCData. Began grouping.
 
2017-06-21: Continued researching VC funds in the file Venture Funds in E:\McNair\Projects\Houston\VCData.
 
2017-06-20: Joined the center! Wrote my page. Started on [[Collecting SBIR Data]]. Finished collecting SBIR Data; saved in bulk(E:)--> McNair-->Projects-->SBIR. Began researching VC funds in the file Venture Funds in E:\McNair\Projects\Houston\VCData
 
 
 
[[Category:Work Log]]
447

edits

Navigation menu