Difference between revisions of "Matching VentureOne (Data)"
Jump to navigation
Jump to search
| Line 1: | Line 1: | ||
{{McNair Projects | {{McNair Projects | ||
| − | |Project Title=Matching VentureOne (Data) | + | |Project Title=Matching VentureOne (Data) |
| − | |Topic Area=Patents and Innovation | + | |Topic Area=Patents and Innovation |
| − | |Owner=Ariel Sun, | + | |Owner=Ariel Sun, Rosemarie Ziedonis |
| − | |Start Term=Summer 2016 | + | |Start Term=Summer 2016 |
| − | |Status=Active | + | |Status=Active |
| − | |Deliverable=Other | + | |Deliverable=Other |
|Primary Billing= AccMcNair01 | |Primary Billing= AccMcNair01 | ||
}} | }} | ||
Revision as of 16:39, 22 June 2016
| Matching VentureOne (Data) | |
|---|---|
| Project Information | |
| Project Title | |
| Start Date | |
| Deadline | |
| Primary Billing | |
| Notes | |
| Has project status | |
| Copyright © 2016 edegan.com. All Rights Reserved. | |
Data Processing
- Get the VentureOne data ready
- Source file for VentureOne data
E:\McNair\Projects\Venture One Data\Venture Data 1.xlsxOriginal data source - Clean it up
E:\McNair\Software\Scripts\Matcher\Input\Venture Data 1.txtextraneous symbols and words removed - Match it against itself to get standardized entity names
E:\McNair\Projects\Venture One Data\Cleaned and Matched Data.xlsx
- Get the patent data ready
- Draw the distinct assignees
Z:\allpatentsprocessed\DistinctAssignees2.txt - Match them against themselves to get standardized org names for patent data
Z:\allpatentsprocessed\DistinctAssignees2matched.txt
- Match standardized org names of patent data to standardized entity names of venture data
Z:\allpatentsprocessed\Venture Patent Matched.txt
- Join patent data to venture data to get patent information of each venture-backed company
- Join
patentdata toassigneedata, creatingfirstjoin_cleanedwhich matches assignees to patent numbers. - Join
firstjoin_cleaneddata tomatchassigneedata, creatingsecondjoin_cleanedwhich matches standard org names to patent numbers - Join
secondjoin_cleaneddata toventurepatentmatcheddata, creatingfourthjoin_cleanedwhich matches standard venture company names to patent numbers
- Final summary tables
- Summary table displaying number of patents owned, minimum grant year, maximum grant year and average grant year for each company
E:\McNair\Projects\Venture One Data\venturepatentreallyfinal.txt - A table of all patent information for each company that has patent
E:\McNair\Projects\Venture One Data\venturepatentfullyjoined.txt
- Notes
- All data in
allpatentsprocessed database. Access it by logging on toresearcher@McNair DBServ:/bulk/allpatentsprocessed - A script of detailed processing procedure can be found at
E:\McNair\Projects\Venture One Data\patent data script.txt