Difference between pages "Form:McNair Faculty" and "Matching VentureOne (Data)"
(Difference between pages)
Jump to navigation
Jump to search
Line 1: | Line 1: | ||
− | + | {{McNair Projects | |
− | + | |Project Title=Matching VentureOne (Other), | |
− | + | |Topic Area=Patents and Innovation, | |
− | + | |Owner=Ariel Sun, | |
+ | |Start Term=Summer 2016, | ||
+ | |Status=Active, | ||
+ | |Deliverable=Other, | ||
+ | }} | ||
+ | ==Data== | ||
+ | *Get the source file for the VentureOne data | ||
+ | :<code>E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx</code> Original data source | ||
− | + | *Clean it up | |
+ | :<code>E:\McNair\Software\Scripts\Matcher\Input\Venture Data 1.txt</code> extraneous symbols and words removed | ||
− | < | + | *Match it against itself |
− | + | :<code>E:\McNair\Projects\Venture One Data\Cleaned and Matched Data.xlsx</code> | |
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | + | *Get the patent data | |
+ | **Draw the distinct assignees | ||
+ | :<code>Z:\allpatentsprocessed\DistinctAssignees2.txt </code><br> | ||
+ | **Match them against themselves | ||
+ | :<code>Z:\allpatentsprocessed\DistinctAssignees2matched.txt </code> | ||
− | |||
+ | *Match venture data to patent data | ||
+ | :<code>Z:\allpatentsprocessed\Venture Patent Matched.txt</code> | ||
− | + | *Join <code>patent</code> data to <code>assignee</code> data, creating <code>firstjoin_cleaned</code> | |
− | + | *Join <code>firstjoin_cleaned</code> data to <code>matchassignee</code> data, creating secondjoin_cleaned</code> | |
− | + | *Join <code>secondjoin_cleaned</code> data to <code>venturepatentmatched</code> data, creating fourthjoin_cleaned</code> | |
− | |||
− | |||
− | </ |
Revision as of 15:33, 15 June 2016
Matching VentureOne (Data) | |
---|---|
Project Information | |
Project Title | |
Start Date | |
Deadline | |
Primary Billing | |
Notes | |
Has project status | |
Copyright © 2016 edegan.com. All Rights Reserved. |
Data
- Get the source file for the VentureOne data
E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx
Original data source
- Clean it up
E:\McNair\Software\Scripts\Matcher\Input\Venture Data 1.txt
extraneous symbols and words removed
- Match it against itself
E:\McNair\Projects\Venture One Data\Cleaned and Matched Data.xlsx
- Get the patent data
- Draw the distinct assignees
Z:\allpatentsprocessed\DistinctAssignees2.txt
- Match them against themselves
Z:\allpatentsprocessed\DistinctAssignees2matched.txt
- Match venture data to patent data
Z:\allpatentsprocessed\Venture Patent Matched.txt
- Join
patent
data toassignee
data, creatingfirstjoin_cleaned
- Join
firstjoin_cleaned
data tomatchassignee
data, creating secondjoin_cleaned - Join
secondjoin_cleaned
data toventurepatentmatched
data, creating fourthjoin_cleaned