Difference between pages "Category:McNair Staff" and "Matching VentureOne (Data)"

From edegan.com
(Difference between pages)
Jump to navigation Jump to search
 
 
Line 1: Line 1:
{{#default_form:McNair Faculty}}
+
{{McNair Projects
 +
|Project Title=Matching VentureOne (Other),
 +
|Topic Area=Patents and Innovation,
 +
|Owner=Ariel Sun,
 +
|Start Term=Summer 2016,
 +
|Status=Active,
 +
|Deliverable=Other,
 +
}}
 +
 
 +
==Data==
 +
*Get the source file for the VentureOne data
 +
:<code>E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx</code> Original data source
 +
 
 +
*Clean it up
 +
:<code>E:\McNair\Software\Scripts\Matcher\Input\Venture Data 1.txt</code> extraneous symbols and words removed
 +
 
 +
*Match it against itself
 +
:<code>E:\McNair\Projects\Venture One Data\Cleaned and Matched Data.xlsx</code>
 +
 
 +
*Get the patent data
 +
**Draw the distinct assignees         
 +
:<code>Z:\allpatentsprocessed\DistinctAssignees2.txt </code><br>
 +
**Match them against themselves
 +
:<code>Z:\allpatentsprocessed\DistinctAssignees2matched.txt </code>
 +
 
 +
 
 +
*Match venture data to patent data
 +
:<code>Z:\allpatentsprocessed\Venture Patent Matched.txt</code>
 +
 
 +
*Join <code>patent</code> data to <code>assignee</code> data, creating <code>firstjoin_cleaned</code>
 +
*Join <code>firstjoin_cleaned</code> data to <code>matchassignee</code> data, creating secondjoin_cleaned</code>
 +
*Join <code>secondjoin_cleaned</code> data to <code>venturepatentmatched</code> data, creating fourthjoin_cleaned</code>

Revision as of 15:33, 15 June 2016


McNair Project
Matching VentureOne (Data)
Project logo 02.png
Project Information
Project Title
Start Date
Deadline
Primary Billing
Notes
Has project status
Copyright © 2016 edegan.com. All Rights Reserved.


Data

  • Get the source file for the VentureOne data
E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx Original data source
  • Clean it up
E:\McNair\Software\Scripts\Matcher\Input\Venture Data 1.txt extraneous symbols and words removed
  • Match it against itself
E:\McNair\Projects\Venture One Data\Cleaned and Matched Data.xlsx
  • Get the patent data
    • Draw the distinct assignees
Z:\allpatentsprocessed\DistinctAssignees2.txt
    • Match them against themselves
Z:\allpatentsprocessed\DistinctAssignees2matched.txt


  • Match venture data to patent data
Z:\allpatentsprocessed\Venture Patent Matched.txt
  • Join patent data to assignee data, creating firstjoin_cleaned
  • Join firstjoin_cleaned data to matchassignee data, creating secondjoin_cleaned
  • Join secondjoin_cleaned data to venturepatentmatched data, creating fourthjoin_cleaned