Difference between revisions of "Matching VentureOne (Data)"

From edegan.com
Jump to navigation Jump to search
Line 24: Line 24:
 
:Z:\allpatentsprocessed\DistinctAssignees2matched.txt  
 
:Z:\allpatentsprocessed\DistinctAssignees2matched.txt  
 
**Load the VentureOne data into a dbase
 
**Load the VentureOne data into a dbase
 +
 +
*Match venture data to patent data
 +
 +
*Join patent data to assignee data, creating firstjoin_cleaned
 +
*Join firstjoin_cleaned data to matchassignee data, creating secondjoin_cleaned
 +
*Join secondjoin_cleaned data to venturepatentmatched data, creating fourthjoin_cleaned

Revision as of 15:25, 15 June 2016


McNair Project
Matching VentureOne (Data)
Project logo 02.png
Project Information
Project Title
Start Date
Deadline
Primary Billing
Notes
Has project status
Copyright © 2016 edegan.com. All Rights Reserved.


Data

  • Get the source file for the VentureOne data
E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx Original data source
  • Clean it up
E:\McNair\Software\Scripts\Matcher\Input\Venture Data 1.txt extraneous symbols and words removed
  • Match it against itself
E:\McNair\Projects\Venture One Data\Cleaned and Matched Data.xlsx
  • Get the patent data
    • Draw the distinct assignees
Z:\allpatentsprocessed\DistinctAssignees2.txt
    • Match them against themselves
Z:\allpatentsprocessed\DistinctAssignees2matched.txt
    • Load the VentureOne data into a dbase
  • Match venture data to patent data
  • Join patent data to assignee data, creating firstjoin_cleaned
  • Join firstjoin_cleaned data to matchassignee data, creating secondjoin_cleaned
  • Join secondjoin_cleaned data to venturepatentmatched data, creating fourthjoin_cleaned