Difference between revisions of "Ravali Kruthiventi (Research Plan)"

From edegan.com
Jump to navigation Jump to search
(Created page with " ===Project - USPTO Assignees, Patent and Citation Data=== ==== Assignees Data ==== *Data source: ** Issues: ** Solution: ** Time to implement: ** Priority: ** Teams waiting f...")
 
Line 2: Line 2:
 
===Project - USPTO Assignees, Patent and Citation Data===
 
===Project - USPTO Assignees, Patent and Citation Data===
 
==== Assignees Data ====
 
==== Assignees Data ====
*Data source:
+
*Data source: patent database (merged data from patent_2015 and patentdata databases)
** Issues:
+
** Issues: citations data contains non numeric patent numbers (likely application numbers, etc)
** Solution:
+
 
** Time to implement:
+
** Solution:  
 +
*** Segregate into smaller tables so that Amir and Marcela can identify patterns
 +
*** link back to appropriate patent numbers from the patent table
 +
 
 +
** Time to implement: 1 day
 +
 
 
** Priority:
 
** Priority:
 +
 
** Teams waiting for it:
 
** Teams waiting for it:
 +
*** Marcela and Amair
 +
****Project : Patent data analysis (?)
 +
*** Jake and James, potentially could need this down the line
 +
**** Project : LBO data
 +
 
** Deadline:
 
** Deadline:
  
*Data Source:
+
*Data Source: USPTO Bulk Data repository
 
** Issues:
 
** Issues:
 +
*** The script inserts copies of data into the tables.
 +
*** Analysis required on the data to make sure the data was inserted correctly from the XML files.
 +
*** Analysis is also required to determine whether this data is better than the data we have in the patent database right now.
 +
**** Action owners : Amir and Marcela
 +
 
** Solution:
 
** Solution:
 +
*** Amir and Marcela and/or I need to look at the data to determine quality
 +
*** Amir and Marcela and/or I will need to delete the copies
 +
 
** Time to implement:
 
** Time to implement:
 
** Priority:
 
** Priority:

Revision as of 13:14, 15 July 2016

Project - USPTO Assignees, Patent and Citation Data

Assignees Data

  • Data source: patent database (merged data from patent_2015 and patentdata databases)
    • Issues: citations data contains non numeric patent numbers (likely application numbers, etc)
    • Solution:
      • Segregate into smaller tables so that Amir and Marcela can identify patterns
      • link back to appropriate patent numbers from the patent table
    • Time to implement: 1 day
    • Priority:
    • Teams waiting for it:
      • Marcela and Amair
        • Project : Patent data analysis (?)
      • Jake and James, potentially could need this down the line
        • Project : LBO data
    • Deadline:
  • Data Source: USPTO Bulk Data repository
    • Issues:
      • The script inserts copies of data into the tables.
      • Analysis required on the data to make sure the data was inserted correctly from the XML files.
      • Analysis is also required to determine whether this data is better than the data we have in the patent database right now.
        • Action owners : Amir and Marcela
    • Solution:
      • Amir and Marcela and/or I need to look at the data to determine quality
      • Amir and Marcela and/or I will need to delete the copies
    • Time to implement:
    • Priority:
    • Teams waiting for it:
    • Deadline:

Project - Lex Machina Data

  • Data Source:
    • Issues:
    • Solution:
    • Time to implement:
    • Priority:
    • Teams waiting for it:
    • Deadline:


Project - Pattern Recognition on Patent Data through Machine Learning

  • Data Source:
    • Plan:
      • Technique
    • Known Issues:
    • Solution:
    • Time to implement:
    • Priority:
    • Teams waiting for it:
    • Deadline: