Changes

Jump to navigation Jump to search
no edit summary
{{Project|Has project output=Data,Tool,How-to|Has image= |Has title=USPTO Bulk Data Processing|Has owner=|Has start date=|Has deadline=|Has keywords=Data|Has sponsor=McNair Center|Has notes=|Has project status=Subsume|Is dependent on=|Does subsume=}} Return to [[Patent Data (Wiki Page)]]. 
<section begin=bulk />
The USPTO provides bulk data recording patent transactions, applications, properties, reassignments, and history through XML files to the general public. These files have been downloaded and the data has been compiled in tables using SQLPostgreSQL. The objective of processing the bulk data is to enhance the McNair Center's historical datasets ([[Patent Data Processing - SQL Steps|patent_2015 and patentdata]]) and track the entirety of US patent activity, specifically concerning utility patents.
<section end=bulk />
== USPTO Assignees Data ==
 
We would like to download and absorb data from this location on the USPTO website into our tables. The objective is to determine whether this dataset is better than the current version of our patent data (a combination of the data in the patent_2015 and patentdata databases.
== Steps Followed to Extract the USPTO Assignees Data ==
===Extracting Data from XML Files ===
*Add more command line options to improve usability.
*Improve portability to allow Unix/Linux pathnames. This is straightforward to do with Perl modules File::Basename and File::Spec.
[[Category: Internal]]
[[Internal Classification: Legacy| ]]

Navigation menu