Changes

Jump to navigation Jump to search
====Downloading the data====
 
This is correct:
 
Run USPTO_Assignee_Download.pl, which is located in McNair/usptoAssigneeData, to download the data from uspto bulk website.
 
Run USPTO_Assignee_Splitter.pl, which is located in McNair/usptoAssigneeData, to split the data.
 
As of 4/20/2017 is we think the correct parser is xml_parser2.plx in McNair/usptoAssigneeData.
 
Look in the "patent" database and look for the most recent frameno and reelno so we don't copy over data that we already have in the database.
 
current max reelno: 39068 (4/20/2017)
current max frameno 9680 (4/20/2017)
 
 
Disregard what is below, this is now believed to be incorrect:
For USPTO Assignment Data, there is a script, under McNair/usptoAssignment, called USPTO_Assignee_Download, which lets a user pass it a text file (file ending in.txt) which contains the url(s) of the assignment data that needs to be downloaded. The script then downloads all the zip files available at that URL. An example called BaseUrls.txt (containing the url that you will probably be using to download the assignment data, unless you're downloading the data from this currrent year, which is in a different link) can be found in McNair/usptoAssignment It then places the downloaded zip files in "E:/McNair/usptoAssigneeData/name", where "name" is the name of the file. If you want to check which files have already been processed, check "McNair/usptoAssigneeData/Finished" to see the finished zip files. (In the future, this should be updated, if possible to specify which years to download, since all assignment data that is not from this current year is under one url, and we've already downloaded most of it.)

Navigation menu