Changes

Jump to navigation Jump to search
which files have already been processed, check "McNair/usptoAssigneeData/Finished" to see the finished zip files.
====For the Main Patent Data=== The equivalent for patent data is called "USPTO_Parser" and can be found under E:/McNair/Software/Scripts/Patent Notes:*You may need to fix the base url in the script. The USPTO has changed it before.It is currently: https://bulkdata.uspto.gov/data2/patent/grant/redbook*Instead of taking a url, as the USPTO_Assignee Download does, ir it takes two arguments, year 1 and year2, which are supposed to represent the range of data that you wish to download (for example, 2015 to 2016).*The perl script places the downloaded zip files into "E:/McNair/PatentData/name" where "name" is the name of the zip file. This location is not quite accurate - the files are actually stored in what appears to be McNair/PatentData.*The folder "Processed" under McNair/PatentData appears to hold all the unzipped zip files that have been downloaded and processedalready. So if you are curious if some files have already been processed, you could look there. They are organized by year.
Now to actually run the scripts:
*Insert how one would do this once I figure it out, I searched online to try to troubleshoot Open a command line or powershellwhy I could not run e: cd .\McNair\Software\Scripts\Patent perl USPTO_Parser.pl 2016 2017 Note the scripts, but I couldn't figure it out 4zip files should appear briefly (sequentially) in E:/McNair/11Software/2017*Scripts/Patent before disappearing and reappearing unzipped in E:/McNair/PatentData
4) The next step would be to parse the actual files.
For the patent data files, based on the existing documentation, it looks like PatentParser, found in McNair/Software/Scripts/Patent,has to be run on each xml file that was downloaded and unzipped during the previous step. It then stores the parsed xml files all in a text file called "Results.txt" (which I assume will have to be deleted afterward). Thi This script utilizes the Claim.pm, Inventor.pm, PatentApplication.pm, and Loader.pm modules. It no
longer uses the AddressBook.pm module.

Navigation menu