Difference between revisions of "USPTO Patent Assignment Dataset"

From edegan.com
Jump to navigation Jump to search
(Created page with "{{McNair Projects |Has title=USPTO Patent Assignment Dataset |Has owner=Ed Egan, |Has keywords=Data |Has project status=Active }}")
 
Line 5: Line 5:
 
|Has project status=Active
 
|Has project status=Active
 
}}
 
}}
 +
 +
This project describes the build out and basic use of the USPTO Assignment Dataset.
 +
 +
The data is described in a USPTO Economic Working Paper by Marco, Myers, Graham and others: https://www.uspto.gov/sites/default/files/documents/USPTO_Patents_Assignment_Dataset_WP.pdf
 +
 +
 +
==Pre-load checks==
 +
 +
The data is large. We don't have space on the main dbase server for it.
 +
 +
df -h
 +
/dev/nvme1n1p2  235G  208G  15G  94% /var/postgresql
 +
 +
 +
Note: To check dbase space usage on the dbase server see [[Posgres_Server_Configuration#Size.2C_Backup_.26_Restore]].
 +
 +
The postgres dbase on the RDP, however, currently has more than 300Gb free and is on a solid state drive, so its performance should be acceptable.
 +
 +
==Getting the data==

Revision as of 13:30, 15 November 2017


McNair Project
USPTO Patent Assignment Dataset
Project logo 02.png
Project Information
Project Title USPTO Patent Assignment Dataset
Owner Ed Egan
Start Date
Deadline
Keywords Data
Primary Billing
Notes
Has project status Active
Copyright © 2016 edegan.com. All Rights Reserved.


This project describes the build out and basic use of the USPTO Assignment Dataset.

The data is described in a USPTO Economic Working Paper by Marco, Myers, Graham and others: https://www.uspto.gov/sites/default/files/documents/USPTO_Patents_Assignment_Dataset_WP.pdf


Pre-load checks

The data is large. We don't have space on the main dbase server for it.

df -h
/dev/nvme1n1p2  235G  208G   15G  94% /var/postgresql

Note: To check dbase space usage on the dbase server see Posgres_Server_Configuration#Size.2C_Backup_.26_Restore.

The postgres dbase on the RDP, however, currently has more than 300Gb free and is on a solid state drive, so its performance should be acceptable.

Getting the data