Changes

Jump to navigation Jump to search
97 bytes added ,  16:01, 19 January 2017
no edit summary
Return to [[Patent Data (Wiki Page)]].
<section begin=dataverse /> The Harvard Dataverse provides datasets that are post author disambiguation. The data provided is the clean version of the U.S. utility patent database spanning 1975-2010. <section end=dataverse /> For details, see the paper at [https://dataverse.harvard.edu/dataset.xhtml?persistentId=hdl:1902.1/15705 Harvard Dataverse].
This page records how to load/and use the Harvard Dataverse.The patents from 1975-2010 loaded as .sqlite3 and csv files can be found at  [https://dataverse.harvard.edu/dataset.xhtml?persistentId=hdl:1902.1/15705 Harvard Dataverse] Unlike USPTO raw data from 1975-2010, this is cleaned data. In particular, the Harvard Dataverse datasets are post author disambiguation. For details, see the paper at [https://dataverse.harvard.edu/dataset.xhtml?persistentId=hdl:1902.1/15705 Harvard Dataverse]. All of them the files have been downloaded to the database server serverr and can be found at cd /bulk/patent.
==Getting the data==

Navigation menu