Difference between revisions of "Patent Data"

From edegan.com
Jump to navigation Jump to search
(MarcelaInteriano moved page Patent Data (Wiki Page) to Patent Data Wiki Page)
Line 1: Line 1:
#REDIRECT [[Patent Data Wiki Page]]
+
{{McNair Projects
 +
|Project Title=Patent Data (Wiki Page)
 +
|Topic Area=Resources and Tools
 +
|Start Term=Spring 2016
 +
|Status=Tabled
 +
|Priority=3
 +
|Due Date=NA
 +
|Stage=DRAFTED
 +
|Deliverable=Wiki Page
 +
|Audience=McNair staff
 +
|Keywords=Patent, Database,
 +
|Primary Billing=AccMcNair01
 +
}}
 +
==Data Background==
 +
 
 +
This project is concerned with maintaining and updating patent data, to enable the McNair Center staff to extract meaningful data for academic papers and reports. Currently, there are two primary sources for this data - the US Patent and Trademark Office as well as the Harvard Dataverse. Data from the LexMachina online database may be added to have data on [[Guide to Patent Litigation (Wiki Page) | patent litigation]].  All the acquired data is stored in normalized tables to be accessed and modified using SQL.
 +
 
 +
The patent data has been separated into multiple databases based on data source or subject matter. Each database consists of several tables for which the known issues have been recorded. The [[Patent|Patent Database]] contains the merged datasets from the USPTO bulk data and Harvard Dataverse using SQL. Specifics on how the datasets were merged are given in [[Patent Data Processing - SQL Steps]]. The Patent Database focuses on patents, patent litigation, patent maintenance, patent assignment, and other details on patent owners. The [[USPTOAssigneesData|USPTO Assignees Database (version 2)]] focuses on patent assignments, a transaction between one or more patent owners with one or more parties where ownership or interest in one or more patents is assigned or shared. The database consists of historical assignment data provided by the USPTO in XML files. Specifics on how the database are given on the [[USPTO Assignees Data Processing]] Page.
 +
 
 +
== Sources of Data==
 +
 
 +
===[[USPTO Bulk Data]]===
 +
 
 +
 
 +
===[[Harvard Dataverse]]===
 +
 
 +
===[[Venture Xpert]]===
 +
===[[Lex Machina]]===
 +
===[[Data Model]]===
 +
 
 +
==Academic Projects==
 +
 
 +
===[[Little Guy Academic Paper|'Little Guy' Academic Paper]]===
 +
The first application of the refined database will be the [[Little Guy Academic Paper]]. {{#section:Little_Guy_Academic_Paper|Little Guy}}
 +
===Patent Trolls===
 +
 
 +
Academic Paper: The patent database will also be used to explore the existence of patent trolls and characteristic litigation activity. An academic paper may be developed defining patent trolls and other entities often confused as patent trolls. The data from Lex Machina will be used to track troll behavior and associated outcomes as well as the impact of other patent intermediary and assertion bodies.
 +
 
 +
Issue Brief: Based on an analysis of the litigation data from Lex Machina, an issue brief, tentatively titled [[The Truth Behind Patent Trolls Issue Brief| The Truth Behind Patent Trolls]], on patent troll activity may be written to report on how best to curve abuses through [[Innovation Policy| innovation policy]] and reform.
 +
 
 +
<includeonly>
 +
[[Category: McNair Projects]]
 +
[[Category: Patents]]
 +
</includeonly><!-- flush -->
 +
[[Category:Internal]]

Revision as of 17:28, 28 October 2016


McNair Project
Patent Data
Project logo 02.png
Project Information
Project Title
Start Date
Deadline
Primary Billing
Notes
Has project status
Copyright © 2016 edegan.com. All Rights Reserved.


Data Background

This project is concerned with maintaining and updating patent data, to enable the McNair Center staff to extract meaningful data for academic papers and reports. Currently, there are two primary sources for this data - the US Patent and Trademark Office as well as the Harvard Dataverse. Data from the LexMachina online database may be added to have data on patent litigation. All the acquired data is stored in normalized tables to be accessed and modified using SQL.

The patent data has been separated into multiple databases based on data source or subject matter. Each database consists of several tables for which the known issues have been recorded. The Patent Database contains the merged datasets from the USPTO bulk data and Harvard Dataverse using SQL. Specifics on how the datasets were merged are given in Patent Data Processing - SQL Steps. The Patent Database focuses on patents, patent litigation, patent maintenance, patent assignment, and other details on patent owners. The USPTO Assignees Database (version 2) focuses on patent assignments, a transaction between one or more patent owners with one or more parties where ownership or interest in one or more patents is assigned or shared. The database consists of historical assignment data provided by the USPTO in XML files. Specifics on how the database are given on the USPTO Assignees Data Processing Page.

Sources of Data

USPTO Bulk Data

Harvard Dataverse

Venture Xpert

Lex Machina

Data Model

Academic Projects

'Little Guy' Academic Paper

The first application of the refined database will be the Little Guy Academic Paper.

Patent Trolls

Academic Paper: The patent database will also be used to explore the existence of patent trolls and characteristic litigation activity. An academic paper may be developed defining patent trolls and other entities often confused as patent trolls. The data from Lex Machina will be used to track troll behavior and associated outcomes as well as the impact of other patent intermediary and assertion bodies.

Issue Brief: Based on an analysis of the litigation data from Lex Machina, an issue brief, tentatively titled The Truth Behind Patent Trolls, on patent troll activity may be written to report on how best to curve abuses through innovation policy and reform.