Patent Design Main Page

From edegan.com
Jump to navigation Jump to search


This is the main page for all the research, work, and design that has been put into working with the patent date. Currently, as of Fall 2017, Oliver Chang, Joe Reilly, and Shelby Bice are working on designing new patent and assignment databases (Shelby), creating a new parser and scripts to pull and parse the data (Oliver), and identifying all the paths within the XML files that lead to data that should be included in the databases(all three).

Project
Patent Design Main Page
Project logo 02.png
Project Information
Has title Patent Design Main Page
Has owner Shelby Bice, Joe Reilly, Oliver Chang
Has start date
Has deadline date
Has keywords patent
Has project status Active
Is dependent on Reproducible Patent Data, USPTO Patent Litigation Data, Small Inventors Project
Has sponsor McNair Center
Copyright © 2019 edegan.com. All Rights Reserved.

There has been lots of work on storing information about the patents in databases, including methods of cleaning the data, what data should be included, etc. Some of it is obsolete and some of it is incorrect. Generally, the newer pages are going to be the most relevant, but it can be helpful to see what is done in the past, especially since some methodology (like the cleaning the data) hasn't changed that much.

Joe's Work

Work (likely finished): Identified paths within the XML examples for utility, reissue, plant, and design patents, for versions 4.0-4.5, from E:\McNair\Projects\SimplerPatentData\data\examples\granted. Only the granted folder was done. Initially, some xpaths were saved in E:\McNair\Projects\SimplerPatentData\data\examples\Patent Schema Reconciliation as a text file also. Paths identified for the following nodes, from http://mcnair.bakerinstitute.org/wiki/Equivalent_XPath_and_APS_Queries, for all types and versions:


strings section:


PATENT_TYPE

TITLE

PCT_DOCUMENT_NUMBER

PATENT_COUNTRY

PATENT_NUMBER

PATENT_KIND

PATENT_GRANT_DATE

APPLICATION_NUMBER

APPLICATION_FILING_DATE

PRIORITY_CLAIMS_DATE

PRIORITY_CLAIMS_COUNTRY

PRIORITY_CLAIMS_PATENT_NUMBER

IPCR_SUBCLASS

IPCR_MAIN_GROUP

IPCR_SUB_GROUP

CPC_SUBCLASS

CPC_MAIN_GROUP

CPC_SUB_GROUP

CLASSIFICATION_NATIONAL_COUNTRY

CLASSIFICATION_NATIONAL_CLASS

PRIMARY_EXAMINER_FIRST_NAME

PRIMARY_EXAMINER_LAST_NAME

PRIMARY_EXAMINER_DEPARTMENT


numbers section:


NUMBER_OF_CLAIMS


applicants section


SEQUENCE

LAST_NAME

FIRST_NAME

ORG_NAME

CITY

COUNTRY

STATE

ADDRESS

POSTCODE


citations section


CITATION_DESCRIPTION

CITATION NUMBER

NPL CITATION NUMBER

COUNTRY

CITATIONS DOC NUMBER

CITATIONS KIND

CITATIONS NAME

CITATIONS DATE

SEQUENCE

LAST_NAME

FIRST_NAME

CITY

COUNTRY

STATE

ADDRESS

LAST_NAME

FIRST_NAME

ORG_NAME

CITY

COUNTRY

STATE

ADDRESS


lawyers section


SEQUENCE

FIRST_NAME

ORG_NAME

Shelby's Work

Most recent work (Redesigning the whole patent database and assignment database): This project page details my design for a new patent database and a new assignment database. http://mcnair.bakerinstitute.org/wiki/Redesign_Assignment_and_Patent_Database

Older work when we were going to modify the database: http://mcnair.bakerinstitute.org/wiki/Redesigning_Patent_Database It has a lot of information on the methodology we tried to use before to update the database, including our frustration with the perl scripts that pulled and parsed, the data, which eventually led to Oliver Chang writing new scripts for us.

This was done by a former McNair Center intern and subsumed by my current project. It includes information on how the assignment database should be formatted: http://mcnair.bakerinstitute.org/wiki/Patent_Assignment_Data_Restructure

Here are some somewhat outdated pages describing aspects of the patent data: