Changes

Jump to navigation Jump to search
==Patents ==
'''Patentdata Schema:'''
 '''Patentdata Schema:''' Column | Type | Modifiers- --------+-------------------+-----------
patent | integer |
kind | character varying |
appyear | integer |
  '''Patent_2015 Schema:'''
Column | Type | Modifiers
filename | varchar |
''' Overlapping Columns '''
patent_data patent_2015
--------------+-------------
patent | patentnumber
kind | kind
claims | numberofclaims
apptype | type
appnum | applicationnumber
gdate | grantdate
appdate | filingdate
'''Combined Schema:'''
Column NamesThe final schema of the patents table is : patent int, kind varchar, claims int, apptype int, appnum int, gdate date, gyear int, appdate date, appyear int
Column | Type | Modifiers
----------------------+-------------------+-----------
patent | integer | not null
grantdate | date |
prioritydate | date |
prioritycountry | character varying |
prioritypatentnumber | character varying |
cpcsubgroup | character varying |
pctpatentnumber | character varying |
claims | integer |
appnum | integer |
gyear | integer |
appdate | date |
appyear | integer |
nber | integer |
uspc | character varying |
uspc_sub | character varying |
From the total list of columns belonging to both the tables (patentdata and patent_2015), a few columns, most of them related to classification of patents, have been dropped since the data in the tables was not clean.
patentnumber intAdditionally, three columns -- patent kind varchar, -- kind grantdate datenber, --gdate type varcharuspc, applicationnumber varcharuspc_sub have been added from the historicalpatentdata, filingdate date, prioritydate date, prioritycountry varchar, prioritypatentnumber varchar, ussubclass varchar, maingroup varchar, subgroup varchar, cpcsubclass varchar, cpcmaingroup varchar, cpcsubgroup varchar, classificationnationalcountry varchar, classificationnationalclass varchar, title varchar, numberofclaims int, primaryexaminerfirstname varchar, primaryexaminerlastname varchar, primaryexaminerdepartment varchar, pctpatentnumber varchar, filename varchar claims int, apptype int, appnum int, gyear int, appdate date, appyear int Output Schema: patents CREATE TABLE patents_merged( patentnumber int, kind varchar, grantdate date, type varchar, applicationnumber varchar, filingdate date, prioritydate date, prioritycountry varchar, prioritypatentnumber varchar, ussubclass varchar, maingroup varchar, subgroup varchar, cpcsubclass varchar, cpcmaingroup varchar, cpcsubgroup varchar, classificationnationalcountry varchar, classificationnationalclass varchar, title varchar, numberofclaims int, primaryexaminerfirstname varchar, primaryexaminerlastname varchar, primaryexaminerdepartment varchar, pctpatentnumber varchar, filename varchar, claims int, apptype int, appnum int, gyear int, appdate date, appyear int );a table built from data downloaded from the USPTO Bulk Data Storage. The join was executed on the patent number.
==== Index and Key Creation ====

Navigation menu