Changes

Jump to navigation Jump to search
2,869 bytes added ,  17:53, 1 March 2017
===Citations Table===
<section begin=citation />
The table has two columns, 'citingpatentnumber' and 'citedpatentnumber'. There are rows with 'citedpatentnumber' greater than 10000000. For instance:
The foreign cited patents will be moved to a separate table.
 
<section end=citation />
===Assignees Assigneeinfo Table===<section begin=assignee/>
'Country' is missing.
allpatent_clone=# SELECT COUNT(*) FROM assignees assigneeinfo WHERE country=' '; count ---------
2361543
(1 row) 
allpatent_clone=# SELECT COUNT(*) FROM assignees assigneeinfo WHERE country='unknown';
count
-------
(1 row)
allpatent_clone=# SELECT COUNT(*) FROM assignees assigneeinfo WHERE country IN ('unknown',' ') AND state=' ';
count
---------
UPDATE: The source of the problem seems to be the Harvard Dataverse.
 <section end=assignee />
===Assignees & assigneesUSU Tables===
<section begin=assignee2 />
No information provided about the assignee. No entries for orgname or first and last names.
344793
(1 row)
<section end=assignee2 />
 
===HistPatent Table===
<section begin="histpatent" />
The USPTO bulk data contains negative patent numbers.
 
applicationid | pubno | patent | nber | uspc | uspc_sub | applicationdate | prioritydate | pubdate | displaydate | disptype | exp_dt | exp_dt_max | pta
--------------+-------+----------+------+------+----------+-----------------+--------------+---------+-------------+----------+------------+------------+------
8466602 | | -2037052 | 23 | 347 | 259 | 1995-06-06 | 1995-06-06 | | 1997-06-19 | ABN | | | 0
8605804 | | -2962913 | 70 | 395 | 500 | 1996-02-23 | 1996-02-23 | | 1998-08-22 | ABN | | | 0
| | 1332054 | 63 | 112 | 153 | | | | 1920-02-24 | ISS | 1937-02-23 | | 0
 
Application numbers are 8 digits long (99/999999). The first two digits are a series code and the last six represent a serial code assigned by the USPTO. The histpatent table has over 3 million application numbers with 7 digits, since leading zeros were dropped. Leading zeros were also dropped for patent numbers.
 
patent=# SELECT COUNT(*) FROM histpatent WHERE applicationid > 9999999;
count
---------
4048050
(1 row)
 
patent=# SELECT COUNT(*) FROM histpatent WHERE applicationid <9999999;
count
---------
3028846
(1 row)
 
<section end="histpatent />
 
===LexJudge===
<section begin="lexjudge" />
There are duplicate entries for certain judges that may be dropped once data on patent litigation has been added.
 
name | court | count
-----------------------------+-------+-------
Malcolm Jones Howard | EDNC | 2
Richard Leroy Williams | EDVa | 2
Peter Jo Messitte | DMd | 2
Andre Maurice Davis | DMd | 2
Paula Xinis | DMd | 2
Mary Hannah Lauck | EDVa | 2
Julie E. Carnes | NDGa | 2
Claude M. Hilton | EDVa | 2
William D. Quarles Jr. | DMd | 2
James C. Dever III | EDNC | 2
Catherine C. Blake | DMd | 2
Gerald Bruce Lee | EDVa | 2
Leonie M. Brinkema | EDVa | 2
<section end="lexjudge" />
 
===PatentId===
<section begin="patentid" />
The patentid table has many entries that do not consist of only integers.
 
SELECT COUNT(*) FROM patentid WHERE patentid ~ '[A-Z]';
count
-------
775982
 
SELECT COUNT(*) FROM patentid WHERE patentid ~ '[^A-Z]';
count
--------
27294418
<section end="patentid" />
[[Category: Internal]]
[[Internal Classification: Data Tables| ]]

Navigation menu