Changes
Jump to navigation
Jump to search
← Older edit
Patent Data Extraction Scripts (Tool)
(view source)
Revision as of 13:47, 21 September 2020
2,214 bytes added
,
13:47, 21 September 2020
no edit summary
{{
Project
|Has project output=Tool
|Has sponsor=
McNair
Projects
Center
|Has title=Patent Data Extraction Scripts (Tool)
|Has owner=Marcela Interiano,
|Has project status=Subsume
|Has keywords=Tool
}}
===Patent applications===
Note that our application data appears to be ONLY utility patents, except for a few plant patents.
At the top level, in spec 4.0 (and presumably others) there are:
<us-patent-application lang="EN" dtd-version="v4.0 2004-12-02" file="US20050000001A1-20050106.XML"
status="PARALLEL-RUN" id="us-patent-application" country="US" date-produced="20041222" date-publ="20050106">
<us-bibliographic-data-application lang="EN" country="US">
...
</us-bibliographic-data-application>
<abstract id="abstract">
</abstract>
<drawings id="DRAWINGS">
</drawings>
<description id="description">
<?summary-of-invention description="Summary of Invention" end="lead"?>
<?summary-of-invention description="Summary of Invention" end="tail"?>
<?brief-description-of-drawings description="Brief Description of Drawings" end="lead"?>
<?brief-description-of-drawings description="Brief Description of Drawings" end="tail"?>
<?detailed-description description="Detailed Description" end="lead"?>
<?detailed-description description="Detailed Description" end="tail"?>
</description>
<claims id="claims">
</claims>
</us-patent-application>
We are currently processing only:
<us-bibliographic-data-application lang="EN" country="US">
...
</us-bibliographic-data-application>
===Utility patent grants fields===
The XML files for patent data are available at
Each XML file contains, in order, sorted by document ID:
#Design patents
(we will discard)
#Plant patents
(we will discard)
#Reissues
(we probably want them)
#Utility patents
(we want them)
====Overview====
DESIGN Patents:
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE us-patent-grant SYSTEM "us-patent-grant-v45-2014-04-03.dtd" [ ]>
<us-patent-grant lang="EN" dtd-version="v4.5 2014-04-03" file="USD0774273-20161220.XML"
status="PRODUCTION" id="us-patent-grant" country="US" date-produced="20161205" date-publ="20161220">
<us-bibliographic-data-grant>
</us-bibliographic-data-grant>
<drawings id="DRAWINGS">
</drawings>
<description id="description">
<?brief-description-of-drawings description="Brief Description of Drawings" end="lead"?>
<description-of-drawings>
</description-of-drawings>
<?brief-description-of-drawings description="Brief Description of Drawings" end="tail"?>
</description>
<us-claim-statement>CLAIM</us-claim-statement>
<claims id="claims">
</claims>
</us-patent-grant>
====Patent====
<onlyinclude>
I have also downloaded all of them on to the database server and can be found by
cd /bulk/patent
[[Category:Patent]]
Ed
Bureaucrats
,
Interface administrators
,
Administrators (Semantic MediaWiki)
,
Administrators
7,613
edits
Navigation menu
Personal tools
Log in
Request account
Namespaces
Page
Discussion
Variants
Views
Read
View source
View history
More
Search
Navigation
Sites
Wiki
Articles
Sections
Projects
Papers in Development
Paper Reviews
Team Members
Legislation
Research Computing
Organizations
Incubator Project
McNair Center
Berkeley's BPP Group
NBER Patent Data
Help
General help
Team help
Administration
Access RDP Server
Batch Upload Files
Tools
Special pages
Printable version