Difference between revisions of "Equivalent XPath and APS Queries"

From edegan.com
Jump to navigation Jump to search
Line 4: Line 4:
 
|Has start date=June 2017
 
|Has start date=June 2017
 
|Has deadline=June 2017
 
|Has deadline=June 2017
|Has project status=Active
+
|Has project status=Tabled
 
|Depends upon it=Reproducible Patent Data
 
|Depends upon it=Reproducible Patent Data
 
}}
 
}}

Revision as of 13:16, 24 June 2017


McNair Project
Equivalent XPath and APS Queries
Project logo 02.png
Project Information
Project Title Equivalent XPath and APS Queries
Owner Oliver Chang, Ed Egan
Start Date June 2017
Deadline June 2017
Primary Billing
Notes
Has project status Tabled
Copyright © 2016 edegan.com. All Rights Reserved.



Current Status for Granted Patents implementation


How it works currently

Only has the intelligence from E:\McNair\PatentData\Processed\xmlparser_4.5_4.4_4.3.pl. That is to say, it only explicitly covers versions 4.3, 4.4, and 4.5 utility patents. Plant, reissue, and design patents contain a nonempty intersection with these attributes but have their own quirks. The actual code to do this XML parsing is at E:\McNair\Projects\SimplerPatentData\src\main\java\org\bakerinstitute\mcnair\uspto_granted\XmlParser.java with the end goal to create an in-memory representation of a granted patent as a E:\McNair\Projects\SimplerPatentData\src\main\java\org\bakerinstitute\mcnair\models\GrantedPatent.java data structure.

To learn about the fields that a model contains, look at the class Model.Metadata which should implement interface TableMetadata. In particular, this bunch of constant data describes the mapping of enum fields to table column names as well as the types of columns.

For an example, we can tell that GrantedPatent is a struct that contains the following data:

private final Map<GrantedPatent.Fields, String> strings;
private final Map<GrantedPatent.Fields, Double> numbers;
private final List<Citation> citations;
private final List<Sciref> scirefs;
private final List<Inventor> inventors;
private final List<AssignmentSummary> assignments;
private final List<Lawyer> lawyers;

Query Equivalences

strings

  • PATENT_TYPE
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/application-reference/@appl-type
    • APS
      • PATN:APT
  • TITLE
    • XML 4.3, 4.4, 4.5
      • //invention-title
    • APS
      • PATN:TTL
  • PCT_DOCUMENT_NUMBER
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/pct-or-regional-filing-data/document-id/doc-number
    • APS
      • PCTA:PCN
  • PATENT_COUNTRY
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/publication-reference/document-id/country
    • APS
      •  ?
  • PATENT_NUMBER
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/publication-reference/document-id/doc-number
    • APS
      • PATN:WKU
  • PATENT_KIND
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/publication-reference/document-id/doc-number/kind
    • APS
      •  ?
  • PATENT_GRANT_DATE
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/publication-reference/document-id/date
    • APS
      • PATN:ISD
  • APPLICATION_NUMBER
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/application-reference/document-id/doc-number
    • APS
      • PATN:APN
  • APPLICATION_FILING_DATE
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/application-reference/document-id/date
    • APS
      • PATN:APD
  • PRIORITY_CLAIMS_DATE
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/priority-claims/priority-claim/date
    • APS
      • PRIR:APD
  • PRIORITY_CLAIMS_COUNTRY
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/priority-claims/priority-claim/country
    • APS
      • PRIR:CNT
  • PRIORITY_CLAIMS_PATENT_NUMBER
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/priority-claims/priority-claim/doc-number
    • APS
      • PRIR:APN
  • IPCR_SUBCLASS
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/classifications-ipcr/classification-ipcr/section + //us-bibliographic-data-grant/classifications-ipcr/classification-ipcr/class + //us-bibliographic-data-grant/classifications-ipcr/classification-ipcr/subclass
    • APS
      •  ?
  • IPCR_MAIN_GROUP
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/classifications-ipcr/classification-ipcr/main-group
    • APS
      •  ?
  • IPCR_SUB_GROUP
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/classifications-ipcr/classification-ipcr/subgroup
    • APS
      •  ?
  • CPC_SUBCLASS
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/classifications-cpc/main-cpc/classification-cpc/section + //us-bibliographic-data-grant/classifications-cpc/main-cpc/classification-cpc/class + //us-bibliographic-data-grant/classifications-cpc/main-cpc/classification-cpc/subclass
    • APS
      •  ?
  • CPC_MAIN_GROUP
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/classifications-cpc/main-cpc/classification-cpc/main-group
    • APS
      •  ?
  • CPC_SUB_GROUP
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/classifications-cpc/main-cpc/classification-cpc/subgroup
    • APS
      •  ?
  • CLASSIFICATION_NATIONAL_COUNTRY
    • XML 4.3, 4.4, 4.5 (first listed is highest preference)
      • //us-bibliographic-data-grant/classification-national/country
      • //us-bibliographic-data-grant/us-field-of-classification-search/classification-national/country
    • APS
      •  ?
  • CLASSIFICATION_NATIONAL_CLASS
    • XML 4.3, 4.4, 4.5 (first listed is highest preference)
      • //us-bibliographic-data-grant/classification-national/main-classification
      • //us-bibliographic-data-grant/us-field-of-classification-search/classification-national/country
    • APS
      •  ?
  • PRIMARY_EXAMINER_FIRST_NAME
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/examiners/primary-examiner/first-name
    • APS
      • PATN:EXP (whole name)
  • PRIMARY_EXAMINER_LAST_NAME
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/examiners/primary-examiner/last-name
    • APS
      • N/A (PATN:EXP has whole name)
  • PRIMARY_EXAMINER_DEPARTMENT
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/examiners/primary-examiner/department
    • APS
      •  ?

numbers

  • NUMBER_OF_CLAIMS
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/number-of-claims
    • APS
      • PATN:NCL

citations (all are strings)

  • SEQUENCE
    • XML 4.3, 4.4, 4.5
      • //us-parties/inventors/inventor/sequence
    • XML 4.0, 4.1, 4.2
      • //parties/applicants/applicant/sequence
    • APS
      •  ?
  • LAST_NAME
    • XML 4.3, 4.4, 4.5
      • //us-parties/inventors/inventor/addressbook/last-name
    • XML 4.0, 4.1, 4.2
      • //parties/applicants/applicant/addressbook/last-name
    • APS
      • N/A (see ORG_NAME)
  • FIRST_NAME
    • XML 4.3, 4.4, 4.5
      • //us-parties/inventors/inventor/addressbook/first-name
    • XML 4.0, 4.1, 4.2
      • //parties/applicants/applicant/addressbook/first-name
    • APS
      • N/A (see ORG_NAME)
  • ORG_NAME
    • XML 4.3, 4.4, 4.5
      • //us-parties/inventors/inventor/addressbook/orgname
    • XML 4.0, 4.1, 4.2
      • //parties/applicants/applicant/addressbook/orgname
    • APS
      • INVT:NAM (whole name, might be of an individual)
  • CITY
    • XML 4.3, 4.4, 4.5
      •  ?
    • XML 4.0, 4.1, 4.2
      •  ?
    • APS
      • INVT:CTY
  • COUNTRY
    • XML 4.3, 4.4, 4.5
      •  ?
    • XML 4.0, 4.1, 4.2
      •  ?
    • APS
      • INVT:CNT
  • STATE
    • XML 4.3, 4.4, 4.5
      •  ?
    • XML 4.0, 4.1, 4.2
      •  ?
    • APS
      • INVT:STA
  • ADDRESS
    • XML 4.3, 4.4, 4.5
      •  ?
    • XML 4.0, 4.1, 4.2
      •  ?
    • APS
      • INVT:STR
  • POSTCODE
    • XML 4.3, 4.4, 4.5
      •  ?
    • XML 4.0, 4.1, 4.2
      •  ?
    • APS
      • INVT:ZIP

scirefs (all are strings)

  • CITATION_DESCRIPTION
    • XML 4.3, 4.4, 4.5
      • //us-bibliographic-data-grant/us-references-cited/us-citation/nplcit/othercit
    • XML 4.0, 4.1, 4.2
      • //us-bibliographic-data-grant/references-cited/citation/nplcit/othercit
    • APS
      •  ?

inventors (all are strings)

  • SEQUENCE
    • XML 4.3, 4.4, 4.5
      • //us-parties/inventors/inventor/sequence
    • XML 4.0, 4.1, 4.2
      • //parties/applicants/applicant/sequence
  • LAST_NAME
    • XML 4.3, 4.4, 4.5 (first listed is highest preference)
      • //us-parties/inventors/inventor/addressbook/last-name
      • //us-parties/inventors/inventor/last-name
    • XML 4.0, 4.1, 4.2 (first listed is highest preference)
      • //parties/applicants/applicant/addressbook/last-name
      • //parties/applicants/applicant/last-name
    • APS
      • N/A (see ORG_NAME)
  • FIRST_NAME
    • XML 4.3, 4.4, 4.5 (first listed is highest preference)
      • //us-parties/inventors/inventor/addressbook/first-name
      • //us-parties/inventors/inventor/first-name/
    • XML 4.0, 4.1, 4.2 (first listed is highest preference)
      • //parties/applicants/applicant/addressbook/first-name
      • //parties/applicants/applicant/first-name
    • APS
      • N/A (see ORG_NAME
  • ORG_NAME
    • XML 4.3, 4.4, 4.5 (first listed is highest preference)
      • //us-parties/inventors/inventor/addressbook/orgname
      • //us-parties/inventors/inventor/orgname/
    • XML 4.0, 4.1, 4.2 (first listed is highest preference)
      • //parties/applicants/applicant/addressbook/orgname
      • //parties/applicants/applicant/orgname
    • APS
      • INVT:NAM
  • CITY
    • XML 4.3, 4.4, 4.5
      •  ?
    • APS
      • INVT:CTY
  • COUNTRY
    • XML 4.3, 4.4, 4.5
      •  ?
    • APS
      • INVT:CNT
  • STATE
    • XML 4.3, 4.4, 4.5
      •  ?
    • APS
      • INVT:STA
  • ADDRESS
    • XML 4.3, 4.4, 4.5
      •  ?
    • APS
      • INVT:STR
  • POSTCODE
    • XML 4.3, 4.4, 4.5
      •  ?
    • APS
      • INVT:ZIP

assignments (all are strings)

  • LAST_NAME
    • XML 4.3, 4.4, 4.5 (first listed is highest preference)
      • //assignees/assignee/addressbook/last-name
      • //assignees/assignee/last-name
    • APS
      • N/A (see ORG_NAME)
  • FIRST_NAME
    • XML 4.3, 4.4, 4.5 (first listed is highest preference)
      • //assignees/assignee/addressbook/first-name
      • //assignees/assignee/first-name/
    • APS
      • N/A (see ORG_NAME)
  • ORG_NAME
    • XML 4.3, 4.4, 4.5 (first listed is highest preference)
      • //assignees/assignee/addressbook/orgname
      • //assignees/assigneeorgname/
    • APS
      • ASSG:NAM
  • CITY
    • XML 4.3, 4.4, 4.5
      •  ?
    • APS
      • ASSG:CTY
  • COUNTRY
    • XML 4.3, 4.4, 4.5
      •  ?
    • APS
      • ASSG:CNT
  • STATE
    • XML 4.3, 4.4, 4.5
      •  ?
    • APS
      • ASSG:STA
  • ADDRESS
    • XML 4.3, 4.4, 4.5
      •  ?
    • APS
      • ASSG:STR
  • POSTCODE
    • XML 4.3, 4.4, 4.5
      •  ?
    • APS
      • ASSG:ZIP

lawyers (all are strings)

Note in the below queries, for XML results only those XML agent nodes with rep-type="attorney" are looked at.

  • SEQUENCE
    • XML 4.3, 4.4, 4.5
      • //us-parties/agents/agent/sequence
    • XML 4.0, 4.1, 4.2
      • //parties/agents/agent/sequence
  • LAST_NAME
    • XML 4.3, 4.4, 4.5 (first listed is highest preference)
      • //us-parties/agents/agent/addressbook/last-name
      • //us-parties/agents/agent/last-name
    • XML 4.0, 4.1, 4.2 (first listed is highest preference)
      • //parties/applicants/applicant/addressbook/last-name
      • //parties/applicants/applicant/last-name
    • APS
      • N/A (see FIRST_NAME)
  • FIRST_NAME
    • XML 4.3, 4.4, 4.5 (first listed is highest preference)
      • //us-parties/agents/agent/addressbook/first-name
      • //us-parties/agents/agent/first-name/
    • XML 4.0, 4.1, 4.2 (first listed is highest preference)
      • //parties/agents/agent/addressbook/first-name
      • //parties/agents/agent/first-name
    • APS
      • LREP:NAM (whole name of an individual)
  • ORG_NAME
    • XML 4.3, 4.4, 4.5 (first listed is highest preference)
      • //us-parties/agents/agent/addressbook/orgname
      • //us-parties/agents/agent/orgname/
    • XML 4.0, 4.1, 4.2 (first listed is highest preference)
      • //parties/agents/agent/addressbook/orgname
      • //parties/agents/agent/orgname
    • APS
      • LREP:FRM
  • CITY
    • XML 4.3, 4.4, 4.5
      •  ?
    • APS
      • LREP:CTY
  • COUNTRY
    • XML 4.3, 4.4, 4.5
      •  ?
    • APS
      • LREP:CNT
  • STATE
    • XML 4.3, 4.4, 4.5
      •  ?
    • APS
      • LREP:STA
  • ADDRESS
    • XML 4.3, 4.4, 4.5
      •  ?
    • APS
      • LREP:STR
  • POSTCODE
    • XML 4.3, 4.4, 4.5
      •  ?
    • APS
      • LREP:ZIP

Examples

  • APS

The first APS entry of E:\McNair\Projects\SimplerPatentData\data\extracts\granted\pftaps19760106_wk01.txt as a GrantedPatent is

strings: {PCT_DOCUMENT_NUMBER=null, PATENT_NUMBER=RE0286710, PATENT_GRANT_DATE=19760106, TITLE=Hydrophone damper assembly, APPLICATION_FILING_DATE=19740826, PRIMARY_EXAMINER_FIRST_NAME=Blix; Trygve M., PATENT_TYPE=2, APPLICATION_NUMBER=500649&, PRIORITY_CLAIMS_DATE=null, PRIORITY_CLAIMS_COUNTRY=null, PRIORITY_CLAIMS_PATENT_NUMBER=null}
integers: {NUMBER_OF_CLAIMS=18}
extras: {class org.bakerinstitute.mcnair.models.Citation=[], class org.bakerinstitute.mcnair.models.Sciref=[], class org.bakerinstitute.mcnair.models.Inventor=[strings: {STATE=MI, ORG_NAME=Widenhofer; James W., COUNTRY=null, CITY=Jackson, ADDRESS=null, POSTCODE=null, CITING_PATENT=RE0286710}
integers: {}], class org.bakerinstitute.mcnair.models.AssignmentSummary=[strings: {COUNTRY=null, CITING_PATENT=RE0286710, CITY=Jackson, POSTCODE=null, ORG_NAME=Sparton Corporation, STATE=MI}
integers: {}], class org.bakerinstitute.mcnair.models.Lawyer=[strings: {FIRST_NAME=null, CITY=null, POSTCODE=null, COUNTRY=null, STATE=null, CITING_PATENT=RE0286710, ORG_NAME=Beaman & Beaman, ADDRESS=null}
integers: {}]}
  • XML 4.5

The first XML entry of E:\McNair\Projects\SimplerPatentData\data\extracts\granted\ipg161227.xml as a GrantedPatent is

strings: {CLASSIFICATION_NATIONAL_COUNTRY=US, PCT_DOCUMENT_NUMBER=, PATENT_COUNTRY=US, APPLICATION_FILING_DATE=20150422, PRIMARY_EXAMINER_FIRST_NAME=Karen, PATENT_NUMBER=, PRIMARY_EXAMINER_LAST_NAME=Kearney, PATENT_KIND=, CLASSIFICATION_NATIONAL_CLASS=D 2614, PATENT_TYPE=design, PATENT_GRANT_DATE=20161227, APPLICATION_NUMBER=29524659, PRIMARY_EXAMINER_DEPARTMENT=2923, TITLE=Smart sports glove}
integers: {NUMBER_OF_CLAIMS=1}
extras: {class org.bakerinstitute.mcnair.models.Citation=[strings: {CITED_PATENT_NUMBER=4051552, CITED_PATENT_CATEGORY=, CITED_PATENT_KIND=A, CITING_PATENT=D0774723, CITED_PATENT_COUNTRY=US}
integers: {}, strings: {CITED_PATENT_NUMBER=6016103, CITED_PATENT_CATEGORY=, CITED_PATENT_KIND=A, CITING_PATENT=D0774723, CITED_PATENT_COUNTRY=US}
integers: {}, strings: {CITED_PATENT_NUMBER=D512549, CITED_PATENT_CATEGORY=, CITED_PATENT_KIND=S, CITING_PATENT=D0774723, CITED_PATENT_COUNTRY=US}
integers: {}, strings: {CITED_PATENT_NUMBER=9189022, CITED_PATENT_CATEGORY=, CITED_PATENT_KIND=B2, CITING_PATENT=D0774723, CITED_PATENT_COUNTRY=US}
integers: {}, strings: {CITED_PATENT_NUMBER=2009/0323316, CITED_PATENT_CATEGORY=, CITED_PATENT_KIND=A1, CITING_PATENT=D0774723, CITED_PATENT_COUNTRY=US}
integers: {}, strings: {CITED_PATENT_NUMBER=2012/0069552, CITED_PATENT_CATEGORY=, CITED_PATENT_KIND=A1, CITING_PATENT=D0774723, CITED_PATENT_COUNTRY=US}
integers: {}, strings: {CITED_PATENT_NUMBER=2012/0081884, CITED_PATENT_CATEGORY=, CITED_PATENT_KIND=A1, CITING_PATENT=D0774723, CITED_PATENT_COUNTRY=US}
integers: {}, strings: {CITED_PATENT_NUMBER=2013/0184092, CITED_PATENT_CATEGORY=, CITED_PATENT_KIND=A1, CITING_PATENT=D0774723, CITED_PATENT_COUNTRY=US}
integers: {}, strings: {CITED_PATENT_NUMBER=2015/0157065, CITED_PATENT_CATEGORY=, CITED_PATENT_KIND=A1, CITING_PATENT=D0774723, CITED_PATENT_COUNTRY=US}
integers: {}, strings: {CITED_PATENT_NUMBER=2015/0237181, CITED_PATENT_CATEGORY=, CITED_PATENT_KIND=A1, CITING_PATENT=D0774723, CITED_PATENT_COUNTRY=US}
integers: {}], class org.bakerinstitute.mcnair.models.Sciref=[strings: {CITING_PATENT=D0774723, CITATION_DESCRIPTION=Cheat Code Central | Extra | Predictions | Video Game Foresight—Power Glove 2, posted on Feb. 10, 2011, © 1997-2016 Cheat Code Central [online], [site visited Aug. 15, 2016]. Available from Internet, <URL: http://www.cheatcc.com/extra/predictions-powerglove.html>.}
integers: {}, strings: {CITING_PATENT=D0774723, CITATION_DESCRIPTION=Lazure | 2014: A bleeding edge effort in chips . . . , posted on Dec. 11, 2014, no copyright date posted [online], [site visited Aug. 17, 2016]. Available from Internet, <URL: https://lazure2.wordpress.com>.}
integers: {}, strings: {CITING_PATENT=D0774723, CITATION_DESCRIPTION=SoyaCincau | 2014 | 04 | 01 | Samsung Fingers Smart Glove with Flexible Display, posted on Apr. 1, 2014, © 2016 SoyaCincau.com [online], [site visited Aug. 17, 2016]. Available from Internet, <URL: http://www.soyacincau.com/2014/04/01/samsung-fingers-smart-glove-with-flexible-display/>.}
integers: {}, strings: {CITING_PATENT=D0774723, CITATION_DESCRIPTION=Tech Ticking | 2014 | 12 | Top 10 of the Best 2014 Wearables, posted by Rami Raouagi on Dec. 19, 2014, © 2016 Tech Ticking [online], [site visited Aug. 17, 2016]. Available from Internet, <URL: http://www.techticking.com/2014/12/top-10-of-best-2014-wearables.html/>.}
integers: {}], class org.bakerinstitute.mcnair.models.Inventor=[strings: {ORG_NAME=, CITING_PATENT=D0774723, LAST_NAME=Daniel, FIRST_NAME=Isaac S., SEQUENCE=}
integers: {}], class org.bakerinstitute.mcnair.models.AssignmentSummary=[], class org.bakerinstitute.mcnair.models.Lawyer=[]}