Changes

Jump to navigation Jump to search
I'm currently looking at the DTD for USPTO patent data (which has to be opened by Microsoft Visual Studio) to ascertain if there are any fields we're not currently pulling from the bulk data that we should. I am using the following link to figure out how to read a DTD: http://www.ldodds.com/delta/dtd_guide.html
 
For example, USPTO patent data for 2005 seems to include multiple paragraphs for the abstract under an element called "abstract". I've included the line from the DTD below:
 
<!--A concise summary of the disclosure.-->
<!ELEMENT abstract (doc-page+ | (abst-problem , abst-solution) | p+)>
 
I will keep looking through DTDs and figure out how far back we have abstracts.
== Test Plan ==

Navigation menu