Changes

Jump to navigation Jump to search
#* Aim to create a completely naive schema with as few constraints as possible--iteratively add more constraints in the future
It takes about 3Since writing raw SQL is a bit cumbersome and error-5 minutes prone, I have added some abstraction layers that make it much easier to quickly add each batch of patentsbulk data. By using Postgres's <code>CopyManager</code> class, we buffer SQL copy commands in memory (as many as possible) and then flush these rows. That isTo understand how the abstraction layers work, see the naive serial implementation with code in <code>COPYE:\McNair\Projects\SimplerPatentData\src\main\java\org\bakerinstitute\mcnair\postgres</code> operates at about 1000 patents per minute along with citations. For a concrete example, assignorssee <code>E:\McNair\Projects\SimplerPatentData\src\main\java\org\bakerinstitute\mcnair\uspto_assignments\GeonamesZips.java</code> for a simple, lawyers, etcself-contained example or <code>E:\McNair\Projects\SimplerPatentData\src\main\java\org\bakerinstitute\mcnair\models\GrantedPatent.java</code> for an example of how to extend the abstraction layer to deal with more complex scenarios.
== Address Data ==

Navigation menu