Changes

Jump to navigation Jump to search
# <del>Database Insert (modify <code>models/</code> files with some mapping to database fields)</del> ''done''
# <del>Create tooling for minions</del> ''skipped''
# <del>Investigate parallel speedup (e.g. multithread, mmap)</del> ''done''
# Create XPath queries for reissue, design patents (only utility right now)
# Create semantic parser for APS files
# Data Cleanup (reference [[Patent_Assignment_Data_Restructure|Marcela and Sonia's work]])
# Investigate parallel speedup (e.g. multithread, mmap)
# Data Source Merger (''only USPTO granted, maintfee, assignment'' not USPTO applications or Harvard Dataverse or Lex Machina currently)
# Setup pipeline script to complete all of these steps in series
# Add constraints to database tables, e.g. correct types, foreign keys, not null, lookup tables
# Add deduplication
== Directory Layout ==

Navigation menu