Changes

Jump to navigation Jump to search
add progress checklist
A continuation of [[Redesigning Patent Database]] that aims to write faster, more centralized code to deal with the USPTO data. By having an end-to-end pipeline we can easily reproduce or update data without worrying about unintentional side effects or missing data.
 
== Progress ==
 
# <del>Downloader</del> ''done''
# <del>Splitter</del> ''done''
# <del>Parser</del> ''done''
# Data Source Merger
# Database Insert
# Data Cleanup
== Directory Layout ==

Navigation menu