Difference between revisions of "Shelby Bice (Research Plan)"
Jump to navigation
Jump to search
ShelbyBice (talk | contribs) (Created page with "2/16/2017 - Talked over project with Ed, began reading existing wiki pages related to patent data and databases") |
|||
| (24 intermediate revisions by one other user not shown) | |||
| Line 1: | Line 1: | ||
| − | 2/ | + | == Overview == |
| + | |||
| + | '''Overall goals:''' | ||
| + | |||
| + | * Create better database that includes all the patent data to which the McNair Center has access. | ||
| + | * More importantly, create documentation of process so it can improved upon/replicated in the future. | ||
| + | |||
| + | '''General Outline - updated 2/21/2017''' | ||
| + | |||
| + | * Familiarize myself with SQL, Perl, and database design | ||
| + | * Familiarize myself with existing scripts and schema for existing database | ||
| + | * Design a better representation for database | ||
| + | * Fix scripts if necessary | ||
| + | * Start moving data into new database by querying existing databases (using SQL) | ||
| + | * Use scripts to query new data | ||
| + | * Test database | ||
| + | * Remove extraneous information from database (copies, patents that we're not interested in, etc.) | ||
| + | |||
| + | '''Documentation I need to include:''' | ||
| + | |||
| + | * Schema of new database (with justification of design), would like to include a visual representation | ||
| + | * SQL commands that were used to fill database with explanation of what they do | ||
| + | * Clear instructions on where to find scripts in bulk drive and an explanation of what each script does | ||
| + | * Visual representation of example table entries that isn't just copied and pasted from a CSV file | ||
| + | |||
| + | '''Project Pages:''' | ||
| + | [[Redesigning Patent Database]] | ||
| + | |||
| + | == Log == | ||
| + | [[Category:Work Log]] | ||
Latest revision as of 16:17, 21 March 2017
Overview
Overall goals:
- Create better database that includes all the patent data to which the McNair Center has access.
- More importantly, create documentation of process so it can improved upon/replicated in the future.
General Outline - updated 2/21/2017
- Familiarize myself with SQL, Perl, and database design
- Familiarize myself with existing scripts and schema for existing database
- Design a better representation for database
- Fix scripts if necessary
- Start moving data into new database by querying existing databases (using SQL)
- Use scripts to query new data
- Test database
- Remove extraneous information from database (copies, patents that we're not interested in, etc.)
Documentation I need to include:
- Schema of new database (with justification of design), would like to include a visual representation
- SQL commands that were used to fill database with explanation of what they do
- Clear instructions on where to find scripts in bulk drive and an explanation of what each script does
- Visual representation of example table entries that isn't just copied and pasted from a CSV file
Project Pages: Redesigning Patent Database