Changes

Jump to navigation Jump to search
2,945 bytes added ,  15:04, 8 December 2017
===Fall 2017===<onlyinclude>
[[Shelby Bice]] [[Work Logs]] [[Shelby Bice (Work Log)|(log page)]]
===Spring 2017 Work===2/14/2017 10-12-08 1:00 am pm - 122:00 45 pm Set up personal wiki- updated the ER Diagram on my project page to include the three tables for reissue, plant, set and design patents respectively. Finished typing up work logthe status of the project as I am leaving it with notes to Oliver and Ed
2/16/2017 10-12-07 3:30 am 15 PM - 124:00 pm Researched past work 45 PM (came in late due to finals) - finished debugging additions to Oliver's code for the tables that are related to design, reissue, and plant patents, added a troubleshooting section to Oliver's page with instructions on databases, discussed how to deal with issues importing the project with Ed.
2017-12-04 2/21/2017 9:00 am 45 pm - 124:00 pm Set - continued debugging and started typing up work page, reviewed SQL, researched designing database, continued going through wikitroubleshooting tips for the next person who alters the patent code
2/23/2017 9-12-01 3:30 am 15 pm - 125:00 pm Reviewed Perl- ran code (and ran into errors) which I have been working on fixing. If I don't finish today, read about I'll continue doing so on Monday. I plan to write up some of the mistakes I made in adding the tables to the database designand add them to Oliver's page, with his permission so that people in the future who are not familiar with the code (like I wasn't) hopefully won't fall into the same pitfalls. The main things are 1) how to set up a maven project page (if IntelliJ doesn't automatically set it up for redesigning you when you open/import the project, and 2) how to set up the data source so you can run SQL scripts and actually load data into the database, started documenting processon the RDP.
3/2/2017 9-11-30-17 1:15 am 55 pm - 123:15 55 pm Started excel spreadsheet - continued altering code. Wrote creation tables script in SQL for creating tables for the design, reissue, and plant patents, and went through the checklist to make sure I had done everything to document current schema design create these new tables based on Oliver's Reproducible Patent Data page. Will definitely run code tomorrow and improvements will type up the exact process I went through to be made, updated project pagescreate new tables.
3/7/2017 9-11-17 2:30 am 15 pm - 125:00 pm Continued working on spreadsheet- continued altering code to include the special fields for plant, reissue, added and design patent. Their models and the changes to XmlParser relevant page links to project page, took notes on what those models should be done. I will go through the rest of the code and check where else I want documentation need to look like make alterations next time I am in the future.
3/9/2017 9-11-16 8:00 45 am - 1210:00 pm Finished first draft of spreadsheet describing 30 am - continued altering code to include the current schema (special fields for plant, reissue, and possible changes) to the Patent databasedesign patents
3/21/2017 9-11-14 8:30 am - 1210:00 pm 30 am - Worked met with Oliver to go over his code, began working on determining "core" tables scripts and finding paths for new patent databasethe fields qunie to design, reissue, and plant patents.
3/22/2017 -11-10: 2:00 - 5:30 pm to 6:30 00 pm - Patent Data meetingupdated format of worklog, finished researching fields of the new design, utility, and reissue tables, and documenting what each table contains
2017-11-03: 2:30 - 3:30 pm, 3/23/2017 9:15 am 45 pm - 125:00 pm (I went to get a snack because I was starving!) - Narrowed down core tables Finished up finding unique fields, added them to database design, and researched what the fields represented - again, this is taking a while since there is little to no documentation online about what fieldson these XML files actually represent.
4/4/2017 9-11-02:00 am 2:15 pm - 114:45 00 pm - Worked on updating documentation, found documentation on pulling made a main data/making tables page and databases, started looking continued going through DTDs to find extra fields to pullthe xpaths Oliver found. Notified Michelle so she can continue adding patent pages
4/6/2017 -10-31: 9:30 00 am - 1110:30 pm am - Kept looking through DTDsstarted trying to determine from Oliver's research what paths are unique to utility, reissue, kept updating documentationand plant patents that we have not yet accounted for in our design of the table
4/11/2017 9-10-27:15 am 2:00 pm - 123:00 30 pm - Worked Kept updating tables. For some of the new values I've been adding to tables (I didn't realize we had XPaths for some fields) I'm struggling to find out exactly what the field means (for example, the date on trying a citation - is this the date the citedpatent was granted? or is the date that the citingpatent cited the citedpatent?) so the process is slow going. I want to get the documentation right though so in the future someone can look up and see exactly what each field in the table represents and not have to update patent data through 2016guess
4/13/2017 9-10-26:30 am 2:00 pm - 124:00 pm - Continued working worked on trying to update updating the designs for the patent data through 2016, databases tables based on what was discussed on Friday (specifically parsing the dataadding inventors and lawyers tables, worked with Ed to update perl scriptsaltering fields on patent tables) - will continue this on Friday
4/18/2017 9-10-20:45 am - 122:00 pm - 5:30 00 pm - Cleaned up documentation more, kept working through the process of parsing the datamet with Ed and Oliver about patent database design
4/20/2017 -10-19: 2:00 am pm - 114:30 00 pm - wrote copy statements finished up ER diagram and description of all tables for copying patent database, started reading papers concerning creating an inventor's database (looks like other research groups have merged the USPTO data with the Harvard Dataverse data from RDP in order to database, continued working on documentation.create an inventors table)
4/25/2017 -10-17: 8:00 45 am - 1210:00 pm 30 am - continued trying to solve twitter issues, worked on documentationER diagram, tried skimmed through a paper I found (linked at the top of the page for Redesign Assignment and Patent Database) to determine see how they cleaned the data, since I assume that will be the next step to clean up research after finishing the USPTO Assignee DataER diagrams
4/27/2017 1-10-12: 2:00 30 pm - 35:00 pm - worked write blog post on documentation more, tried to figure out how Grace Hopper Celebration and attempted to clean citation datasolve Twitter issue
===Fall 2017 Work===9/15/2017 2-10-03: 8:00 pm 45 am - 510:00 pm 30 am - introduced finished adding descriptions to new the fields for each table in the patent database projected, reviewed and took notes on USPTO Assignment data (notes can be found under McNair/Projects/Redesigning Patent Database/New Patent Database Project as Notes started work on USPTO Assignment Data Paper)an ER diagram
9/22/2017 8-09-29: 2:30 am 00 pm - 105:30 am 00 pm - continued looking at paper on USPTO assignment data and finished adding to the notes on what design for the design of that Patent database should look liketo the project page, specifically on what I need added descriptions of the fields for different tables and what each table to the project page including the datatype that I donthink the field will be when it't know yet about s loaded into the design. Had to set up connection to RDP again due to technical issues. database
2017-09-28: 9/23/2017 2:00 pm am - 410:00 pm 30 am - continued working on going over and documenting last semester's patent database design of and adding the details to the Redesign Assignment database and Patent Database project page. Additionally, I began trying to determine how it will connect to Patent database by writing out what will be match up the information in each the Document_Info table in Assignment and questions about different possible structures of tables that we will have to address before finalizing the design - match up with a patent_id in the notes can be found under McNair/Projects/Redesigning Patent Database/New table in Patent Database Project as Notes on USPTO Assignment Data Paper. Questions are highlighted in yellow throughout the document[[Category:Work Log]]
9Tomorrow I will finish adding the design for the Patent database to the project page, add descriptions of the fields for each table to the project page, and start working on ER diagrams for the two databases. Links for creating ER diagrams: https:/26/erdplus.com/#/standalone or https://creately.com/app/?tempID=hqdgwjki1&login_type=demo# 2017 -09-26: 8:45 am - 10:00 am - continued worked on design of Assignment database by checking my design against the work done last semester on the assignment data restructure to make sure I didn't miss anything major. Began going over my patent database design from last semester to tweak it. Will need to sync up with Joe Reilly to see if there are any new fields that we are pulling from the data. Additionally, I made a new project page called Redesign Assignment and Patent Database that encompasses the new design for the Assignment database and Patent database redesign and moved some of the notes from McNair/Projects/Redesigning Patent Database/New Patent Database Project/Notes on USPTO Assignment Data Paper to the project page.
The main takeaway from looking over Patent Assignment Data Restructure is that, after assembling the table according to my design (which doesn't seem to have any contradictions with the Patent Assignment Data Restructure) that there will by multiple steps for cleaning the data, specifically the fields relating to location and address in the assignment table. While the Patent Assignment Data Restructure mentions connecting to the Patent database, it is not clear from the page what field would be used to connect to the Patent database.
9/28/2017 9-09-23: 2:00 am pm - 104:30 am 00 pm - continued going over and documenting last semester's patent database working on design and adding the details to the Redesign of Assignment database and Patent Database project page. Additionally, I began trying to determine how it will connect to match up the information Patent database by writing out what will be in the Document_Info each table in Assignment and questions about different possible structures of tables that we will have to match up with a patent_id in address before finalizing the design - the notes can be found under McNair/Projects/Redesigning Patent table in Database/New PatentDatabase Project as Notes on USPTO Assignment Data Paper.Questions are highlighted in yellow throughout the document[[Category:Work Log]]
Tomorrow I will finish 2017-09-22: 8:30 am - 10:30 am - continued looking at paper on USPTO assignment data and adding to the notes on what the design of that database should look like, specifically on what I need for different tables and what I don't know yet about the design. Had to set up connection to RDP again due to technical issues.  2017-09-15: 2:00 pm - 5:00 pm - introduced to new patent database projected, reviewed and took notes on USPTO Assignment data (notes can be found under McNair/Projects/Redesigning Patent database Database/New Patent Database Project as Notes on USPTO Assignment Data Paper) </onlyinclude> ===Spring 2017=== 2017-04-27: 1:00 pm - 3:00 pm - worked on documentation more, tried to figure out how to clean citation data 2017-04-25: 10:00 am - 12:00 pm - worked on documentation, tried to determine how to clean up the project pageUSPTO Assignee Data 2017-04-20: 10:00 am - 11:30 pm - wrote copy statements for copying data from RDP to database, add descriptions continued working on documentation. 2017-04-18: 9:45 am - 12:30 pm - Cleaned up documentation more, kept working through the process of parsing the fields for each table data 2017-04-13: 9:30 am - 12:00 pm - Continued working on trying to update patent data through 2016, specifically parsing the project pagedata, and start working worked with Ed to update perl scripts 2017-04-11: 9:15 am - 12:00 pm - Worked on ER diagrams for the two databases.trying to update patent data through 2016 2017-04-06: 9:30 am - 11:30 pm - Kept looking through DTDs, kept updating documentation
Links for creating ER diagrams2017-04-04: https9://erdplus.com/#/standalone or https00 am - 11:45 pm - Worked on updating documentation, found documentation on pulling data//creately.com/app/?tempID=hqdgwjki1&login_type=demo#making tables and databases, started looking through DTDs to find extra fields to pull
2017-03-23: 9/29/2017 2:00 pm 15 am - 512:00 pm - finished adding the design for the Patent database to the project page, added descriptions of the Narrowed down core tables and fields for each table to the project page including the datatype that I think the field will be when it's loaded into the database
10/2017-03/2017 8-22: 5:45 am - 1030 pm to 6:30 am pm - finished adding descriptions to the fields for each table in the patent database, started work on an ER diagramPatent Data meeting
10/12/2017 2-03-21: 9:30 pm am - 512:00 pm write blog post - Worked on Grace Hopper Celebration and attempted to solve Twitter issuedetermining "core" tables for new patent database
10/17/2017 8-03-09: 9:45 00 am - 1012:30 am - continued trying to solve twitter issues, worked on ER diagram, skimmed through a paper I found (linked at the top 00 pm Finished first draft of spreadsheet describing the page for Redesign Assignment current schema (and Patent Databasepossible changes) to see how they cleaned the data, since I assume that will be the next step to research after finishing the ER diagramsPatent database
10/19/2017 2-03-07: 9:00 pm 30 am - 412:00 pm - finished up ER diagram and description of all tables for patent databaseContinued working on spreadsheet, added relevant page links to project page, started reading papers concerning creating an inventor's database (looks took notes on what I want documentation to look like other research groups have merged the USPTO data with in the Harvard Dataverse data in order to create an inventors table)future
10/20/2017 - 203-02:00 pm 9:15 am - 512:00 15 pm - met with Ed Started excel spreadsheet to document current schema design and Oliver about patent database designimprovements to be made, updated project pages
10/26/2017 2-02-23:00 pm 9:30 am - 412:00 pm - worked on updating the designs Reviewed Perl, read about database design, set up project page for the patent databases tables based on what was discussed on Friday (specifically adding inventors and lawyers tablesredesigning database, altering fields on patent tables) - will continue this on Fridaystarted documenting process
10/27/2017 2-02-21: 9:00 pm am - 312:30 00 pm - Kept updating tables. For some of the new values I've been adding to tables (I didn't realize we had XPaths for some fields) I'm struggling to find out exactly what the field means (for exampleSet up work page, reviewed SQL, researched designing database, the date on a citation - is this the date the citedpatent was granted? or is the date that the citingpatent cited the citedpatent?) so the process is slow continued going. I want to get the documentation right though so in the future someone can look up and see exactly what each field in the table represents and not have to guessthrough wiki
10/31/2017 9-02-16:00 am - 10:30 am - started trying to determine from Oliver's research what paths are unique to utility, reissue12:00 pm Researched past work on databases, and plant patents that we have not yet accounted for in our design of the tablediscussed project with Ed
11/2/2017 2-02-14: 10:15 pm 00 am - 412:00 pm - made a main data page and continued going through the xpaths Oliver found. Notified Michelle so she can continue adding patent pagesSet up personal wiki, set up work log
11/3/2017 2[[Category:30 - 5:00 pm Finished up finding unique fieldsWork Log]]

Navigation menu