Changes

Jump to navigation Jump to search
3,170 bytes added ,  15:04, 8 December 2017
===Fall 2017===
<onlyinclude>
[[Shelby Bice]] [[Work Logs]] [[Shelby Bice (Work Log)|(log page)]]
[[Shelby Bice]] [[Work Logs]] [[Shelby Bice (Work Log)|(log 2017-12-08 1:00 pm - 2:45 pm - updated the ER Diagram on my project page)]]to include the three tables for reissue, plant, and design patents respectively. Finished typing up the status of the project as I am leaving it with notes to Oliver and Ed
===Spring 2017 Work===2/14/2017 10-12-07 3:00 am 15 PM - 124:00 pm Set up personal wiki45 PM (came in late due to finals) - finished debugging additions to Oliver's code for the tables that are related to design, reissue, and plant patents, set up work logadded a troubleshooting section to Oliver's page with instructions on how to deal with issues importing the project.
2017-12-04 2/16/2017 10:30 am 45 pm - 124:00 pm Researched past work on databases, discussed project with Ed- continued debugging and started typing up troubleshooting tips for the next person who alters the patent code
2/21/2017 9-12-01 3:00 am 15 pm - 125:00 pm Set - ran code (and ran into errors) which I have been working on fixing. If I don't finish today, I'll continue doing so on Monday. I plan to write up work some of the mistakes I made in adding the tables to the database and add them to Oliver's page, reviewed with his permission so that people in the future who are not familiar with the code (like I wasn't) hopefully won't fall into the same pitfalls. The main things are 1) how to set up a maven project (if IntelliJ doesn't automatically set it up for you when you open/import the project, and 2) how to set up the data source so you can run SQL, researched designing scripts and actually load data into the database, continued going through wikion the RDP.
2/23/2017 9-11-30-17 1:30 am 55 pm - 123:00 55 pm Reviewed Perl- continued altering code. Wrote creation tables script in SQL for creating tables for the design, reissue, read about database designand plant patents, set and went through the checklist to make sure I had done everything to create these new tables based on Oliver's Reproducible Patent Data page. Will definitely run code tomorrow and will type up project page for redesigning database, started documenting the exact processI went through to create new tables.
3/2017-11-17 2/2017 9:15 am pm - 125:15 00 pm Started excel spreadsheet - continued altering code to document current schema include the special fields for plant, reissue, and design patent. Their models and improvements the changes to XmlParser relevant to those models should be made, updated project pagesdone. I will go through the rest of the code and check where else I need to make alterations next time I am in.
3/7/2017 9-11-16 8:45 am - 10:30 am - 12:00 pm Continued working on spreadsheetcontinued altering code to include the special fields for plant, added relevant page links to project pagereissue, took notes on what I want documentation to look like in the futureand design patents
3/9/2017 9-11-14 8:00 30 - 10:30 am - 12:00 pm Finished first draft of spreadsheet describing met with Oliver to go over his code, began working on scripts and finding paths for the current schema (fields qunie to design, reissue, and possible changes) to the Patent databaseplant patents.
3/21/2017 9-11-10: 2:30 am 00 - 125:00 pm - Worked on determining "core" updated format of worklog, finished researching fields of the new design, utility, and reissue tables for new patent database, and documenting what each table contains
2017-11-03: 2:30 - 3/22/2017 5:30 pm to 6, 3:45 pm - 5:30 00 pm (I went to get a snack because I was starving!) - Patent Data meetingFinished up finding unique fields, added them to database design, and researched what the fields represented - again, this is taking a while since there is little to no documentation online about what fields on these XML files actually represent.
3/23/2017 9-11-02: 2:15 am pm - 124:00 pm - Narrowed down core tables made a main data page and fieldscontinued going through the xpaths Oliver found. Notified Michelle so she can continue adding patent pages
4/4/2017 -10-31: 9:00 am - 1110:45 pm 30 am - Worked on updating documentation, found documentation on pulling data/making tables and databases, started looking through DTDs trying to find extra fields determine from Oliver's research what paths are unique to pullutility, reissue, and plant patents that we have not yet accounted for in our design of the table
4/6/2017 9-10-27: 2:30 am 00 pm - 113:30 pm - Kept looking through DTDsupdating tables. For some of the new values I've been adding to tables (I didn't realize we had XPaths for some fields) I'm struggling to find out exactly what the field means (for example, kept updating the date on a citation - is this the date the citedpatent was granted? or is the date that the citingpatent cited the citedpatent?) so the process is slow going. I want to get the documentationright though so in the future someone can look up and see exactly what each field in the table represents and not have to guess
4/11/2017 9-10-26:15 am 2:00 pm - 124:00 pm - Worked worked on trying to update updating the designs for the patent data through 2016databases tables based on what was discussed on Friday (specifically adding inventors and lawyers tables, altering fields on patent tables) - will continue this on Friday
4/13/2017 9-10-20: - 2:30 am 00 pm - 125:00 pm - Continued working on trying to update patent data through 2016, specifically parsing the data, worked met with Ed to update perl scriptsand Oliver about patent database design
4/18/2017 9-10-19:45 am 2:00 pm - 124:30 00 pm - Cleaned finished up documentation moreER diagram and description of all tables for patent database, kept working through started reading papers concerning creating an inventor's database (looks like other research groups have merged the process of parsing USPTO data with the Harvard Dataverse datain order to create an inventors table)
4/20/2017 -10-17: 8:00 45 am - 1110:30 pm am - wrote copy statements continued trying to solve twitter issues, worked on ER diagram, skimmed through a paper I found (linked at the top of the page for copying Redesign Assignment and Patent Database) to see how they cleaned the data from RDP , since I assume that will be the next step to database, continued working on documentation.research after finishing the ER diagrams
4/25/2017 -10-12:00 am 2:30 pm - 125:00 pm - worked write blog post on documentation, tried to determine how Grace Hopper Celebration and attempted to clean up the USPTO Assignee Datasolve Twitter issue
4/27/2017 1-10-03: 8:00 pm 45 am - 310:00 pm 30 am - worked finished adding descriptions to the fields for each table in the patent database, started work on documentation more, tried to figure out how to clean citation dataan ER diagram
===Fall 2017 Work===9/15/2017 -09-29: 2:00 pm - 5:00 pm - introduced finished adding the design for the Patent database to new patent database projectedthe project page, reviewed and took notes on USPTO Assignment data (notes can added descriptions of the fields for each table to the project page including the datatype that I think the field will be found under McNair/Projects/Redesigning Patent Database/New Patent Database Project as Notes on USPTO Assignment Data Paper)when it's loaded into the database
2017-09-28: 9/22/2017 8:30 00 am - 10:30 am - continued looking at paper on USPTO assignment data going over and documenting last semester's patent database design and adding the details to the notes on what the design of that database should look likeRedesign Assignment and Patent Database project page. Additionally, specifically on what I need for different tables and what I don't know yet about began trying to determine how to match up the information in the design. Had Document_Info table in Assignment to set match up connection to RDP again due to technical issueswith a patent_id in the Patent table in Patent.
9/23/2017 2:00 pm - 4:00 pm - continued working on Tomorrow I will finish adding the design of Assignment for the Patent database and how it will connect to Patent database by writing out what will be in the project page, add descriptions of the fields for each table in Assignment and questions about different possible structures of tables that we will have to address before finalizing the design - project page, and start working on ER diagrams for the notes can be found under McNairtwo databases. Links for creating ER diagrams: https://erdplus.com/Projects#/standalone or https:/Redesigning Patent Database/New Patent Database Project as Notes on USPTO Assignment Data Papercreately. Questions are highlighted in yellow throughout the document[[Category:Work Log]]com/app/?tempID=hqdgwjki1&login_type=demo#
9/2017-09-26/2017 : 8:45 am - 10:00 am - continued worked on design of Assignment database by checking my design against the work done last semester on the assignment data restructure to make sure I didn't miss anything major. Began going over my patent database design from last semester to tweak it. Will need to sync up with Joe Reilly to see if there are any new fields that we are pulling from the data. Additionally, I made a new project page called Redesign Assignment and Patent Database that encompasses the new design for the Assignment database and Patent database redesign and moved some of the notes from McNair/Projects/Redesigning Patent Database/New Patent Database Project/Notes on USPTO Assignment Data Paper to the project page.
The main takeaway from looking over Patent Assignment Data Restructure is that, after assembling the table according to my design (which doesn't seem to have any contradictions with the Patent Assignment Data Restructure) that there will by multiple steps for cleaning the data, specifically the fields relating to location and address in the assignment table. While the Patent Assignment Data Restructure mentions connecting to the Patent database, it is not clear from the page what field would be used to connect to the Patent database.
92017-09-23: 2:00 pm - 4:00 pm - continued working on design of Assignment database and how it will connect to Patent database by writing out what will be in each table in Assignment and questions about different possible structures of tables that we will have to address before finalizing the design - the notes can be found under McNair/Projects/28Redesigning Patent Database/New Patent Database Project as Notes on USPTO Assignment Data Paper. Questions are highlighted in yellow throughout the document[[Category:Work Log]] 2017 9-09-22:00 8:30 am - 10:30 am - continued going over and documenting last semester's patent database design looking at paper on USPTO assignment data and adding to the details to notes on what the Redesign Assignment design of that database should look like, specifically on what I need for different tables and Patent Database project pagewhat I don't know yet about the design. Additionally, I began trying Had to determine how set up connection to match up the information in the Document_Info table in Assignment RDP again due to match up with a patent_id in the Patent table in Patenttechnical issues.
Tomorrow I will finish adding 2017-09-15: 2:00 pm - 5:00 pm - introduced to new patent database projected, reviewed and took notes on USPTO Assignment data (notes can be found under McNair/Projects/Redesigning Patent Database/New Patent Database Project as Notes on USPTO Assignment Data Paper) </onlyinclude> ===Spring 2017=== 2017-04-27: 1:00 pm - 3:00 pm - worked on documentation more, tried to figure out how to clean citation data 2017-04-25: 10:00 am - 12:00 pm - worked on documentation, tried to determine how to clean up the design USPTO Assignee Data 2017-04-20: 10:00 am - 11:30 pm - wrote copy statements for the Patent copying data from RDP to database to , continued working on documentation. 2017-04-18: 9:45 am - 12:30 pm - Cleaned up documentation more, kept working through the project page, add descriptions process of parsing the fields for each table data 2017-04-13: 9:30 am - 12:00 pm - Continued working on trying to update patent data through 2016, specifically parsing the project pagedata, worked with Ed to update perl scripts 2017-04-11: 9:15 am - 12:00 pm - Worked on trying to update patent data through 2016 2017-04-06: 9:30 am - 11:30 pm - Kept looking through DTDs, kept updating documentation 2017-04-04: 9:00 am - 11:45 pm - Worked on updating documentation, found documentation on pulling data/making tables and start working on ER diagrams for the two databases., started looking through DTDs to find extra fields to pull 2017-03-23: 9:15 am - 12:00 pm - Narrowed down core tables and fields
Links for creating ER diagrams2017-03-22: https5://erdplus.com/#/standalone or https30 pm to 6://creately.com/app/?tempID=hqdgwjki1&login_type=demo#30 pm - Patent Data meeting
2017-03-21: 9/29/2017 2:00 pm 30 am - 512:00 pm - finished adding the design for the Patent database to the project page, added descriptions of the fields Worked on determining "core" tables for each table to the project page including the datatype that I think the field will be when it's loaded into the new patent database
10/2017-03/2017 8-09: 9:45 00 am - 1012:30 am - finished adding descriptions 00 pm Finished first draft of spreadsheet describing the current schema (and possible changes) to the fields for each table in the patent Patent database, started work on an ER diagram
10/12/2017 2-03-07: 9:30 pm am - 512:00 pm write blog post Continued working on spreadsheet, added relevant page links to project page, took notes on Grace Hopper Celebration and attempted what I want documentation to solve Twitter issuelook like in the future
10/17/2017 8-03-02:45 9:15 am - 1012:30 am - continued trying 15 pm Started excel spreadsheet to solve twitter issues, worked on ER diagram, skimmed through a paper I found (linked at the top of the page for Redesign Assignment document current schema design and Patent Database) improvements to see how they cleaned the databe made, since I assume that will be the next step to research after finishing the ER diagramsupdated project pages
10/19/2017 2-02-23:00 pm 9:30 am - 412:00 pm - finished Reviewed Perl, read about database design, set up ER diagram and description of all tables project page for patent redesigning database, started reading papers concerning creating an inventor's database (looks like other research groups have merged the USPTO data with the Harvard Dataverse data in order to create an inventors table)documenting process
10/20/2017 - 202-21: 9:00 pm am - 512:00 pm - met with Ed and Oliver about patent Set up work page, reviewed SQL, researched designing database design, continued going through wiki
2017-02-16: 10/26/2017 2:00 pm 30 am - 412:00 pm - worked Researched past work on updating the designs for the patent databases tables based on what was , discussed on Friday (specifically adding inventors and lawyers tables, altering fields on patent tables) - will continue this on Fridayproject with Ed
2017-02-14: 10/27/2017 2:00 pm am - 312:30 00 pm - Kept updating tables. For some of the new values I've been adding to tables (I didn't realize we had XPaths for some fields) I'm struggling to find out exactly what the field means (for exampleSet up personal wiki, the date on a citation - is this the date the citedpatent was granted? or is the date that the citingpatent cited the citedpatent?) so the process is slow going. I want to get the documentation right though so in the future someone can look set up and see exactly what each field in the table represents and not have to guesswork log
10/31/2017 9[[Category:00 am - 10:30 am - started trying to determine from Oliver's research what paths are unique to utility, reissue, and plant patents that we have not yet accounted for in our design of the tableWork Log]]

Navigation menu