Changes

Jump to navigation Jump to search
2,614 bytes added ,  15:04, 8 December 2017
[[Shelby Bice]] [[Work Logs]] [[Shelby Bice (Work Log)|(log page)]]
9/15/2017 2-12-08 1:00 pm - 52:00 45 pm - introduced updated the ER Diagram on my project page to new patent database projectedinclude the three tables for reissue, plant, reviewed and took design patents respectively. Finished typing up the status of the project as I am leaving it with notes on USPTO Assignment data (notes can be found under McNair/Projects/Redesigning Patent Database/New Patent Database Project as Notes on USPTO Assignment Data Paper)to Oliver and Ed
9/22/2017 8-12-07 3:30 am 15 PM - 104:30 am 45 PM (came in late due to finals) - continued looking at paper on USPTO assignment data and adding finished debugging additions to Oliver's code for the notes on what the tables that are related to design of that database should look like, specifically on what I need for different tables reissue, and what I donplant patents, added a troubleshooting section to Oliver't know yet about the design. Had to set up connection to RDP again due s page with instructions on how to technical deal with issuesimporting the project.
9/23/2017 -12-04 2:00 45 pm - 4:00 pm - continued working on design of Assignment database debugging and how it will connect to Patent database by writing out what will be in each table in Assignment and questions about different possible structures of tables that we will have to address before finalizing started typing up troubleshooting tips for the design - next person who alters the notes can be found under McNair/Projects/Redesigning Patent Database/New Patent Database Project as Notes on USPTO Assignment Data Paper. Questions are highlighted in yellow throughout the document[[Category:Work Log]]patent code
9/26/2017 8-12-01 3:45 am 15 pm - 105:00 am pm - continued worked ran code (and ran into errors) which I have been working on design fixing. If I don't finish today, I'll continue doing so on Monday. I plan to write up some of Assignment database by checking my design against the work done last semester on mistakes I made in adding the assignment data restructure tables to make sure I didn't miss anything major. Began going over my patent the database design from last semester and add them to tweak it. Will need to sync up Oliver's page, with Joe Reilly to see if there are any new fields his permission so that we people in the future who are pulling from not familiar with the code (like I wasn't) hopefully won't fall into the datasame pitfalls. Additionally, I made The main things are 1) how to set up a new maven project page called Redesign Assignment and Patent Database that encompasses the new design (if IntelliJ doesn't automatically set it up for you when you open/import the Assignment database project, and Patent database redesign 2) how to set up the data source so you can run SQL scripts and moved some of actually load data into the notes from McNair/Projects/Redesigning Patent Database/New Patent Database Project/Notes database on USPTO Assignment Data Paper to the project pageRDP.
The main takeaway from looking over 2017-11-30-17 1:55 pm - 3:55 pm - continued altering code. Wrote creation tables script in SQL for creating tables for the design, reissue, and plant patents, and went through the checklist to make sure I had done everything to create these new tables based on Oliver's Reproducible Patent Assignment Data Restructure is thatpage. Will definitely run code tomorrow and will type up the exact process I went through to create new tables.  2017-11-17 2:15 pm - 5:00 pm - continued altering code to include the special fields for plant, reissue, and design patent. Their models and the changes to XmlParser relevant to those models should be done. I will go through the rest of the code and check where else I need to make alterations next time I am in.  2017-11-16 8:45 am - 10:30 am - continued altering code to include the special fields for plant, reissue, and design patents 2017-11-14 8:30 - 10:30 am - met with Oliver to go over his code, after assembling began working on scripts and finding paths for the table according fields qunie to my design , reissue, and plant patents. 2017-11-10: 2:00 - 5:00 pm - updated format of worklog, finished researching fields of the new design, utility, and reissue tables, and documenting what each table contains 2017-11-03: 2:30 - 3:30 pm, 3:45 pm - 5:00 pm (which doesn't seem I went to get a snack because I was starving!) - Finished up finding unique fields, added them to have any contradictions with database design, and researched what the Patent Assignment Data Restructure) that fields represented - again, this is taking a while since there will by multiple steps for cleaning the is little to no documentation online about what fields on these XML files actually represent. 2017-11-02: 2:15 pm - 4:00 pm - made a main data, specifically page and continued going through the fields relating xpaths Oliver found. Notified Michelle so she can continue adding patent pages 2017-10-31: 9:00 am - 10:30 am - started trying to determine from Oliver's research what paths are unique to location utility, reissue, and address plant patents that we have not yet accounted for in our design of the assignment table 2017-10-27: 2:00 pm - 3:30 pm - Kept updating tables. While For some of the Patent Assignment Data Restructure mentions connecting new values I've been adding to tables (I didn't realize we had XPaths for some fields) I'm struggling to find out exactly what the Patent databasefield means (for example, it the date on a citation - is this the date the citedpatent was granted? or is not clear from the page date that the citingpatent cited the citedpatent?) so the process is slow going. I want to get the documentation right though so in the future someone can look up and see exactly what each field would be used to connect in the table represents and not have to the Patent database.guess
9/28/2017 9-10-26: 2:00 am pm - 104:30 am 00 pm - continued going over and documenting last semester's worked on updating the designs for the patent database design and databases tables based on what was discussed on Friday (specifically adding the details to the Redesign Assignment inventors and Patent Database project page. Additionallylawyers tables, I began trying to determine how to match up the information in the Document_Info table in Assignment to match up with a patent_id in the Patent table in Patent.altering fields on patent tables) - will continue this on Friday
Tomorrow I will finish adding the 2017-10-20: - 2:00 pm - 5:00 pm - met with Ed and Oliver about patent database design for the Patent database to the project page, add descriptions of the fields for each table to the project page, and start working on ER diagrams for the two databases.
Links for creating ER diagrams2017-10-19: https2://erdplus.com/#/standalone or https00 pm - 4://creately.com/app/?tempID=hqdgwjki1&login_type=demo#00 pm - finished up ER diagram and description of all tables for patent database, started reading papers concerning creating an inventor's database (looks like other research groups have merged the USPTO data with the Harvard Dataverse data in order to create an inventors table)
9/29/2017 2-10-17: 8:00 pm 45 am - 510:00 pm 30 am - finished adding the design for the Patent database continued trying to solve twitter issues, worked on ER diagram, skimmed through a paper I found (linked at the project page, added descriptions top of the fields page for each table Redesign Assignment and Patent Database) to see how they cleaned the project page including the datatype data, since I assume that I think the field will be when it's loaded into the databasenext step to research after finishing the ER diagrams
10/03/2017 8:45 am - 10-12: 2:30 am pm - finished adding descriptions 5:00 pm write blog post on Grace Hopper Celebration and attempted to the fields for each table in the patent database, started work on an ER diagramsolve Twitter issue
2017-10-03: 8:45 am - 10/12/2017 2:30 pm am - 5:00 pm write blog post finished adding descriptions to the fields for each table in the patent database, started work on Grace Hopper Celebration and attempted to solve Twitter issuean ER diagram
10/17/2017 8-09-29: 2:45 am 00 pm - 105:30 am 00 pm - continued trying finished adding the design for the Patent database to solve twitter issuesthe project page, worked on ER diagram, skimmed through a paper I found (linked at the top added descriptions of the page fields for Redesign Assignment and Patent Database) each table to see how they cleaned the data, since project page including the datatype that I assume that think the field will be when it's loaded into the next step to research after finishing the ER diagramsdatabase
10/19/2017 2-09-28: 9:00 pm am - 410:00 pm 30 am - finished up ER diagram continued going over and description of all tables for documenting last semester's patent databasedesign and adding the details to the Redesign Assignment and Patent Database project page. Additionally, started reading papers concerning creating an inventor's database (looks like other research groups have merged I began trying to determine how to match up the USPTO data with information in the Harvard Dataverse data Document_Info table in order Assignment to create an inventors match up with a patent_id in the Patent table)in Patent.
10/20/2017 - 2:00 pm - 5:00 pm - met with Ed Tomorrow I will finish adding the design for the Patent database to the project page, add descriptions of the fields for each table to the project page, and Oliver about patent database designstart working on ER diagrams for the two databases.
10Links for creating ER diagrams: https://26erdplus.com/2017 2#/standalone or https:00 pm - 4:00 pm - worked on updating the designs for the patent databases tables based on what was discussed on Friday (specifically adding inventors and lawyers tables, altering fields on patent tables) - will continue this on Friday//creately.com/app/?tempID=hqdgwjki1&login_type=demo#
10/27/2017 2-09-26:00 pm 8:45 am - 310:30 pm 00 am - Kept updating tables. For some continued worked on design of Assignment database by checking my design against the work done last semester on the new values I've been adding assignment data restructure to tables (make sure I didn't realize miss anything major. Began going over my patent database design from last semester to tweak it. Will need to sync up with Joe Reilly to see if there are any new fields that we had XPaths for some fields) are pulling from the data. Additionally, I'm struggling to find out exactly what made a new project page called Redesign Assignment and Patent Database that encompasses the field means (new design for example, the date Assignment database and Patent database redesign and moved some of the notes from McNair/Projects/Redesigning Patent Database/New Patent Database Project/Notes on a citation - USPTO Assignment Data Paper to the project page. The main takeaway from looking over Patent Assignment Data Restructure is this that, after assembling the date table according to my design (which doesn't seem to have any contradictions with the citedpatent was granted? or is the date Patent Assignment Data Restructure) that there will by multiple steps for cleaning the citingpatent cited data, specifically the citedpatent?) so fields relating to location and address in the process is slow goingassignment table. I want While the Patent Assignment Data Restructure mentions connecting to get the documentation right though so in Patent database, it is not clear from the future someone can look up and see exactly page what each field in would be used to connect to the table represents and not have to guessPatent database.
10/31/2017 9-09-23: 2:00 am pm - 104:30 am 00 pm - started trying continued working on design of Assignment database and how it will connect to determine from Oliver's research Patent database by writing out what paths are unique to utility, reissue, will be in each table in Assignment and plant patents questions about different possible structures of tables that we will have not yet accounted for to address before finalizing the design - the notes can be found under McNair/Projects/Redesigning Patent Database/New Patent Database Project as Notes on USPTO Assignment Data Paper. Questions are highlighted in our design of yellow throughout the tabledocument[[Category:Work Log]]
11/2/2017 2-09-22: 8:15 pm 30 am - 410:00 pm 30 am - made a main continued looking at paper on USPTO assignment data page and continued going through adding to the notes on what the design of that database should look like, specifically on what I need for different tables and what I don't know yet about the xpaths Oliver founddesign. Had to set up connection to RDP again due to technical issues. Notified Michelle so she can continue adding patent pages
11/3/2017 2:30 - 309-15:30 pm, 32:45 00 pm - 5:00 pm (I went to get a snack because I was starving!) - Finished up finding unique fields, added them introduced to new patent database designprojected, reviewed and researched what the fields represented - again, this is taking a while since there is little to no documentation online about what fields took notes on USPTO Assignment data (notes can be found under McNair/Projects/Redesigning Patent Database/New Patent Database Project as Notes on these XML files actually represent.USPTO Assignment Data Paper)
</onlyinclude>
===Spring 2017 Work===
2/14/2017 10-04-27: 1:00 am pm - 123:00 pm Set up personal wiki- worked on documentation more, set up work logtried to figure out how to clean citation data
2/16/2017 -04-25: 10:30 00 am - 12:00 pm Researched past work - worked on databasesdocumentation, discussed project with Edtried to determine how to clean up the USPTO Assignee Data
2/21/2017 9-04-20: 10:00 am - 1211:00 30 pm Set up work page, reviewed SQL, researched designing - wrote copy statements for copying data from RDP to database, continued going through wikiworking on documentation.
2/23/2017 -04-18: 9:30 45 am - 12:00 30 pm Reviewed Perl, read about database design, set - Cleaned up project page for redesigning databasedocumentation more, started documenting kept working through the processof parsing the data
3/2/2017 -04-13: 9:15 30 am - 12:15 00 pm Started excel spreadsheet - Continued working on trying to document current schema design and improvements update patent data through 2016, specifically parsing the data, worked with Ed to be made, updated project pagesupdate perl scripts
3/7/2017 -04-11: 9:30 15 am - 12:00 pm Continued working on spreadsheet, added relevant page links to project page, took notes - Worked on what I want documentation trying to look like in the futureupdate patent data through 2016
3/9/2017 -04-06: 9:00 30 am - 1211:00 30 pm Finished first draft of spreadsheet describing the current schema (and possible changes) to the Patent database- Kept looking through DTDs, kept updating documentation
3/21/2017 -04-04: 9:30 00 am - 1211:00 45 pm - Worked on determining "core" updating documentation, found documentation on pulling data/making tables for new patent databaseand databases, started looking through DTDs to find extra fields to pull
3/22/2017 5-03-23:30 pm to 69:15 am - 12:30 00 pm - Patent Data meetingNarrowed down core tables and fields
3/23/2017 9-03-22:15 am - 125:30 pm to 6:00 30 pm - Narrowed down core tables and fieldsPatent Data meeting
4/4/2017 -03-21: 9:00 30 am - 1112:45 00 pm - Worked on updating documentation, found documentation on pulling data/making determining "core" tables and databases, started looking through DTDs to find extra fields to pullfor new patent database
4/6/2017 -03-09: 9:30 00 am - 1112:30 00 pm - Kept looking through DTDs, kept updating documentationFinished first draft of spreadsheet describing the current schema (and possible changes) to the Patent database
4/11/2017 -03-07: 9:15 30 am - 12:00 pm - Worked Continued working on spreadsheet, added relevant page links to project page, took notes on trying what I want documentation to update patent data through 2016look like in the future
4/13/2017 -03-02: 9:30 15 am - 12:00 15 pm - Continued working on trying Started excel spreadsheet to document current schema design and improvements to update patent data through 2016be made, specifically parsing the data, worked with Ed to update perl scriptsupdated project pages
4/18/2017 -02-23: 9:45 30 am - 12:30 00 pm - Cleaned Reviewed Perl, read about database design, set up documentation moreproject page for redesigning database, kept working through the started documenting process of parsing the data
4/20/2017 10-02-21: 9:00 am - 1112:30 00 pm - wrote copy statements for copying data from RDP to Set up work page, reviewed SQL, researched designing database, continued working on documentation.going through wiki
4/25/2017 -02-16: 10:00 30 am - 12:00 pm - worked Researched past work on documentationdatabases, tried to determine how to clean up the USPTO Assignee Datadiscussed project with Ed
4/27/2017 1-02-14: 10:00 pm am - 312:00 pm - worked on documentation moreSet up personal wiki, tried to figure out how to clean citation dataset up work log
[[Category:Work Log]]

Navigation menu