Difference between revisions of "Sonia Zhang (Work Log)"

From edegan.com
Jump to navigation Jump to search
Line 20: Line 20:
 
03/10/2017 - Extracted U.S. address information in ptoassigneend table. The extracted records are stored in the new table 'ptoassigneend_missus'. See details [[Patent Data Restructure]]
 
03/10/2017 - Extracted U.S. address information in ptoassigneend table. The extracted records are stored in the new table 'ptoassigneend_missus'. See details [[Patent Data Restructure]]
  
03/12/2017 - Applied similar methods to extract address information from Japanese patents. The results are stored in 'ptoassigneend_missjapan'.
+
03/12/2017 - Applied similar methods to extract address information from Japanese patents. The results are stored in 'ptoassigneend_missjapan'. Matched the post code pattern to 200 distinct countries that exist in patent table.

Revision as of 17:51, 13 March 2017


Sonia Zhang Work Logs (log page)

Summer Work

02/23/2017 - Set Up the User Page and the Work Log Page. Got an overview of the patent data.

02/27/2017 - Started working on the issues listed on Patent Data Issues.

02/28/2017 - Cleaned the assigneeinfo, msalist etc.

03/1/2017 - Had a meeting discussing problems in the patent data.

03/2/2017 - Cleaned some of the 'name' and 'city' records in ptoassigneend2. Created ptoassigneend_country table to store country information. Figured out some methods to fill the empty 'city'/'country' information.

03/06/2017 - Updated ptoassigneend table. Filled some of the missing value in 'country' as 'UNITED STATES' based on 'state' information.

03/10/2017 - Extracted U.S. address information in ptoassigneend table. The extracted records are stored in the new table 'ptoassigneend_missus'. See details Patent Data Restructure

03/12/2017 - Applied similar methods to extract address information from Japanese patents. The results are stored in 'ptoassigneend_missjapan'. Matched the post code pattern to 200 distinct countries that exist in patent table.