Changes

Jump to navigation Jump to search
--1128247
==Restructure Address Information(First Stage)==
Note: This section was worked on by [[Sonia Zhang]]
*''' Both state name and its abbreviation exist.'''
 
==Restructure Address Information (Second Stage)==
 
Note: This section was worked on by [[Sonia Zhang]]
 
The dbase is '''patent'''.
 
The table is '''ptoassigneend'''.
 
SQL code and other things are in:
E:/McNair/Projects/PatentAddress
 
IO files are on the dbase server in:
Z:/PatentAddress
 
====To do====
 
In no particular order:
*Remove city, state, zip, country from addrline1 & addrline2 to get clean addrlines.
*Maybe concatenate addrline1 and addrline to make addrline
*Identify clean data (e.g. City that is a city, zip that is a zip, state that is a state)
**By pattern, length, match to list
*Try some more patterns, perhaps with a slightly higher false positive rate, on the remaining uncleaned data
**Iterate!
 
==== ====

Navigation menu