Changes

Jump to navigation Jump to search
As mentioned, the ptoassigneend_us_extracted is cleaned. This section works on the remaining records which are stored in ptoassigneend_us_temp.
NextFirst, filter out records with city that is a city, zip that is a zip, state that is a state. Note: The consistency between city and state or city and postcode is not checked in this section.
*zip that is a zip
*city that is a city
 
No comma, no dot
The feature city is messy. In this way, we first clean feature city before identifying clean data.
* IS NOT NULL AND city != '' AND city !~* '([,]|[.])'
 
Problem: can't identify cities
-- 23501

Navigation menu