Changes

Jump to navigation Jump to search
no edit summary
*Load() - Expects to find GNS-XX.txt (where XX is an ISO3166 code) and it to have GNS standard column names; Loads it.
*Index - Build all of master index and all sub-indices
*GetIndexKeys() -Takes a specific GNS NT FC code (e.g. P,L,A) or ALL and returns a set of index keys
*GetUNIs() - Takes a place name and a type (e.g. P,L,A,ALL); returns a list of corresponding UNIs
*GetLongLat() - Takes a UNI, returns a longitude, latitude pair
*Load() - load the data file(s)
*CleanAndParse() - Do a first round of cleaning and parsing (calls internal methods). Extract out the postcode and replace stop words.
*UnMatched() - Takes an NT FC code (e.g. P,L,A,ALL) and returns the set of currently unmatched country name keys for that type
*ReturnMatches() - Marks country name keys with their new match sets
*Exact Matching - Case insensitive of matching of the entire sequence of both the source and the reference strings
*LCS - Longest Common Subsequence based matching (See below)
*Place and administrative area - somewhere identified as a NTFC=P or NTFC=A respectively in the GNS data. Unless otherwise specified matches are performed for both place and administrative area separately and in series.
The sequence of processing is as follows (matching only the remaining unmatched locations at each stage):
Anonymous user

Navigation menu