Changes
Jump to navigation
Jump to search
Where Hard and Soft African are artificial constructs based on the sounds that occur in names, but with some correlation between Hard African with Eastern African nations. (Note that someareas, particular North-Eastern, African nations are classified as Arab). Romanian and Pakistani may be difficult to identify in some datasets but do appear to be recognisably unique in sufficient volume. The various [http://en.wikipedia.org/wiki/Slavic_peoples Slavic definitions] do not tie precisily to their accepted geographical and linguistic definitions but are suitably close as to be used.
Culture Based Classifications (view source)
Revision as of 23:08, 16 July 2009
, 23:08, 16 July 2009no edit summary
Western European should be further decomposed into English, German, French, Italian, and perhaps Portugese. It is unlikely that it is possible to distinguish Spanish from Hispanic. Furthermore, Israel (and more particularly Jewish) is likely to be very hard to distinguish in most data sources. Israel has had a large amount of immigration, particularly from Russia and Europe, and is perhaps the definitive land of immigrants.
==Classification by Egan (2009)== From an inspection (by a researcherhand) of names around the world, it appears that the following classes may be able to be identified:
*European
**North Slavic
*African
**Hard East African**Soft Other African
*Asian
**Arab
**Vietnamese
The source file was not sufficiently detailed for many countries for them to be included in the classification. This is not necessarily a problem; the remaining countries account for a very low percentage of the world's population and for most applications we are only interested in classifications that a human could make in a real world context. The definition file for this classification system ([http://www.edegan.com/repository/Culture-EganClassification.txt Culture-EganClassification.txt]) uses [[UN GeoRegion Codes | UN recognized country names]] and excludes unclassifiable countries.