Difference between revisions of "Culture Based Classifications"

From edegan.com
Jump to navigation Jump to search
imported>Ed
imported>Ed
Line 3: Line 3:
 
There are many possible 'culture' based classes that one might want to use. At the finest grained level (and in the most ambitious case), one might want to predict actual countries of origin ([[UN GeoRegion Codes | standardized country names are provided by the United Nations]]). At all more course-grained levels, countries must be aggregated into meaningful units.  
 
There are many possible 'culture' based classes that one might want to use. At the finest grained level (and in the most ambitious case), one might want to predict actual countries of origin ([[UN GeoRegion Codes | standardized country names are provided by the United Nations]]). At all more course-grained levels, countries must be aggregated into meaningful units.  
  
Commonly used meaningful aggregations include:
+
Two commonly used (and meaningful) aggregations are:
 
#[[UN GeoRegion Codes | UN GeoRegions]]
 
#[[UN GeoRegion Codes | UN GeoRegions]]
 
#[[Ethnologue Classification]] - A language development based aggregation
 
#[[Ethnologue Classification]] - A language development based aggregation
 +
 +
==Custom Classifications==
 +
 +
For the purpose of economic analysis it is not necessary to know which country an individuals name has its roots in; broad areas of origin are sufficient. Areas that are of particular interest might include:
 +
#Western European (including the colonies)
 +
#Scandinavian
 +
#Slavic
 +
#Hispanic/Latin
 +
#Chinese (including Chinese 'dependencies')
 +
#Arab (Muslim)
 +
#Israel (Jewish)
 +
#African
 +
#India/Pakistan
 +
#Asia-Pacific (Phillipines, Vietnam...)
 +
#Korean
 +
#Japanese
 +
 +
==Other Classifications==
 +
 +
One other potentially useful classification of names is based on differences in [http://en.wikipedia.org/wiki/Writing_system writing systems]. The following is a loose list of the major writing systems of the world:
 +
*Latin (alphabetic)
 +
*Cyrillic (alphabetic)
 +
*Hangul (featural alphabetic)
 +
*Other alphabets
 +
*Arabic (abjad)
 +
*Other abjads
 +
*Devanagari (abugida)
 +
*Other abugidas
 +
*Syllabaries
 +
*Chinese characters (logographic)

Revision as of 20:48, 11 July 2009

There are many possible 'culture' based classes that one might want to use. At the finest grained level (and in the most ambitious case), one might want to predict actual countries of origin ( standardized country names are provided by the United Nations). At all more course-grained levels, countries must be aggregated into meaningful units.

Two commonly used (and meaningful) aggregations are:

  1. UN GeoRegions
  2. Ethnologue Classification - A language development based aggregation

Custom Classifications

For the purpose of economic analysis it is not necessary to know which country an individuals name has its roots in; broad areas of origin are sufficient. Areas that are of particular interest might include:

  1. Western European (including the colonies)
  2. Scandinavian
  3. Slavic
  4. Hispanic/Latin
  5. Chinese (including Chinese 'dependencies')
  6. Arab (Muslim)
  7. Israel (Jewish)
  8. African
  9. India/Pakistan
  10. Asia-Pacific (Phillipines, Vietnam...)
  11. Korean
  12. Japanese

Other Classifications

One other potentially useful classification of names is based on differences in writing systems. The following is a loose list of the major writing systems of the world:

  • Latin (alphabetic)
  • Cyrillic (alphabetic)
  • Hangul (featural alphabetic)
  • Other alphabets
  • Arabic (abjad)
  • Other abjads
  • Devanagari (abugida)
  • Other abugidas
  • Syllabaries
  • Chinese characters (logographic)