Changes

Jump to navigation Jump to search
*This page is a part of series in [[Classifying Names by Culture]] This page details the (currently 1st draft version) construction of the Cultural Homogenteity in Venture Capital Data. This data is posted for reference (and debate) by project members. The choices made in constraints, variables, processing, and so forth are often arbitrary and will no doubt be changed as the project progresses.
==Constraints on Retrieved Data==
==Retrieved Data by VentureXpert Perspective==
The data was processed by the [[Normalizer.pl]] perl script to produce 3rd normal form relational database tables. In the case of executive titles this was problematic and required rekeying in the database.
===Portfolio Companies===
*Executives (both Company and Fund executives) have multiple job titles - the first listed title was taken (requires rekeying)
*Executive Name Prefixes were parsed for Dr and Ms (recorded as binary codes)
*Anglicized First Names and genders were (incompletely) identified using the [[SSA Baby Names | U.S. Social Security Administration Baby Names list]]
*Company Executives with the following SQL constraint were marked as founders:
**Like "*Founder*" Or Like "*Chief Exec*" Or Like "*CEO*" Or (Like "President*" Or "Co-President") Or Like "*Chairman*"
*Company Executives marked as "N" for Non-Managing Board Members were coded as Managing
*Fund Executives meeting the following SQL constraint were marked as "Partners" (broadly defined):
**(Like "*Partner*" And Not Like "Admin*" And Not Like "Advisor*" And Not Like "Limited*") Or "Principal" Or "Director" Or "Managing Director" Or "CEO" Or Like "*Founder*" Or Like "*Investment*" Or Like "*Executive Officer*"
*Fund Executives were marked as ServedOnBoard if they appeared on the board of any portfolio company in the dataset at any time
*The Company and fund zip codes were standardized to 5 digits
*Anglicized First Names Companies must not have null or "Undisclosed" names*Funds must not be named "Undisclosed Fund" and genders must have participated in a first round of investment in the period of interest (i.e. within the constraint on date of first investment on companies) ==Output Files== Based on the above processing, the following comma-seperated, text-quoted, text files were created (incompletelyNote requires ftp login) identified using the :*[ftp://ftp.edegan.com/repository/CulturalVC-CoExecutives.txt CulturalVC-CoExecutives.txt]*[SSA Baby Names | Uftp://ftp.edegan.com/repository/CulturalVC-FundExecutives.Stxt CulturalVC-FundExecutives. Social Security Administration Baby Names listtxt]*[ftp://ftp.edegan.com/repository/CulturalVC-Rounds.txt CulturalVC-Rounds.txt]
Anonymous user

Navigation menu