Difference between revisions of "Contruction of the Cultural Homogeneity in VC Dataset"
Jump to navigation
Jump to search
(New page: This page details the (currently 1st draft version) construction of the Cultural Homogenteity in Venture Capital Data. ==Constraints on Retrieved Data== The following constraints were pl...) |
|||
Line 40: | Line 40: | ||
==Lookup Tables== | ==Lookup Tables== | ||
− | Lookup tables for Stage, | + | Lookup tables for Stage, State and VentureXpert Minor Industry Code are provided on the [[VentureXpert]] page. The numeric values were recorded in the database. |
+ | |||
+ | ==Processing the Data== | ||
+ | |||
+ | The following problems were addressed and steps undertaken: | ||
+ | *Executives (both Company and Fund executives) have multiple job titles | ||
+ | *Executive Name Prefixes were parsed for Dr and Ms (recorded as binary codes) |
Revision as of 19:37, 1 August 2009
This page details the (currently 1st draft version) construction of the Cultural Homogenteity in Venture Capital Data.
Contents
Constraints on Retrieved Data
The following constraints were placed in the retrieval of data from VentureXpert (via SDC Platinum):
- Portfolio Company Date of First Investment: 2003-2007 inclusive
- Portfolio Company Nation: US
- Venture Capital Fund Nation: US
- Portfolio Company Standard US Venture Disbursement: Yes (Note: Correlates almost perfectly with PWC Moneytree VC Deals - excludes private equity, angel investment, and other non-VC, as recognised by Thomson)
- Venture Capital Fund PWC Moneytree Deals: Yes (see above)
Retrieved Data by VentureXpert Perspective
The data was processed by the Normalizer.pl perl script to produce 3rd normal form relational database tables.
Portfolio Companies
- Company (Primary Key: CoExex-1)
- CoExec-1, DateCompanyReceivedFirstInvestment, CompanyFoundingDate, CompanyZipCode, CompanyCity, CompanyAreaCode, CompanyCounty, CompanyName, CompanyNation, CompanyNationCode, CompanyState, CompanyStateCode, CompanyStreetAddressLine1, CompanyStreetAddressLine2, CompanyIndustryClass, CompanyIndustryMajorGroup, CompanyIndustryMinorGroup, CompanyIndustrySubGroup1, CompanyIndustrySubGroup2, CompanyIndustrySubGroup3, StandardUSVentureDisbursement
- CoExec (Primary Key: Co-Exec-3)
- CoExec-1, CoExec-3, ExecutiveisNonManagingBoardMember, ExecutiveisPrimaryContact, ExecutivesCity, ExecutivesFirstName, ExecutivesLastName, ExecutivesNamePrefix, ExecutivesEMailAddress, ExecutivesPhoneNumber, ExecutivesPreviousPosition
- CoExecTitle (Primary Key: Co-Exec-2)
- CoExec-1, CoExec-2, ExecutivesJobTitle
Fund
- Fund
- FundExec-1, FirmFundList1stCloseDateofeachFund, FirmFoundingDate, FundInitialClosing, FundAreaCode, FundCity, FundCounty, FundStageFocus, FundInvestmentType, FundMSA, FundMSACode, FundName, FundNation, FundNationCode, FundRaisingStatus, FundSequenceNo, FundSequenceType, FundSizeMil, FundState, FundStateCode, FundTargetSize000, FundTypeLongDescription, FundTypeShortDescription, FundYear, FundZipCode, FirmReportedCapitalunderMgmtMil, FirmNation, FirmName, FirmGeographyPreference, FirmIndustryPreference, FirmPreferredInvestmntRoleCode, FirmInvestmentStagePreference, FirmPreferredMaxInvestmentMil, FirmPreferredMinInvestmentMil, FirmState, FirmZipCode, FirmStateCode, PWCMoneytreeDealsYN, FirmFundListNameofeachFund
- FunExec
- FundExec-1, FundExec-3, ExecutiveisNonManagingBoardMember, ExecutiveisPrimaryContact, ExecutivesCity, ExecutivesFirstName, ExecutivesLastName, ExecutivesNamePrefix, ExecutivesPreviousPosition
- FundExecTitle
- FundExec-1, FundExec-2, ExecutivesJobTitle
Disbursement
- Round
- Round-1, Round-2, RoundDates, CompanyStageLevel1ateachRoundDate, RoundAmtDisclosed000, RoundAmtEstimated000, RoundNumbers, NumberofInvestorseaRound
- RoundCompany
- Round-1, CompanyName, CompanyState, CompanyStateCode, TotalKnownAmtInvestedinCompany000
- RoundInvestor
- Date, DisclosedAmtk, Investor, Round-1, Round-3
Lookup Tables
Lookup tables for Stage, State and VentureXpert Minor Industry Code are provided on the VentureXpert page. The numeric values were recorded in the database.
Processing the Data
The following problems were addressed and steps undertaken:
- Executives (both Company and Fund executives) have multiple job titles
- Executive Name Prefixes were parsed for Dr and Ms (recorded as binary codes)