Changes

Jump to navigation Jump to search
The objective is to add city ranking and serials, possibly as well as no. coinvestors, and VC experience x no. coinvestors, to a linear model.
The data for the linear model should include real and synthetic matches. However, to make it comparable to Chenyu's data, we likely need to exclude some markets.
Marcos' used Z:\VentureXpertDB\vcdb3\MasterRealOneSynth.txt as a base dataset, which contained only a single synth. However, in lpm_full.do, he loads MasterWithSynthcode20.txt instead. Note that some of Marcos' do files were not in the dropbox but were in E:\mcnair\Projects\MatchingEntrepsToVC\Stata\DoFiles
Data notes:
*Exit, exitvalue and related measures are going to be right censored
 
In data_import2.m, Chenyu has the following restrictions that I clone in STATA (counts are mine):
*He starts with MasterC20YearFull.txt, rather than MasterRealC20YearFullPlus.txt (which suggests he isn't using the latest data)
*Mkts are pccode20,dealminroundyear
*Removes unmatched VCs and startups (shouldn't be in latest dataset?)
*Requires that matched VCs have synthetics with all startups in the market (should be redundant now?)
*Requires there to tbe >=5 and <=10 real matches in a market
**This reduces the number of obs from 445,710 to just 59,205
*Requires the year to be >=1990 and <=2001
**142,738 out of 445,710, or 18,055 out of 59,205
*Removes duplicates (should be redundant with revised data?)
*Removes markets with marketid NaN (not clear why this happens)
 
In master_dyad.m, Chenyu has year bounds of 2002 and 2016. This upper bound likely has right censoring on exits.
 
===Notes from Conference Call===

Navigation menu