Changes

Jump to navigation Jump to search
no edit summary
Also, for all count variables, I've log-transformed it (adding 1 before to account for zeros) as suggested by Ed Egan. For the distance variable, I've also log-transformed it. Continuous variables are not log-transformed because most of them contains zeros, and adding 1 doesn't seem to make much sense.
 
 
==Building new Dataset in SQL for a Linear Probability Model==
 
To run the linear probability model, we need to build a new dataset. This was partially done in the Stata Do-File explained above, but doing it in SQL will give the opportunity to be more flexible when choosing the synthetic match.
 
The idea is to build, for each startup, a new dataset that matches all VCs that were active in the year it matched with the real VC AND that were in the same Industry 100 code. This new set of synthetic firm for each startup will then be appended to the real match dataset.
44

edits

Navigation menu