Changes

Jump to navigation Jump to search
141 bytes added ,  15:57, 4 August 2017
*The stata do file '''hubs_load_data.do''' uploads the final hubs data file. This is the raw data that includes variables early funding, late funding, number of early funding deals, number of late funding deals, dummies for whether a hub is present etc. The do file processes data, creates more variables. It also estimates the hazard rate model, and matches the sample based on estimated hazard ratios. It eliminates all city_states that were never matched with a neighbor in ANY year from 2000-2017. The output from this file is '''matched_hubs.dta'''.
*We need a panel data with controls and treatments, such that for each matched control-treatment pair, with pairid = n, in '''matched_hubs.dta''' we assign the same pairid = n to observations of city_states 2 periods before and after they were assigned control and treatment status originally. The matlab code in '''time_matching_hubs.m''' performs this. However the resulting panel has problems of overlap. For instance if chicago_2008 was treatment and boulder_2014 was control with pair id 200 in the original hazard rate match, chicago_2009 is a control in some pair 203 and this observation gets matched with our pair id 200. Need to resolve this.  *The stata do file '''hubs_load_clean.do''' uploads this final panel data set - but this panel has the problem of overlaps discussed above.
76

edits

Navigation menu