Changes

Jump to navigation Jump to search
2,339 bytes added ,  13:46, 21 September 2020
no edit summary
{{Project
|Has project output=Data,Tool,How-to
|Has sponsor=Kauffman Incubator Project
|Has title=vcdb4
|Has owner=Ed Egan,
and others.
The build should be done as quickly but cleanly as possible, as it is needed right away but also will likely need to be updated in January of 2020 to reflect 2019's year -end. ===Notes=== There appears to be an error in the M&A data processing, which needs exploration. Compare: SELECT * FROM portcoexit WHERE coname ILIKE 'Augmentix%' OR coname ILIKE 'CheMatch%' OR coname ILIKE 'Hispanic Television%' OR coname ILIKE 'IDev T%' OR coname ILIKE 'Merrick%' OR coname ILIKE 'RosettaMed%' OR coname ILIKE 'SAT Corp%' OR coname ILIKE 'Visualase%'; SELECT * FROM mas WHERE targetname ILIKE 'Augmentix%' OR targetname ILIKE 'CheMatch%' OR targetname ILIKE 'Hispanic Television%' OR targetname ILIKE 'IDev T%' OR targetname ILIKE 'Merrick%' OR targetname ILIKE 'RosettaMed%' OR targetname ILIKE 'SAT Corp%' OR targetname ILIKE 'Visualase%'; 
==SDC Platinum Requests==
We will use the old Tiger Places, but it is worth exploring how to rebuild this from scratch. First, visit https://www.census.gov/geographies/mapping-files/time-series/geo/tiger-line-file.html. Then download each state's place file. Load them and merge them. See [[PostGIS_Installation#Bulk_Download_TIGER_Shapefiles]] for more information. It is also worth reading the [[Tiger Geocoder]] and [[Jeemin Sim (Work Log)]].
 
====CPI====
 
We loaded the old CPI file, updating it with real numbers for 2018 and an estimate for 2019 based on extrapolating the last four months from the first eight, and then computing an annual average. The inflator is normalized to 2017 dollars. Data was taken from: https://www.bls.gov/cpi/tables/supplemental-files/historical-cpi-u-201908.pdf
====Other things to do====
**Agglomeration!
**Note: Needs updating to reflect latest decisions
 
===BuildBaseTables===
 
To this end, the base tables, which are used by multiple different projects, are built in BuildBaseTables.sql
 
Build notes:
*Common code from MatchingEntrepsV3.sql was moved to BuildBaseTables.sql
*Almost all of the women's code now appears to be in the new base
*The super tables still need building
*The code for the Matching VC-Entreps project is now almost entirely in RevisedDBaseCode.sql
 
==Agglomeration==
 
This build supports the [[Urban Start-up Agglomeration and Venture Capital Investment]] paper. It uses the sql script: AgglomerationVcdb4.sql, which is in E:\projects\agglomeration. The notes are here: [[Urban_Start-up_Agglomeration_and_Venture_Capital_Investment#vcdb4]]
==Ranking==
A new Ranking.sql file was created in E:\projects\vcdb4 and run.
 
There are imperfect matches between cities in VentureXpert and places in Tiger. In the past, this hasn't mattered as we'd considered such a small sub set of cities (and had gotten lucky on coverage). Geocoding covers 97.2607% (47.760 out of 49,136) of companybasecore, 48,815 (99.35%) of which have a non-null city. This drops to 96.42% of firms with datefirstinv >= 2010. Using just geocoded would therefore cost us a small number of firms, but these may not be randomly distributed. What we need, then is a complete lookup table...

Navigation menu