Changes

Jump to navigation Jump to search
670 bytes added ,  13:52, 23 July 2020
*Change the path to ../vcdb20h1/sdc (usually 3 places per file).
These Originally these SDC requests have the following constraints:*Venture related deals only (see below)
*Deals from 1/1/1980 to 07/20/2020
*US companies (also targets and issuers)
*100pc owned after the acquisition (eliminated, see below)
Note However, it appears that Thomson-Reuters have drastically redefined their 'venture related' flag, as about 25% of the MA data appears missing now: Using the venture-related flag, we now have 42,005 portco records, whereas we did have 49,250 in the last pull around 10 months ago...I tried re-pulling PortCo without the venture-related constraint but with a flag for venture-related and buyout (STD-VC and STD-Buyout), which appear the only two such options now (there used to be many more to choose from 1985 ). This gave 68,596 records, of which 42,068 were venture, 26,527 were not venture, 20,208 were buyout, 48,387 were not buyouts, and 2,2256 were both and 8,575 were neither. I noticed that zozi.com was missing from the new pull, and that it had some early stage deals. It is marked as neither venture nor buyout. I therefore repulled everything without the constraint but with a venture-related and buyout flag at the basics only failsportco level. I omitted "VC" from the new file names. Delete Note that you should select Columinar Grid and not text as the script output method, and move on! The other pull adds data to an older pulldon't both appending the session details.  I redid also re-did the M&A queries, pulling more info from 1980 to 2020H1 in four goes. This I removed the 100pc owned constraint from the search, as I'm concerned that when it isn't reported, observations are omitted. It can be added back later. I also included completed vs. withdrawn, pc cash and stock, and some other useful measures. Then I pulled all the non-private acquisitions in two goes as well. After these changes, there are only two constraints:*Deals from 1/1/1980 to 07/20/2020*US companies (also targets and issuers) The four private M&A pulls were normalized and then combined to create:
USMAPrivate80-20H1-normal.txt
We might want The two public (actually this includes everything that isn't private) M&A pulls were normalized and then combined to take a look at what NOT venture related deals might yield..create: USMAPublic80-20H1-normal.txt
Then normalize Normalizing the files. Generally, this is generally straight-forward. Only Use NormalizeFixedWidth.pl and only copy down the missing keys (e.g., coname, statecode, datefirst, etc.), for . For most files there is nothing to fix, just select the last header row and go. For RoundOnOneLine, remove the footer, run NormalizeFixedWidth.pl first then RoundOnOneLine.pl, and then fix the header. For fund, fund execs, and portco, remove double quotes after normalization.
For PortCoLongDescription (see [[Vcdb4#Long_Description]]):
*Remove the header and footer and save as Process.txton the dbase server, making sure it is encoded as UNIX and UTF-8.
*Run the following:
cat Process.txt | perl -pe 's/^([^ ])/###\1/g' > Out1.txt
*Remove the following from just the description field: ". Save as Out4wHeaderClean.txt, making sure it is UNIX and UTF-8. Then
cat Out4wHeaderClean.txt | perl -pe 's/\s{2,}/ /g' > Out5.txt
*Copy to USVCPortCoLongDesc1980CleanedUSPortCoLongDesc1980Cleaned.txt and make any manual fixes necessary to load Do some clean up:*Remove quotes from fund, fund execs and description field of portco. Issue: *It appears that Thomson-Reuters have drastically redefined their 'venture related' flag, as about 25% of the data appears missing now: We now have 42,005 portco records, whereas we did have 49,250 (open it in the last pull around 10 months ago...*I tried re-pulling PortCo without the venture-related constraint but with a flag for venture-related excel and buyout (STD-VC and STD-Buyout), which appear the only two such options now (there used filter based on ISTEXT on row F to be many more to choose fromidentify issues).*This gave 68,596 records, of which 42,068 were venture, 26,527 were not venture, 20,208 were buyout, 48,387 were not buyouts, and 2,2256 were both and 8,575 were neither. I noticed that zozi.com was missing from the new pull, and that it had some early stage deals. It is marked as neither venture nor buyout.
==Dbase==

Navigation menu