Changes

Jump to navigation Jump to search
1,533 bytes added ,  12:39, 23 September 2019
The two USMAPrivate100pc queries are different. The first pulls just date announced, date effective, target name, target state and tv. The second adds basic acquirer information from 2013 forward (to allow retroactive revision by Thomson for 5+ years) and can be combined with MAUSTargetComp100pc1985-July2018.txt (after adjusting the spacing) to make USMAPrivate100pc2013Full. For some reason, the query always fails with an out of memory message when trying to pull the whole thing.
USSDCRound1980 was updated to remove fields that should have been in USVCPortCos1980 only. When normalizing be sure to only copy down key fields. USMAPrivate100pc1985 was updated to reflect the MAs load in LoadingScriptsV1. There wasn't a good original. We are using 1985 forward as there are data issues that prevent download/extraction for the 1980-1984 data. Year completed was added as a check variable but might have been the source of issues and so was removed. Date Effective can be used instead. And USIPOComp1980 was updated to allow all exchanges (not just NNA). I couldn't require completion in the search, so that will have to be done in the dbase. USVCFund1980 was updated because some variables -- those concerned with the fund's name and fund address -- had changed name. USTRoundOnOneLine1980 was fixed so that it is just the key (coname,statecode,datefirst) and the round info field, so that it works with the RoundOnOneLine.pl script. Finally, note that USPortCoLongDesc1980 needs processing separately(see below).
===Long Description===
cat In5.txt | perl -pe 's/\s{2,}/ /g' > Out4.txt
 
===Round On One Line===
 
The process is run USVCRoundOnOneLine1980.ssh with USVCRoundOnOneLine1980.rpt to generate USVCRoundOnOneLine1980.txt, then remove the footer and:
perl Normalizer.pl -file="USVCRoundOnOneLine1980-NoFoot.txt"
copy down the key (0,1,2)
perl RoundOnOneLine.pl -file="USVCRoundOnOneLine1980-NoFoot-normal.txt"
then put the header back in!
 
===Everything else===
 
Just run the Normalizer. Only copy down key fields -- never copy down anything else as it will introduce data errors. The primary and foreign key fields, which may still need cleaning in SQL to be valid, are (they are marked with * if they should be copied down):
 
*USVCRound1980 -- coname*, statecode*, datefirst*, rounddate
*USVCPortCo1980 -- coname, statecode, datefirst
*USVCRoundOnOneLine1980 -- Coname*, statecode*, datefirst*, rounddate, fundname
*USVCFund1980 -- fundname, firmname
*USVCFirms1980 -- firmname
*USPortCoLongDesc1980 -- coname*, statecode*, datefirst*
*USVCFirmBranchOffices1980 --firmname
*USIPO1980 -- issuer
*USVCPortCoExecs1980 -- coname*, statecode*, datefirst*
*USVCFundExecs1980 -- fundname*, and maybe fundyear*
*USMAPrivate100pc2013 -- dateeffective, targetname, acquirorname
 
Not that USMAPrivate100pc2013 and USIPO1980 have some non-numerics in their value fields, and we are generally going to have take care of some type issues.

Navigation menu