Changes

Jump to navigation Jump to search
New page: All rights are reserved on my tools and scripts, as is my documentation on how to work with various data sources. The data sources themselves belong to their respective owners. My four mo...
All rights are reserved on my tools and scripts, as is my documentation on how to work with various data sources. The data sources themselves belong to their respective owners.

My four most popular tools are:
*'''The Matcher''': A firm name matching tool capable of joining very large datasets that use firm name identifiers. It has been used to join the NBER patent data to COMPUSTAT and CRSP for the NBER Patent Data Project, and has a loyal following.
*'''Normalizer.pl''': A script for processing SDC data into third-normal form ready for import into a database
*'''STATA-Fix-Regressions.pl''': A script that had a loyal following before OUTREG2 (and other tools) got so good.
*'''BibTucker.pl''': A script for doing wierd and wonderful things to large files of BibTeX entries.

These tools are available to researchers who provide sufficient reassurances.

==Data==

Pages on data sources include:
*[[NBER Patent Data]]
*[[VentureXpert]]
*[[Data Dictionaries]]

==Tools and Script by application==

Geocoding:
*[[Geocoding Inventor Locations]]
*[[GEOnet Names Server]]
*[[UN GeoRegion Codes]]

Classifying surnames:
*[[Classifying Names by Culture]]
*[[Culture Based Classifications]]
*[[Normalizing Surnames]]
*[[Sources of Surname Data]]
*[[Extracting Features from Surnames]]

Political Contributions:
*[[Political Contributions By Venture Capitalists]]

General Perl:
*[[PhD Masterclass - How to Build a Web Crawler]]
Anonymous user

Navigation menu