Tools, Scripts, and Data

All rights are reserved on my tools and scripts, as is my documentation on how to work with various data sources. The data sources themselves belong to their respective owners.

My four most popular tools are:

The Matcher: A firm name matching tool capable of joining very large datasets that use firm name identifiers. It has been used to join the NBER patent data to COMPUSTAT and CRSP for the NBER Patent Data Project, and has a loyal following.
Normalizer.pl: A script for processing SDC data into third-normal form ready for import into a database
STATA-Fix-Regressions.pl: A script that had a loyal following before OUTREG2 (and other tools) got so good.
BibTucker.pl: A script for doing wierd and wonderful things to large files of BibTeX entries.

These tools are available to researchers who provide sufficient reassurances.

Data

Pages on data sources include:

Tools and Scripts (by application)

Geocoding:

Classifying surnames:

Political Contributions:

Political Contributions By Venture Capitalists

General Perl:

PhD Masterclass - How to Build a Web Crawler

Tools, Scripts, and Data

Data

Tools and Scripts (by application)

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Sites

Sections

Organizations

Help

Tools