Hubs

From edegan.com
Revision as of 15:50, 18 March 2016 by imported>Rachel (→‎Supplementary Data Sets)
Jump to navigation Jump to search

The Hubs Research Project is a full-length academic paper analyzing the effectiveness of "hubs", a component of the entrepreneurship ecosystem, in the advancement and growth of entrepreneurial success in a metropolitan area.

This research will primarily focused on large and mid-sized Metropolitan Statistical Areas (MSAs), as that is where the greater majority of Venture Capital funding is located.

Primary Data Set

The Hubs data set, from SDC Platinum, is currently in the process of being constructed.

The data set includes all United States Venture Capital transactions (moneytree) from the twenty-five year period of 1990 through 2015. Data has been accumulated at the portfolio company, fund, and round level. It will be analyzed at the MSA level. We will be looking at in terms of number of companies funded in number of funds active, and flow of investment in a given MSA


The data set has now been uploaded to the database server, named Hubs. There are 4 tables: Rounds, CombinedRounds, Companies, and Funds

Hub Candidates Data Set

The Hubs candidate data set is a list of potential hubs found in MSAs throughout the country. Researchers are currently pulling qualitative and quantitative information from the candidate's websites, in an attempt to categorize what can be identified as a hub. This is a difficult data set to pull, as there is little to no quantitative information available for this category of institution, and is dependent on accessibility of information to the public on the internet.

Characteristics/Variables

  • Year Founded
  • Square footage
  • LinkedIN self-identifiers (what the organization classifies itself on its LinkedIN profile)
  • Activeness on Twitter (binomial)
  • Member Directory available online (binomial)
  • Number of conference rooms
  • Price ($/month) for Flex desk
  • Offers Reserved desk (binomial)
  • Offers office space for rent (binomial)
  • Offers community membership-- not for coworking but for community events, etc. (binomial)
  • Number of events offered per month (estimate)
  • Offers code academy
  • Mission Statement/Vision (for qualitative or key-word analysis)

These characteristics/variables will be used to determine whether a candidate is or is not likely to be a Hub.

As of March 10th 2016, the list contains 125 Hub candidates.

Supplementary Data Sets

Patent data: to be pulled from USPTO or SDC Platinum.

  • unable to find on the internet, must be pulled from the larger dataset

Number of STEM Graduate Students (NSF) and University R&D Spending (NSF): Grad Students found for the year 2015, no data going back historically; R&D found for the past 10 years

  • categorized university by MSA

Per Capita Income and Employment Data (US Census Bureau): complete for most recent census, unable to find data going back historically

Firm Births (BDS): data set found for 1990 to present, currently being cleaned up for use

Resources

To Do

We need to find and clean up data sets at the MSA level

  • Patent data (USPTO)
  • Number of STEM Graduate Students (NSF)
    • in progress
  • University R&D Spending (NSF)
  • Per Capita Income (US Census)
    • complete (Employment and Income_MSA.xls)
  • Employment (US Census)
    • complete (Employment and Income_MSA.xls)
  • Firm births (BDS)
  • select MSAs