Abstract

Academic Paper
Title	Hubs Scorecard (Academic Paper)
Author	Ed Egan, Yael Hochberg
RAs	Todd Rachowin, Ariel Sun
Status	Tabled
	© edegan.com, 2016

As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics.

Work in Progress

Our goal is to automate the collection of variables for potential hubs as much as possible. The key steps for the project are:

Creating a comprehensive list of potential hubs (Complete)
Determining the best variables for the scorecard (Complete)
Building "filters" for automating the collection (Complete)
Running and auditing of the automation (In Progress)
Collecting the remaining manual data (next step)

For the detailed current work in progress for building the Hubs datasheet for the scorecard go to: Hubs: Hubs Scorecard
For a tracker of work in progress for the dataset building for the scorecard go to Hubs: Hubs Data Building
For a high-level overview of the variables for the scorecard go to Hubs: Hubs Data
For more information on Mechanical Turks in general, see Mechanical Turk (Tool).
Comprehensive list of potential hubs can be found at:

1. E:\McNair\Projects\Hubs\Raw Program List Contains 600 entities - vast majority are firmly not hubs (file pedigree unknown)
2. E:\McNair\Projects\Hubs\Hubs Data - Contains 125 entities - many are not hubs (overlap with above file unknown, this file's pedigree from old Hubs project).

Hubs Data

(7/27 Onwards)

Collected variables for 30 hubs that are surely hubs. The results are here:

E:\McNair\Projects\Hubs\Hubs Variables -Ariel.xls

(Until 7/27)

See Hubs: Hubs Scorecard

(Week of 7/11)

1) We published the twitter count on mechanical turk and received results.

2) We have audited the results and updated the amazon.

3) We are creating additional potential turks on the amazon site (See Hubs: Hubs Scorecard)

4) We are finding more potential hubs from members of international national business innovation association

(Week of 7/4)

1) We have created the list and commented our thoughts after ---. For determining the variables, we have separated the list into two parts: a list of desired variables and ones that were previously collected, many of which are desired variables.

2) We have also created an example of how to write mechanical turks for collecting certain variables

Variables to be Used

Old variable list (see Hubs Data.xls) contains 18+3 variables. Overlap with new variable list is ~50%

Current Complete List

As of Week of 7/11

Onsite Venture Capital
- Assets Under Management
- Number
Onsite Angel Investors
Onsite Mentors
Founding Date
Site URL
Office hours investors
Office hours mentor/advisors
Onsite temporary workshops
Networking Meetups
Sponsors/Partners
- University
- Corporate
Curriculum
Onsite code school
Alumni Network
Nonprofit status
Mission statement
Specific Industry
Price for a space
Price for office
Twitter activity
Size (sqft)
Size (# companies)
Onsite accelerator
Community membership??
Franchise
Multiple locations within city

Grouping of Variables

There are a few categories the majority of the variables fall under

Group 1: Low Hanging Fruit Variables in this group are very easy to find and automate.

Twitter Activity
URL
Address
Mission Statement
Specific Industry
Nonprofit
Sponsors/Partners
Price for a space + office
Founding Date

Group 2: The Difficult to Find There are certain variables where the information is not readily available online or difficult to find.

Size (can try to find press releases)

Group 3: In Between 1 and 2 Variables that aren't too easy or difficult to find and automate.

Onsite accelerator
Alumni mentor

Group 4: The Hard to Differentiate The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate. In order to fix this, we will need to create filters akin to the DSM5 scorecard. See the below section.

Onsite VC v. Angel Investors
Onsite OH Investors v. mentors
Onsite temporary workshops v. networking events
Curriculum v. code school

General Approach Group 4

The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g. a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.

Group 5: The Need further Discussion Before Collection Variables that need to be developed more prior to collection.

Franchise and multiple locations within a city
Community Membership

Hubs Scorecard (Academic Paper)

Contents

Abstract

Work in Progress

Hubs Data

Variables to be Used

Current Complete List

Grouping of Variables

General Approach Group 4

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Sites

Sections

Organizations

Help

Tools