Changes

Jump to navigation Jump to search
no edit summary
#'''Running''' and '''auditing''' of the automation (In Progress)
#Collecting the remaining manual data (next step)
 
*For the current work in progress for building the Hubs datasheet for the scorecard go to: [[Hubs: Hubs Scorecard]]
##<code>E:\McNair\Projects\Hubs\Raw Program List</code> Contains 600 entities - vast majority are firmly not hubs (file pedigree unknown)
##<code>E:\McNair\Projects\Hubs\Hubs Data</code> - Contains 125 entities - many are not hubs (overlap with above file unknown, this file's pedigree from old Hubs project).
 
 
==Variables to be Used==
 
Old variable list (see Hubs Data.xls) contains 18+3 variables. Overlap with new variable list is ~50%
 
===Current Complete List===
'''As of Week of 7/11'''
#Onsite Venture Capital
#*Assets Under Management
#*Number
#Onsite Angel Investors
#Onsite Mentors
#Founding Date
#Site URL
#Office hours investors
#Office hours mentor/advisors
#Onsite temporary workshops
#Networking Meetups
#Sponsors/Partners
#*University
#*Corporate
#Curriculum
#Onsite code school
#Alumni Network
#Nonprofit status
#Mission statement
#Specific Industry
#Price for a space
#Price for office
#Twitter activity
#Size (sqft)
#Size (# companies)
#Onsite accelerator
#Community membership??
#Franchise
#Multiple locations within city
 
===Grouping of Variables===
There are a few categories the majority of the variables fall under
 
'''Group 1: Low Hanging Fruit'''
Variables in this group are very easy to find and automate.
#Twitter Activity
#URL
#Address
#Mission Statement
#Specific Industry
#Nonprofit
#Sponsors/Partners
#Price for a space + office
#Founding Date
 
 
'''Group 2: The Difficult to Find'''
There are certain variables where the information is not readily available online or difficult to find.
#Size (can try to find press releases)
 
 
'''Group 3: In Between 1 and 2'''
Variables that aren't too easy or difficult to find and automate.
#Onsite accelerator
#Alumni mentor
 
 
'''Group 4: The Hard to Differentiate'''
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate. In order to fix this, we will need to create filters akin to the DSM5 scorecard. See the below section.
#Onsite VC v. Angel Investors
#Onsite OH Investors v. mentors
#Onsite temporary workshops v. networking events
#Curriculum v. code school
 
=====General Approach Group 4=====
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g. a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.
 
 
'''Group 5: The Need further Discussion Before Collection'''
Variables that need to be developed more prior to collection.
#Franchise and multiple locations within a city
#Community Membership
460

edits

Navigation menu