<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>http://www.edegan.com/mediawiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=ArielSun</id>
	<title>edegan.com - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="http://www.edegan.com/mediawiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=ArielSun"/>
	<link rel="alternate" type="text/html" href="http://www.edegan.com/wiki/Special:Contributions/ArielSun"/>
	<updated>2026-06-02T01:16:50Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.34.2</generator>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Work_Hours&amp;diff=9058</id>
		<title>Work Hours</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Work_Hours&amp;diff=9058"/>
		<updated>2016-09-26T18:18:45Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Please complete your preferred times for the Fall term of 2015 below.&lt;br /&gt;
&lt;br /&gt;
{|  class=&amp;quot;wikitable sortable&amp;quot; style=&amp;quot;border: 1px solid darkgray; bgcolor: #f9f9f9&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|'''Name'''&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|'''Mon'''&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|'''Tues'''&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|'''Wed'''&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|'''Thurs'''&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|'''Fri'''&lt;br /&gt;
|-&lt;br /&gt;
| Albert Nabiullin||3-4:30||12:45-3:15||3-4:30||12:45-3:15||3-4:30&lt;br /&gt;
|-&lt;br /&gt;
| Amir Kazempour||||||||||&lt;br /&gt;
|-&lt;br /&gt;
| Avesh Krishna||9-11:00||2:30-4:00||2-5:30||2:30-4:00||2-4:00&lt;br /&gt;
|-&lt;br /&gt;
| Ariel Sun||11-12||||11-12||||11-12, 1:15-2:45&lt;br /&gt;
|-&lt;br /&gt;
| Ben Baldazo||2-5:00||1:-5:00||||||2-:500&lt;br /&gt;
|-&lt;br /&gt;
| Carlin Cherry||||10-12, 2:20-3:50||12:45-2, 3-5:30||2:20-3:50||12:45-2&lt;br /&gt;
|-&lt;br /&gt;
| Catherine Kirby||||1:00-3:00||3:00-5:00||2:00-5:00||1:00-4:00&lt;br /&gt;
|-&lt;br /&gt;
| Christy Warden||||2:00-4:45||||2:00-4:45||&lt;br /&gt;
|-&lt;br /&gt;
| Dylan Dickens||1-6:00||||||||&lt;br /&gt;
|-&lt;br /&gt;
| Harsh Upadhyay||3-5:30||3-5:30||3-5:30||3-5:30||3-5:30&lt;br /&gt;
|-&lt;br /&gt;
| Jake Silberman||||||||||&lt;br /&gt;
|-&lt;br /&gt;
| James Chen||||10:00-12:00||||3:00-5:00||10:00-4:00&lt;br /&gt;
|-&lt;br /&gt;
| Julia Wang||1-4:30||||1-4:30||||1-4:00&lt;br /&gt;
|-&lt;br /&gt;
| Marcela Interiano||||||||||&lt;br /&gt;
|-&lt;br /&gt;
| Meghana Gaur||||||||||&lt;br /&gt;
|-&lt;br /&gt;
| Ramee Saleh||3-5:00||3-5:00||3-5:00||3-5:00||3-5:00&lt;br /&gt;
|-&lt;br /&gt;
| Ravali Kruthiventi||3-6||||3-6||||3-6&lt;br /&gt;
|-&lt;br /&gt;
| Todd Rachowin||||||||||&lt;br /&gt;
|-&lt;br /&gt;
| Tay Jacobe||10:00-12:00||4:00-6:00||10:00-12:00||4:00-6:00||10:00-12:00&lt;br /&gt;
|-&lt;br /&gt;
| Will Cleland||||12:30-4||||12:30-4||2-5:00 &lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
[[Category: McNair Admin]]&lt;br /&gt;
[[admin_classification::Admin| ]]&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Ariel_Sun_(Work_Log)&amp;diff=8666</id>
		<title>Ariel Sun (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Ariel_Sun_(Work_Log)&amp;diff=8666"/>
		<updated>2016-09-07T19:29:40Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Category:Work Log]]&lt;br /&gt;
[[Ariel Sun]] [[Work Logs]] [[Ariel Sun (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
06/01/2016 - Introduction/Wiki Building&lt;br /&gt;
&lt;br /&gt;
06/02/2016 - Refined Wiki Organization and Content&lt;br /&gt;
&lt;br /&gt;
06/03/2016 - Organized Topic Areas and Worked on Public Wiki Page&lt;br /&gt;
&lt;br /&gt;
06/06/2016 - Continued Organizing Public Wiki Page&lt;br /&gt;
&lt;br /&gt;
06/07/2016 - Draft of Women in Entrepreneurship blog post&lt;br /&gt;
&lt;br /&gt;
06/08/2016 - Work on Challenges Women Entrepreneurs Face wiki page&lt;br /&gt;
&lt;br /&gt;
06/09/2016 - Clean up content of patent trolls and put on the public page&lt;br /&gt;
&lt;br /&gt;
06/10/2016 - Put up resources for business dynamism in high tech issue brief page&lt;br /&gt;
&lt;br /&gt;
06/13/2016 - Clean up venture one data and LBO data&lt;br /&gt;
&lt;br /&gt;
06/14/2016 - Match venture one and LBO data to patent data&lt;br /&gt;
&lt;br /&gt;
06/15/2016 - Create tables that match patent information to each LBO/venture company&lt;br /&gt;
&lt;br /&gt;
06/16/2016 - Create and Finalize LBO/venture company and patent summary table&lt;br /&gt;
&lt;br /&gt;
06/17/2016 - Familiarize with Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/20/2016 - Analyze existing SQL code of Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/21/2016 - Clean up and rebuild Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/22/2016 - First draft of complete SQL script for Hubs datasets&lt;br /&gt;
&lt;br /&gt;
07/15/2016 - Help Ed send out VentureOne data, add grouping and considerations of Hubs scorecard variables&lt;br /&gt;
&lt;br /&gt;
07/18/2016 - Work on differentiating curriculum v. code school, redo Matching VentureOne&lt;br /&gt;
&lt;br /&gt;
07/19/2016 - Finish Marching Venture One, update on Wiki, work on differentiating curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
07/20/2016 - Work on differentiating OH investor v. mentor, temporary workshop v. networking meetup&lt;br /&gt;
&lt;br /&gt;
07/21/2016 - Consolidate previously done hubs variables, work on number of onsite accelerators&lt;br /&gt;
&lt;br /&gt;
07/22/2016 - Work on Hubs variables: price of flexible/dedicated desk, Onsite VCs v. Angel Investors, number of members&lt;br /&gt;
&lt;br /&gt;
07/29/2016 - Code for Hubs variables that are hubs, E:/McNair/Projects/Hubs/Hus Variables-Ariel&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
09/07/2016 - Moved scorecard related pages to hubs scorecard academic paper, edit hubs vc sql data script&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=The_Impact_of_Entrepreneurship_Hubs_on_Urban_Venture_Capital_Investment&amp;diff=8654</id>
		<title>The Impact of Entrepreneurship Hubs on Urban Venture Capital Investment</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=The_Impact_of_Entrepreneurship_Hubs_on_Urban_Venture_Capital_Investment&amp;diff=8654"/>
		<updated>2016-09-07T16:33:30Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Project Title=Hubs(Academic Paper)&lt;br /&gt;
|Topic Area=Entrepreneurship Ecosystems&lt;br /&gt;
|Owner=Todd Rachowin, Ariel Sun&lt;br /&gt;
|Start Term=Spring 2016&lt;br /&gt;
|Status=Active&lt;br /&gt;
|Deliverable=Academic Paper&lt;br /&gt;
|Audience=Academics&lt;br /&gt;
|Keywords=Hubs, Incubators, Accelerators, Venture, Capital, Angel, Investor, Startups&lt;br /&gt;
|Primary Billing=AccNBER01&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Hubs Pages=&lt;br /&gt;
*This page [[Hubs (Academic Paper)]] is the main page for the Hubs project!&lt;br /&gt;
*The old work done by Rachael is on the [[Hubs]] page&lt;br /&gt;
*For a high-level overview of the variables for the scorecard go to [[Hubs Scorecard (Academic Paper)]]. This summarizes:&lt;br /&gt;
**Current work in progress for building the Hubs scorecard: [[Hubs: Hubs Scorecard]]&lt;br /&gt;
**Tracking of work in progress for the scorecard [[Hubs: Hubs Data Building]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Abstract=&lt;br /&gt;
&lt;br /&gt;
The Hubs Research Project is a full-length academic paper analyzing the effectiveness of &amp;quot;hubs&amp;quot;, a component of the entrepreneurship ecosystem, in the advancement and growth of entrepreneurial success in a metropolitan area.&lt;br /&gt;
&lt;br /&gt;
This research will primarily be focused on large and mid-sized Metropolitan Statistical Areas (MSAs), as that is where the greater majority of Venture Capital funding is located.&lt;br /&gt;
&lt;br /&gt;
A general overview of entrepreneurial ecosystems can be found here: [[Entrepreneurial Ecosystem]].&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Current Work=&lt;br /&gt;
==General Overview==&lt;br /&gt;
Currently there are '''3''' major tasks being performed (list to be updated):&lt;br /&gt;
#'''Creation of VC data table''': '''UPDATE: Complete''' (see completed work section below)&lt;br /&gt;
#'''Creation of Hubs Dataset''': '''UPDATE: See current work in progress for updates''' We will collect key variables for potential Hubs.&lt;br /&gt;
#'''Hazard Rate Model''': '''UPDATE: (7/11) Spoke to Xun Tang, econometrics professor in Rice's Economics Department, and now looking for appropriate proportional rate hazard models with time varying covariates.''' In order to perform our diff-diff model, we need to match MSAs.  In order to do so, we will be using a hazard rate model to produce a probability that a MSA gets a Hub and compare MSAs that do and don't have hubs with similar probabilities.&lt;br /&gt;
&lt;br /&gt;
==Work In Progress==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
==Venture Capital Data General Overview==&lt;br /&gt;
The main goal of the data set is to aggregate company, fund, and round level data to be analyzed at a combined MSA and year level. The data set is compromised of two major parts: a granular company/fund/round and an aggregated CMSA-Year.  The data includes all United States Venture Capital transactions (moneytree) from the twenty-five year period of 1990 through 2015.&lt;br /&gt;
&lt;br /&gt;
The Hubs data set, from SDC Platinum, has been constructed in the server:&lt;br /&gt;
 Data files are in 128.42.44.181/bulk/Hubs&lt;br /&gt;
 All files are in 128.42.44.182/bulk/Projects/Hubs&lt;br /&gt;
 psql Hubs2&lt;br /&gt;
&lt;br /&gt;
Sql files:&lt;br /&gt;
:&amp;lt;code&amp;gt;E:\McNair\Projects\Hubs\Data Script v10.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
Note: We need to check that everything in '''Data Script v9 Ariel.txt''' has been incorporated into v10&lt;br /&gt;
&lt;br /&gt;
Table Header Rows + 5 lines:&lt;br /&gt;
:&amp;lt;code&amp;gt;E:\McNair\Projects\Hubs\Data Table List v2.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
Note: This was generated by '''Data Script v10.txt'''&lt;br /&gt;
&lt;br /&gt;
===Procedure - Granular Table===&lt;br /&gt;
#Start with separate raw datasets for Companies, Funds, and Rounds - '''Locate Raw Datasets and Determine Pedigree'''&lt;br /&gt;
#Add Data to Each Individual dataset (e.g. add MSA code)&lt;br /&gt;
#Clean and standardize names (e.g. company or fund name) for each dataset&lt;br /&gt;
#Join the Datasets (here we need to exclude undisclosed companies)&lt;br /&gt;
&lt;br /&gt;
===Procedure - CMSA-Year Table===&lt;br /&gt;
#Create a consistent CMSA-Year table to be used later&lt;br /&gt;
#Using the tables from the granular table, parse out the right data&lt;br /&gt;
#Join the parsed out data with the CMSA-Year Table&lt;br /&gt;
#Join these Tables&lt;br /&gt;
&lt;br /&gt;
==VC Specific Tables and Procedure==&lt;br /&gt;
===Raw data tables===&lt;br /&gt;
#'''Funds''': fund name, first investment date, last investment date, fund closing date, address, known investment, average investment, number of companies invested, MSA, MSA code.&lt;br /&gt;
#'''Rounds''': round date, company name, state, round number, stage 1, stage 2, stage 3&lt;br /&gt;
#'''Combined Rounds''': company name, round date, disclosed amount, investor&lt;br /&gt;
#'''Companies''': company name, first investment, last investment, MSA, MSA code, address, state, date founded, known funding, industry&lt;br /&gt;
#'''MSA List''': MSA, MSA code, CMSA, CMSA code&lt;br /&gt;
#'''Industry List''': changes 6 industry categories to 4— ICT, Life Sciences, Semiconductors, Other&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Granular Table (Fund-Round-Company)===&lt;br /&gt;
The final table here contains all venture capital transactions by disclosed funds and portfolio companies, together with their CMSAs.&lt;br /&gt;
To get the table, we processed the raw data sets in the following steps:&lt;br /&gt;
#Clean '''Company''' data&lt;br /&gt;
##Import raw data companies&lt;br /&gt;
##Add variable 'CMSA' from data set MSA list, update variable 'industry' by joining data set industry list&lt;br /&gt;
##Remove duplicates and remove undisclosed companies &lt;br /&gt;
#Clean '''Fund''' data&lt;br /&gt;
##Import raw data funds&lt;br /&gt;
##Add variable 'CMSA'&lt;br /&gt;
##Remove duplicates and remove undisclosed funds&lt;br /&gt;
##Match fund names with itself using [[The Matcher (Tool) |The Matcher]] to get the standard fund names&lt;br /&gt;
#Clean '''Round''' data&lt;br /&gt;
##Import raw data rounds and combined rounds&lt;br /&gt;
##Add variables 'number of investment', 'estimated investment' and 'year'&lt;br /&gt;
##Remove duplicates and remove undisclosed funds&lt;br /&gt;
#'''Combine''' '''Companies''' and '''Rounds'''&lt;br /&gt;
##Combine cleaned companies and rounds data table on company names&lt;br /&gt;
##Add variable 'round number' and 'stage'&lt;br /&gt;
##Remove duplicates&lt;br /&gt;
#'''Combine''' '''Funds''' and '''rounds-companies'''&lt;br /&gt;
##Match fund names in rounds data table with standard fund names using [[The Matcher (Tool) |The Matcher]] to standardize fund names in rounds data table&lt;br /&gt;
##Join standard fund names to rounds-companies table&lt;br /&gt;
##Join cleaned funds table to rounds-companies table on standard fund names&lt;br /&gt;
&lt;br /&gt;
Note: This was done by Ariel and then edited by Todd.&lt;br /&gt;
&lt;br /&gt;
===CMSA-Year Aggregated Table===&lt;br /&gt;
&lt;br /&gt;
The original MSA to CMSA was done by Rachel and used here. '''LOCATE THE FILE!!!'''&lt;br /&gt;
&lt;br /&gt;
The final table contains number of companies and amount of investment, categorized by distance and stages, of each CMSA. &lt;br /&gt;
&lt;br /&gt;
We processed data as follows:&lt;br /&gt;
#Create the '''CMSA-Year''' Table&lt;br /&gt;
##Create single variable tables: Distinct CMSA, year, stage, found year of fund and found year of company.&lt;br /&gt;
##Create the cross production tables: CMSA-year, CMSA-year-fund year founded and CMSA-year-company year founded&lt;br /&gt;
#Draw data from cleaned companies, funds and rounds tables&lt;br /&gt;
##Create a table with 'CMSA', 'number of companies' and 'year Founded' from cleaned companies table and join it to CMSA -year founded&lt;br /&gt;
##Create a table with 'Company CMSA', 'round year', 'disclosed amount' from rounds-companies combined table, and add stage binary variables. Join it to CMSA-year-company year founded&lt;br /&gt;
##Create a table with 'CMSA', 'fund year', 'number of investors' from cleaned funds table and join it to CMSA-year-fund year founded&lt;br /&gt;
#Create '''near-far''' and stages table&lt;br /&gt;
##Add fund data to rounds-companies&lt;br /&gt;
##Create near-far and stages binary variable&lt;br /&gt;
##Count investment and deals by CMSA and year, categorized by near-far and stages&lt;br /&gt;
#Combine all tables by CMSA and round-year&lt;br /&gt;
&lt;br /&gt;
==Supplementary Data Sets==&lt;br /&gt;
&lt;br /&gt;
Supplementary data sets are cleaned and joined back to CMSAyear table on CMSA and year:&lt;br /&gt;
&lt;br /&gt;
#Number of STEM graduate student, by university and year(2005 to 2014). &lt;br /&gt;
#University R&amp;amp;D spending, by university and year(2004 to 2014).&lt;br /&gt;
#Income per capital, by MSA and year(2000 to 2012)&lt;br /&gt;
#Wages and salaries, by MSA and year(2000 to 2012)&lt;br /&gt;
&lt;br /&gt;
All of these files were created originally by Rachel. Some were cleaned in Excel. No new data was added (some extra cols, no extra rows).&lt;br /&gt;
&lt;br /&gt;
The datasets can respectively be found at:&lt;br /&gt;
 E:\McNair\Projects\Hubs\STEM grads for upload v2.xls&lt;br /&gt;
   --Contains: university	zipcode	newmsacode	msa	msacode	cmsa	cmsacode	year	nostudents&lt;br /&gt;
   --CMSA code inside sheet seems to be ours. Check with Ariel.&lt;br /&gt;
 E:\McNair\Projects\Hubs\NSF spending for upload.xls&lt;br /&gt;
   --Contains: Institution	MSA	CMSA code	Year	Spending&lt;br /&gt;
   --We think the CMSA Code is ours. Check with Ariel. &lt;br /&gt;
 E:\McNair\Projects\Hubs\Income per capita upload.xls&lt;br /&gt;
   --Contains: Fips	Area	Year	Income&lt;br /&gt;
   --Lookup to CMSA was done using VLOOKUPs in Excel. See Matcher Helper vTR.xls, and other Matcher Helper ???.xls files&lt;br /&gt;
 E:\McNair\Projects\Hubs\Wage for upload v2.xls&lt;br /&gt;
   --Contains: Fips	MSA	Year	Wage&lt;br /&gt;
   --Lookup to CMSA was done using VLOOKUPs in Excel. See Matcher Helper vTR.xls, and other Matcher Helper ???.xls files&lt;br /&gt;
&lt;br /&gt;
=Resources=&lt;br /&gt;
&lt;br /&gt;
===Additional Resources===&lt;br /&gt;
* Yael Hochberg and Fehder (2015), located in dropbox&lt;br /&gt;
** Use this paper as a guideline on how to conduct the analysis&lt;br /&gt;
*US Census Bureau data on employment by MSA: http://factfinder.census.gov/faces/tableservices/jsf/pages/productview.xhtml?pid=ACS_14_5YR_B23027&amp;amp;prodType=table&lt;br /&gt;
*USPTO utility patents by MSA: http://www.uspto.gov/web/offices/ac/ido/oeip/taf/cls_cbsa/allcbsa_gd.htm&lt;br /&gt;
*MSA level trends: http://www.metrotrends.org/data.cf&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;includeonly&amp;gt;&lt;br /&gt;
[[Category: McNair Projects]]&lt;br /&gt;
&amp;lt;/includeonly&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs_Scorecard_(Academic_Paper)&amp;diff=8653</id>
		<title>Hubs Scorecard (Academic Paper)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs_Scorecard_(Academic_Paper)&amp;diff=8653"/>
		<updated>2016-09-07T16:33:04Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Project Title=Hubs Scorecard(Academic Paper)&lt;br /&gt;
|Topic Area=Entrepreneurship Ecosystems&lt;br /&gt;
|Owner=Todd Rachowin, Ariel Sun&lt;br /&gt;
|Start Term=Summer 2016&lt;br /&gt;
|Status=Active&lt;br /&gt;
|Deliverable=Academic Paper&lt;br /&gt;
|Audience=Academics&lt;br /&gt;
|Keywords=Hubs, Incubators, Accelerators, Venture, Capital, Angel, Investor, Startups&lt;br /&gt;
|Primary Billing=AccNBER01&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Abstract=&lt;br /&gt;
As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
Our goal is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs (Complete)&lt;br /&gt;
#Determining the best variables for the scorecard (Complete)&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection (Complete)&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation (In Progress)&lt;br /&gt;
#Collecting the remaining manual data (next step)&lt;br /&gt;
&lt;br /&gt;
*For the detailed current work in progress for building the Hubs datasheet for the scorecard  go to: [[Hubs: Hubs Scorecard]]&lt;br /&gt;
*For a tracker of work in progress for the dataset building for the scorecard go to [[Hubs: Hubs Data Building]]&lt;br /&gt;
*For a high-level overview of the variables for the scorecard go to [[Hubs: Hubs Data]]&lt;br /&gt;
*For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
*Comprehensive list of potential hubs can be found at:&lt;br /&gt;
##&amp;lt;code&amp;gt;E:\McNair\Projects\Hubs\Raw Program List&amp;lt;/code&amp;gt; Contains 600 entities - vast majority are firmly not hubs (file pedigree unknown)&lt;br /&gt;
##&amp;lt;code&amp;gt;E:\McNair\Projects\Hubs\Hubs Data&amp;lt;/code&amp;gt; - Contains 125 entities - many are not hubs (overlap with above file unknown, this file's pedigree from old Hubs project).&lt;br /&gt;
&lt;br /&gt;
==Hubs Data==&lt;br /&gt;
'''(7/27 Onwards)'''&lt;br /&gt;
&lt;br /&gt;
Collected variables for 30 hubs that are surely hubs. The results are here:&lt;br /&gt;
*&amp;lt;code&amp;gt;E:\McNair\Projects\Hubs\Hubs Variables -Ariel.xls&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''(Until 7/27)'''&lt;br /&gt;
&lt;br /&gt;
See [[Hubs: Hubs Scorecard]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''(Week of 7/11)'''&lt;br /&gt;
&lt;br /&gt;
1) We published the twitter count on mechanical turk and received results.&lt;br /&gt;
&lt;br /&gt;
2) We have audited the results and updated the amazon.&lt;br /&gt;
&lt;br /&gt;
3) We are creating additional potential turks on the amazon site (See [[Hubs: Hubs Scorecard]])&lt;br /&gt;
&lt;br /&gt;
4) We are finding more potential hubs from members of international national business innovation association&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''(Week of 7/4)'''&lt;br /&gt;
&lt;br /&gt;
1) We have created the list and commented our thoughts after ---. For determining the variables, we have separated the list into two parts: a list of desired variables and ones that were previously collected, many of which are desired variables.&lt;br /&gt;
&lt;br /&gt;
2) We have also created an example of how to write mechanical turks for collecting certain variables&lt;br /&gt;
&lt;br /&gt;
==Variables to be Used==&lt;br /&gt;
&lt;br /&gt;
Old variable list (see Hubs Data.xls) contains 18+3 variables. Overlap with new variable list is ~50%&lt;br /&gt;
&lt;br /&gt;
===Current Complete List===&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
===Grouping of Variables===&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#URL&lt;br /&gt;
#Address&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Founding Date&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
=====General Approach Group 4=====&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Scorecard&amp;diff=8371</id>
		<title>Hubs: Hubs Scorecard</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Scorecard&amp;diff=8371"/>
		<updated>2016-09-02T19:38:24Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Hubs Pages=&lt;br /&gt;
*The main page for Hubs scorecard can be found: [[Hubs Scorecard (Academic Paper)]]&lt;br /&gt;
*For a tracker of work in progress for the dataset building for the scorecard go to [[Hubs: Hubs Data Building]]&lt;br /&gt;
*For a high-level overview of the variables for the scorecard go to [[Hubs: Hubs Data]]&lt;br /&gt;
&lt;br /&gt;
=Background=&lt;br /&gt;
This page represents the work used for creating the hubs data for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
Our goal is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs (Complete)&lt;br /&gt;
##&amp;lt;code&amp;gt;E:\McNair\Projects\Hubs\Raw Program List&amp;lt;/code&amp;gt; Contains 600 entities - vast majority are firmly not hubs (file pedigree unknown)&lt;br /&gt;
##&amp;lt;code&amp;gt;E:\McNair\Projects\Hubs\Hubs Data&amp;lt;/code&amp;gt; - Contains 125 entities - many are not hubs (overlap with above file unknown, this file's pedigree from old Hubs project).&lt;br /&gt;
#Determining the best variables for the scorecard (Complete)&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection (Complete)&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation (In Progress)&lt;br /&gt;
#*See section 4.2&lt;br /&gt;
#Collecting the remaining manual data (next step)&lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
&lt;br /&gt;
Old variable list (see Hubs Data.xls) contains 18+3 variables. Overlap with new variable list is ~50%&lt;br /&gt;
&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#URL&lt;br /&gt;
#Address&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Founding Date&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
=====General Approach Group 4=====&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
=Steps Needed to Complete=&lt;br /&gt;
#Create Processes for Collecting Data&lt;br /&gt;
#*Status (7/27): Complete&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/27): Complete&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/27): In Progress (see section:)&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status (7/27): NS&lt;br /&gt;
&lt;br /&gt;
==How to Code the Variables==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''UPDATE (7/20)''': Gunny has created a tool to do this process&lt;br /&gt;
#URL&lt;br /&gt;
#*'''UPDATE (7/22)''': Veeral has code to do this procedure (search company, city in google)&lt;br /&gt;
#Address&lt;br /&gt;
#*'''UPDATE (7/22)''': Code written.  Difficulties occur with very large companies (e.g. Impact Hub).  Will require Veeral's program, expected time for each assignment is 10-20 seconds - pay rate, therefore, recommended $.05&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Using Veeral's code, crossproduct allintext: (Group A) and site: (Group B), where '''Group A'''=Contact (high coverage), About Us, Find Us, Locations, Address, '''Group B'''= Company URLs.&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (allintext: About/Mission site: from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''UPDATE (7/21)''': Given that most companies include their specialty in mission statement and difficulty to turk, we will manually check each mission statement and mark it accordingly. &lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#**'''UPDATE (7/21)''': Code written, but may require additional manual work. Expected time to complete is 45 seconds due to a potential list of a lot of sponsors/partners - pay rate, therefore, recommended $.12. &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Choose first result from Search Text 1 and Search Text 2 (allintext: Sponsors/Partnerrs site:URL)&lt;br /&gt;
#*#Record all Sponsors from Search Text 1 into SPONSORS.  If there does not exist a list or the link was for only 1 sponsor, record DNE.&lt;br /&gt;
#*#If any Sponsors from Search Text 1 include a University or College (will be listed in name), record them into UNIVERSITY SPONSORS&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to company’s URL&lt;br /&gt;
#*#On the homepage, look for the section related to pricing. If pricing is not found in the homepage, look for the links ‘coworking’, ‘work space’, ‘membership’, ‘pricing’ ,‘join’ or ‘Apply for membership’ , and look for the pricing information under those links. If there is no price related section, record DNE for both ‘Flexible Desk’ and ‘Dedicated Desk’.&lt;br /&gt;
#*#If there is pricing information, look for the price of sharing space per month, often denoted as Shared/Flexible desk/non-dedicated desk, record the price at ‘Flexible Desk’. If the price is not found, record DNE.&lt;br /&gt;
#*#Look for the price of a dedicated desk per month, often denoted as Reserved/dedicated desk/private desk record the price at ‘Dedicated Desk’.  If the price is not found, record DNE.&lt;br /&gt;
#*#If price information is not found and there is a ‘locations’ link, click on it and choose the first location of the list. Repeat step 3 -4. &lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Keywords: 24/7 access, dedicated desk, pricing&lt;br /&gt;
#*#Google allintext:&amp;quot;keywords&amp;quot; site:URL&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#**'''UPDATE (7/21)''': Difficulties observed when figuring out how to Turk this, have solution (whois.net)&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size (SQFT)&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose first result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
#Size (# Companies)&lt;br /&gt;
#*'''BRAINSTORM''': (7/22) Some companies don’t list all members but only selective ones. Some companies do not separate current members and alumni and goes like:&amp;quot;we have served more than 120 startups...&amp;quot;&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#*#Count the number of members&lt;br /&gt;
#*#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#*#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Search allintitle:&amp;quot;Members/Startups/Residents/Villagers/Ventures&amp;quot; site:URL in Google.&lt;br /&gt;
#*#If no result found, record DNE.&lt;br /&gt;
#*#If there are results, go to the first result which is usually in the form like&amp;quot;Members - Company Name&amp;quot;.&lt;br /&gt;
#*#If the result direct you to a page that lists the members of the company, count the number of companies and record the number.&lt;br /&gt;
#*#If the result direct you to a page that does not give information on number of members, record DNE.&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text (Mentor/Mentorship) into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#**'''UPDATE (7/21)''': Code written - 2nd part, while more manual, appears to have greater range.  2nd code would only require Veeral's code.  1st code expected completion time is 30 seconds.&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to company's URL&lt;br /&gt;
#*#Look for the link 'Accelerators' or 'Accelerating/Accelerator/Acceleration/Accelerate Programs'&lt;br /&gt;
#*#If accelerators are found, count the number of accelerators/accelerating programs and record the number. **or also copy the names of the accelerators?&lt;br /&gt;
#*#If accelerators are not found in step 1, go to the links 'Services' , 'Benefit', 'Resources', 'For Entrepreneurs', 'Startups' and look for the section of 'Accelerator/Accelerating Programs' &lt;br /&gt;
#*#If accelerators are found, count the number of accelerators/accelerating programs and record the number.&lt;br /&gt;
#*#If accelerators are not found, record 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Search [allintitle:&amp;quot;accelerator&amp;quot;/&amp;quot;accelerate&amp;quot; site:URL] in Google&lt;br /&gt;
#*#Copy the titles of the results. **We have to scrutinize the titles ourselves to determine whether they are distinct onsite accelerators and record the number manually.&lt;br /&gt;
#*#If no result appears, record 0.&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
'''Curriculum and Code School'''&lt;br /&gt;
&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Onsite OH Investors v. Mentors'''&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
#Search allintext:&amp;quot;office hours&amp;quot; site:URL&lt;br /&gt;
#Mark ''office hours'' as 1 if there is a result, otherwise mark as 0.&lt;br /&gt;
#Click on the first five results&lt;br /&gt;
#On each of the five pages, search for two items:&lt;br /&gt;
##search for 'mentor'. (Ctrl + F) If 'mentor' appears in the description paragraph of office hours on any of the five pages, mark ''mentor OH'' as 1. Otherwise mark as DNE and copy the description paragraph of office hours of all five pages.&lt;br /&gt;
##search for 'fund'. (Ctrl + F) If 'fund' appears in the description paragraph of office hours on any of the five pages, mark ''investor OH'' as 1. Otherwise mark as DNE and copy the description paragraph of office hours of all five pages.&lt;br /&gt;
&lt;br /&gt;
'''Onsite temporary workshops v. networking events'''&lt;br /&gt;
*'''Turk for Both 1 of 2''':&lt;br /&gt;
*#Search the Search Text 1 (allintext: events site: URL) and choose link to &amp;quot;Events&amp;quot;, &amp;quot;Calendar&amp;quot;, or related.  Record 'url' on SOURCE If this does not exist, go to Step 7&lt;br /&gt;
*#For all events that have dates, copy the events from today's date to the following month into ALL EVENTS&lt;br /&gt;
*#For all events that have Office Hours in the name, record the events in OFFICE HOURS.  For all events that have summit, record the events in SUMMITS.&lt;br /&gt;
*#For all events that are related to teaching or learning (e.g. contain &amp;quot;Training,&amp;quot; &amp;quot;Seminar,&amp;quot; &amp;quot;Class,&amp;quot; &amp;quot;Learn,&amp;quot; &amp;quot;Bootcamp,&amp;quot; &amp;quot;Workshop,&amp;quot; &amp;quot;Pitch Event&amp;quot;, copy the name of the events into WORKSHOPS&lt;br /&gt;
*#For all events that are related to scoial activities and networking (e.g. &amp;quot;Social,&amp;quot; &amp;quot;Meet Up,&amp;quot; &amp;quot;Breakfast&amp;quot;/&amp;quot;Lunch&amp;quot;/&amp;quot;Happy Hour&amp;quot;, &amp;quot;Movie Night&amp;quot;/&amp;quot;Bowling&amp;quot;), copy the name of the events into NETWORKING.  For all events that are unclear or did not fit into these descriptions&lt;br /&gt;
*#If a message explicity says there are no events, mark as 0 for ALL EVENTS, OFFICE HOURS, SUMMITS, WORKSHOPS, and NETWORKING &lt;br /&gt;
*#If this does not exist, search Search Text 2 (allintext: Company Name site: meetup.com) and click on the meetup.com for the company if it exists.  If it does exist, record meetup on SOURCE.  If not, go to step 9.&lt;br /&gt;
*#Repeat Steps 2-6.&lt;br /&gt;
*#If this does not exist, search Search Text 3 (allintext: Company Name site: eventbrite.com) and click on the eventbrite.com for the company if it exists.  If it does exist, record eventbrite on SOURCE.  If not, mark DNE for all variables.&lt;br /&gt;
*#Repeat Steps 2-6.&lt;br /&gt;
&lt;br /&gt;
*'''Turk for Both 2 of 2''':&lt;br /&gt;
#Go to Company URL&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events from today's date to next months and record it in ALL EVENTS. If there is no information of events or dates of the events on the website, record DNE for all variables.&lt;br /&gt;
#For all events that have Office Hours in the name, count the number of events in OFFICE HOURS.  For all events that have summit, count the number of the events in SUMMITS.&lt;br /&gt;
#For all events that are related to teaching or learning (e.g. contain &amp;quot;Training,&amp;quot; &amp;quot;Seminar,&amp;quot; &amp;quot;Class,&amp;quot; &amp;quot;Learn,&amp;quot; &amp;quot;Bootcamp,&amp;quot; &amp;quot;Workshop,&amp;quot; &amp;quot;Pitch Event&amp;quot;, count the number of the events into WORKSHOPS&lt;br /&gt;
#For all events that are related to scoial activities and networking (e.g. &amp;quot;Social,&amp;quot; &amp;quot;Meet Up,&amp;quot; &amp;quot;Breakfast&amp;quot;/&amp;quot;Lunch&amp;quot;/&amp;quot;Happy Hour&amp;quot;, &amp;quot;Movie Night&amp;quot;/&amp;quot;Bowling&amp;quot;), count the number of the events into NETWORKING&lt;br /&gt;
&lt;br /&gt;
'''Onsite VC v. Angel Investors'''&lt;br /&gt;
*Notes: Few companies have a section for their onsite VCs or angel investors. Even the company(Innovation Pavilion) that has Angel programs and VC programs does not conduct the programs by itself, but cooperate with external angel investors or VCs. Some companies have mentors or board members who are from VCs, but it does not mean they will invest in the member startups in those companies.&lt;br /&gt;
&lt;br /&gt;
===Group 5===&lt;br /&gt;
#Multiple Locations&lt;br /&gt;
#*Addresses are included in Group 1, but still needs to be discussed&lt;br /&gt;
#*Getting&lt;br /&gt;
&lt;br /&gt;
==Generating the Data==&lt;br /&gt;
 All files can be found in the E:/Mcnair/Projects/Hubs/Searching&lt;br /&gt;
 Recommended to select the CSV and Excel worksheets because there are many JSON files&lt;br /&gt;
&lt;br /&gt;
There are generally  '''6''' steps we need to do for each variable when creating the data table:&lt;br /&gt;
*A good reference for this procedure is in the folder Address&lt;br /&gt;
#Run Veeral's Code on your search terms&lt;br /&gt;
#*A list of Companies Can be found in the file 'List of Companies'&lt;br /&gt;
#*Recommended to have the search file sorted by company (e.g. if searching 3 companies (A,B,C) with 2 search terms (S,T), recommend having your list as: A-S,A-T,B-S,etc.)&lt;br /&gt;
#*'''Procedure'''&lt;br /&gt;
#*# (Ariel or Veeral To Write)&lt;br /&gt;
#Check to see if output results are working properly&lt;br /&gt;
#*Recommend to do alt-d-f-f and choose only 1 and 2&lt;br /&gt;
#*Check at least 10 different companies and ensure desired result is in the results&lt;br /&gt;
#Clean table and format for Mechanical Turk&lt;br /&gt;
#*Ensure that mechanical turks are not getting error terms&lt;br /&gt;
#*We will likely use 1 row for each company and have specific headers that will allow for the inputs to be automatically populated (see [[Mechanical Turk (Tool)]]&lt;br /&gt;
#*You should also check to see if we need to find results manually&lt;br /&gt;
#Write the Turk on Amazon&lt;br /&gt;
#*See [[Mechanical Turk (Tool)]]&lt;br /&gt;
#Run and audit the Turk&lt;br /&gt;
#*Randomly choose ~30 companies (can use the above) and compare results with the Turkers&lt;br /&gt;
#*Check for AT LEAST the following:&lt;br /&gt;
#**% similar to manual&lt;br /&gt;
#**DNEs&lt;br /&gt;
#Post Results in [[Hubs: Hubs Data Building]]&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
See Section 3 of [[Hubs (Academic Paper)]]&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs_Scorecard_(Academic_Paper)&amp;diff=8368</id>
		<title>Hubs Scorecard (Academic Paper)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs_Scorecard_(Academic_Paper)&amp;diff=8368"/>
		<updated>2016-09-02T19:37:09Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Work in Progress */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Project Title=Hubs Scorecard(Academic Paper)&lt;br /&gt;
|Topic Area=Entrepreneurship Ecosystems&lt;br /&gt;
|Owner=Todd Rachowin, Ariel Sun&lt;br /&gt;
|Start Term=Summer 2016&lt;br /&gt;
|Status=Active&lt;br /&gt;
|Deliverable=Academic Paper&lt;br /&gt;
|Audience=Academics&lt;br /&gt;
|Keywords=Hubs, Incubators, Accelerators, Venture, Capital, Angel, Investor, Startups&lt;br /&gt;
|Primary Billing=AccNBER01&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Abstract=&lt;br /&gt;
As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
Our goal is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs (Complete)&lt;br /&gt;
#Determining the best variables for the scorecard (Complete)&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection (Complete)&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation (In Progress)&lt;br /&gt;
#Collecting the remaining manual data (next step)&lt;br /&gt;
&lt;br /&gt;
*For the detailed current work in progress for building the Hubs datasheet for the scorecard  go to: [[Hubs: Hubs Scorecard]]&lt;br /&gt;
*For a tracker of work in progress for the dataset building for the scorecard go to [[Hubs: Hubs Data Building]]&lt;br /&gt;
*For a high-level overview of the variables for the scorecard go to [[Hubs: Hubs Data]]&lt;br /&gt;
*For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
*Comprehensive list of potential hubs can be found at:&lt;br /&gt;
##&amp;lt;code&amp;gt;E:\McNair\Projects\Hubs\Raw Program List&amp;lt;/code&amp;gt; Contains 600 entities - vast majority are firmly not hubs (file pedigree unknown)&lt;br /&gt;
##&amp;lt;code&amp;gt;E:\McNair\Projects\Hubs\Hubs Data&amp;lt;/code&amp;gt; - Contains 125 entities - many are not hubs (overlap with above file unknown, this file's pedigree from old Hubs project).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Variables to be Used==&lt;br /&gt;
&lt;br /&gt;
Old variable list (see Hubs Data.xls) contains 18+3 variables. Overlap with new variable list is ~50%&lt;br /&gt;
&lt;br /&gt;
===Current Complete List===&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
===Grouping of Variables===&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#URL&lt;br /&gt;
#Address&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Founding Date&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
=====General Approach Group 4=====&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs_Scorecard_(Academic_Paper)&amp;diff=8364</id>
		<title>Hubs Scorecard (Academic Paper)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs_Scorecard_(Academic_Paper)&amp;diff=8364"/>
		<updated>2016-09-02T19:36:31Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Project Title=Hubs Scorecard(Academic Paper)&lt;br /&gt;
|Topic Area=Entrepreneurship Ecosystems&lt;br /&gt;
|Owner=Todd Rachowin, Ariel Sun&lt;br /&gt;
|Start Term=Summer 2016&lt;br /&gt;
|Status=Active&lt;br /&gt;
|Deliverable=Academic Paper&lt;br /&gt;
|Audience=Academics&lt;br /&gt;
|Keywords=Hubs, Incubators, Accelerators, Venture, Capital, Angel, Investor, Startups&lt;br /&gt;
|Primary Billing=AccNBER01&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Abstract=&lt;br /&gt;
As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
Our goal is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs (Complete)&lt;br /&gt;
#Determining the best variables for the scorecard (Complete)&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection (Complete)&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation (In Progress)&lt;br /&gt;
#Collecting the remaining manual data (next step)&lt;br /&gt;
&lt;br /&gt;
*For the current work in progress for building the Hubs datasheet for the scorecard  go to: [[Hubs: Hubs Scorecard]]&lt;br /&gt;
*For a tracker of work in progress for the dataset building for the scorecard go to [[Hubs: Hubs Data Building]]&lt;br /&gt;
*For a high-level overview of the variables for the scorecard go to [[Hubs: Hubs Data]]&lt;br /&gt;
*For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
*Comprehensive list of potential hubs can be found at:&lt;br /&gt;
##&amp;lt;code&amp;gt;E:\McNair\Projects\Hubs\Raw Program List&amp;lt;/code&amp;gt; Contains 600 entities - vast majority are firmly not hubs (file pedigree unknown)&lt;br /&gt;
##&amp;lt;code&amp;gt;E:\McNair\Projects\Hubs\Hubs Data&amp;lt;/code&amp;gt; - Contains 125 entities - many are not hubs (overlap with above file unknown, this file's pedigree from old Hubs project).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Variables to be Used==&lt;br /&gt;
&lt;br /&gt;
Old variable list (see Hubs Data.xls) contains 18+3 variables. Overlap with new variable list is ~50%&lt;br /&gt;
&lt;br /&gt;
===Current Complete List===&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
===Grouping of Variables===&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#URL&lt;br /&gt;
#Address&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Founding Date&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
=====General Approach Group 4=====&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs_Scorecard_(Academic_Paper)&amp;diff=8357</id>
		<title>Hubs Scorecard (Academic Paper)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs_Scorecard_(Academic_Paper)&amp;diff=8357"/>
		<updated>2016-09-02T19:32:32Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Project Title=Hubs Scorecard(Academic Paper)&lt;br /&gt;
|Topic Area=Entrepreneurship Ecosystems&lt;br /&gt;
|Owner=Todd Rachowin, Ariel Sun&lt;br /&gt;
|Start Term=Summer 2016&lt;br /&gt;
|Status=Active&lt;br /&gt;
|Deliverable=Academic Paper&lt;br /&gt;
|Audience=Academics&lt;br /&gt;
|Keywords=Hubs, Incubators, Accelerators, Venture, Capital, Angel, Investor, Startups&lt;br /&gt;
|Primary Billing=AccNBER01&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Abstract=&lt;br /&gt;
As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
Our goal is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs (Complete)&lt;br /&gt;
#Determining the best variables for the scorecard (Complete)&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection (Complete)&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation (In Progress)&lt;br /&gt;
#Collecting the remaining manual data (next step)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*For the current work in progress for building the Hubs datasheet for the scorecard  go to: [[Hubs: Hubs Scorecard]]&lt;br /&gt;
*For a tracker of work in progress for the dataset building for the scorecard go to [[Hubs: Hubs Data Building]]&lt;br /&gt;
*For a high-level overview of the variables for the scorecard go to [[Hubs: Hubs Data]]&lt;br /&gt;
*For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
*Comprehensive list of potential hubs can be found at:&lt;br /&gt;
##&amp;lt;code&amp;gt;E:\McNair\Projects\Hubs\Raw Program List&amp;lt;/code&amp;gt; Contains 600 entities - vast majority are firmly not hubs (file pedigree unknown)&lt;br /&gt;
##&amp;lt;code&amp;gt;E:\McNair\Projects\Hubs\Hubs Data&amp;lt;/code&amp;gt; - Contains 125 entities - many are not hubs (overlap with above file unknown, this file's pedigree from old Hubs project).&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs_Scorecard_(Academic_Paper)&amp;diff=8354</id>
		<title>Hubs Scorecard (Academic Paper)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs_Scorecard_(Academic_Paper)&amp;diff=8354"/>
		<updated>2016-09-02T19:30:35Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: Created page with &amp;quot;{{McNair Projects |Project Title=Hubs Scorecard(Academic Paper) |Topic Area=Entrepreneurship Ecosystems |Owner=Todd Rachowin, Ariel Sun |Start Term=Summer 2016 |Status=Active...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Project Title=Hubs Scorecard(Academic Paper)&lt;br /&gt;
|Topic Area=Entrepreneurship Ecosystems&lt;br /&gt;
|Owner=Todd Rachowin, Ariel Sun&lt;br /&gt;
|Start Term=Summer 2016&lt;br /&gt;
|Status=Active&lt;br /&gt;
|Deliverable=Academic Paper&lt;br /&gt;
|Audience=Academics&lt;br /&gt;
|Keywords=Hubs, Incubators, Accelerators, Venture, Capital, Angel, Investor, Startups&lt;br /&gt;
|Primary Billing=AccNBER01&lt;br /&gt;
}}&lt;br /&gt;
&lt;br /&gt;
=Abstract=&lt;br /&gt;
As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
Our goal is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs (Complete)&lt;br /&gt;
##&amp;lt;code&amp;gt;E:\McNair\Projects\Hubs\Raw Program List&amp;lt;/code&amp;gt; Contains 600 entities - vast majority are firmly not hubs (file pedigree unknown)&lt;br /&gt;
##&amp;lt;code&amp;gt;E:\McNair\Projects\Hubs\Hubs Data&amp;lt;/code&amp;gt; - Contains 125 entities - many are not hubs (overlap with above file unknown, this file's pedigree from old Hubs project).&lt;br /&gt;
#Determining the best variables for the scorecard (Complete)&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection (Complete)&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation (In Progress)&lt;br /&gt;
#Collecting the remaining manual data (next step)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*For the current work in progress for building the Hubs datasheet for the scorecard  go to: [[Hubs: Hubs Scorecard]]&lt;br /&gt;
*For a tracker of work in progress for the dataset building for the scorecard go to [[Hubs: Hubs Data Building]]&lt;br /&gt;
*For a high-level overview of the variables for the scorecard go to [[Hubs: Hubs Data]]&lt;br /&gt;
*For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Work_Hours&amp;diff=8023</id>
		<title>Work Hours</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Work_Hours&amp;diff=8023"/>
		<updated>2016-08-31T19:20:13Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Please complete your preferred times for the Fall term of 2015 below.&lt;br /&gt;
&lt;br /&gt;
{|  class=&amp;quot;wikitable sortable&amp;quot; style=&amp;quot;border: 1px solid darkgray; bgcolor: #f9f9f9&amp;quot;&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|'''Name'''&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|'''Mon'''&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|'''Tues'''&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|'''Wed'''&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|'''Thurs'''&lt;br /&gt;
| align=&amp;quot;center&amp;quot; style=&amp;quot;background:#f0f0f0;&amp;quot;|'''Fri'''&lt;br /&gt;
|-&lt;br /&gt;
| Albert Nabiullin||||||3-4:30||12:30-3||3-4:30&lt;br /&gt;
|-&lt;br /&gt;
| Amir Kazempour||||||||||&lt;br /&gt;
|-&lt;br /&gt;
| Ariel Sun||11-12, 1:15-2:45||||11-12, 1:15-2:45||||11-12, 1:15-2:45&lt;br /&gt;
|-&lt;br /&gt;
| Ben Baldazo||||||||||&lt;br /&gt;
|-&lt;br /&gt;
| Carlin Cherry||||||3-5:00||2:30-4||&lt;br /&gt;
|-&lt;br /&gt;
| Dylan Dickens||1-5:00||||||||&lt;br /&gt;
|-&lt;br /&gt;
| Harsh Upadhyay||3-5:30||3-5:30||3-5:30||3-5:30||3-5:30&lt;br /&gt;
|-&lt;br /&gt;
| Jake Silberman||||||||||&lt;br /&gt;
|-&lt;br /&gt;
| James Chen||||||||||&lt;br /&gt;
|-&lt;br /&gt;
| Julia Wang||||||1-4:30||||1-4:00&lt;br /&gt;
|-&lt;br /&gt;
| Marcela Interiano||||||||||&lt;br /&gt;
|-&lt;br /&gt;
| Meghana Gaur||||||||||&lt;br /&gt;
|-&lt;br /&gt;
| Ramee Saleh||||||3-5:00||3-5:00||3-5:00&lt;br /&gt;
|-&lt;br /&gt;
| Ravali Kruthiventi||3-6||||3-6||||3-6&lt;br /&gt;
|-&lt;br /&gt;
| Todd Rachowin||||||||||&lt;br /&gt;
|-&lt;br /&gt;
| Veeral Shah||12:30-2:30||||||||12:30-2:30&lt;br /&gt;
|-&lt;br /&gt;
| Will Cleland||||12:30-4||||12:30-4||2-5:00 &lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
[[Category: McNair Admin]]&lt;br /&gt;
[[admin_classification::Admin| ]]&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7872</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7872"/>
		<updated>2016-07-29T22:51:08Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* List of Variables */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Hubs Pages=&lt;br /&gt;
*The main page for Hubs can be found: [[Hubs (Academic Paper)]]&lt;br /&gt;
*For the current work in progress for building the Hubs datasheet for the scorecard  go to: [[Hubs: Hubs Scorecard]]&lt;br /&gt;
*For a tracker of work in progress for the dataset building for the scorecard go to [[Hubs: Hubs Data Building]]&lt;br /&gt;
*For a high-level overview of the variables for the scorecard go to [[Hubs: Hubs Data]]&lt;br /&gt;
&lt;br /&gt;
=List of Variables=&lt;br /&gt;
For a more in-depth of the variables and procedure please see: [[Hubs: Hubs Scorecard]].  This page will reflect the variables being collected separated into three categories.  Each variable will include a breakdown of levels being collected if the definition is not trivial and an approximate approach.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''07/29''' Ariel: code Hubs variable for Hubs E:/McNair/Projects/Hubs/Hubs Variable-Ariel&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''As of Week of 7/25'''&lt;br /&gt;
===Group 1===&lt;br /&gt;
'''Variables Difficult to Obtain'''&lt;br /&gt;
#'''Founding Date''' ''(date_founded)''&lt;br /&gt;
#*''' ''Difficulty:'' ''' Finding date based on our strategies&lt;br /&gt;
#*''' ''New Approach:'' ''' &lt;br /&gt;
#*#Whois.net Date&lt;br /&gt;
#*#Factavia/other press release searches &lt;br /&gt;
#'''Multiple locations within city + Franchise''' (as of now just addresses) ''(multi_address)''&lt;br /&gt;
#*''' ''Difficulty:'' ''' Company or establishment level will impact measurements&lt;br /&gt;
#*''' ''New Approach:'' ''' Will record all addresses at company level&lt;br /&gt;
#'''Onsite Venture Capital v. Angel Investors''' (e.g. # and Assets Under Management) ''(onsite_Vc_bin)/(onsite_vc_list)'' ''(onsite_angel_bin)/etc.''&lt;br /&gt;
#*''' ''Levels:'' ''' Binary, list of investors&lt;br /&gt;
#*''' ''Difficulty:'' ''' Hub website usually does not include investors&lt;br /&gt;
#*''' ''New Approach:'' ''' &lt;br /&gt;
#*#Google key terms with address of Hub&lt;br /&gt;
#*#Start with partners and use google/crunchbase&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
'''Variables Comfortable, Not Complete''' (rough order of most difficult to least difficult)&lt;br /&gt;
#'''Onsite accelerator''' ''(onsite_accel_bin)/(onsite_accel_cnt)/(onsite_accel_list)''&lt;br /&gt;
#*''' ''Levels:'' ''' Binary, count, list&lt;br /&gt;
#*''' ''Difficulty:'' ''' Usually not a list, which requires more scrubbing as many other variables just require us to find one page on a website. &lt;br /&gt;
#*''' ''Approach:'' '''&lt;br /&gt;
#*#Google searches and procedure to use on website yields decent results&lt;br /&gt;
#*#Similar procedure to onsite investors&lt;br /&gt;
#'''Size (# members)''' ''(num_members)''&lt;br /&gt;
#*''' ''Levels:'' ''' Count for companies (currently not planning to include list of companies given that some potential hubs have 200+ members)&lt;br /&gt;
#*''' ''Difficulty:'' ''' Some companies don’t list all members - only selective ones-, others do not separate current members and alumni, and some just write &amp;quot;we have served more than 120 startups...&amp;quot;&lt;br /&gt;
#*''' ''Approach:'' ''' For companies that have a list, we will count.  For those with select members, we will count those they listed and try to see if there is a comment about how many they have.  For those that just have a statement &amp;quot;with over,&amp;quot; we will write the number and + (e.g. &amp;quot;120+).&lt;br /&gt;
#'''Office hours investors''' and '''Office hours mentor/advisors''' ''(OH_bin)/(OH_inv_bin)/(OH_inv_list)/etc.''&lt;br /&gt;
#*''' ''Levels:'' ''' Binary for OH, binary for two separate OH, list of names/descriptions of OH&lt;br /&gt;
#*''' ''Difficulty:'' ''' Some companies do not list who OH are with, not always obvious if investor, mentor, or advisor, sometimes not clear if mentor is investor/future investor&lt;br /&gt;
#*''' ''Approach:'' ''' Google approach to get to OH pages and then lookup key words in description to separate out&lt;br /&gt;
#'''Onsite temporary workshops and Networking Meetups''' (Count) ''(onsite_temp_events_bin)/(onsite_temp_workshop_bin)/(onsite_temp_workshop_cnt)/etc.''&lt;br /&gt;
#*''' ''Levels:'' '''  Binary for do they exist, count for each&lt;br /&gt;
#*''' ''Difficulty:'' ''' Difficult for Turkers to differentiate between these two and also other potential events (e.g. symposiums)&lt;br /&gt;
#*''' ''Approach:'' ''' Uses key search terms (e.g. Java/etc.) to separate out workshops and key terms (e.g. lunch/happy hour) for networking meetings&lt;br /&gt;
#'''Onsite code school''' and '''Curriculum''' ''(onsite_long_term_courses)/(onsite_code_school_bin)''&lt;br /&gt;
#*''' ''Levels:'' '''  Binary for do they exist, binary for each&lt;br /&gt;
#*''' ''Difficulty:'' ''' Difficult for Turkers to differentiate between long-term coding programs for individuals and curriculum for startups&lt;br /&gt;
#*''' ''Approach:'' ''' Uses key search terms (e.g. specific code schools) to separate out known code schools and also to look into key terms (e.g. leadership) for curriculum&lt;br /&gt;
#'''Sponsors/Partners''' (University, Corporate) ''(sponsors_cnt)/(sponsors_list)/etc.''&lt;br /&gt;
#*''' ''Levels:'' ''' Count, list of sponsors/partners (if exist), separate columns for university and corporate&lt;br /&gt;
#*''' ''Difficulty:'' ''' Not all companies will list sponsors, partnesrs, or either.  Not always clear the difference among sponsors, partners, investors.&lt;br /&gt;
#*''' ''Approach:'' ''' Use two different levels and use of google search, then if list exists, separate by &amp;quot;college&amp;quot;/&amp;quot;university&amp;quot; and rest&lt;br /&gt;
#'''Alumni Network''' ''(alumni_bin)/(alumni_list)''&lt;br /&gt;
#*''' ''Levels:'' ''' Binary, list&lt;br /&gt;
#*''' ''Difficulty:'' ''' Not all companies list alumni, some only list &amp;quot;selected&amp;quot;&lt;br /&gt;
#*''' ''Approach:'' ''' Include all that have lists&lt;br /&gt;
#'''Size (sqft)''' ''(size_sqft)''&lt;br /&gt;
#*''' ''Levels:'' ''' Number in sqft&lt;br /&gt;
#*''' ''Difficulty:'' ''' Not all companies list square feet online&lt;br /&gt;
#*''' ''Approach:'' '''&lt;br /&gt;
#*#Google search with key words&lt;br /&gt;
#*#If results do not appear, use of press releases is possible&lt;br /&gt;
#'''Onsite Mentors''' ''(onsite_mentors_bin)/(onsite_mentors_cnt)/(onsite_mentors_list)''&lt;br /&gt;
#*''' ''Levels:'' ''' Count and list of mentors (if exist)&lt;br /&gt;
#*''' ''Difficulty:'' ''' Not all companies list mentors - bigger issue is onsite investors&lt;br /&gt;
#*''' ''Approach:'' ''' Use two different levels and use of google search&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
'''Variables Easy to Obtain'''&lt;br /&gt;
#'''Twitter activity''' ''(twit_handle)/(twit_prev_mon_cnt_tweets)/(twit_cnt_followers)/(twit_cnt_retweets)''&lt;br /&gt;
#*''' ''Levels:'' ''' Twitter Handle, # Tweets in a Month, # Followers, # Retweets&lt;br /&gt;
#*''' ''Approach:'' ''' Easy to get twitter handle from Turk or Veeral's code that allows us to run a series of searches on google and then use Gunny's Twitter crawler to get other levels from handle&lt;br /&gt;
#'''Site URL''' ''(url)''&lt;br /&gt;
#*''' ''Levels:'' ''' URL&lt;br /&gt;
#*''' ''Approach:'' ''' Google using Veeral's code that allows us to search &lt;br /&gt;
#''' ''Whois Date'' ''' ''(date_whois)''&lt;br /&gt;
#*''' ''Levels:'' ''' Date&lt;br /&gt;
#*''' ''Approach:'' ''' Date active website was registered&lt;br /&gt;
#'''Address''' ''(address)''&lt;br /&gt;
#*''' ''Levels:'' ''' Will include all addresses&lt;br /&gt;
#*''' ''Approach:'' ''' Google key terms (e.g. Contact Us) and URL using Veeral's code&lt;br /&gt;
#'''Nonprofit status''' ''(nonprofit_binary)''&lt;br /&gt;
#*''' ''Levels:'' ''' Binary variable indicating if the potential Hub is a nonprofit organization&lt;br /&gt;
#*''' ''Approach:'' ''' http://www.guidestar.org/ is a site that we can use to search if a company is nonprofit or not&lt;br /&gt;
#'''Mission statement''' ''(missions_stmt)''&lt;br /&gt;
#*''' ''Levels:'' ''' Official mission statement or description of company (if mission does not exist)&lt;br /&gt;
#*''' ''Approach:'' ''' If not explicitly stated mission statement, will include &amp;quot;About&amp;quot; or statements on main page&lt;br /&gt;
#'''Specific Industry''' ''(spec_industry)''&lt;br /&gt;
#*''' ''Levels:'' ''' Industry included in statement (no aggregation)&lt;br /&gt;
#*''' ''Approach:'' ''' *Based on Mission Statement, not aggregated&lt;br /&gt;
#'''Price for a space/office''' ''(price_space)''&lt;br /&gt;
#*''' ''Levels:'' ''' Two prices one for shared, other for private&lt;br /&gt;
#*''' ''Approach:'' ''' Uses google methodology with key terms and URL&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7871</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7871"/>
		<updated>2016-07-29T22:50:36Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* List of Variables */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Hubs Pages=&lt;br /&gt;
*The main page for Hubs can be found: [[Hubs (Academic Paper)]]&lt;br /&gt;
*For the current work in progress for building the Hubs datasheet for the scorecard  go to: [[Hubs: Hubs Scorecard]]&lt;br /&gt;
*For a tracker of work in progress for the dataset building for the scorecard go to [[Hubs: Hubs Data Building]]&lt;br /&gt;
*For a high-level overview of the variables for the scorecard go to [[Hubs: Hubs Data]]&lt;br /&gt;
&lt;br /&gt;
=List of Variables=&lt;br /&gt;
For a more in-depth of the variables and procedure please see: [[Hubs: Hubs Scorecard]].  This page will reflect the variables being collected separated into three categories.  Each variable will include a breakdown of levels being collected if the definition is not trivial and an approximate approach.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''07/29''' Ariel E:/McNair/Projects/Hubs/Hubs Variale-Ariel&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''As of Week of 7/25'''&lt;br /&gt;
===Group 1===&lt;br /&gt;
'''Variables Difficult to Obtain'''&lt;br /&gt;
#'''Founding Date''' ''(date_founded)''&lt;br /&gt;
#*''' ''Difficulty:'' ''' Finding date based on our strategies&lt;br /&gt;
#*''' ''New Approach:'' ''' &lt;br /&gt;
#*#Whois.net Date&lt;br /&gt;
#*#Factavia/other press release searches &lt;br /&gt;
#'''Multiple locations within city + Franchise''' (as of now just addresses) ''(multi_address)''&lt;br /&gt;
#*''' ''Difficulty:'' ''' Company or establishment level will impact measurements&lt;br /&gt;
#*''' ''New Approach:'' ''' Will record all addresses at company level&lt;br /&gt;
#'''Onsite Venture Capital v. Angel Investors''' (e.g. # and Assets Under Management) ''(onsite_Vc_bin)/(onsite_vc_list)'' ''(onsite_angel_bin)/etc.''&lt;br /&gt;
#*''' ''Levels:'' ''' Binary, list of investors&lt;br /&gt;
#*''' ''Difficulty:'' ''' Hub website usually does not include investors&lt;br /&gt;
#*''' ''New Approach:'' ''' &lt;br /&gt;
#*#Google key terms with address of Hub&lt;br /&gt;
#*#Start with partners and use google/crunchbase&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
'''Variables Comfortable, Not Complete''' (rough order of most difficult to least difficult)&lt;br /&gt;
#'''Onsite accelerator''' ''(onsite_accel_bin)/(onsite_accel_cnt)/(onsite_accel_list)''&lt;br /&gt;
#*''' ''Levels:'' ''' Binary, count, list&lt;br /&gt;
#*''' ''Difficulty:'' ''' Usually not a list, which requires more scrubbing as many other variables just require us to find one page on a website. &lt;br /&gt;
#*''' ''Approach:'' '''&lt;br /&gt;
#*#Google searches and procedure to use on website yields decent results&lt;br /&gt;
#*#Similar procedure to onsite investors&lt;br /&gt;
#'''Size (# members)''' ''(num_members)''&lt;br /&gt;
#*''' ''Levels:'' ''' Count for companies (currently not planning to include list of companies given that some potential hubs have 200+ members)&lt;br /&gt;
#*''' ''Difficulty:'' ''' Some companies don’t list all members - only selective ones-, others do not separate current members and alumni, and some just write &amp;quot;we have served more than 120 startups...&amp;quot;&lt;br /&gt;
#*''' ''Approach:'' ''' For companies that have a list, we will count.  For those with select members, we will count those they listed and try to see if there is a comment about how many they have.  For those that just have a statement &amp;quot;with over,&amp;quot; we will write the number and + (e.g. &amp;quot;120+).&lt;br /&gt;
#'''Office hours investors''' and '''Office hours mentor/advisors''' ''(OH_bin)/(OH_inv_bin)/(OH_inv_list)/etc.''&lt;br /&gt;
#*''' ''Levels:'' ''' Binary for OH, binary for two separate OH, list of names/descriptions of OH&lt;br /&gt;
#*''' ''Difficulty:'' ''' Some companies do not list who OH are with, not always obvious if investor, mentor, or advisor, sometimes not clear if mentor is investor/future investor&lt;br /&gt;
#*''' ''Approach:'' ''' Google approach to get to OH pages and then lookup key words in description to separate out&lt;br /&gt;
#'''Onsite temporary workshops and Networking Meetups''' (Count) ''(onsite_temp_events_bin)/(onsite_temp_workshop_bin)/(onsite_temp_workshop_cnt)/etc.''&lt;br /&gt;
#*''' ''Levels:'' '''  Binary for do they exist, count for each&lt;br /&gt;
#*''' ''Difficulty:'' ''' Difficult for Turkers to differentiate between these two and also other potential events (e.g. symposiums)&lt;br /&gt;
#*''' ''Approach:'' ''' Uses key search terms (e.g. Java/etc.) to separate out workshops and key terms (e.g. lunch/happy hour) for networking meetings&lt;br /&gt;
#'''Onsite code school''' and '''Curriculum''' ''(onsite_long_term_courses)/(onsite_code_school_bin)''&lt;br /&gt;
#*''' ''Levels:'' '''  Binary for do they exist, binary for each&lt;br /&gt;
#*''' ''Difficulty:'' ''' Difficult for Turkers to differentiate between long-term coding programs for individuals and curriculum for startups&lt;br /&gt;
#*''' ''Approach:'' ''' Uses key search terms (e.g. specific code schools) to separate out known code schools and also to look into key terms (e.g. leadership) for curriculum&lt;br /&gt;
#'''Sponsors/Partners''' (University, Corporate) ''(sponsors_cnt)/(sponsors_list)/etc.''&lt;br /&gt;
#*''' ''Levels:'' ''' Count, list of sponsors/partners (if exist), separate columns for university and corporate&lt;br /&gt;
#*''' ''Difficulty:'' ''' Not all companies will list sponsors, partnesrs, or either.  Not always clear the difference among sponsors, partners, investors.&lt;br /&gt;
#*''' ''Approach:'' ''' Use two different levels and use of google search, then if list exists, separate by &amp;quot;college&amp;quot;/&amp;quot;university&amp;quot; and rest&lt;br /&gt;
#'''Alumni Network''' ''(alumni_bin)/(alumni_list)''&lt;br /&gt;
#*''' ''Levels:'' ''' Binary, list&lt;br /&gt;
#*''' ''Difficulty:'' ''' Not all companies list alumni, some only list &amp;quot;selected&amp;quot;&lt;br /&gt;
#*''' ''Approach:'' ''' Include all that have lists&lt;br /&gt;
#'''Size (sqft)''' ''(size_sqft)''&lt;br /&gt;
#*''' ''Levels:'' ''' Number in sqft&lt;br /&gt;
#*''' ''Difficulty:'' ''' Not all companies list square feet online&lt;br /&gt;
#*''' ''Approach:'' '''&lt;br /&gt;
#*#Google search with key words&lt;br /&gt;
#*#If results do not appear, use of press releases is possible&lt;br /&gt;
#'''Onsite Mentors''' ''(onsite_mentors_bin)/(onsite_mentors_cnt)/(onsite_mentors_list)''&lt;br /&gt;
#*''' ''Levels:'' ''' Count and list of mentors (if exist)&lt;br /&gt;
#*''' ''Difficulty:'' ''' Not all companies list mentors - bigger issue is onsite investors&lt;br /&gt;
#*''' ''Approach:'' ''' Use two different levels and use of google search&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
'''Variables Easy to Obtain'''&lt;br /&gt;
#'''Twitter activity''' ''(twit_handle)/(twit_prev_mon_cnt_tweets)/(twit_cnt_followers)/(twit_cnt_retweets)''&lt;br /&gt;
#*''' ''Levels:'' ''' Twitter Handle, # Tweets in a Month, # Followers, # Retweets&lt;br /&gt;
#*''' ''Approach:'' ''' Easy to get twitter handle from Turk or Veeral's code that allows us to run a series of searches on google and then use Gunny's Twitter crawler to get other levels from handle&lt;br /&gt;
#'''Site URL''' ''(url)''&lt;br /&gt;
#*''' ''Levels:'' ''' URL&lt;br /&gt;
#*''' ''Approach:'' ''' Google using Veeral's code that allows us to search &lt;br /&gt;
#''' ''Whois Date'' ''' ''(date_whois)''&lt;br /&gt;
#*''' ''Levels:'' ''' Date&lt;br /&gt;
#*''' ''Approach:'' ''' Date active website was registered&lt;br /&gt;
#'''Address''' ''(address)''&lt;br /&gt;
#*''' ''Levels:'' ''' Will include all addresses&lt;br /&gt;
#*''' ''Approach:'' ''' Google key terms (e.g. Contact Us) and URL using Veeral's code&lt;br /&gt;
#'''Nonprofit status''' ''(nonprofit_binary)''&lt;br /&gt;
#*''' ''Levels:'' ''' Binary variable indicating if the potential Hub is a nonprofit organization&lt;br /&gt;
#*''' ''Approach:'' ''' http://www.guidestar.org/ is a site that we can use to search if a company is nonprofit or not&lt;br /&gt;
#'''Mission statement''' ''(missions_stmt)''&lt;br /&gt;
#*''' ''Levels:'' ''' Official mission statement or description of company (if mission does not exist)&lt;br /&gt;
#*''' ''Approach:'' ''' If not explicitly stated mission statement, will include &amp;quot;About&amp;quot; or statements on main page&lt;br /&gt;
#'''Specific Industry''' ''(spec_industry)''&lt;br /&gt;
#*''' ''Levels:'' ''' Industry included in statement (no aggregation)&lt;br /&gt;
#*''' ''Approach:'' ''' *Based on Mission Statement, not aggregated&lt;br /&gt;
#'''Price for a space/office''' ''(price_space)''&lt;br /&gt;
#*''' ''Levels:'' ''' Two prices one for shared, other for private&lt;br /&gt;
#*''' ''Approach:'' ''' Uses google methodology with key terms and URL&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Ariel_Sun_(Work_Log)&amp;diff=7870</id>
		<title>Ariel Sun (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Ariel_Sun_(Work_Log)&amp;diff=7870"/>
		<updated>2016-07-29T22:48:07Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Category:Work Log]]&lt;br /&gt;
[[Ariel Sun]] [[Work Logs]] [[Ariel Sun (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
06/01/2016 - Introduction/Wiki Building&lt;br /&gt;
&lt;br /&gt;
06/02/2016 - Refined Wiki Organization and Content&lt;br /&gt;
&lt;br /&gt;
06/03/2016 - Organized Topic Areas and Worked on Public Wiki Page&lt;br /&gt;
&lt;br /&gt;
06/06/2016 - Continued Organizing Public Wiki Page&lt;br /&gt;
&lt;br /&gt;
06/07/2016 - Draft of Women in Entrepreneurship blog post&lt;br /&gt;
&lt;br /&gt;
06/08/2016 - Work on Challenges Women Entrepreneurs Face wiki page&lt;br /&gt;
&lt;br /&gt;
06/09/2016 - Clean up content of patent trolls and put on the public page&lt;br /&gt;
&lt;br /&gt;
06/10/2016 - Put up resources for business dynamism in high tech issue brief page&lt;br /&gt;
&lt;br /&gt;
06/13/2016 - Clean up venture one data and LBO data&lt;br /&gt;
&lt;br /&gt;
06/14/2016 - Match venture one and LBO data to patent data&lt;br /&gt;
&lt;br /&gt;
06/15/2016 - Create tables that match patent information to each LBO/venture company&lt;br /&gt;
&lt;br /&gt;
06/16/2016 - Create and Finalize LBO/venture company and patent summary table&lt;br /&gt;
&lt;br /&gt;
06/17/2016 - Familiarize with Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/20/2016 - Analyze existing SQL code of Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/21/2016 - Clean up and rebuild Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/22/2016 - First draft of complete SQL script for Hubs datasets&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
07/15/2016 - Help Ed send out VentureOne data, add grouping and considerations of Hubs scorecard variables&lt;br /&gt;
&lt;br /&gt;
07/18/2016 - Work on differentiating curriculum v. code school, redo Matching VentureOne&lt;br /&gt;
&lt;br /&gt;
07/19/2016 - Finish Marching Venture One, update on Wiki, work on differentiating curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
07/20/2016 - Work on differentiating OH investor v. mentor, temporary workshop v. networking meetup&lt;br /&gt;
&lt;br /&gt;
07/21/2016 - Consolidate previously done hubs variables, work on number of onsite accelerators&lt;br /&gt;
&lt;br /&gt;
07/22/2016 - Work on Hubs variables: price of flexible/dedicated desk, Onsite VCs v. Angel Investors, number of members&lt;br /&gt;
&lt;br /&gt;
07/29/2016 - Code for Hubs variables that are hubs, E:/McNair/Projects/Hubs/Hus Variables-Ariel&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data_Building&amp;diff=7492</id>
		<title>Hubs: Hubs Data Building</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data_Building&amp;diff=7492"/>
		<updated>2016-07-26T21:38:03Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Variables==&lt;br /&gt;
Procedure copy the below for each variable being worked on:&lt;br /&gt;
&lt;br /&gt;
#'''(URL)'''&lt;br /&gt;
#**'''STATUS''': Complete&lt;br /&gt;
#**'''Previously Collected''': Yes&lt;br /&gt;
#**'''Published on Mechanical Turk''': No&lt;br /&gt;
#**'''Audited''': NA&lt;br /&gt;
#***Audit Results: NA&lt;br /&gt;
#***#:&lt;br /&gt;
#**'''Updates''': The file is saved at E:/McNair/Projects/Hubs/Searching/URLs.csv&lt;br /&gt;
#'''(Address)'''&lt;br /&gt;
#**'''STATUS''': In Progress&lt;br /&gt;
#**'''Previously Collected''': No&lt;br /&gt;
#**'''Published on Mechanical Turk''': No&lt;br /&gt;
#**'''Audited''': No&lt;br /&gt;
#***Audit Results: &lt;br /&gt;
#***#:&lt;br /&gt;
#**'''Updates''': The file ready for Turk is saved at E:/McNair/Projects/Hubs/Mechanical Turk/Address.csv&lt;br /&gt;
#'''(Mission/About)'''&lt;br /&gt;
#**'''STATUS''': In Progress&lt;br /&gt;
#**'''Previously Collected''': Yes&lt;br /&gt;
#**'''Published on Mechanical Turk''': No&lt;br /&gt;
#**'''Audited''': No&lt;br /&gt;
#***Audit Results: &lt;br /&gt;
#***#:&lt;br /&gt;
#**'''Updates''': The file ready for Turk is saved at E:/McNair/Projects/Hubs/Mechanical Turk/Mission.csv&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data_Building&amp;diff=7491</id>
		<title>Hubs: Hubs Data Building</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data_Building&amp;diff=7491"/>
		<updated>2016-07-26T21:37:22Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Variables */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Variables==&lt;br /&gt;
Procedure copy the below for each variable being worked on:&lt;br /&gt;
&lt;br /&gt;
#'''(URL)'''&lt;br /&gt;
#**'''STATUS''': Complete&lt;br /&gt;
#**'''Previously Collected''': Yes&lt;br /&gt;
#**'''Published on Mechanical Turk''': No&lt;br /&gt;
#**'''Audited''': NA&lt;br /&gt;
#***Audit Results: NA&lt;br /&gt;
#***#:&lt;br /&gt;
#**'''Updates''': The file is saved at E:/McNair/Projects/Hubs/Searching/URLs.csv&lt;br /&gt;
#'''(Address)'''&lt;br /&gt;
#**'''STATUS''': In Progress&lt;br /&gt;
#**'''Previously Collected''': No&lt;br /&gt;
#**'''Published on Mechanical Turk''': No&lt;br /&gt;
#**'''Audited''': No&lt;br /&gt;
#***Audit Results: &lt;br /&gt;
#***#:&lt;br /&gt;
#**'''Updates''': The file ready for Turk is saved at E:/McNair/Projects/Hubs/Mechanical Turk/Address.csv&lt;br /&gt;
#'''(Mission/About)'''&lt;br /&gt;
#**'''STATUS''': In Progress&lt;br /&gt;
#**'''Previously Collected''': Yes&lt;br /&gt;
#**'''Published on Mechanical Turk''': No&lt;br /&gt;
#**'''Audited''': No&lt;br /&gt;
#***Audit Results: &lt;br /&gt;
#***#:&lt;br /&gt;
#**'''Updates''': The file is saved at E:/McNair/Projects/Hubs/Mechanical Turk/Mission.csv&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Ariel_Sun_(Work_Log)&amp;diff=7409</id>
		<title>Ariel Sun (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Ariel_Sun_(Work_Log)&amp;diff=7409"/>
		<updated>2016-07-22T21:49:59Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Category:Work Log]]&lt;br /&gt;
[[Ariel Sun]] [[Work Logs]] [[Ariel Sun (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
06/01/2016 - Introduction/Wiki Building&lt;br /&gt;
&lt;br /&gt;
06/02/2016 - Refined Wiki Organization and Content&lt;br /&gt;
&lt;br /&gt;
06/03/2016 - Organized Topic Areas and Worked on Public Wiki Page&lt;br /&gt;
&lt;br /&gt;
06/06/2016 - Continued Organizing Public Wiki Page&lt;br /&gt;
&lt;br /&gt;
06/07/2016 - Draft of Women in Entrepreneurship blog post&lt;br /&gt;
&lt;br /&gt;
06/08/2016 - Work on Challenges Women Entrepreneurs Face wiki page&lt;br /&gt;
&lt;br /&gt;
06/09/2016 - Clean up content of patent trolls and put on the public page&lt;br /&gt;
&lt;br /&gt;
06/10/2016 - Put up resources for business dynamism in high tech issue brief page&lt;br /&gt;
&lt;br /&gt;
06/13/2016 - Clean up venture one data and LBO data&lt;br /&gt;
&lt;br /&gt;
06/14/2016 - Match venture one and LBO data to patent data&lt;br /&gt;
&lt;br /&gt;
06/15/2016 - Create tables that match patent information to each LBO/venture company&lt;br /&gt;
&lt;br /&gt;
06/16/2016 - Create and Finalize LBO/venture company and patent summary table&lt;br /&gt;
&lt;br /&gt;
06/17/2016 - Familiarize with Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/20/2016 - Analyze existing SQL code of Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/21/2016 - Clean up and rebuild Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/22/2016 - First draft of complete SQL script for Hubs datasets&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
07/15/2016 - Help Ed send out VentureOne data, add grouping and considerations of Hubs scorecard variables&lt;br /&gt;
&lt;br /&gt;
07/18/2016 - Work on differentiating curriculum v. code school, redo Matching VentureOne&lt;br /&gt;
&lt;br /&gt;
07/19/2016 - Finish Marching Venture One, update on Wiki, work on differentiating curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
07/20/2016 - Work on differentiating OH investor v. mentor, temporary workshop v. networking meetup&lt;br /&gt;
&lt;br /&gt;
07/21/2016 - Consolidate previously done hubs variables, work on number of onsite accelerators&lt;br /&gt;
&lt;br /&gt;
07/22/2016 - Work on Hubs variables: price of flexible/dedicated desk, Onsite VCs v. Angel Investors, number of members&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7408</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7408"/>
		<updated>2016-07-22T21:42:54Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Onsite VC v. Angel Investors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#URL&lt;br /&gt;
#Address&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Founding Date&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/21): G/Y: Founding date, size (members) issues&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/21): G/Y: much progress has been made, but issues with onsite venture capitalists/angel investors&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/21): Hannah working on this&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/21): NS&lt;br /&gt;
#*Begin Date: TBD&lt;br /&gt;
#*Reach Goal: TBD&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status (7/21): NS&lt;br /&gt;
#*Begin Date: TBD&lt;br /&gt;
#*Reach Goal: TBD&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/20)''': Gunny has created a tool to do this process&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE''' (7/14)&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO/No Need (Veeral)&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/22)''': Veeral has code to do this procedure &lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format ___.__/ (e.g. if url is example.us/other, record example.us/)&lt;br /&gt;
#Address&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/22)''': Code written.  Difficulties occur with very large companies (e.g. Impact Hub).  Will require Veeral's program, expected time for each assignment is 10-20 seconds - pay rate, therefore, recommended $.05&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Using Veeral's code, crossproduct allintext: (Group A) and site: (Group B), where '''Group A'''=Contact (high coverage), About Us, Find Us, Locations, Address, '''Group B'''= Company URLs.&lt;br /&gt;
#*#Click on first result.  If addresses exist, record in ADDRESS, STATE, and ZIP.&lt;br /&gt;
#*#If not, go to company's URL. If addresses exist, record in ADDRESS, STATE, and ZIP.&lt;br /&gt;
#*#If address exists, but ZIP does not, plug in address into search engine and record ZIP.&lt;br /&gt;
#*#Otherwise, record DNE.&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (allintext: About/Mission site: from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Given that most companies include their specialty in mission statement and difficulty to turk, we will manually check each mission statement and mark it accordingly. &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#NONE&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Code written, but may require additional manual work. Expected time to complete is 45 seconds due to a potential list of a lot of sponsors/partners - pay rate, therefore, recommended $.12. &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Choose first result from Search Text 1 and Search Text 2 (allintext: Sponsors/Partnerrs site:URL)&lt;br /&gt;
#*#Record all Sponsors from Search Text 1 into SPONSORS.  If there does not exist a list or the link was for only 1 sponsor, record DNE.&lt;br /&gt;
#*#If any Sponsors from Search Text 1 include a University or College (will be listed in name), record them into UNIVERSITY SPONSORS&lt;br /&gt;
#*#Record all Partners from Search Text 2 into PARTNERS. If there does not exist a list or the link was for only 1 partner, record DNE.&lt;br /&gt;
#*#If any Partners from Search Text 2 include a University or College (will be listed in name), record them into UNIVERSITY PARTNERS&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/22)''':  Code 1 written, code 2 need more work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to company’s URL&lt;br /&gt;
#*#On the homepage, look for the section related to pricing. If pricing is not found in the homepage, look for the links ‘coworking’, ‘work space’, ‘membership’, ‘pricing’ ,‘join’ or ‘Apply for membership’ , and look for the pricing information under those links. If there is no price related section, record DNE for both ‘Flexible Desk’ and ‘Dedicated Desk’.&lt;br /&gt;
#*#If there is pricing information, look for the price of sharing space per month, often denoted as Shared/Flexible desk/non-dedicated desk, record the price at ‘Flexible Desk’. If the price is not found, record DNE.&lt;br /&gt;
#*#Look for the price of a dedicated desk per month, often denoted as Reserved/dedicated desk/private desk record the price at ‘Dedicated Desk’.  If the price is not found, record DNE.&lt;br /&gt;
#*#If price information is not found and there is a ‘locations’ link, click on it and choose the first location of the list. Repeat step 3 -4. &lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Keywords: 24/7 access, dedicated desk, pricing&lt;br /&gt;
#*#Google allintext:&amp;quot;keywords&amp;quot; site:URL&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': To Be Discussed Further&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Difficulties observed when figuring out how to Turk this &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size (SQFT)&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
#Size (# Companies)&lt;br /&gt;
#*'''BRAINSTORM''': (7/22) Some companies don’t list all members but only selective ones. Some companies do not separate current members and alumni and goes like:&amp;quot;we have served more than 120 startups...&amp;quot;&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/22)''': Brainstorm and code updated (Capital Factory 227)&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#*#Count the number of members&lt;br /&gt;
#*#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#*#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Search allintitle:&amp;quot;Members/Startups/Residents/Villagers/Ventures&amp;quot; site:URL in Google.&lt;br /&gt;
#*#If no result found, record DNE.&lt;br /&gt;
#*#If there are results, go to the first result which is usually in the form like&amp;quot;Members - Company Name&amp;quot;.&lt;br /&gt;
#*#If the result direct you to a page that lists the members of the company, count the number of companies and record the number.&lt;br /&gt;
#*#If the result direct you to a page that does not give information on number of members, record DNE.&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text (Mentor/Mentorship) into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Code written - 2nd part, while more manual, appears to have greater range.  2nd code would only require Veeral's code.  1st code expected completion time is 30 seconds.&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to company's URL&lt;br /&gt;
#*#Look for the link 'Accelerators' or 'Accelerating/Accelerator/Acceleration/Accelerate Programs'&lt;br /&gt;
#*#If accelerators are found, count the number of accelerators/accelerating programs and record the number. **or also copy the names of the accelerators?&lt;br /&gt;
#*#If accelerators are not found in step 1, go to the links 'Services' , 'Benefit', 'Resources', 'For Entrepreneurs', 'Startups' and look for the section of 'Accelerator/Accelerating Programs' &lt;br /&gt;
#*#If accelerators are found, count the number of accelerators/accelerating programs and record the number.&lt;br /&gt;
#*#If accelerators are not found, record 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Search [allintitle:&amp;quot;accelerator&amp;quot;/&amp;quot;accelerate&amp;quot; site:URL] in Google&lt;br /&gt;
#*#Copy the titles of the results. **We have to scrutinize the titles ourselves to determine whether they are distinct onsite accelerators and record the number manually.&lt;br /&gt;
#*#If no result appears, record 0.&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
Thoughts (Ariel, 07/20): &lt;br /&gt;
The names listed on 'mentor' page/sections must all be mentors, and the same applies for investors/OH investors although few companies list their investors. So here the only thing we are trying to differentiate is whether the mentor is a investor. maybe via checking whether they are from a VC firm?? But even they are from VC companies doesn't mean they are going to invest on the startups of the Hubs they are mentoring on.  Or another way to think about it is differentiating between mentors/OH mentors. Mentors tend to give the particular startups long term support and available when needed while OH mentors only gives advice on the spot. &lt;br /&gt;
&lt;br /&gt;
'''Mentors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on improving entrepreneurial community through ongoing, recurring support&lt;br /&gt;
**Help and guide the startups on: business plans and models, management, development, execution, technology innovation, marketing, sales&lt;br /&gt;
**Common fields/occupations: founder/CEO of another company, business development, serial entrepreneur, marketing, sales, management consulting, technology and innovation, research professor etc.&lt;br /&gt;
**Some companies offer mentor office hours&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Investors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on investing on early stage or growth stage startups&lt;br /&gt;
**Usually from VC firms&lt;br /&gt;
**Common fields/ occupations: VC firm manager, VC firm partner, fund manager&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
#Search allintext:&amp;quot;office hours&amp;quot; site:URL&lt;br /&gt;
#Mark ''office hours'' as 1 if there is a result, otherwise mark as 0.&lt;br /&gt;
#Click on the first five results&lt;br /&gt;
#On each of the five pages, search for two items:&lt;br /&gt;
##search for 'mentor'. (Ctrl + F) If 'mentor' appears in the description paragraph of office hours on any of the five pages, mark ''mentor OH'' as 1. Otherwise mark as DNE and copy the description paragraph of office hours of all five pages.&lt;br /&gt;
##search for 'fund'. (Ctrl + F) If 'fund' appears in the description paragraph of office hours on any of the five pages, mark ''investor OH'' as 1. Otherwise mark as DNE and copy the description paragraph of office hours of all five pages.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is learning and discussing &lt;br /&gt;
**Often have a specific topic: business issue (e.g. online marketing) or techniques learning (e.g. intro to Java script)&lt;br /&gt;
**In the forms of: workshop, class, panel, project, XX session, seminar, series, intro to XX&lt;br /&gt;
**Exception: tech meetup is usually a workshop(e.g. C++ programmer meetup, http://techranchaustin.com/events/)&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
**Do we care about what particular workshops (e.g. coding, leadership, etc.)?&lt;br /&gt;
**Summits/major events&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
**See Turk for Both Below&lt;br /&gt;
&lt;br /&gt;
'''Networking Events'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is to meet fellow entrepreneurs and experts and networking with them&lt;br /&gt;
**Focus on experience sharing or communication as opposed to discussing a specific topic or technical subject&lt;br /&gt;
**In the forms of: meetup, networking, happy hour, info session?, luncheon, XX night, socials, talks??, community XX&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
**See Turk for Both Below&lt;br /&gt;
&lt;br /&gt;
*'''Turk for Both 1 of 2''':&lt;br /&gt;
*#Search the Search Text 1 (allintext: events site: URL) and choose link to &amp;quot;Events&amp;quot;, &amp;quot;Calendar&amp;quot;, or related.  Record 'url' on SOURCE If this does not exist, go to Step 7&lt;br /&gt;
*#For all events that have dates, copy the events from today's date to the following month into ALL EVENTS&lt;br /&gt;
*#For all events that have Office Hours in the name, record the events in OFFICE HOURS.  For all events that have summit, record the events in SUMMITS.&lt;br /&gt;
*#For all events that are related to teaching or learning (e.g. contain &amp;quot;Training,&amp;quot; &amp;quot;Seminar,&amp;quot; &amp;quot;Class,&amp;quot; &amp;quot;Learn,&amp;quot; &amp;quot;Bootcamp,&amp;quot; &amp;quot;Workshop,&amp;quot; &amp;quot;Pitch Event&amp;quot;, copy the name of the events into WORKSHOPS&lt;br /&gt;
*#For all events that are related to scoial activities and networking (e.g. &amp;quot;Social,&amp;quot; &amp;quot;Meet Up,&amp;quot; &amp;quot;Breakfast&amp;quot;/&amp;quot;Lunch&amp;quot;/&amp;quot;Happy Hour&amp;quot;, &amp;quot;Movie Night&amp;quot;/&amp;quot;Bowling&amp;quot;), copy the name of the events into NETWORKING.  For all events that are unclear or did not fit into these descriptions&lt;br /&gt;
*#If a message explicity says there are no events, mark as 0 for ALL EVENTS, OFFICE HOURS, SUMMITS, WORKSHOPS, and NETWORKING &lt;br /&gt;
*#If this does not exist, search Search Text 2 (allintext: Company Name site: meetup.com) and click on the meetup.com for the company if it exists.  If it does exist, record meetup on SOURCE.  If not, go to step 9.&lt;br /&gt;
*#Repeat Steps 2-6.&lt;br /&gt;
*#If this does not exist, search Search Text 3 (allintext: Company Name site: eventbrite.com) and click on the eventbrite.com for the company if it exists.  If it does exist, record eventbrite on SOURCE.  If not, mark DNE for all variables.&lt;br /&gt;
*#Repeat Steps 2-6.&lt;br /&gt;
&lt;br /&gt;
*'''Turk for Both 2 of 2''':&lt;br /&gt;
#Go to Company URL&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events from today's date to next months and record it in ALL EVENTS. If there is no information of events or dates of the events on the website, record DNE for all variables.&lt;br /&gt;
#For all events that have Office Hours in the name, count the number of events in OFFICE HOURS.  For all events that have summit, count the number of the events in SUMMITS.&lt;br /&gt;
#For all events that are related to teaching or learning (e.g. contain &amp;quot;Training,&amp;quot; &amp;quot;Seminar,&amp;quot; &amp;quot;Class,&amp;quot; &amp;quot;Learn,&amp;quot; &amp;quot;Bootcamp,&amp;quot; &amp;quot;Workshop,&amp;quot; &amp;quot;Pitch Event&amp;quot;, count the number of the events into WORKSHOPS&lt;br /&gt;
#For all events that are related to scoial activities and networking (e.g. &amp;quot;Social,&amp;quot; &amp;quot;Meet Up,&amp;quot; &amp;quot;Breakfast&amp;quot;/&amp;quot;Lunch&amp;quot;/&amp;quot;Happy Hour&amp;quot;, &amp;quot;Movie Night&amp;quot;/&amp;quot;Bowling&amp;quot;), count the number of the events into NETWORKING&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
*Notes: Few companies have a section for their onsite VCs or angel investors. Even the company(Innovation Pavilion) that has Angel programs and VC programs does not conduct the programs by itself, but cooperate with external angel investors or VCs. Some companies have mentors or board members who are from VCs, but it does not mean they will invest in the member startups in those companies.&lt;br /&gt;
&lt;br /&gt;
===Group 5===&lt;br /&gt;
#Multiple Locations&lt;br /&gt;
#*Addresses are included in Group 1, but still needs to be discussed&lt;br /&gt;
#*Getting&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
See Section 3 of [[Hubs (Academic Paper)]]&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7402</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7402"/>
		<updated>2016-07-22T20:45:59Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Group 2 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#URL&lt;br /&gt;
#Address&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Founding Date&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/21): G/Y: Founding date, size (members) issues&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/21): G/Y: much progress has been made, but issues with onsite venture capitalists/angel investors&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/21): Hannah working on this&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/21): NS&lt;br /&gt;
#*Begin Date: TBD&lt;br /&gt;
#*Reach Goal: TBD&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status (7/21): NS&lt;br /&gt;
#*Begin Date: TBD&lt;br /&gt;
#*Reach Goal: TBD&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/20)''': Gunny has created a tool to do this process&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Address&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/22)''': Code written.  Difficulties occur with very large companies (e.g. Impact Hub).  Will require Veeral's program, expected time for each assignment is 10-20 seconds - pay rate, therefore, recommended $.05&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Using Veeral's code, crossproduct allintext: (Group A) and site: (Group B), where '''Group A'''=Contact (high coverage), About Us, Find Us, Locations, Address, '''Group B'''= Company URLs.&lt;br /&gt;
#*Click on first result.  If addresses exist, record in ADDRESS, STATE, and ZIP.&lt;br /&gt;
#*#If not, go to company's URL. If addresses exist, record in ADDRESS, STATE, and ZIP.&lt;br /&gt;
#*#If address exists, but ZIP does not, plug in address into search engine and record ZIP.&lt;br /&gt;
#*#Otherwise, record DNE.&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Given that most companies include their specialty in mission statement and difficulty to turk, we will manually check each mission statement and mark it accordingly. &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#NONE&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Code written, but may require additional manual work. Expected time to complete is 45 seconds due to a potential list of a lot of sponsors/partners - pay rate, therefore, recommended $.12. &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Choose first result from Search Text 1 and Search Text 2 (allintext: Sponsors/Partnerrs site:URL)&lt;br /&gt;
#*#Record all Sponsors from Search Text 1 into SPONSORS.  If there does not exist a list or the link was for only 1 sponsor, record DNE.&lt;br /&gt;
#*#If any Sponsors from Search Text 1 include a University or College (will be listed in name), record them into UNIVERSITY SPONSORS&lt;br /&gt;
#*#Record all Partners from Search Text 2 into PARTNERS. If there does not exist a list or the link was for only 1 partner, record DNE.&lt;br /&gt;
#*#If any Partners from Search Text 2 include a University or College (will be listed in name), record them into UNIVERSITY PARTNERS&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/22)''':  Code 1 written, code 2 need more work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to company’s URL&lt;br /&gt;
#*#On the homepage, look for the section related to pricing. If pricing is not found in the homepage, look for the links ‘coworking’, ‘work space’, ‘membership’, ‘pricing’ ,‘join’ or ‘Apply for membership’ , and look for the pricing information under those links. If there is no price related section, record DNE for both ‘Flexible Desk’ and ‘Dedicated Desk’.&lt;br /&gt;
#*#If there is pricing information, look for the price of sharing space per month, often denoted as Shared/Flexible desk/non-dedicated desk, record the price at ‘Flexible Desk’. If the price is not found, record DNE.&lt;br /&gt;
#*#Look for the price of a dedicated desk per month, often denoted as Reserved/dedicated desk/private desk record the price at ‘Dedicated Desk’.  If the price is not found, record DNE.&lt;br /&gt;
#*#If price information is not found and there is a ‘locations’ link, click on it and choose the first location of the list. Repeat step 3 -4. &lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Keywords: 24/7 access, dedicated desk, pricing&lt;br /&gt;
#*#Google allintext:&amp;quot;keywords&amp;quot; site:URL&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': To Be Discussed Further&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Difficulties observed when figuring out how to Turk this &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size (SQFT)&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
#Size (# Companies)&lt;br /&gt;
#*'''BRAINSTORM''': (7/22) Some companies don’t list all members but only selective ones. Some companies do not separate current members and alumni and goes like:&amp;quot;we have served more than 120 startups...&amp;quot;&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/22)''': Brainstorm and code updated (Capital Factory 227)&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#*#Count the number of members&lt;br /&gt;
#*#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#*#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Search allintitle:&amp;quot;Members/Startups/Residents/Villagers/Ventures&amp;quot; site:URL in Google.&lt;br /&gt;
#*#If no result found, record DNE.&lt;br /&gt;
#*#If there are results, go to the first result which is usually in the form like&amp;quot;Members - Company Name&amp;quot;.&lt;br /&gt;
#*#If the result direct you to a page that lists the members of the company, count the number of companies and record the number.&lt;br /&gt;
#*#If the result direct you to a page that does not give information on number of members, record DNE.&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Code written - 2nd part, while more manual, appears to have greater range.  2nd code would only require Veeral's code.  1st code expected completion time is 30 seconds.&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to company's URL&lt;br /&gt;
#*#Look for the link 'Accelerators' or 'Accelerating/Accelerator/Acceleration/Accelerate Programs'&lt;br /&gt;
#*#If accelerators are found, count the number of accelerators/accelerating programs and record the number. **or also copy the names of the accelerators?&lt;br /&gt;
#*#If accelerators are not found in step 1, go to the links 'Services' , 'Benefit', 'Resources', 'For Entrepreneurs', 'Startups' and look for the section of 'Accelerator/Accelerating Programs' &lt;br /&gt;
#*#If accelerators are found, count the number of accelerators/accelerating programs and record the number.&lt;br /&gt;
#*#If accelerators are not found, record 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Search [allintitle:&amp;quot;accelerator&amp;quot;/&amp;quot;accelerate&amp;quot; site:URL] in Google&lt;br /&gt;
#*#Copy the titles of the results. **We have to scrutinize the titles ourselves to determine whether they are distinct onsite accelerators and record the number manually.&lt;br /&gt;
#*#If no result appears, record 0.&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
Thoughts (Ariel, 07/20): &lt;br /&gt;
The names listed on 'mentor' page/sections must all be mentors, and the same applies for investors/OH investors although few companies list their investors. So here the only thing we are trying to differentiate is whether the mentor is a investor. maybe via checking whether they are from a VC firm?? But even they are from VC companies doesn't mean they are going to invest on the startups of the Hubs they are mentoring on.  Or another way to think about it is differentiating between mentors/OH mentors. Mentors tend to give the particular startups long term support and available when needed while OH mentors only gives advice on the spot. &lt;br /&gt;
&lt;br /&gt;
'''Mentors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on improving entrepreneurial community through ongoing, recurring support&lt;br /&gt;
**Help and guide the startups on: business plans and models, management, development, execution, technology innovation, marketing, sales&lt;br /&gt;
**Common fields/occupations: founder/CEO of another company, business development, serial entrepreneur, marketing, sales, management consulting, technology and innovation, research professor etc.&lt;br /&gt;
**Some companies offer mentor office hours&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Investors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on investing on early stage or growth stage startups&lt;br /&gt;
**Usually from VC firms&lt;br /&gt;
**Common fields/ occupations: VC firm manager, VC firm partner, fund manager&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
#Search allintext:&amp;quot;office hours&amp;quot; site:URL&lt;br /&gt;
#Mark ''office hours'' as 1 if there is a result, otherwise mark as 0.&lt;br /&gt;
#Click on the first five results&lt;br /&gt;
#On each of the five pages, search for two items:&lt;br /&gt;
##search for 'mentor'. (Ctrl + F) If 'mentor' appears in the description paragraph of office hours on any of the five pages, mark ''mentor OH'' as 1. Otherwise mark as DNE and copy the description paragraph of office hours of all five pages.&lt;br /&gt;
##search for 'fund'. (Ctrl + F) If 'fund' appears in the description paragraph of office hours on any of the five pages, mark ''investor OH'' as 1. Otherwise mark as DNE and copy the description paragraph of office hours of all five pages.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is learning and discussing &lt;br /&gt;
**Often have a specific topic: business issue (e.g. online marketing) or techniques learning (e.g. intro to Java script)&lt;br /&gt;
**In the forms of: workshop, class, panel, project, XX session, seminar, series, intro to XX&lt;br /&gt;
**Exception: tech meetup is usually a workshop(e.g. C++ programmer meetup, http://techranchaustin.com/events/)&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
**Do we care about what particular workshops (e.g. coding, leadership, etc.)?&lt;br /&gt;
**Summits/major events&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
**See Turk for Both Below&lt;br /&gt;
&lt;br /&gt;
'''Networking Events'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is to meet fellow entrepreneurs and experts and networking with them&lt;br /&gt;
**Focus on experience sharing or communication as opposed to discussing a specific topic or technical subject&lt;br /&gt;
**In the forms of: meetup, networking, happy hour, info session?, luncheon, XX night, socials, talks??, community XX&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
**See Turk for Both Below&lt;br /&gt;
&lt;br /&gt;
*'''Turk for Both 1 of 2''':&lt;br /&gt;
*#Search the Search Text 1 (allintext: events site: URL) and choose link to &amp;quot;Events&amp;quot;, &amp;quot;Calendar&amp;quot;, or related.  Record 'url' on SOURCE If this does not exist, go to Step 7&lt;br /&gt;
*#For all events that have dates, copy the events from today's date to the following month into ALL EVENTS&lt;br /&gt;
*#For all events that have Office Hours in the name, record the events in OFFICE HOURS.  For all events that have summit, record the events in SUMMITS.&lt;br /&gt;
*#For all events that are related to teaching or learning (e.g. contain &amp;quot;Training,&amp;quot; &amp;quot;Seminar,&amp;quot; &amp;quot;Class,&amp;quot; &amp;quot;Learn,&amp;quot; &amp;quot;Bootcamp,&amp;quot; &amp;quot;Workshop,&amp;quot; &amp;quot;Pitch Event&amp;quot;, copy the name of the events into WORKSHOPS&lt;br /&gt;
*#For all events that are related to scoial activities and networking (e.g. &amp;quot;Social,&amp;quot; &amp;quot;Meet Up,&amp;quot; &amp;quot;Breakfast&amp;quot;/&amp;quot;Lunch&amp;quot;/&amp;quot;Happy Hour&amp;quot;, &amp;quot;Movie Night&amp;quot;/&amp;quot;Bowling&amp;quot;), copy the name of the events into NETWORKING.  For all events that are unclear or did not fit into these descriptions&lt;br /&gt;
*#If a message explicity says there are no events, mark as 0 for ALL EVENTS, OFFICE HOURS, SUMMITS, WORKSHOPS, and NETWORKING &lt;br /&gt;
*#If this does not exist, search Search Text 2 (allintext: Company Name site: meetup.com) and click on the meetup.com for the company if it exists.  If it does exist, record meetup on SOURCE.  If not, go to step 9.&lt;br /&gt;
*#Repeat Steps 2-6.&lt;br /&gt;
*#If this does not exist, search Search Text 3 (allintext: Company Name site: eventbrite.com) and click on the eventbrite.com for the company if it exists.  If it does exist, record eventbrite on SOURCE.  If not, mark DNE for all variables.&lt;br /&gt;
*#Repeat Steps 2-6.&lt;br /&gt;
&lt;br /&gt;
*'''Turk for Both 2 of 2''':&lt;br /&gt;
#Go to Company URL&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events from today's date to next months and record it in ALL EVENTS. If there is no information of events or dates of the events on the website, record DNE for all variables.&lt;br /&gt;
#For all events that have Office Hours in the name, count the number of events in OFFICE HOURS.  For all events that have summit, count the number of the events in SUMMITS.&lt;br /&gt;
#For all events that are related to teaching or learning (e.g. contain &amp;quot;Training,&amp;quot; &amp;quot;Seminar,&amp;quot; &amp;quot;Class,&amp;quot; &amp;quot;Learn,&amp;quot; &amp;quot;Bootcamp,&amp;quot; &amp;quot;Workshop,&amp;quot; &amp;quot;Pitch Event&amp;quot;, count the number of the events into WORKSHOPS&lt;br /&gt;
#For all events that are related to scoial activities and networking (e.g. &amp;quot;Social,&amp;quot; &amp;quot;Meet Up,&amp;quot; &amp;quot;Breakfast&amp;quot;/&amp;quot;Lunch&amp;quot;/&amp;quot;Happy Hour&amp;quot;, &amp;quot;Movie Night&amp;quot;/&amp;quot;Bowling&amp;quot;), count the number of the events into NETWORKING&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Group 5===&lt;br /&gt;
#Multiple Locations&lt;br /&gt;
#*Addresses are included in Group 1, but still needs to be discussed&lt;br /&gt;
#*Getting&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
See Section 3 of [[Hubs (Academic Paper)]]&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7385</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7385"/>
		<updated>2016-07-22T16:56:08Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Group 1 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#URL&lt;br /&gt;
#Address&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Founding Date&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/21): G/Y: Founding date issues&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/21): Green: much progress has been made&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/21): Hannah working on this&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/21): NS&lt;br /&gt;
#*Begin Date: TBD&lt;br /&gt;
#*Reach Goal: TBD&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status (7/21): NS&lt;br /&gt;
#*Begin Date: TBD&lt;br /&gt;
#*Reach Goal: TBD&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/20)''': Gunny has created a tool to do this process&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Address&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/22)''': Code written.  Difficulties occur with very large companies (e.g. Impact Hub).  Will require Veeral's program, expected time for each assignment is 10-20 seconds - pay rate, therefore, recommended $.05&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Using Veeral's code, crossproduct allintext: (Group A) and site: (Group B), where '''Group A'''=Contact (high coverage), About Us, Find Us, Locations, Address, '''Group B'''= Company URLs.&lt;br /&gt;
#*Click on first result.  If addresses exist, record in ADDRESS, STATE, and ZIP.&lt;br /&gt;
#*#If not, go to company's URL. If addresses exist, record in ADDRESS, STATE, and ZIP.&lt;br /&gt;
#*#If address exists, but ZIP does not, plug in address into search engine and record ZIP.&lt;br /&gt;
#*#Otherwise, record DNE.&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Given that most companies include their specialty in mission statement and difficulty to turk, we will manually check each mission statement and mark it accordingly. &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#NONE&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Code written, but may require additional manual work. Expected time to complete is 45 seconds due to a potential list of a lot of sponsors/partners - pay rate, therefore, recommended $.12. &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Choose first result from Search Text 1 and Search Text 2 (allintext: Sponsors/Partnerrs site:URL)&lt;br /&gt;
#*#Record all Sponsors from Search Text 1 into SPONSORS.  If there does not exist a list or the link was for only 1 sponsor, record DNE.&lt;br /&gt;
#*#If any Sponsors from Search Text 1 include a University or College (will be listed in name), record them into UNIVERSITY SPONSORS&lt;br /&gt;
#*#Record all Partners from Search Text 2 into PARTNERS. If there does not exist a list or the link was for only 1 partner, record DNE.&lt;br /&gt;
#*#If any Partners from Search Text 2 include a University or College (will be listed in name), record them into UNIVERSITY PARTNERS&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/22)''':  Code 1 written, code 2 need more work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to company’s URL&lt;br /&gt;
#*#On the homepage, look for the section related to pricing. If pricing is not found in the homepage, look for the links ‘coworking’, ‘work space’, ‘membership’, ‘pricing’ ,‘join’ or ‘Apply for membership’ , and look for the pricing information under those links. If there is no price related section, record DNE for both ‘Flexible Desk’ and ‘Dedicated Desk’.&lt;br /&gt;
#*#If there is pricing information, look for the price of sharing space per month, often denoted as Shared/Flexible desk/non-dedicated desk, record the price at ‘Flexible Desk’. If the price is not found, record DNE.&lt;br /&gt;
#*#Look for the price of a dedicated desk per month, often denoted as Reserved/dedicated desk/private desk record the price at ‘Dedicated Desk’.  If the price is not found, record DNE.&lt;br /&gt;
#*#If price information is not found and there is a ‘locations’ link, click on it and choose the first location of the list. Repeat step 3 -4. &lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Keywords: 24/7 access, dedicated desk, pricing&lt;br /&gt;
#*#Google allintext:&amp;quot;keywords&amp;quot; site:URL&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': To Be Discussed Further&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Difficulties observed when figuring out how to Turk this &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Code written - 2nd part, while more manual, appears to have greater range.  2nd code would only require Veeral's code.  1st code expected completion time is 30 seconds.&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to company's URL&lt;br /&gt;
#*#Look for the link 'Accelerators' or 'Accelerating/Accelerator/Acceleration/Accelerate Programs'&lt;br /&gt;
#*#If accelerators are found, count the number of accelerators/accelerating programs and record the number. **or also copy the names of the accelerators?&lt;br /&gt;
#*#If accelerators are not found in step 1, go to the links 'Services' , 'Benefit', 'Resources', 'For Entrepreneurs', 'Startups' and look for the section of 'Accelerator/Accelerating Programs' &lt;br /&gt;
#*#If accelerators are found, count the number of accelerators/accelerating programs and record the number.&lt;br /&gt;
#*#If accelerators are not found, record 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Search [allintitle:&amp;quot;accelerator&amp;quot; site:URL] in Google&lt;br /&gt;
#*#Copy the titles of the results. **We have to scrutinize the titles ourselves to determine whether they are distinct onsite accelerators and record the number manually.&lt;br /&gt;
#*#If no result appears, record 0.&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
Thoughts (Ariel, 07/20): &lt;br /&gt;
The names listed on 'mentor' page/sections must all be mentors, and the same applies for investors/OH investors although few companies list their investors. So here the only thing we are trying to differentiate is whether the mentor is a investor. maybe via checking whether they are from a VC firm?? But even they are from VC companies doesn't mean they are going to invest on the startups of the Hubs they are mentoring on.  Or another way to think about it is differentiating between mentors/OH mentors. Mentors tend to give the particular startups long term support and available when needed while OH mentors only gives advice on the spot. &lt;br /&gt;
&lt;br /&gt;
'''Mentors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on improving entrepreneurial community through ongoing, recurring support&lt;br /&gt;
**Help and guide the startups on: business plans and models, management, development, execution, technology innovation, marketing, sales&lt;br /&gt;
**Common fields/occupations: founder/CEO of another company, business development, serial entrepreneur, marketing, sales, management consulting, technology and innovation, research professor etc.&lt;br /&gt;
**Some companies offer mentor office hours&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Investors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on investing on early stage or growth stage startups&lt;br /&gt;
**Usually from VC firms&lt;br /&gt;
**Common fields/ occupations: VC firm manager, VC firm partner, fund manager&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
#Search allintext:&amp;quot;office hours&amp;quot; site:URL&lt;br /&gt;
#Mark ''office hours'' as 1 if there is a result, otherwise mark as 0.&lt;br /&gt;
#Click on the first five results&lt;br /&gt;
#On each of the five pages, search for two items:&lt;br /&gt;
##search for 'mentor'. (Ctrl + F) If 'mentor' appears in the description paragraph of office hours on any of the five pages, mark ''mentor OH'' as 1. Otherwise mark as DNE and copy the description paragraph of office hours of all five pages.&lt;br /&gt;
##search for 'fund'. (Ctrl + F) If 'fund' appears in the description paragraph of office hours on any of the five pages, mark ''investor OH'' as 1. Otherwise mark as DNE and copy the description paragraph of office hours of all five pages.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is learning and discussing &lt;br /&gt;
**Often have a specific topic: business issue (e.g. online marketing) or techniques learning (e.g. intro to Java script)&lt;br /&gt;
**In the forms of: workshop, class, panel, project, XX session, seminar, series, intro to XX&lt;br /&gt;
**Exception: tech meetup is usually a workshop(e.g. C++ programmer meetup, http://techranchaustin.com/events/)&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
**Do we care about what particular workshops (e.g. coding, leadership, etc.)?&lt;br /&gt;
**Summits/major events&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
**See Turk for Both Below&lt;br /&gt;
&lt;br /&gt;
'''Networking Events'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is to meet fellow entrepreneurs and experts and networking with them&lt;br /&gt;
**Focus on experience sharing or communication as opposed to discussing a specific topic or technical subject&lt;br /&gt;
**In the forms of: meetup, networking, happy hour, info session?, luncheon, XX night, socials, talks??, community XX&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
**See Turk for Both Below&lt;br /&gt;
&lt;br /&gt;
*'''Turk for Both 1 of 2''':&lt;br /&gt;
*#Search the Search Text 1 (allintext: events site: URL) and choose link to &amp;quot;Events&amp;quot;, &amp;quot;Calendar&amp;quot;, or related.  Record 'url' on SOURCE If this does not exist, go to Step 7&lt;br /&gt;
*#For all events that have dates, copy the events from today's date to the following month into ALL EVENTS&lt;br /&gt;
*#For all events that have Office Hours in the name, record the events in OFFICE HOURS.  For all events that have summit, record the events in SUMMITS.&lt;br /&gt;
*#For all events that are related to teaching or learning (e.g. contain &amp;quot;Training,&amp;quot; &amp;quot;Seminar,&amp;quot; &amp;quot;Class,&amp;quot; &amp;quot;Learn,&amp;quot; &amp;quot;Bootcamp,&amp;quot; &amp;quot;Workshop,&amp;quot; &amp;quot;Pitch Event&amp;quot;, copy the name of the events into WORKSHOPS&lt;br /&gt;
*#For all events that are related to scoial activities and networking (e.g. &amp;quot;Social,&amp;quot; &amp;quot;Meet Up,&amp;quot; &amp;quot;Breakfast&amp;quot;/&amp;quot;Lunch&amp;quot;/&amp;quot;Happy Hour&amp;quot;, &amp;quot;Movie Night&amp;quot;/&amp;quot;Bowling&amp;quot;), copy the name of the events into NETWORKING.  For all events that are unclear or did not fit into these descriptions&lt;br /&gt;
*#If a message explicity says there are no events, mark as 0 for ALL EVENTS, OFFICE HOURS, SUMMITS, WORKSHOPS, and NETWORKING &lt;br /&gt;
*#If this does not exist, search Search Text 2 (allintext: Company Name site: meetup.com) and click on the meetup.com for the company if it exists.  If it does exist, record meetup on SOURCE.  If not, go to step 9.&lt;br /&gt;
*#Repeat Steps 2-6.&lt;br /&gt;
*#If this does not exist, search Search Text 3 (allintext: Company Name site: eventbrite.com) and click on the eventbrite.com for the company if it exists.  If it does exist, record eventbrite on SOURCE.  If not, mark DNE for all variables.&lt;br /&gt;
*#Repeat Steps 2-6.&lt;br /&gt;
&lt;br /&gt;
*'''Turk for Both 2 of 2''':&lt;br /&gt;
#Go to Company URL&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events from today's date to next months and record it in ALL EVENTS. If there is no information of events or dates of the events on the website, record DNE for all variables.&lt;br /&gt;
#For all events that have Office Hours in the name, count the number of events in OFFICE HOURS.  For all events that have summit, count the number of the events in SUMMITS.&lt;br /&gt;
#For all events that are related to teaching or learning (e.g. contain &amp;quot;Training,&amp;quot; &amp;quot;Seminar,&amp;quot; &amp;quot;Class,&amp;quot; &amp;quot;Learn,&amp;quot; &amp;quot;Bootcamp,&amp;quot; &amp;quot;Workshop,&amp;quot; &amp;quot;Pitch Event&amp;quot;, count the number of the events into WORKSHOPS&lt;br /&gt;
#For all events that are related to scoial activities and networking (e.g. &amp;quot;Social,&amp;quot; &amp;quot;Meet Up,&amp;quot; &amp;quot;Breakfast&amp;quot;/&amp;quot;Lunch&amp;quot;/&amp;quot;Happy Hour&amp;quot;, &amp;quot;Movie Night&amp;quot;/&amp;quot;Bowling&amp;quot;), count the number of the events into NETWORKING&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Group 5===&lt;br /&gt;
#Multiple Locations&lt;br /&gt;
#*Addresses are included in Group 1, but still needs to be discussed&lt;br /&gt;
#*Getting&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
See Section 3 of [[Hubs (Academic Paper)]]&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7384</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7384"/>
		<updated>2016-07-22T16:53:42Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Group 1 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#URL&lt;br /&gt;
#Address&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Founding Date&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/21): G/Y: Founding date issues&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/21): Green: much progress has been made&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/21): Hannah working on this&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/21): NS&lt;br /&gt;
#*Begin Date: TBD&lt;br /&gt;
#*Reach Goal: TBD&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status (7/21): NS&lt;br /&gt;
#*Begin Date: TBD&lt;br /&gt;
#*Reach Goal: TBD&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/20)''': Gunny has created a tool to do this process&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Address&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/22)''': Code written.  Difficulties occur with very large companies (e.g. Impact Hub).  Will require Veeral's program, expected time for each assignment is 10-20 seconds - pay rate, therefore, recommended $.05&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Using Veeral's code, crossproduct allintext: (Group A) and site: (Group B), where '''Group A'''=Contact (high coverage), About Us, Find Us, Locations, Address, '''Group B'''= Company URLs.&lt;br /&gt;
#*Click on first result.  If addresses exist, record in ADDRESS, STATE, and ZIP.&lt;br /&gt;
#*#If not, go to company's URL. If addresses exist, record in ADDRESS, STATE, and ZIP.&lt;br /&gt;
#*#If address exists, but ZIP does not, plug in address into search engine and record ZIP.&lt;br /&gt;
#*#Otherwise, record DNE.&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Given that most companies include their specialty in mission statement and difficulty to turk, we will manually check each mission statement and mark it accordingly. &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#NONE&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Code written, but may require additional manual work. Expected time to complete is 45 seconds due to a potential list of a lot of sponsors/partners - pay rate, therefore, recommended $.12. &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Choose first result from Search Text 1 and Search Text 2 (allintext: Sponsors/Partnerrs site:URL)&lt;br /&gt;
#*#Record all Sponsors from Search Text 1 into SPONSORS.  If there does not exist a list or the link was for only 1 sponsor, record DNE.&lt;br /&gt;
#*#If any Sponsors from Search Text 1 include a University or College (will be listed in name), record them into UNIVERSITY SPONSORS&lt;br /&gt;
#*#Record all Partners from Search Text 2 into PARTNERS. If there does not exist a list or the link was for only 1 partner, record DNE.&lt;br /&gt;
#*#If any Partners from Search Text 2 include a University or College (will be listed in name), record them into UNIVERSITY PARTNERS&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/22)''':  Code 1 written, code 2 need more work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to company’s URL&lt;br /&gt;
#*#On the homepage, look for the section related to pricing. If pricing is not found in the homepage, look for the links ‘coworking’, ‘work space’, ‘membership’, ‘pricing’ ,‘join’ or ‘Apply for membership’ , and look for the pricing information under those links. If there is no price related section, record DNE for both ‘Flexible Desk’ and ‘Dedicated Desk’.&lt;br /&gt;
#*#If there is pricing information, look for the price of sharing space per month, often denoted as Shared/Flexible desk/non-dedicated desk, record the price at ‘Flexible Desk’. If the price is not found, record DNE.&lt;br /&gt;
#*#Look for the price of a dedicated desk per month, often denoted as Reserved/dedicated desk/private desk record the price at ‘Dedicated Desk’.  If the price is not found, record DNE.&lt;br /&gt;
#*#If price information is not found and there is a ‘locations’ link, click on it and choose the first location of the list. Repeat step 3 -4. &lt;br /&gt;
&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Keywords: 24/7 access, dedicated desk, pricing&lt;br /&gt;
#*#Google allintext:&amp;quot;keywords&amp;quot; site:URL&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': To Be Discussed Further&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Difficulties observed when figuring out how to Turk this &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Code written - 2nd part, while more manual, appears to have greater range.  2nd code would only require Veeral's code.  1st code expected completion time is 30 seconds.&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to company's URL&lt;br /&gt;
#*#Look for the link 'Accelerators' or 'Accelerating/Accelerator/Acceleration/Accelerate Programs'&lt;br /&gt;
#*#If accelerators are found, count the number of accelerators/accelerating programs and record the number. **or also copy the names of the accelerators?&lt;br /&gt;
#*#If accelerators are not found in step 1, go to the links 'Services' , 'Benefit', 'Resources', 'For Entrepreneurs', 'Startups' and look for the section of 'Accelerator/Accelerating Programs' &lt;br /&gt;
#*#If accelerators are found, count the number of accelerators/accelerating programs and record the number.&lt;br /&gt;
#*#If accelerators are not found, record 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Search [allintitle:&amp;quot;accelerator&amp;quot; site:URL] in Google&lt;br /&gt;
#*#Copy the titles of the results. **We have to scrutinize the titles ourselves to determine whether they are distinct onsite accelerators and record the number manually.&lt;br /&gt;
#*#If no result appears, record 0.&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
Thoughts (Ariel, 07/20): &lt;br /&gt;
The names listed on 'mentor' page/sections must all be mentors, and the same applies for investors/OH investors although few companies list their investors. So here the only thing we are trying to differentiate is whether the mentor is a investor. maybe via checking whether they are from a VC firm?? But even they are from VC companies doesn't mean they are going to invest on the startups of the Hubs they are mentoring on.  Or another way to think about it is differentiating between mentors/OH mentors. Mentors tend to give the particular startups long term support and available when needed while OH mentors only gives advice on the spot. &lt;br /&gt;
&lt;br /&gt;
'''Mentors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on improving entrepreneurial community through ongoing, recurring support&lt;br /&gt;
**Help and guide the startups on: business plans and models, management, development, execution, technology innovation, marketing, sales&lt;br /&gt;
**Common fields/occupations: founder/CEO of another company, business development, serial entrepreneur, marketing, sales, management consulting, technology and innovation, research professor etc.&lt;br /&gt;
**Some companies offer mentor office hours&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Investors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on investing on early stage or growth stage startups&lt;br /&gt;
**Usually from VC firms&lt;br /&gt;
**Common fields/ occupations: VC firm manager, VC firm partner, fund manager&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
#Search allintext:&amp;quot;office hours&amp;quot; site:URL&lt;br /&gt;
#Mark ''office hours'' as 1 if there is a result, otherwise mark as 0.&lt;br /&gt;
#Click on the first five results&lt;br /&gt;
#On each of the five pages, search for two items:&lt;br /&gt;
##search for 'mentor'. (Ctrl + F) If 'mentor' appears in the description paragraph of office hours on any of the five pages, mark ''mentor OH'' as 1. Otherwise mark as DNE and copy the description paragraph of office hours of all five pages.&lt;br /&gt;
##search for 'fund'. (Ctrl + F) If 'fund' appears in the description paragraph of office hours on any of the five pages, mark ''investor OH'' as 1. Otherwise mark as DNE and copy the description paragraph of office hours of all five pages.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is learning and discussing &lt;br /&gt;
**Often have a specific topic: business issue (e.g. online marketing) or techniques learning (e.g. intro to Java script)&lt;br /&gt;
**In the forms of: workshop, class, panel, project, XX session, seminar, series, intro to XX&lt;br /&gt;
**Exception: tech meetup is usually a workshop(e.g. C++ programmer meetup, http://techranchaustin.com/events/)&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
**Do we care about what particular workshops (e.g. coding, leadership, etc.)?&lt;br /&gt;
**Summits/major events&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
**See Turk for Both Below&lt;br /&gt;
&lt;br /&gt;
'''Networking Events'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is to meet fellow entrepreneurs and experts and networking with them&lt;br /&gt;
**Focus on experience sharing or communication as opposed to discussing a specific topic or technical subject&lt;br /&gt;
**In the forms of: meetup, networking, happy hour, info session?, luncheon, XX night, socials, talks??, community XX&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
**See Turk for Both Below&lt;br /&gt;
&lt;br /&gt;
*'''Turk for Both 1 of 2''':&lt;br /&gt;
*#Search the Search Text 1 (allintext: events site: URL) and choose link to &amp;quot;Events&amp;quot;, &amp;quot;Calendar&amp;quot;, or related.  Record 'url' on SOURCE If this does not exist, go to Step 7&lt;br /&gt;
*#For all events that have dates, copy the events from today's date to the following month into ALL EVENTS&lt;br /&gt;
*#For all events that have Office Hours in the name, record the events in OFFICE HOURS.  For all events that have summit, record the events in SUMMITS.&lt;br /&gt;
*#For all events that are related to teaching or learning (e.g. contain &amp;quot;Training,&amp;quot; &amp;quot;Seminar,&amp;quot; &amp;quot;Class,&amp;quot; &amp;quot;Learn,&amp;quot; &amp;quot;Bootcamp,&amp;quot; &amp;quot;Workshop,&amp;quot; &amp;quot;Pitch Event&amp;quot;, copy the name of the events into WORKSHOPS&lt;br /&gt;
*#For all events that are related to scoial activities and networking (e.g. &amp;quot;Social,&amp;quot; &amp;quot;Meet Up,&amp;quot; &amp;quot;Breakfast&amp;quot;/&amp;quot;Lunch&amp;quot;/&amp;quot;Happy Hour&amp;quot;, &amp;quot;Movie Night&amp;quot;/&amp;quot;Bowling&amp;quot;), copy the name of the events into NETWORKING.  For all events that are unclear or did not fit into these descriptions&lt;br /&gt;
*#If a message explicity says there are no events, mark as 0 for ALL EVENTS, OFFICE HOURS, SUMMITS, WORKSHOPS, and NETWORKING &lt;br /&gt;
*#If this does not exist, search Search Text 2 (allintext: Company Name site: meetup.com) and click on the meetup.com for the company if it exists.  If it does exist, record meetup on SOURCE.  If not, go to step 9.&lt;br /&gt;
*#Repeat Steps 2-6.&lt;br /&gt;
*#If this does not exist, search Search Text 3 (allintext: Company Name site: eventbrite.com) and click on the eventbrite.com for the company if it exists.  If it does exist, record eventbrite on SOURCE.  If not, mark DNE for all variables.&lt;br /&gt;
*#Repeat Steps 2-6.&lt;br /&gt;
&lt;br /&gt;
*'''Turk for Both 2 of 2''':&lt;br /&gt;
#Go to Company URL&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events from today's date to next months and record it in ALL EVENTS. If there is no information of events or dates of the events on the website, record DNE for all variables.&lt;br /&gt;
#For all events that have Office Hours in the name, count the number of events in OFFICE HOURS.  For all events that have summit, count the number of the events in SUMMITS.&lt;br /&gt;
#For all events that are related to teaching or learning (e.g. contain &amp;quot;Training,&amp;quot; &amp;quot;Seminar,&amp;quot; &amp;quot;Class,&amp;quot; &amp;quot;Learn,&amp;quot; &amp;quot;Bootcamp,&amp;quot; &amp;quot;Workshop,&amp;quot; &amp;quot;Pitch Event&amp;quot;, count the number of the events into WORKSHOPS&lt;br /&gt;
#For all events that are related to scoial activities and networking (e.g. &amp;quot;Social,&amp;quot; &amp;quot;Meet Up,&amp;quot; &amp;quot;Breakfast&amp;quot;/&amp;quot;Lunch&amp;quot;/&amp;quot;Happy Hour&amp;quot;, &amp;quot;Movie Night&amp;quot;/&amp;quot;Bowling&amp;quot;), count the number of the events into NETWORKING&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===Group 5===&lt;br /&gt;
#Multiple Locations&lt;br /&gt;
#*Addresses are included in Group 1, but still needs to be discussed&lt;br /&gt;
#*Getting&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
See Section 3 of [[Hubs (Academic Paper)]]&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7361</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7361"/>
		<updated>2016-07-21T21:10:20Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Group 3 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/21): G/Y: Founding date issues&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/21): Green: much progress has been made&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/21): Hannah working on this&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/21): NS&lt;br /&gt;
#*Begin Date: TBD&lt;br /&gt;
#*Reach Goal: TBD&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status (7/21): NS&lt;br /&gt;
#*Begin Date: TBD&lt;br /&gt;
#*Reach Goal: TBD&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/20)''': Gunny has created a tool to do this process&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Given that most companies include their specialty in mission statement and difficulty to turk, we will manually check each mission statement and mark it accordingly. &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#NONE&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Code written, but may require additional manual work. Expected time to complete is 45 seconds due to a potential list of a lot of sponsors/partners - pay rate, therefore, recommended $.12. &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Choose first result from Search Text 1 and Search Text 2 (allintext: Sponsors/Partnerrs site:URL)&lt;br /&gt;
#*#Record all Sponsors from Search Text 1 into SPONSORS.  If there does not exist a list or the link was for only 1 sponsor, record DNE.&lt;br /&gt;
#*#If any Sponsors from Search Text 1 include a University or College (will be listed in name), record them into UNIVERSITY SPONSORS&lt;br /&gt;
#*#Record all Partners from Search Text 2 into PARTNERS. If there does not exist a list or the link was for only 1 partner, record DNE.&lt;br /&gt;
#*#If any Partners from Search Text 2 include a University or College (will be listed in name), record them into UNIVERSITY PARTNERS&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to company's URL&lt;br /&gt;
#*#Look for the link 'Accelerators' or 'Accelerating/Accelerator/Acceleration/Accelerate Programs'&lt;br /&gt;
#*#If accelerators are found, count the number of accelerators/accelerating programs and record the number. **or also copy the names of the accelerators?&lt;br /&gt;
#*#If accelerators are not found in step 1, go to the links 'Services' , 'Benefit', 'Resources', 'For Entrepreneurs', 'Startups' and look for the section of 'Accelerator/Accelerating Programs' &lt;br /&gt;
#*#If accelerators are found, count the number of accelerators/accelerating programs and record the number.&lt;br /&gt;
#*#If accelerators are not found, record 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Search [allintitle:&amp;quot;accelerator&amp;quot; site:URL] in Google&lt;br /&gt;
#*#Copy the titles of the results. **We have to scrutinize the titles ourselves to determine whether they are distinct onsite accelerators and record the number manually.&lt;br /&gt;
#*#If no result appears, record 0.&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
Thoughts (Ariel, 07/20): &lt;br /&gt;
The names listed on 'mentor' page/sections must all be mentors, and the same applies for investors/OH investors although few companies list their investors. So here the only thing we are trying to differentiate is whether the mentor is a investor. maybe via checking whether they are from a VC firm?? But even they are from VC companies doesn't mean they are going to invest on the startups of the Hubs they are mentoring on.  Or another way to think about it is differentiating between mentors/OH mentors. Mentors tend to give the particular startups long term support and available when needed while OH mentors only gives advice on the spot. &lt;br /&gt;
&lt;br /&gt;
'''Mentors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on improving entrepreneurial community through ongoing, recurring support&lt;br /&gt;
**Help and guide the startups on: business plans and models, management, development, execution, technology innovation, marketing, sales&lt;br /&gt;
**Common fields/occupations: founder/CEO of another company, business development, serial entrepreneur, marketing, sales, management consulting, technology and innovation, research professor etc.&lt;br /&gt;
**Some companies offer mentor office hours&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Investors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on investing on early stage or growth stage startups&lt;br /&gt;
**Usually from VC firms&lt;br /&gt;
**Common fields/ occupations: VC firm manager, VC firm partner, fund manager&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
#Search allintext:&amp;quot;office hours&amp;quot; site:URL&lt;br /&gt;
#Mark ''office hours'' as 1 if there is a result, otherwise mark as 0.&lt;br /&gt;
#Click on the first five results&lt;br /&gt;
#On each of the five pages, search for two items:&lt;br /&gt;
##search for 'mentor'. (Ctrl + F) If 'mentor' appears in the description paragraph of office hours on any of the five pages, mark ''mentor OH'' as 1. Otherwise mark as DNE and copy the description paragraph of office hours of all five pages.&lt;br /&gt;
##search for 'fund'. (Ctrl + F) If 'fund' appears in the description paragraph of office hours on any of the five pages, mark ''investor OH'' as 1. Otherwise mark as DNE and copy the description paragraph of office hours of all five pages.&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is learning and discussing &lt;br /&gt;
**Often have a specific topic: business issue (e.g. online marketing) or techniques learning (e.g. intro to Java script)&lt;br /&gt;
**In the forms of: workshop, class, panel, project, XX session, seminar, series, intro to XX&lt;br /&gt;
**Exception: tech meetup is usually a workshop(e.g. C++ programmer meetup, http://techranchaustin.com/events/)&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
**Do we care about what particular workshops (e.g. coding, leadership, etc.)?&lt;br /&gt;
**Summits/major events&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
**See Turk for Both Below&lt;br /&gt;
&lt;br /&gt;
'''Networking Events'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is to meet fellow entrepreneurs and experts and networking with them&lt;br /&gt;
**Focus on experience sharing or communication as opposed to discussing a specific topic or technical subject&lt;br /&gt;
**In the forms of: meetup, networking, happy hour, info session?, luncheon, XX night, socials, talks??, community XX&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
**See Turk for Both Below&lt;br /&gt;
&lt;br /&gt;
*'''Turk for Both 1 of 2''':&lt;br /&gt;
*#Search the Search Text 1 (allintext: events site: URL) and choose link to &amp;quot;Events&amp;quot;, &amp;quot;Calendar&amp;quot;, or related.  Record 'url' on SOURCE If this does not exist, go to Step 7&lt;br /&gt;
*#For all events that have dates, copy the events from today's date to the following month into ALL EVENTS&lt;br /&gt;
*#For all events that have Office Hours in the name, record the events in OFFICE HOURS.  For all events that have summit, record the events in SUMMITS.&lt;br /&gt;
*#For all events that are related to teaching or learning (e.g. contain &amp;quot;Training,&amp;quot; &amp;quot;Seminar,&amp;quot; &amp;quot;Class,&amp;quot; &amp;quot;Learn,&amp;quot; &amp;quot;Bootcamp,&amp;quot; &amp;quot;Workshop,&amp;quot; &amp;quot;Pitch Event&amp;quot;, copy the name of the events into WORKSHOPS&lt;br /&gt;
*#For all events that are related to scoial activities and networking (e.g. &amp;quot;Social,&amp;quot; &amp;quot;Meet Up,&amp;quot; &amp;quot;Breakfast&amp;quot;/&amp;quot;Lunch&amp;quot;/&amp;quot;Happy Hour&amp;quot;, &amp;quot;Movie Night&amp;quot;/&amp;quot;Bowling&amp;quot;), copy the name of the events into NETWORKING.  For all events that are unclear or did not fit into these descriptions&lt;br /&gt;
*#If a message explicity says there are no events, mark as 0 for ALL EVENTS, OFFICE HOURS, SUMMITS, WORKSHOPS, and NETWORKING &lt;br /&gt;
*#If this does not exist, search Search Text 2 (allintext: Company Name site: meetup.com) and click on the meetup.com for the company if it exists.  If it does exist, record meetup on SOURCE.  If not, go to step 9.&lt;br /&gt;
*#Repeat Steps 2-6.&lt;br /&gt;
*#If this does not exist, search Search Text 3 (allintext: Company Name site: eventbrite.com) and click on the eventbrite.com for the company if it exists.  If it does exist, record eventbrite on SOURCE.  If not, mark DNE for all variables.&lt;br /&gt;
*#Repeat Steps 2-6.&lt;br /&gt;
&lt;br /&gt;
*'''Turk for Both 2 of 2''':&lt;br /&gt;
#Go to Company URL&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events from today's date to next months and record it in ALL EVENTS. If there is no information of events or dates of the events on the website, record DNE for all variables.&lt;br /&gt;
#For all events that have Office Hours in the name, count the number of events in OFFICE HOURS.  For all events that have summit, count the number of the events in SUMMITS.&lt;br /&gt;
#For all events that are related to teaching or learning (e.g. contain &amp;quot;Training,&amp;quot; &amp;quot;Seminar,&amp;quot; &amp;quot;Class,&amp;quot; &amp;quot;Learn,&amp;quot; &amp;quot;Bootcamp,&amp;quot; &amp;quot;Workshop,&amp;quot; &amp;quot;Pitch Event&amp;quot;, count the number of the events into WORKSHOPS&lt;br /&gt;
#For all events that are related to scoial activities and networking (e.g. &amp;quot;Social,&amp;quot; &amp;quot;Meet Up,&amp;quot; &amp;quot;Breakfast&amp;quot;/&amp;quot;Lunch&amp;quot;/&amp;quot;Happy Hour&amp;quot;, &amp;quot;Movie Night&amp;quot;/&amp;quot;Bowling&amp;quot;), count the number of the events into NETWORKING&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7360</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7360"/>
		<updated>2016-07-21T21:09:24Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Group 3 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/21): G/Y: Founding date issues&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/21): Green: much progress has been made&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/21): Hannah working on this&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/21): NS&lt;br /&gt;
#*Begin Date: TBD&lt;br /&gt;
#*Reach Goal: TBD&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status (7/21): NS&lt;br /&gt;
#*Begin Date: TBD&lt;br /&gt;
#*Reach Goal: TBD&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/20)''': Gunny has created a tool to do this process&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Given that most companies include their specialty in mission statement and difficulty to turk, we will manually check each mission statement and mark it accordingly. &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#NONE&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Code written, but may require additional manual work. Expected time to complete is 45 seconds due to a potential list of a lot of sponsors/partners - pay rate, therefore, recommended $.12. &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Choose first result from Search Text 1 and Search Text 2 (allintext: Sponsors/Partnerrs site:URL)&lt;br /&gt;
#*#Record all Sponsors from Search Text 1 into SPONSORS.  If there does not exist a list or the link was for only 1 sponsor, record DNE.&lt;br /&gt;
#*#If any Sponsors from Search Text 1 include a University or College (will be listed in name), record them into UNIVERSITY SPONSORS&lt;br /&gt;
#*#Record all Partners from Search Text 2 into PARTNERS. If there does not exist a list or the link was for only 1 partner, record DNE.&lt;br /&gt;
#*#If any Partners from Search Text 2 include a University or College (will be listed in name), record them into UNIVERSITY PARTNERS&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to company's URL&lt;br /&gt;
#*#Look for the link 'Accelerators' or 'Accelerating/Accelerator/Acceleration/Accelerate Programs'&lt;br /&gt;
#*#If accelerators are found, count the number of accelerators/accelerating programs and record the number. **or also copy the names of the accelerator?&lt;br /&gt;
#*#If accelerators are not found in step 1, go to the links 'Services' , 'Benefit', 'Resources', 'For Entrepreneurs', 'Startups' and look for the section of 'Accelerator/Accelerating Programs' &lt;br /&gt;
#*#If accelerators are found, count the number of accelerators/accelerating programs and record the number.&lt;br /&gt;
#*#If accelerators are not found, record 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Search allintitle:&amp;quot;accelerator&amp;quot; site:URL in Google&lt;br /&gt;
#*#Copy the titles of the results. **We have to scrutinize the titles ourselves to determine whether they are distinct onsite accelerators and record the number manually.&lt;br /&gt;
#*#If no result appears, record 0.&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
Thoughts (Ariel, 07/20): &lt;br /&gt;
The names listed on 'mentor' page/sections must all be mentors, and the same applies for investors/OH investors although few companies list their investors. So here the only thing we are trying to differentiate is whether the mentor is a investor. maybe via checking whether they are from a VC firm?? But even they are from VC companies doesn't mean they are going to invest on the startups of the Hubs they are mentoring on.  Or another way to think about it is differentiating between mentors/OH mentors. Mentors tend to give the particular startups long term support and available when needed while OH mentors only gives advice on the spot. &lt;br /&gt;
&lt;br /&gt;
'''Mentors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on improving entrepreneurial community through ongoing, recurring support&lt;br /&gt;
**Help and guide the startups on: business plans and models, management, development, execution, technology innovation, marketing, sales&lt;br /&gt;
**Common fields/occupations: founder/CEO of another company, business development, serial entrepreneur, marketing, sales, management consulting, technology and innovation, research professor etc.&lt;br /&gt;
**Some companies offer mentor office hours&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Investors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on investing on early stage or growth stage startups&lt;br /&gt;
**Usually from VC firms&lt;br /&gt;
**Common fields/ occupations: VC firm manager, VC firm partner, fund manager&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
#Search allintext:&amp;quot;office hours&amp;quot; site:URL&lt;br /&gt;
#Mark ''office hours'' as 1 if there is a result, otherwise mark as 0.&lt;br /&gt;
#Click on the first five results&lt;br /&gt;
#On each of the five pages, search for two items:&lt;br /&gt;
##search for 'mentor'. (Ctrl + F) If 'mentor' appears in the description paragraph of office hours on any of the five pages, mark ''mentor OH'' as 1. Otherwise mark as DNE and copy the description paragraph of office hours of all five pages.&lt;br /&gt;
##search for 'fund'. (Ctrl + F) If 'fund' appears in the description paragraph of office hours on any of the five pages, mark ''investor OH'' as 1. Otherwise mark as DNE and copy the description paragraph of office hours of all five pages.&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is learning and discussing &lt;br /&gt;
**Often have a specific topic: business issue (e.g. online marketing) or techniques learning (e.g. intro to Java script)&lt;br /&gt;
**In the forms of: workshop, class, panel, project, XX session, seminar, series, intro to XX&lt;br /&gt;
**Exception: tech meetup is usually a workshop(e.g. C++ programmer meetup, http://techranchaustin.com/events/)&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
**Do we care about what particular workshops (e.g. coding, leadership, etc.)?&lt;br /&gt;
**Summits/major events&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
**See Turk for Both Below&lt;br /&gt;
&lt;br /&gt;
'''Networking Events'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is to meet fellow entrepreneurs and experts and networking with them&lt;br /&gt;
**Focus on experience sharing or communication as opposed to discussing a specific topic or technical subject&lt;br /&gt;
**In the forms of: meetup, networking, happy hour, info session?, luncheon, XX night, socials, talks??, community XX&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
**See Turk for Both Below&lt;br /&gt;
&lt;br /&gt;
*'''Turk for Both 1 of 2''':&lt;br /&gt;
*#Search the Search Text 1 (allintext: events site: URL) and choose link to &amp;quot;Events&amp;quot;, &amp;quot;Calendar&amp;quot;, or related.  Record 'url' on SOURCE If this does not exist, go to Step 7&lt;br /&gt;
*#For all events that have dates, copy the events from today's date to the following month into ALL EVENTS&lt;br /&gt;
*#For all events that have Office Hours in the name, record the events in OFFICE HOURS.  For all events that have summit, record the events in SUMMITS.&lt;br /&gt;
*#For all events that are related to teaching or learning (e.g. contain &amp;quot;Training,&amp;quot; &amp;quot;Seminar,&amp;quot; &amp;quot;Class,&amp;quot; &amp;quot;Learn,&amp;quot; &amp;quot;Bootcamp,&amp;quot; &amp;quot;Workshop,&amp;quot; &amp;quot;Pitch Event&amp;quot;, copy the name of the events into WORKSHOPS&lt;br /&gt;
*#For all events that are related to scoial activities and networking (e.g. &amp;quot;Social,&amp;quot; &amp;quot;Meet Up,&amp;quot; &amp;quot;Breakfast&amp;quot;/&amp;quot;Lunch&amp;quot;/&amp;quot;Happy Hour&amp;quot;, &amp;quot;Movie Night&amp;quot;/&amp;quot;Bowling&amp;quot;), copy the name of the events into NETWORKING.  For all events that are unclear or did not fit into these descriptions&lt;br /&gt;
*#If a message explicity says there are no events, mark as 0 for ALL EVENTS, OFFICE HOURS, SUMMITS, WORKSHOPS, and NETWORKING &lt;br /&gt;
*#If this does not exist, search Search Text 2 (allintext: Company Name site: meetup.com) and click on the meetup.com for the company if it exists.  If it does exist, record meetup on SOURCE.  If not, go to step 9.&lt;br /&gt;
*#Repeat Steps 2-6.&lt;br /&gt;
*#If this does not exist, search Search Text 3 (allintext: Company Name site: eventbrite.com) and click on the eventbrite.com for the company if it exists.  If it does exist, record eventbrite on SOURCE.  If not, mark DNE for all variables.&lt;br /&gt;
*#Repeat Steps 2-6.&lt;br /&gt;
&lt;br /&gt;
*'''Turk for Both 2 of 2''':&lt;br /&gt;
#Go to Company URL&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events from today's date to next months and record it in ALL EVENTS. If there is no information of events or dates of the events on the website, record DNE for all variables.&lt;br /&gt;
#For all events that have Office Hours in the name, count the number of events in OFFICE HOURS.  For all events that have summit, count the number of the events in SUMMITS.&lt;br /&gt;
#For all events that are related to teaching or learning (e.g. contain &amp;quot;Training,&amp;quot; &amp;quot;Seminar,&amp;quot; &amp;quot;Class,&amp;quot; &amp;quot;Learn,&amp;quot; &amp;quot;Bootcamp,&amp;quot; &amp;quot;Workshop,&amp;quot; &amp;quot;Pitch Event&amp;quot;, count the number of the events into WORKSHOPS&lt;br /&gt;
#For all events that are related to scoial activities and networking (e.g. &amp;quot;Social,&amp;quot; &amp;quot;Meet Up,&amp;quot; &amp;quot;Breakfast&amp;quot;/&amp;quot;Lunch&amp;quot;/&amp;quot;Happy Hour&amp;quot;, &amp;quot;Movie Night&amp;quot;/&amp;quot;Bowling&amp;quot;), count the number of the events into NETWORKING&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7335</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7335"/>
		<updated>2016-07-21T17:09:14Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Onsite OH Investors v. mentors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Given that most companies include their specialty in mission statement and difficulty to turk, we will manually check each mission statement and mark it accordingly. &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#NONE&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Code written, but may require additional manual work. Expected time to complete is 45 seconds due to a potential list of a lot of sponsors/partners - pay rate, therefore, recommended $.12. &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Choose first result from Search Text 1 and Search Text 2 (allintext: Sponsors/Partnerrs site:URL)&lt;br /&gt;
#*#Record all Sponsors from Search Text 1 into SPONSORS.  If there does not exist a list or the link was for only 1 sponsor, record DNE.&lt;br /&gt;
#*#If any Sponsors from Search Text 1 include a University or College (will be listed in name), record them into UNIVERSITY SPONSORS&lt;br /&gt;
#*#Record all Partners from Search Text 2 into PARTNERS. If there does not exist a list or the link was for only 1 partner, record DNE.&lt;br /&gt;
#*#If any Partners from Search Text 2 include a University or College (will be listed in name), record them into UNIVERSITY PARTNERS&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
Thoughts (Ariel, 07/20): &lt;br /&gt;
The names listed on 'mentor' page/sections must all be mentors, and the same applies for investors/OH investors although few companies list their investors. So here the only thing we are trying to differentiate is whether the mentor is a investor. maybe via checking whether they are from a VC firm?? But even they are from VC companies doesn't mean they are going to invest on the startups of the Hubs they are mentoring on.  Or another way to think about it is differentiating between mentors/OH mentors. Mentors tend to give the particular startups long term support and available when needed while OH mentors only gives advice on the spot. &lt;br /&gt;
&lt;br /&gt;
'''Mentors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on improving entrepreneurial community through ongoing, recurring support&lt;br /&gt;
**Help and guide the startups on: business plans and models, management, development, execution, technology innovation, marketing, sales&lt;br /&gt;
**Common fields/occupations: founder/CEO of another company, business development, serial entrepreneur, marketing, sales, management consulting, technology and innovation, research professor etc.&lt;br /&gt;
**Some companies offer mentor office hours&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Investors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on investing on early stage or growth stage startups&lt;br /&gt;
**Usually from VC firms&lt;br /&gt;
**Common fields/ occupations: VC firm manager, VC firm partner, fund manager&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
#Search allintext:&amp;quot;office hours&amp;quot; site:URL&lt;br /&gt;
#Mark ''office hours'' as 1 if there is a result, otherwise mark as 0.&lt;br /&gt;
#Click on the first five results&lt;br /&gt;
#On each of the five pages, search for two items:&lt;br /&gt;
##search for 'mentor'. (Ctrl + F) If 'mentor' appears in the description paragraph of office hours on any of the five pages, mark ''mentor OH'' as 1. Otherwise mark as DNE and copy the description paragraph of office hours of all five pages.&lt;br /&gt;
##search for 'fund'. (Ctrl + F) If 'fund' appears in the description paragraph of office hours on any of the five pages, mark ''investor OH'' as 1. Otherwise mark as DNE and copy the description paragraph of office hours of all five pages.&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is learning and discussing &lt;br /&gt;
**Often have a specific topic: business issue (e.g. online marketing) or techniques learning (e.g. intro to Java script)&lt;br /&gt;
**In the forms of: workshop, class, panel, project, XX session, seminar, series, intro to XX&lt;br /&gt;
**Exception: tech meetup is usually a workshop(e.g. C++ programmer meetup, http://techranchaustin.com/events/)&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
'''Networking Events'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is to meet fellow entrepreneurs and experts and networking with them&lt;br /&gt;
**Focus on experience sharing or communication as opposed to discussing a specific topic or technical subject&lt;br /&gt;
**In the forms of: meetup, networking, happy hour, info session?, luncheon, XX night, socials, talks??, community XX&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7326</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7326"/>
		<updated>2016-07-21T16:48:39Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Onsite OH Investors v. mentors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Given that most companies include their specialty in mission statement and difficulty to turk, we will manually check each mission statement and mark it accordingly. &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#NONE&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/21)''': Code written, but may require additional manual work. Expected time to complete is 45 seconds due to a potential list of a lot of sponsors/partners - pay rate, therefore, recommended $.12. &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Choose first result from Search Text 1 and Search Text 2 (allintext: Sponsors/Partnerrs site:URL)&lt;br /&gt;
#*#Record all Sponsors from Search Text 1 into SPONSORS.  If there does not exist a list or the link was for only 1 sponsor, record DNE.&lt;br /&gt;
#*#If any Sponsors from Search Text 1 include a University or College (will be listed in name), record them into UNIVERSITY SPONSORS&lt;br /&gt;
#*#Record all Partners from Search Text 2 into PARTNERS. If there does not exist a list or the link was for only 1 partner, record DNE.&lt;br /&gt;
#*#If any Partners from Search Text 2 include a University or College (will be listed in name), record them into UNIVERSITY PARTNERS&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
Thoughts (Ariel, 07/20): &lt;br /&gt;
The names listed on 'mentor' page/sections must all be mentors, and the same applies for investors/OH investors although few companies list their investors. So here the only thing we are trying to differentiate is whether the mentor is a investor. maybe via checking whether they are from a VC firm?? But even they are from VC companies doesn't mean they are going to invest on the startups of the Hubs they are mentoring on.  Or another way to think about it is differentiating between mentors/OH mentors. Mentors tend to give the particular startups long term support and available when needed while OH mentors only gives advice on the spot. &lt;br /&gt;
&lt;br /&gt;
'''Mentors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on improving entrepreneurial community through ongoing, recurring support&lt;br /&gt;
**Help and guide the startups on: business plans and models, management, development, execution, technology innovation, marketing, sales&lt;br /&gt;
**Common fields/occupations: founder/CEO of another company, business development, serial entrepreneur, marketing, sales, management consulting, technology and innovation, research professor etc.&lt;br /&gt;
**Some companies offer mentor office hours&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Investors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on investing on early stage or growth stage startups&lt;br /&gt;
**Usually from VC firms&lt;br /&gt;
**Common fields/ occupations: VC firm manager, VC firm partner, fund manager&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
#Search allintext:&amp;quot;office hours&amp;quot; site:URL&lt;br /&gt;
#Mark office hours as 1 if&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is learning and discussing &lt;br /&gt;
**Often have a specific topic: business issue (e.g. online marketing) or techniques learning (e.g. intro to Java script)&lt;br /&gt;
**In the forms of: workshop, class, panel, project, XX session, seminar, series, intro to XX&lt;br /&gt;
**Exception: tech meetup is usually a workshop(e.g. C++ programmer meetup, http://techranchaustin.com/events/)&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
'''Networking Events'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is to meet fellow entrepreneurs and experts and networking with them&lt;br /&gt;
**Focus on experience sharing or communication as opposed to discussing a specific topic or technical subject&lt;br /&gt;
**In the forms of: meetup, networking, happy hour, info session?, luncheon, XX night, socials, talks??, community XX&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Ariel_Sun_(Work_Log)&amp;diff=7300</id>
		<title>Ariel Sun (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Ariel_Sun_(Work_Log)&amp;diff=7300"/>
		<updated>2016-07-20T21:55:18Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Category:Work Log]]&lt;br /&gt;
[[Ariel Sun]] [[Work Logs]] [[Ariel Sun (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
06/01/2016 - Introduction/Wiki Building&lt;br /&gt;
&lt;br /&gt;
06/02/2016 - Refined Wiki Organization and Content&lt;br /&gt;
&lt;br /&gt;
06/03/2016 - Organized Topic Areas and Worked on Public Wiki Page&lt;br /&gt;
&lt;br /&gt;
06/06/2016 - Continued Organizing Public Wiki Page&lt;br /&gt;
&lt;br /&gt;
06/07/2016 - Draft of Women in Entrepreneurship blog post&lt;br /&gt;
&lt;br /&gt;
06/08/2016 - Work on Challenges Women Entrepreneurs Face wiki page&lt;br /&gt;
&lt;br /&gt;
06/09/2016 - Clean up content of patent trolls and put on the public page&lt;br /&gt;
&lt;br /&gt;
06/10/2016 - Put up resources for business dynamism in high tech issue brief page&lt;br /&gt;
&lt;br /&gt;
06/13/2016 - Clean up venture one data and LBO data&lt;br /&gt;
&lt;br /&gt;
06/14/2016 - Match venture one and LBO data to patent data&lt;br /&gt;
&lt;br /&gt;
06/15/2016 - Create tables that match patent information to each LBO/venture company&lt;br /&gt;
&lt;br /&gt;
06/16/2016 - Create and Finalize LBO/venture company and patent summary table&lt;br /&gt;
&lt;br /&gt;
06/17/2016 - Familiarize with Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/20/2016 - Analyze existing SQL code of Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/21/2016 - Clean up and rebuild Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/22/2016 - First draft of complete SQL script for Hubs datasets&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
07/15/2016 - Help Ed send out VentureOne data, add grouping and considerations of Hubs scorecard variables&lt;br /&gt;
&lt;br /&gt;
07/18/2016 - Work on differentiating curriculum v. code school, redo Matching VentureOne&lt;br /&gt;
&lt;br /&gt;
07/19/2016 - Finish Marching Venture One, update on Wiki, work on differentiating curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
07/20/2016 - Work on differentiating OH investor v. mentor, temporary workshop v. networking meetup&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Ariel_Sun_(Work_Log)&amp;diff=7299</id>
		<title>Ariel Sun (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Ariel_Sun_(Work_Log)&amp;diff=7299"/>
		<updated>2016-07-20T21:54:10Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Category:Work Log]]&lt;br /&gt;
[[Ariel Sun]] [[Work Logs]] [[Ariel Sun (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
06/01/2016 - Introduction/Wiki Building&lt;br /&gt;
&lt;br /&gt;
06/02/2016 - Refined Wiki Organization and Content&lt;br /&gt;
&lt;br /&gt;
06/03/2016 - Organized Topic Areas and Worked on Public Wiki Page&lt;br /&gt;
&lt;br /&gt;
06/06/2016 - Continued Organizing Public Wiki Page&lt;br /&gt;
&lt;br /&gt;
06/07/2016 - Draft of Women in Entrepreneurship blog post&lt;br /&gt;
&lt;br /&gt;
06/08/2016 - Work on Challenges Women Entrepreneurs Face wiki page&lt;br /&gt;
&lt;br /&gt;
06/09/2016 - Clean up content of patent trolls and put on the public page&lt;br /&gt;
&lt;br /&gt;
06/10/2016 - Put up resources for business dynamism in high tech issue brief page&lt;br /&gt;
&lt;br /&gt;
06/13/2016 - Clean up venture one data and LBO data&lt;br /&gt;
&lt;br /&gt;
06/14/2016 - Match venture one and LBO data to patent data&lt;br /&gt;
&lt;br /&gt;
06/15/2016 - Create tables that match patent information to each LBO/venture company&lt;br /&gt;
&lt;br /&gt;
06/16/2016 - Create and Finalize LBO/venture company and patent summary table&lt;br /&gt;
&lt;br /&gt;
06/17/2016 - Familiarize with Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/20/2016 - Analyze existing SQL code of Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/21/2016 - Clean up and rebuild Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/22/2016 - First draft of complete SQL script for Hubs datasets&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
07/15/2016 - Help Ed send out VentureOne data, add grouping, considerations of Hubs scorecard variables&lt;br /&gt;
&lt;br /&gt;
07/18/2016 - Work on differentiating curriculum v. code school, redo Matching VentureOne&lt;br /&gt;
&lt;br /&gt;
07/19/2016 - Finish Marching Venture One, update on Wiki, work on differentiating curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
07/20/2016 - Work on differentiating OH investor v. mentor, temporary workshop v. networking meetup&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Ariel_Sun_(Work_Log)&amp;diff=7298</id>
		<title>Ariel Sun (Work Log)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Ariel_Sun_(Work_Log)&amp;diff=7298"/>
		<updated>2016-07-20T21:53:53Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Category:Work Log]]&lt;br /&gt;
[[Ariel Sun]] [[Work Logs]] [[Ariel Sun (Work Log)|(log page)]]&lt;br /&gt;
&lt;br /&gt;
06/01/2016 - Introduction/Wiki Building&lt;br /&gt;
&lt;br /&gt;
06/02/2016 - Refined Wiki Organization and Content&lt;br /&gt;
&lt;br /&gt;
06/03/2016 - Organized Topic Areas and Worked on Public Wiki Page&lt;br /&gt;
&lt;br /&gt;
06/06/2016 - Continued Organizing Public Wiki Page&lt;br /&gt;
&lt;br /&gt;
06/07/2016 - Draft of Women in Entrepreneurship blog post&lt;br /&gt;
&lt;br /&gt;
06/08/2016 - Work on Challenges Women Entrepreneurs Face wiki page&lt;br /&gt;
&lt;br /&gt;
06/09/2016 - Clean up content of patent trolls and put on the public page&lt;br /&gt;
&lt;br /&gt;
06/10/2016 - Put up resources for business dynamism in high tech issue brief page&lt;br /&gt;
&lt;br /&gt;
06/13/2016 - Clean up venture one data and LBO data&lt;br /&gt;
&lt;br /&gt;
06/14/2016 - Match venture one and LBO data to patent data&lt;br /&gt;
&lt;br /&gt;
06/15/2016 - Create tables that match patent information to each LBO/venture company&lt;br /&gt;
&lt;br /&gt;
06/16/2016 - Create and Finalize LBO/venture company and patent summary table&lt;br /&gt;
&lt;br /&gt;
06/17/2016 - Familiarize with Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/20/2016 - Analyze existing SQL code of Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/21/2016 - Clean up and rebuild Hubs datasets&lt;br /&gt;
&lt;br /&gt;
06/22/2016 - First draft of complete SQL script for Hubs datasets&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
07/15/2016 - Help Ed send out VentureOne data, add grouping, considerations of Hubs scorecard variables&lt;br /&gt;
07/18/2016 - Work on differentiating curriculum v. code school, redo Matching VentureOne&lt;br /&gt;
07/19/2016 - Finish Marching Venture One, update on Wiki, work on differentiating curriculum v. code school&lt;br /&gt;
07/20/2016 - Work on differentiating OH investor v. mentor, temporary workshop v. networking meetup&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7297</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7297"/>
		<updated>2016-07-20T21:49:02Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Onsite temporary workshops v. networking events */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
Thoughts (Ariel, 07/20): &lt;br /&gt;
The names listed on 'mentor' page/sections must all be mentors, and the same applies for investors/OH investors although few companies list their investors. So here the only thing we are trying to differentiate is whether the mentor is a investor. maybe via checking whether they are from a VC firm?? But even they are from VC companies doesn't mean they are going to invest on the startups of the Hubs they are mentoring on.  Or another way to think about it is differentiating between mentors/OH mentors. Mentors tend to give the particular startups long term support and available when needed while OH mentors only gives advice on the spot. &lt;br /&gt;
&lt;br /&gt;
'''Mentors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on improving entrepreneurial community through ongoing, recurring support&lt;br /&gt;
**Help and guide the startups on: business plans and models, management, development, execution, technology innovation, marketing, sales&lt;br /&gt;
**Common fields/occupations: founder/CEO of another company, business development, serial entrepreneur, marketing, sales, management consulting, technology and innovation, research professor etc.&lt;br /&gt;
**Some companies offer mentor office hours&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
'''Investors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on investing on early stage or growth stage startups&lt;br /&gt;
**Usually from VC firms&lt;br /&gt;
**Common fields/ occupations: VC firm manager, VC firm partner, fund manager&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is learning and discussing &lt;br /&gt;
**Often have a specific topic: business issue (e.g. online marketing) or techniques learning (e.g. intro to Java script)&lt;br /&gt;
**In the forms of: workshop, class, panel, project, XX session, seminar, series, intro to XX&lt;br /&gt;
**Exception: tech meetup is usually a workshop(e.g. C++ programmer meetup, http://techranchaustin.com/events/)&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
'''Networking Events'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is to meet fellow entrepreneurs and experts and networking with them&lt;br /&gt;
**Focus on experience sharing or communication as opposed to discussing a specific topic or technical subject&lt;br /&gt;
**In the forms of: meetup, networking, happy hour, info session?, luncheon, XX night, socials, talks??, community XX&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=OLD1=&lt;br /&gt;
We will be creating a &amp;quot;Hubs scorecard&amp;quot; to determine how hub-like potential spaces are.  In order to do so, we will evaluate the places based on certain variables.  Previous variables for potential hubs were collected.  Below, we list those as well as other variables we think might be helpful to build out the scorecard.&lt;br /&gt;
&lt;br /&gt;
Ideally, we would have the following variables (not collected previously):&lt;br /&gt;
#Onsite VC/Angel/Investors (Count or binary)&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Onsite Mentors (binary) --- ''Are these the same as advisers?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#&amp;quot;Office hours&amp;quot; with investors or mentors (binary)&lt;br /&gt;
##Comments: Previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.).   We view this separation as important, BUT very difficult to collect&lt;br /&gt;
##Mechanical Turk Comments: &lt;br /&gt;
#Onsite temporary workshops (binary or count)  *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Networking Meetups (Binary or count) *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Sponsors and Partners (binary and list) --- a''re these the same?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Alumni Network (binary)  --- ''not all potential hubslist this and the fact that some do might indicate its importance''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Nonprofit (binary)  --- ''helpful in determining goals of potential hubs''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Mission Includes Key Buzzwords (e.g. &amp;quot;ecosystem&amp;quot;, &amp;quot;community&amp;quot;)  --- ''help separate simple coworking spaces form hubs''&lt;br /&gt;
&lt;br /&gt;
Example of Prior Variables Collected:&lt;br /&gt;
*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text.  We think what we really want is to see if they have a specialty (e.g. healthcare)''&lt;br /&gt;
*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''&lt;br /&gt;
*Price for Single Space --- ''defined as price for flexible desk, relatively complete inputs''&lt;br /&gt;
*Price for Office --- ''no inputs''&lt;br /&gt;
*Twitter Activity (Multinomial or Count) --- ''High=2/Moderate=1/No=0, no explanations on how to categorize the activity. Also no handles''&lt;br /&gt;
*Size (sqft) --- ''no records for majority of the companies''&lt;br /&gt;
*Num Conference Rooms --- ''no records for majority of the companies''&lt;br /&gt;
*Onsite accelerator (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Onsite code school (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Community Membership (binary) --- ''relatively complete inputs''&lt;br /&gt;
&lt;br /&gt;
=OLD2=&lt;br /&gt;
*'''Twitter activity''': '' &lt;br /&gt;
'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed &lt;br /&gt;
&lt;br /&gt;
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''NUMBER OF EVENTS''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
'''Considerations'''&lt;br /&gt;
*Difficulties Encountered:&lt;br /&gt;
*Expected Time to Complete:&lt;br /&gt;
*Expectation of Results (accuracy of turk, comprehensiveness):&lt;br /&gt;
*Other Comments:&lt;br /&gt;
&lt;br /&gt;
'''Procedure'''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events in July 2016 and record it. If there is no information of events on the website, record DNE.&lt;br /&gt;
&lt;br /&gt;
Note***: ''Events include meetups, workshops, info sessions etc. We do not want to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Onsite Mentors''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'&lt;br /&gt;
#If the key words can be identified, mark as 1&lt;br /&gt;
#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring&lt;br /&gt;
#If these exist, mark as 1.&lt;br /&gt;
#If not, go to links related to membership 'benefits,' 'perks,' or related.&lt;br /&gt;
#Do same process as end of 4 and 5&lt;br /&gt;
#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine.  If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.&lt;br /&gt;
#If none of these steps result in a mark of 1, mark as 0 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Nonprofit''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'&lt;br /&gt;
#Look for the key word 'nonprofit'/'non-profit'&lt;br /&gt;
#If 'nonprofit' is identified, mark as 1, otherwise 0.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Number of Members''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#Count the number of members&lt;br /&gt;
#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Sponsors and Partners''':''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related sections&lt;br /&gt;
#If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7296</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7296"/>
		<updated>2016-07-20T21:48:16Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Onsite temporary workshops v. networking events */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
Thoughts (Ariel, 07/20): &lt;br /&gt;
The names listed on 'mentor' page/sections must all be mentors, and the same applies for investors/OH investors although few companies list their investors. So here the only thing we are trying to differentiate is whether the mentor is a investor. maybe via checking whether they are from a VC firm?? But even they are from VC companies doesn't mean they are going to invest on the startups of the Hubs they are mentoring on.  Or another way to think about it is differentiating between mentors/OH mentors. Mentors tend to give the particular startups long term support and available when needed while OH mentors only gives advice on the spot. &lt;br /&gt;
&lt;br /&gt;
'''Mentors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on improving entrepreneurial community through ongoing, recurring support&lt;br /&gt;
**Help and guide the startups on: business plans and models, management, development, execution, technology innovation, marketing, sales&lt;br /&gt;
**Common fields/occupations: founder/CEO of another company, business development, serial entrepreneur, marketing, sales, management consulting, technology and innovation, research professor etc.&lt;br /&gt;
**Some companies offer mentor office hours&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
'''Investors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on investing on early stage or growth stage startups&lt;br /&gt;
**Usually from VC firms&lt;br /&gt;
**Common fields/ occupations: VC firm manager, VC firm partner, fund manager&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is learning and discussing &lt;br /&gt;
**Often have a specific topic: business issue (e.g. online marketing) or techniques learning (e.g. intro to Java script)&lt;br /&gt;
**In the forms of: workshop, class, panel, project, XX session, seminar, series, intro to XX&lt;br /&gt;
**Exception: tech meetup is usually a workshop(e.g. C++ programmer meetup, http://techranchaustin.com/events/)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
'''Networking Events'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**The purpose is to meet fellow entrepreneurs and experts and networking with them&lt;br /&gt;
**Focus on experience sharing or communication as opposed to discussing a specific topic or technical subject&lt;br /&gt;
**In the forms of: meetup, networking, happy hour, info session?, luncheon, XX night, socials, talks??, community XX&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=OLD1=&lt;br /&gt;
We will be creating a &amp;quot;Hubs scorecard&amp;quot; to determine how hub-like potential spaces are.  In order to do so, we will evaluate the places based on certain variables.  Previous variables for potential hubs were collected.  Below, we list those as well as other variables we think might be helpful to build out the scorecard.&lt;br /&gt;
&lt;br /&gt;
Ideally, we would have the following variables (not collected previously):&lt;br /&gt;
#Onsite VC/Angel/Investors (Count or binary)&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Onsite Mentors (binary) --- ''Are these the same as advisers?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#&amp;quot;Office hours&amp;quot; with investors or mentors (binary)&lt;br /&gt;
##Comments: Previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.).   We view this separation as important, BUT very difficult to collect&lt;br /&gt;
##Mechanical Turk Comments: &lt;br /&gt;
#Onsite temporary workshops (binary or count)  *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Networking Meetups (Binary or count) *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Sponsors and Partners (binary and list) --- a''re these the same?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Alumni Network (binary)  --- ''not all potential hubslist this and the fact that some do might indicate its importance''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Nonprofit (binary)  --- ''helpful in determining goals of potential hubs''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Mission Includes Key Buzzwords (e.g. &amp;quot;ecosystem&amp;quot;, &amp;quot;community&amp;quot;)  --- ''help separate simple coworking spaces form hubs''&lt;br /&gt;
&lt;br /&gt;
Example of Prior Variables Collected:&lt;br /&gt;
*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text.  We think what we really want is to see if they have a specialty (e.g. healthcare)''&lt;br /&gt;
*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''&lt;br /&gt;
*Price for Single Space --- ''defined as price for flexible desk, relatively complete inputs''&lt;br /&gt;
*Price for Office --- ''no inputs''&lt;br /&gt;
*Twitter Activity (Multinomial or Count) --- ''High=2/Moderate=1/No=0, no explanations on how to categorize the activity. Also no handles''&lt;br /&gt;
*Size (sqft) --- ''no records for majority of the companies''&lt;br /&gt;
*Num Conference Rooms --- ''no records for majority of the companies''&lt;br /&gt;
*Onsite accelerator (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Onsite code school (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Community Membership (binary) --- ''relatively complete inputs''&lt;br /&gt;
&lt;br /&gt;
=OLD2=&lt;br /&gt;
*'''Twitter activity''': '' &lt;br /&gt;
'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed &lt;br /&gt;
&lt;br /&gt;
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''NUMBER OF EVENTS''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
'''Considerations'''&lt;br /&gt;
*Difficulties Encountered:&lt;br /&gt;
*Expected Time to Complete:&lt;br /&gt;
*Expectation of Results (accuracy of turk, comprehensiveness):&lt;br /&gt;
*Other Comments:&lt;br /&gt;
&lt;br /&gt;
'''Procedure'''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events in July 2016 and record it. If there is no information of events on the website, record DNE.&lt;br /&gt;
&lt;br /&gt;
Note***: ''Events include meetups, workshops, info sessions etc. We do not want to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Onsite Mentors''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'&lt;br /&gt;
#If the key words can be identified, mark as 1&lt;br /&gt;
#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring&lt;br /&gt;
#If these exist, mark as 1.&lt;br /&gt;
#If not, go to links related to membership 'benefits,' 'perks,' or related.&lt;br /&gt;
#Do same process as end of 4 and 5&lt;br /&gt;
#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine.  If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.&lt;br /&gt;
#If none of these steps result in a mark of 1, mark as 0 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Nonprofit''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'&lt;br /&gt;
#Look for the key word 'nonprofit'/'non-profit'&lt;br /&gt;
#If 'nonprofit' is identified, mark as 1, otherwise 0.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Number of Members''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#Count the number of members&lt;br /&gt;
#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Sponsors and Partners''':''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related sections&lt;br /&gt;
#If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7295</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7295"/>
		<updated>2016-07-20T21:12:06Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Onsite temporary workshops v. networking events */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
Thoughts (Ariel, 07/20): &lt;br /&gt;
The names listed on 'mentor' page/sections must all be mentors, and the same applies for investors/OH investors although few companies list their investors. So here the only thing we are trying to differentiate is whether the mentor is a investor. maybe via checking whether they are from a VC firm?? But even they are from VC companies doesn't mean they are going to invest on the startups of the Hubs they are mentoring on.  Or another way to think about it is differentiating between mentors/OH mentors. Mentors tend to give the particular startups long term support and available when needed while OH mentors only gives advice on the spot. &lt;br /&gt;
&lt;br /&gt;
'''Mentors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on improving entrepreneurial community through ongoing, recurring support&lt;br /&gt;
**Help and guide the startups on: business plans and models, management, development, execution, technology innovation, marketing, sales&lt;br /&gt;
**Common fields/occupations: founder/CEO of another company, business development, serial entrepreneur, marketing, sales, management consulting, technology and innovation, research professor etc.&lt;br /&gt;
**Some companies offer mentor office hours&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
'''Investors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on investing on early stage or growth stage startups&lt;br /&gt;
**Usually from VC firms&lt;br /&gt;
**Common fields/ occupations: VC firm manager, VC firm partner, fund manager&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
'''Networking Events'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=OLD1=&lt;br /&gt;
We will be creating a &amp;quot;Hubs scorecard&amp;quot; to determine how hub-like potential spaces are.  In order to do so, we will evaluate the places based on certain variables.  Previous variables for potential hubs were collected.  Below, we list those as well as other variables we think might be helpful to build out the scorecard.&lt;br /&gt;
&lt;br /&gt;
Ideally, we would have the following variables (not collected previously):&lt;br /&gt;
#Onsite VC/Angel/Investors (Count or binary)&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Onsite Mentors (binary) --- ''Are these the same as advisers?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#&amp;quot;Office hours&amp;quot; with investors or mentors (binary)&lt;br /&gt;
##Comments: Previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.).   We view this separation as important, BUT very difficult to collect&lt;br /&gt;
##Mechanical Turk Comments: &lt;br /&gt;
#Onsite temporary workshops (binary or count)  *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Networking Meetups (Binary or count) *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Sponsors and Partners (binary and list) --- a''re these the same?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Alumni Network (binary)  --- ''not all potential hubslist this and the fact that some do might indicate its importance''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Nonprofit (binary)  --- ''helpful in determining goals of potential hubs''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Mission Includes Key Buzzwords (e.g. &amp;quot;ecosystem&amp;quot;, &amp;quot;community&amp;quot;)  --- ''help separate simple coworking spaces form hubs''&lt;br /&gt;
&lt;br /&gt;
Example of Prior Variables Collected:&lt;br /&gt;
*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text.  We think what we really want is to see if they have a specialty (e.g. healthcare)''&lt;br /&gt;
*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''&lt;br /&gt;
*Price for Single Space --- ''defined as price for flexible desk, relatively complete inputs''&lt;br /&gt;
*Price for Office --- ''no inputs''&lt;br /&gt;
*Twitter Activity (Multinomial or Count) --- ''High=2/Moderate=1/No=0, no explanations on how to categorize the activity. Also no handles''&lt;br /&gt;
*Size (sqft) --- ''no records for majority of the companies''&lt;br /&gt;
*Num Conference Rooms --- ''no records for majority of the companies''&lt;br /&gt;
*Onsite accelerator (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Onsite code school (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Community Membership (binary) --- ''relatively complete inputs''&lt;br /&gt;
&lt;br /&gt;
=OLD2=&lt;br /&gt;
*'''Twitter activity''': '' &lt;br /&gt;
'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed &lt;br /&gt;
&lt;br /&gt;
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''NUMBER OF EVENTS''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
'''Considerations'''&lt;br /&gt;
*Difficulties Encountered:&lt;br /&gt;
*Expected Time to Complete:&lt;br /&gt;
*Expectation of Results (accuracy of turk, comprehensiveness):&lt;br /&gt;
*Other Comments:&lt;br /&gt;
&lt;br /&gt;
'''Procedure'''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events in July 2016 and record it. If there is no information of events on the website, record DNE.&lt;br /&gt;
&lt;br /&gt;
Note***: ''Events include meetups, workshops, info sessions etc. We do not want to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Onsite Mentors''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'&lt;br /&gt;
#If the key words can be identified, mark as 1&lt;br /&gt;
#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring&lt;br /&gt;
#If these exist, mark as 1.&lt;br /&gt;
#If not, go to links related to membership 'benefits,' 'perks,' or related.&lt;br /&gt;
#Do same process as end of 4 and 5&lt;br /&gt;
#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine.  If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.&lt;br /&gt;
#If none of these steps result in a mark of 1, mark as 0 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Nonprofit''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'&lt;br /&gt;
#Look for the key word 'nonprofit'/'non-profit'&lt;br /&gt;
#If 'nonprofit' is identified, mark as 1, otherwise 0.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Number of Members''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#Count the number of members&lt;br /&gt;
#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Sponsors and Partners''':''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related sections&lt;br /&gt;
#If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7294</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7294"/>
		<updated>2016-07-20T21:09:27Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Onsite OH Investors v. mentors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
Thoughts (Ariel, 07/20): &lt;br /&gt;
The names listed on 'mentor' page/sections must all be mentors, and the same applies for investors/OH investors although few companies list their investors. So here the only thing we are trying to differentiate is whether the mentor is a investor. maybe via checking whether they are from a VC firm?? But even they are from VC companies doesn't mean they are going to invest on the startups of the Hubs they are mentoring on.  Or another way to think about it is differentiating between mentors/OH mentors. Mentors tend to give the particular startups long term support and available when needed while OH mentors only gives advice on the spot. &lt;br /&gt;
&lt;br /&gt;
'''Mentors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on improving entrepreneurial community through ongoing, recurring support&lt;br /&gt;
**Help and guide the startups on: business plans and models, management, development, execution, technology innovation, marketing, sales&lt;br /&gt;
**Common fields/occupations: founder/CEO of another company, business development, serial entrepreneur, marketing, sales, management consulting, technology and innovation, research professor etc.&lt;br /&gt;
**Some companies offer mentor office hours&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
'''Investors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on investing on early stage or growth stage startups&lt;br /&gt;
**Usually from VC firms&lt;br /&gt;
**Common fields/ occupations: VC firm manager, VC firm partner, fund manager&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=OLD1=&lt;br /&gt;
We will be creating a &amp;quot;Hubs scorecard&amp;quot; to determine how hub-like potential spaces are.  In order to do so, we will evaluate the places based on certain variables.  Previous variables for potential hubs were collected.  Below, we list those as well as other variables we think might be helpful to build out the scorecard.&lt;br /&gt;
&lt;br /&gt;
Ideally, we would have the following variables (not collected previously):&lt;br /&gt;
#Onsite VC/Angel/Investors (Count or binary)&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Onsite Mentors (binary) --- ''Are these the same as advisers?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#&amp;quot;Office hours&amp;quot; with investors or mentors (binary)&lt;br /&gt;
##Comments: Previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.).   We view this separation as important, BUT very difficult to collect&lt;br /&gt;
##Mechanical Turk Comments: &lt;br /&gt;
#Onsite temporary workshops (binary or count)  *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Networking Meetups (Binary or count) *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Sponsors and Partners (binary and list) --- a''re these the same?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Alumni Network (binary)  --- ''not all potential hubslist this and the fact that some do might indicate its importance''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Nonprofit (binary)  --- ''helpful in determining goals of potential hubs''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Mission Includes Key Buzzwords (e.g. &amp;quot;ecosystem&amp;quot;, &amp;quot;community&amp;quot;)  --- ''help separate simple coworking spaces form hubs''&lt;br /&gt;
&lt;br /&gt;
Example of Prior Variables Collected:&lt;br /&gt;
*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text.  We think what we really want is to see if they have a specialty (e.g. healthcare)''&lt;br /&gt;
*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''&lt;br /&gt;
*Price for Single Space --- ''defined as price for flexible desk, relatively complete inputs''&lt;br /&gt;
*Price for Office --- ''no inputs''&lt;br /&gt;
*Twitter Activity (Multinomial or Count) --- ''High=2/Moderate=1/No=0, no explanations on how to categorize the activity. Also no handles''&lt;br /&gt;
*Size (sqft) --- ''no records for majority of the companies''&lt;br /&gt;
*Num Conference Rooms --- ''no records for majority of the companies''&lt;br /&gt;
*Onsite accelerator (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Onsite code school (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Community Membership (binary) --- ''relatively complete inputs''&lt;br /&gt;
&lt;br /&gt;
=OLD2=&lt;br /&gt;
*'''Twitter activity''': '' &lt;br /&gt;
'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed &lt;br /&gt;
&lt;br /&gt;
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''NUMBER OF EVENTS''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
'''Considerations'''&lt;br /&gt;
*Difficulties Encountered:&lt;br /&gt;
*Expected Time to Complete:&lt;br /&gt;
*Expectation of Results (accuracy of turk, comprehensiveness):&lt;br /&gt;
*Other Comments:&lt;br /&gt;
&lt;br /&gt;
'''Procedure'''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events in July 2016 and record it. If there is no information of events on the website, record DNE.&lt;br /&gt;
&lt;br /&gt;
Note***: ''Events include meetups, workshops, info sessions etc. We do not want to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Onsite Mentors''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'&lt;br /&gt;
#If the key words can be identified, mark as 1&lt;br /&gt;
#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring&lt;br /&gt;
#If these exist, mark as 1.&lt;br /&gt;
#If not, go to links related to membership 'benefits,' 'perks,' or related.&lt;br /&gt;
#Do same process as end of 4 and 5&lt;br /&gt;
#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine.  If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.&lt;br /&gt;
#If none of these steps result in a mark of 1, mark as 0 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Nonprofit''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'&lt;br /&gt;
#Look for the key word 'nonprofit'/'non-profit'&lt;br /&gt;
#If 'nonprofit' is identified, mark as 1, otherwise 0.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Number of Members''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#Count the number of members&lt;br /&gt;
#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Sponsors and Partners''':''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related sections&lt;br /&gt;
#If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7293</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7293"/>
		<updated>2016-07-20T21:08:54Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Group 4 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
Thoughts (Ariel, 07/20): &lt;br /&gt;
The name listed on 'mentor' page/sections must all be mentors, and the same applies for investors/OH investors although few companies list their investors. So here the only thing we are trying to differentiate is whether the mentor is a investor. maybe via checking whether they are from a VC firm?? But even they are from VC companies doesn't mean they are going to invest on the startups of the Hubs they are mentoring on.  Or another way to think about it is differentiating between mentors/OH mentors. Mentors tend to give the particular startups long term support and available when needed while OH mentors only gives advice on the spot. &lt;br /&gt;
&lt;br /&gt;
'''Mentors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on improving entrepreneurial community through ongoing, recurring support&lt;br /&gt;
**Help and guide the startups on: business plans and models, management, development, execution, technology innovation, marketing, sales&lt;br /&gt;
**Common fields/occupations: founder/CEO of another company, business development, serial entrepreneur, marketing, sales, management consulting, technology and innovation, research professor etc.&lt;br /&gt;
**Some companies offer mentor office hours&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
'''Investors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on investing on early stage or growth stage startups&lt;br /&gt;
**Usually from VC firms&lt;br /&gt;
**Common fields/ occupations: VC firm manager, VC firm partner, fund manager&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=OLD1=&lt;br /&gt;
We will be creating a &amp;quot;Hubs scorecard&amp;quot; to determine how hub-like potential spaces are.  In order to do so, we will evaluate the places based on certain variables.  Previous variables for potential hubs were collected.  Below, we list those as well as other variables we think might be helpful to build out the scorecard.&lt;br /&gt;
&lt;br /&gt;
Ideally, we would have the following variables (not collected previously):&lt;br /&gt;
#Onsite VC/Angel/Investors (Count or binary)&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Onsite Mentors (binary) --- ''Are these the same as advisers?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#&amp;quot;Office hours&amp;quot; with investors or mentors (binary)&lt;br /&gt;
##Comments: Previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.).   We view this separation as important, BUT very difficult to collect&lt;br /&gt;
##Mechanical Turk Comments: &lt;br /&gt;
#Onsite temporary workshops (binary or count)  *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Networking Meetups (Binary or count) *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Sponsors and Partners (binary and list) --- a''re these the same?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Alumni Network (binary)  --- ''not all potential hubslist this and the fact that some do might indicate its importance''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Nonprofit (binary)  --- ''helpful in determining goals of potential hubs''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Mission Includes Key Buzzwords (e.g. &amp;quot;ecosystem&amp;quot;, &amp;quot;community&amp;quot;)  --- ''help separate simple coworking spaces form hubs''&lt;br /&gt;
&lt;br /&gt;
Example of Prior Variables Collected:&lt;br /&gt;
*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text.  We think what we really want is to see if they have a specialty (e.g. healthcare)''&lt;br /&gt;
*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''&lt;br /&gt;
*Price for Single Space --- ''defined as price for flexible desk, relatively complete inputs''&lt;br /&gt;
*Price for Office --- ''no inputs''&lt;br /&gt;
*Twitter Activity (Multinomial or Count) --- ''High=2/Moderate=1/No=0, no explanations on how to categorize the activity. Also no handles''&lt;br /&gt;
*Size (sqft) --- ''no records for majority of the companies''&lt;br /&gt;
*Num Conference Rooms --- ''no records for majority of the companies''&lt;br /&gt;
*Onsite accelerator (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Onsite code school (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Community Membership (binary) --- ''relatively complete inputs''&lt;br /&gt;
&lt;br /&gt;
=OLD2=&lt;br /&gt;
*'''Twitter activity''': '' &lt;br /&gt;
'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed &lt;br /&gt;
&lt;br /&gt;
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''NUMBER OF EVENTS''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
'''Considerations'''&lt;br /&gt;
*Difficulties Encountered:&lt;br /&gt;
*Expected Time to Complete:&lt;br /&gt;
*Expectation of Results (accuracy of turk, comprehensiveness):&lt;br /&gt;
*Other Comments:&lt;br /&gt;
&lt;br /&gt;
'''Procedure'''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events in July 2016 and record it. If there is no information of events on the website, record DNE.&lt;br /&gt;
&lt;br /&gt;
Note***: ''Events include meetups, workshops, info sessions etc. We do not want to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Onsite Mentors''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'&lt;br /&gt;
#If the key words can be identified, mark as 1&lt;br /&gt;
#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring&lt;br /&gt;
#If these exist, mark as 1.&lt;br /&gt;
#If not, go to links related to membership 'benefits,' 'perks,' or related.&lt;br /&gt;
#Do same process as end of 4 and 5&lt;br /&gt;
#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine.  If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.&lt;br /&gt;
#If none of these steps result in a mark of 1, mark as 0 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Nonprofit''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'&lt;br /&gt;
#Look for the key word 'nonprofit'/'non-profit'&lt;br /&gt;
#If 'nonprofit' is identified, mark as 1, otherwise 0.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Number of Members''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#Count the number of members&lt;br /&gt;
#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Sponsors and Partners''':''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related sections&lt;br /&gt;
#If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7292</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7292"/>
		<updated>2016-07-20T21:06:06Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Onsite OH Investors v. mentors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
Thoughts (Ariel, 07/20): &lt;br /&gt;
The name listed on 'mentor' page/sections must all be mentors, and the same applies for investors/OH investors although few companies list their investors. So here the only thing we are trying to differentiate is whether the mentor is a investor. maybe via checking whether they are from a VC firm?? But even they are from VC companies doesn't mean they are going to invest on the startups of the Hubs they are mentoring on.  Or another way to think about it is differentiating between mentors/OH mentors. Mentors tend to give the particular startups long term support and available when needed while OH mentors only gives advice on the spot. &lt;br /&gt;
&lt;br /&gt;
'''Mentors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on improving entrepreneurial community through ongoing, recurring support&lt;br /&gt;
**Help and guide the startups on: business plans and models, management, development, execution, technology innovation, marketing, sales&lt;br /&gt;
**Common fields/occupations: founder/CEO of another company, business development, serial entrepreneur, marketing, sales, management consulting, technology and innovation, research professor etc.&lt;br /&gt;
**Some companies offer mentor office hours&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Investors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on investing on early stage or growth stage startups&lt;br /&gt;
**Usually from VC firms&lt;br /&gt;
**Common fields/ occupations: VC firm manager, VC firm partner, fund manager&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=OLD1=&lt;br /&gt;
We will be creating a &amp;quot;Hubs scorecard&amp;quot; to determine how hub-like potential spaces are.  In order to do so, we will evaluate the places based on certain variables.  Previous variables for potential hubs were collected.  Below, we list those as well as other variables we think might be helpful to build out the scorecard.&lt;br /&gt;
&lt;br /&gt;
Ideally, we would have the following variables (not collected previously):&lt;br /&gt;
#Onsite VC/Angel/Investors (Count or binary)&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Onsite Mentors (binary) --- ''Are these the same as advisers?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#&amp;quot;Office hours&amp;quot; with investors or mentors (binary)&lt;br /&gt;
##Comments: Previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.).   We view this separation as important, BUT very difficult to collect&lt;br /&gt;
##Mechanical Turk Comments: &lt;br /&gt;
#Onsite temporary workshops (binary or count)  *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Networking Meetups (Binary or count) *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Sponsors and Partners (binary and list) --- a''re these the same?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Alumni Network (binary)  --- ''not all potential hubslist this and the fact that some do might indicate its importance''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Nonprofit (binary)  --- ''helpful in determining goals of potential hubs''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Mission Includes Key Buzzwords (e.g. &amp;quot;ecosystem&amp;quot;, &amp;quot;community&amp;quot;)  --- ''help separate simple coworking spaces form hubs''&lt;br /&gt;
&lt;br /&gt;
Example of Prior Variables Collected:&lt;br /&gt;
*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text.  We think what we really want is to see if they have a specialty (e.g. healthcare)''&lt;br /&gt;
*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''&lt;br /&gt;
*Price for Single Space --- ''defined as price for flexible desk, relatively complete inputs''&lt;br /&gt;
*Price for Office --- ''no inputs''&lt;br /&gt;
*Twitter Activity (Multinomial or Count) --- ''High=2/Moderate=1/No=0, no explanations on how to categorize the activity. Also no handles''&lt;br /&gt;
*Size (sqft) --- ''no records for majority of the companies''&lt;br /&gt;
*Num Conference Rooms --- ''no records for majority of the companies''&lt;br /&gt;
*Onsite accelerator (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Onsite code school (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Community Membership (binary) --- ''relatively complete inputs''&lt;br /&gt;
&lt;br /&gt;
=OLD2=&lt;br /&gt;
*'''Twitter activity''': '' &lt;br /&gt;
'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed &lt;br /&gt;
&lt;br /&gt;
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''NUMBER OF EVENTS''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
'''Considerations'''&lt;br /&gt;
*Difficulties Encountered:&lt;br /&gt;
*Expected Time to Complete:&lt;br /&gt;
*Expectation of Results (accuracy of turk, comprehensiveness):&lt;br /&gt;
*Other Comments:&lt;br /&gt;
&lt;br /&gt;
'''Procedure'''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events in July 2016 and record it. If there is no information of events on the website, record DNE.&lt;br /&gt;
&lt;br /&gt;
Note***: ''Events include meetups, workshops, info sessions etc. We do not want to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Onsite Mentors''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'&lt;br /&gt;
#If the key words can be identified, mark as 1&lt;br /&gt;
#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring&lt;br /&gt;
#If these exist, mark as 1.&lt;br /&gt;
#If not, go to links related to membership 'benefits,' 'perks,' or related.&lt;br /&gt;
#Do same process as end of 4 and 5&lt;br /&gt;
#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine.  If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.&lt;br /&gt;
#If none of these steps result in a mark of 1, mark as 0 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Nonprofit''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'&lt;br /&gt;
#Look for the key word 'nonprofit'/'non-profit'&lt;br /&gt;
#If 'nonprofit' is identified, mark as 1, otherwise 0.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Number of Members''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#Count the number of members&lt;br /&gt;
#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Sponsors and Partners''':''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related sections&lt;br /&gt;
#If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7291</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7291"/>
		<updated>2016-07-20T21:05:36Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Onsite OH Investors v. mentors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
Thoughts (Ariel, 07/20): &lt;br /&gt;
The name listed on 'mentor' page/sections must all be mentors, and the same applies for investors/OH investors although few companies list their investors. So here the only thing we are trying to differentiate is whether the mentor is a investor. maybe via checking whether they are from a VC firm?? But even they are from VC companies doesn't mean they are going to invest on the startups of the Hubs they are mentoring on.  Or another way to think about it is differentiating between mentors/OH mentors. Mentors tend to give the particular startups long term support and available when needed while OH mentors only gives advice on the spot. &lt;br /&gt;
&lt;br /&gt;
'''Mentors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on improving entrepreneurial community through ongoing, recurring support&lt;br /&gt;
**Help and guide the startups on: business plans and models, management, development, execution, technology innovation, marketing, sales&lt;br /&gt;
**Common fields/occupations: founder/CEO of another company, business development, serial entrepreneur, marketing, sales, management consulting, technology and innovation, research professor etc.&lt;br /&gt;
**Some companies offer mentor hours&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Investors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on investing on early stage or growth stage startups&lt;br /&gt;
**Usually from VC firms&lt;br /&gt;
**Common fields/ occupations: VC firm manager, VC firm partner, fund manager&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=OLD1=&lt;br /&gt;
We will be creating a &amp;quot;Hubs scorecard&amp;quot; to determine how hub-like potential spaces are.  In order to do so, we will evaluate the places based on certain variables.  Previous variables for potential hubs were collected.  Below, we list those as well as other variables we think might be helpful to build out the scorecard.&lt;br /&gt;
&lt;br /&gt;
Ideally, we would have the following variables (not collected previously):&lt;br /&gt;
#Onsite VC/Angel/Investors (Count or binary)&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Onsite Mentors (binary) --- ''Are these the same as advisers?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#&amp;quot;Office hours&amp;quot; with investors or mentors (binary)&lt;br /&gt;
##Comments: Previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.).   We view this separation as important, BUT very difficult to collect&lt;br /&gt;
##Mechanical Turk Comments: &lt;br /&gt;
#Onsite temporary workshops (binary or count)  *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Networking Meetups (Binary or count) *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Sponsors and Partners (binary and list) --- a''re these the same?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Alumni Network (binary)  --- ''not all potential hubslist this and the fact that some do might indicate its importance''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Nonprofit (binary)  --- ''helpful in determining goals of potential hubs''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Mission Includes Key Buzzwords (e.g. &amp;quot;ecosystem&amp;quot;, &amp;quot;community&amp;quot;)  --- ''help separate simple coworking spaces form hubs''&lt;br /&gt;
&lt;br /&gt;
Example of Prior Variables Collected:&lt;br /&gt;
*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text.  We think what we really want is to see if they have a specialty (e.g. healthcare)''&lt;br /&gt;
*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''&lt;br /&gt;
*Price for Single Space --- ''defined as price for flexible desk, relatively complete inputs''&lt;br /&gt;
*Price for Office --- ''no inputs''&lt;br /&gt;
*Twitter Activity (Multinomial or Count) --- ''High=2/Moderate=1/No=0, no explanations on how to categorize the activity. Also no handles''&lt;br /&gt;
*Size (sqft) --- ''no records for majority of the companies''&lt;br /&gt;
*Num Conference Rooms --- ''no records for majority of the companies''&lt;br /&gt;
*Onsite accelerator (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Onsite code school (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Community Membership (binary) --- ''relatively complete inputs''&lt;br /&gt;
&lt;br /&gt;
=OLD2=&lt;br /&gt;
*'''Twitter activity''': '' &lt;br /&gt;
'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed &lt;br /&gt;
&lt;br /&gt;
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''NUMBER OF EVENTS''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
'''Considerations'''&lt;br /&gt;
*Difficulties Encountered:&lt;br /&gt;
*Expected Time to Complete:&lt;br /&gt;
*Expectation of Results (accuracy of turk, comprehensiveness):&lt;br /&gt;
*Other Comments:&lt;br /&gt;
&lt;br /&gt;
'''Procedure'''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events in July 2016 and record it. If there is no information of events on the website, record DNE.&lt;br /&gt;
&lt;br /&gt;
Note***: ''Events include meetups, workshops, info sessions etc. We do not want to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Onsite Mentors''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'&lt;br /&gt;
#If the key words can be identified, mark as 1&lt;br /&gt;
#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring&lt;br /&gt;
#If these exist, mark as 1.&lt;br /&gt;
#If not, go to links related to membership 'benefits,' 'perks,' or related.&lt;br /&gt;
#Do same process as end of 4 and 5&lt;br /&gt;
#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine.  If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.&lt;br /&gt;
#If none of these steps result in a mark of 1, mark as 0 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Nonprofit''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'&lt;br /&gt;
#Look for the key word 'nonprofit'/'non-profit'&lt;br /&gt;
#If 'nonprofit' is identified, mark as 1, otherwise 0.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Number of Members''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#Count the number of members&lt;br /&gt;
#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Sponsors and Partners''':''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related sections&lt;br /&gt;
#If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7290</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7290"/>
		<updated>2016-07-20T21:04:43Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Onsite OH Investors v. mentors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
Thoughts (Ariel, 07/20): The name listed on 'mentor' page/sections must all be mentors, and the same applies for investors/OH investors although few companies list their investors. So here the only thing we are trying to differentiate is whether the mentor is a investor. maybe via checking whether they are from a VC firm?? But even they are from VC companies doesn't mean they are going to invest on the startups of the Hubs they are mentoring on.  Or another way to think about it is differentiating between mentors/OH mentors. Mentors tend to give the particular startups long term support and available when needed while OH mentors only gives advice on the spot. &lt;br /&gt;
&lt;br /&gt;
'''Mentors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on improving entrepreneurial community through ongoing, recurring support&lt;br /&gt;
**Help and guide the startups on: business plans and models, management, development, execution, technology innovation, marketing, sales&lt;br /&gt;
*Common fields/occupations: founder/CEO of another company, business development, serial entrepreneur, marketing, sales, management consulting, technology and innovation, research professor etc.&lt;br /&gt;
**Some companies offer mentor hours&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Investors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Focus on investing on early stage or growth stage startups&lt;br /&gt;
**Usually from VC firms&lt;br /&gt;
**Common fields/ occupations: VC firm manager, VC firm partner, fund manager&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=OLD1=&lt;br /&gt;
We will be creating a &amp;quot;Hubs scorecard&amp;quot; to determine how hub-like potential spaces are.  In order to do so, we will evaluate the places based on certain variables.  Previous variables for potential hubs were collected.  Below, we list those as well as other variables we think might be helpful to build out the scorecard.&lt;br /&gt;
&lt;br /&gt;
Ideally, we would have the following variables (not collected previously):&lt;br /&gt;
#Onsite VC/Angel/Investors (Count or binary)&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Onsite Mentors (binary) --- ''Are these the same as advisers?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#&amp;quot;Office hours&amp;quot; with investors or mentors (binary)&lt;br /&gt;
##Comments: Previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.).   We view this separation as important, BUT very difficult to collect&lt;br /&gt;
##Mechanical Turk Comments: &lt;br /&gt;
#Onsite temporary workshops (binary or count)  *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Networking Meetups (Binary or count) *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Sponsors and Partners (binary and list) --- a''re these the same?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Alumni Network (binary)  --- ''not all potential hubslist this and the fact that some do might indicate its importance''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Nonprofit (binary)  --- ''helpful in determining goals of potential hubs''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Mission Includes Key Buzzwords (e.g. &amp;quot;ecosystem&amp;quot;, &amp;quot;community&amp;quot;)  --- ''help separate simple coworking spaces form hubs''&lt;br /&gt;
&lt;br /&gt;
Example of Prior Variables Collected:&lt;br /&gt;
*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text.  We think what we really want is to see if they have a specialty (e.g. healthcare)''&lt;br /&gt;
*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''&lt;br /&gt;
*Price for Single Space --- ''defined as price for flexible desk, relatively complete inputs''&lt;br /&gt;
*Price for Office --- ''no inputs''&lt;br /&gt;
*Twitter Activity (Multinomial or Count) --- ''High=2/Moderate=1/No=0, no explanations on how to categorize the activity. Also no handles''&lt;br /&gt;
*Size (sqft) --- ''no records for majority of the companies''&lt;br /&gt;
*Num Conference Rooms --- ''no records for majority of the companies''&lt;br /&gt;
*Onsite accelerator (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Onsite code school (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Community Membership (binary) --- ''relatively complete inputs''&lt;br /&gt;
&lt;br /&gt;
=OLD2=&lt;br /&gt;
*'''Twitter activity''': '' &lt;br /&gt;
'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed &lt;br /&gt;
&lt;br /&gt;
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''NUMBER OF EVENTS''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
'''Considerations'''&lt;br /&gt;
*Difficulties Encountered:&lt;br /&gt;
*Expected Time to Complete:&lt;br /&gt;
*Expectation of Results (accuracy of turk, comprehensiveness):&lt;br /&gt;
*Other Comments:&lt;br /&gt;
&lt;br /&gt;
'''Procedure'''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events in July 2016 and record it. If there is no information of events on the website, record DNE.&lt;br /&gt;
&lt;br /&gt;
Note***: ''Events include meetups, workshops, info sessions etc. We do not want to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Onsite Mentors''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'&lt;br /&gt;
#If the key words can be identified, mark as 1&lt;br /&gt;
#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring&lt;br /&gt;
#If these exist, mark as 1.&lt;br /&gt;
#If not, go to links related to membership 'benefits,' 'perks,' or related.&lt;br /&gt;
#Do same process as end of 4 and 5&lt;br /&gt;
#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine.  If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.&lt;br /&gt;
#If none of these steps result in a mark of 1, mark as 0 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Nonprofit''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'&lt;br /&gt;
#Look for the key word 'nonprofit'/'non-profit'&lt;br /&gt;
#If 'nonprofit' is identified, mark as 1, otherwise 0.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Number of Members''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#Count the number of members&lt;br /&gt;
#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Sponsors and Partners''':''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related sections&lt;br /&gt;
#If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7289</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7289"/>
		<updated>2016-07-20T21:03:32Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Onsite OH Investors v. mentors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Cultivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text''': Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
**'''Search Text 1''': website design, coding, web development, software, bootcamp&lt;br /&gt;
**'''Search Text 2''': General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
**Thoughts (Ariel, 07/20): The name listed on 'mentor' page/sections must all be mentors, and the same applies for investors/OH investors although few companies list their investors. So here the only thing we are trying to differentiate is whether the mentor is a investor. maybe via checking whether they are from a VC firm?? But even they are from VC companies doesn't mean they are going to invest on the startups of the Hubs they are mentoring on.  Or another way to think about it is differentiating between mentors/OH mentors. Mentors tend to give the particular startups long term support and available when needed while OH mentors only gives advice on the spot. &lt;br /&gt;
&lt;br /&gt;
'''Mentors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
*Focus on improving entrepreneurial community through ongoing, recurring support&lt;br /&gt;
*Help and guide the startups on: business plans and models, management, development, execution, technology innovation, marketing, sales&lt;br /&gt;
*Common fields/occupations: founder/CEO of another company, business development, serial entrepreneur, marketing, sales, management consulting, technology and innovation, research professor etc.&lt;br /&gt;
*Some companies offer mentor hours&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Investors'''&lt;br /&gt;
&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
*Focus on investing on early stage or growth stage startups&lt;br /&gt;
*Usually from VC firms&lt;br /&gt;
*Common fields/ occupations: VC firm manager, VC firm partner, fund manager&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=OLD1=&lt;br /&gt;
We will be creating a &amp;quot;Hubs scorecard&amp;quot; to determine how hub-like potential spaces are.  In order to do so, we will evaluate the places based on certain variables.  Previous variables for potential hubs were collected.  Below, we list those as well as other variables we think might be helpful to build out the scorecard.&lt;br /&gt;
&lt;br /&gt;
Ideally, we would have the following variables (not collected previously):&lt;br /&gt;
#Onsite VC/Angel/Investors (Count or binary)&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Onsite Mentors (binary) --- ''Are these the same as advisers?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#&amp;quot;Office hours&amp;quot; with investors or mentors (binary)&lt;br /&gt;
##Comments: Previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.).   We view this separation as important, BUT very difficult to collect&lt;br /&gt;
##Mechanical Turk Comments: &lt;br /&gt;
#Onsite temporary workshops (binary or count)  *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Networking Meetups (Binary or count) *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Sponsors and Partners (binary and list) --- a''re these the same?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Alumni Network (binary)  --- ''not all potential hubslist this and the fact that some do might indicate its importance''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Nonprofit (binary)  --- ''helpful in determining goals of potential hubs''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Mission Includes Key Buzzwords (e.g. &amp;quot;ecosystem&amp;quot;, &amp;quot;community&amp;quot;)  --- ''help separate simple coworking spaces form hubs''&lt;br /&gt;
&lt;br /&gt;
Example of Prior Variables Collected:&lt;br /&gt;
*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text.  We think what we really want is to see if they have a specialty (e.g. healthcare)''&lt;br /&gt;
*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''&lt;br /&gt;
*Price for Single Space --- ''defined as price for flexible desk, relatively complete inputs''&lt;br /&gt;
*Price for Office --- ''no inputs''&lt;br /&gt;
*Twitter Activity (Multinomial or Count) --- ''High=2/Moderate=1/No=0, no explanations on how to categorize the activity. Also no handles''&lt;br /&gt;
*Size (sqft) --- ''no records for majority of the companies''&lt;br /&gt;
*Num Conference Rooms --- ''no records for majority of the companies''&lt;br /&gt;
*Onsite accelerator (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Onsite code school (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Community Membership (binary) --- ''relatively complete inputs''&lt;br /&gt;
&lt;br /&gt;
=OLD2=&lt;br /&gt;
*'''Twitter activity''': '' &lt;br /&gt;
'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed &lt;br /&gt;
&lt;br /&gt;
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''NUMBER OF EVENTS''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
'''Considerations'''&lt;br /&gt;
*Difficulties Encountered:&lt;br /&gt;
*Expected Time to Complete:&lt;br /&gt;
*Expectation of Results (accuracy of turk, comprehensiveness):&lt;br /&gt;
*Other Comments:&lt;br /&gt;
&lt;br /&gt;
'''Procedure'''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events in July 2016 and record it. If there is no information of events on the website, record DNE.&lt;br /&gt;
&lt;br /&gt;
Note***: ''Events include meetups, workshops, info sessions etc. We do not want to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Onsite Mentors''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'&lt;br /&gt;
#If the key words can be identified, mark as 1&lt;br /&gt;
#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring&lt;br /&gt;
#If these exist, mark as 1.&lt;br /&gt;
#If not, go to links related to membership 'benefits,' 'perks,' or related.&lt;br /&gt;
#Do same process as end of 4 and 5&lt;br /&gt;
#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine.  If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.&lt;br /&gt;
#If none of these steps result in a mark of 1, mark as 0 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Nonprofit''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'&lt;br /&gt;
#Look for the key word 'nonprofit'/'non-profit'&lt;br /&gt;
#If 'nonprofit' is identified, mark as 1, otherwise 0.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Number of Members''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#Count the number of members&lt;br /&gt;
#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Sponsors and Partners''':''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related sections&lt;br /&gt;
#If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7209</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7209"/>
		<updated>2016-07-20T16:12:05Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Onsite OH Investors v. mentors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Caltivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
**Search Text: Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
**Search Text 1: website design, coding, web development, software, bootcamp&lt;br /&gt;
**Search Text 2: General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
*'''Desc''':&lt;br /&gt;
&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
&lt;br /&gt;
*'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
*'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=OLD1=&lt;br /&gt;
We will be creating a &amp;quot;Hubs scorecard&amp;quot; to determine how hub-like potential spaces are.  In order to do so, we will evaluate the places based on certain variables.  Previous variables for potential hubs were collected.  Below, we list those as well as other variables we think might be helpful to build out the scorecard.&lt;br /&gt;
&lt;br /&gt;
Ideally, we would have the following variables (not collected previously):&lt;br /&gt;
#Onsite VC/Angel/Investors (Count or binary)&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Onsite Mentors (binary) --- ''Are these the same as advisers?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#&amp;quot;Office hours&amp;quot; with investors or mentors (binary)&lt;br /&gt;
##Comments: Previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.).   We view this separation as important, BUT very difficult to collect&lt;br /&gt;
##Mechanical Turk Comments: &lt;br /&gt;
#Onsite temporary workshops (binary or count)  *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Networking Meetups (Binary or count) *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Sponsors and Partners (binary and list) --- a''re these the same?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Alumni Network (binary)  --- ''not all potential hubslist this and the fact that some do might indicate its importance''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Nonprofit (binary)  --- ''helpful in determining goals of potential hubs''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Mission Includes Key Buzzwords (e.g. &amp;quot;ecosystem&amp;quot;, &amp;quot;community&amp;quot;)  --- ''help separate simple coworking spaces form hubs''&lt;br /&gt;
&lt;br /&gt;
Example of Prior Variables Collected:&lt;br /&gt;
*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text.  We think what we really want is to see if they have a specialty (e.g. healthcare)''&lt;br /&gt;
*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''&lt;br /&gt;
*Price for Single Space --- ''defined as price for flexible desk, relatively complete inputs''&lt;br /&gt;
*Price for Office --- ''no inputs''&lt;br /&gt;
*Twitter Activity (Multinomial or Count) --- ''High=2/Moderate=1/No=0, no explanations on how to categorize the activity. Also no handles''&lt;br /&gt;
*Size (sqft) --- ''no records for majority of the companies''&lt;br /&gt;
*Num Conference Rooms --- ''no records for majority of the companies''&lt;br /&gt;
*Onsite accelerator (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Onsite code school (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Community Membership (binary) --- ''relatively complete inputs''&lt;br /&gt;
&lt;br /&gt;
=OLD2=&lt;br /&gt;
*'''Twitter activity''': '' &lt;br /&gt;
'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed &lt;br /&gt;
&lt;br /&gt;
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''NUMBER OF EVENTS''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
'''Considerations'''&lt;br /&gt;
*Difficulties Encountered:&lt;br /&gt;
*Expected Time to Complete:&lt;br /&gt;
*Expectation of Results (accuracy of turk, comprehensiveness):&lt;br /&gt;
*Other Comments:&lt;br /&gt;
&lt;br /&gt;
'''Procedure'''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events in July 2016 and record it. If there is no information of events on the website, record DNE.&lt;br /&gt;
&lt;br /&gt;
Note***: ''Events include meetups, workshops, info sessions etc. We do not want to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Onsite Mentors''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'&lt;br /&gt;
#If the key words can be identified, mark as 1&lt;br /&gt;
#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring&lt;br /&gt;
#If these exist, mark as 1.&lt;br /&gt;
#If not, go to links related to membership 'benefits,' 'perks,' or related.&lt;br /&gt;
#Do same process as end of 4 and 5&lt;br /&gt;
#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine.  If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.&lt;br /&gt;
#If none of these steps result in a mark of 1, mark as 0 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Nonprofit''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'&lt;br /&gt;
#Look for the key word 'nonprofit'/'non-profit'&lt;br /&gt;
#If 'nonprofit' is identified, mark as 1, otherwise 0.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Number of Members''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#Count the number of members&lt;br /&gt;
#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Sponsors and Partners''':''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related sections&lt;br /&gt;
#If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7208</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7208"/>
		<updated>2016-07-20T16:11:14Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Onsite OH Investors v. mentors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Caltivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
**Search Text: Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
**Search Text 1: website design, coding, web development, software, bootcamp&lt;br /&gt;
**Search Text 2: General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
'''Desc''':&lt;br /&gt;
&lt;br /&gt;
'''Characteristics''':&lt;br /&gt;
&lt;br /&gt;
'''TBD Points''':&lt;br /&gt;
&lt;br /&gt;
'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=OLD1=&lt;br /&gt;
We will be creating a &amp;quot;Hubs scorecard&amp;quot; to determine how hub-like potential spaces are.  In order to do so, we will evaluate the places based on certain variables.  Previous variables for potential hubs were collected.  Below, we list those as well as other variables we think might be helpful to build out the scorecard.&lt;br /&gt;
&lt;br /&gt;
Ideally, we would have the following variables (not collected previously):&lt;br /&gt;
#Onsite VC/Angel/Investors (Count or binary)&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Onsite Mentors (binary) --- ''Are these the same as advisers?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#&amp;quot;Office hours&amp;quot; with investors or mentors (binary)&lt;br /&gt;
##Comments: Previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.).   We view this separation as important, BUT very difficult to collect&lt;br /&gt;
##Mechanical Turk Comments: &lt;br /&gt;
#Onsite temporary workshops (binary or count)  *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Networking Meetups (Binary or count) *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Sponsors and Partners (binary and list) --- a''re these the same?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Alumni Network (binary)  --- ''not all potential hubslist this and the fact that some do might indicate its importance''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Nonprofit (binary)  --- ''helpful in determining goals of potential hubs''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Mission Includes Key Buzzwords (e.g. &amp;quot;ecosystem&amp;quot;, &amp;quot;community&amp;quot;)  --- ''help separate simple coworking spaces form hubs''&lt;br /&gt;
&lt;br /&gt;
Example of Prior Variables Collected:&lt;br /&gt;
*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text.  We think what we really want is to see if they have a specialty (e.g. healthcare)''&lt;br /&gt;
*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''&lt;br /&gt;
*Price for Single Space --- ''defined as price for flexible desk, relatively complete inputs''&lt;br /&gt;
*Price for Office --- ''no inputs''&lt;br /&gt;
*Twitter Activity (Multinomial or Count) --- ''High=2/Moderate=1/No=0, no explanations on how to categorize the activity. Also no handles''&lt;br /&gt;
*Size (sqft) --- ''no records for majority of the companies''&lt;br /&gt;
*Num Conference Rooms --- ''no records for majority of the companies''&lt;br /&gt;
*Onsite accelerator (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Onsite code school (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Community Membership (binary) --- ''relatively complete inputs''&lt;br /&gt;
&lt;br /&gt;
=OLD2=&lt;br /&gt;
*'''Twitter activity''': '' &lt;br /&gt;
'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed &lt;br /&gt;
&lt;br /&gt;
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''NUMBER OF EVENTS''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
'''Considerations'''&lt;br /&gt;
*Difficulties Encountered:&lt;br /&gt;
*Expected Time to Complete:&lt;br /&gt;
*Expectation of Results (accuracy of turk, comprehensiveness):&lt;br /&gt;
*Other Comments:&lt;br /&gt;
&lt;br /&gt;
'''Procedure'''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events in July 2016 and record it. If there is no information of events on the website, record DNE.&lt;br /&gt;
&lt;br /&gt;
Note***: ''Events include meetups, workshops, info sessions etc. We do not want to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Onsite Mentors''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'&lt;br /&gt;
#If the key words can be identified, mark as 1&lt;br /&gt;
#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring&lt;br /&gt;
#If these exist, mark as 1.&lt;br /&gt;
#If not, go to links related to membership 'benefits,' 'perks,' or related.&lt;br /&gt;
#Do same process as end of 4 and 5&lt;br /&gt;
#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine.  If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.&lt;br /&gt;
#If none of these steps result in a mark of 1, mark as 0 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Nonprofit''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'&lt;br /&gt;
#Look for the key word 'nonprofit'/'non-profit'&lt;br /&gt;
#If 'nonprofit' is identified, mark as 1, otherwise 0.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Number of Members''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#Count the number of members&lt;br /&gt;
#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Sponsors and Partners''':''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related sections&lt;br /&gt;
#If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7207</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7207"/>
		<updated>2016-07-20T16:10:50Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Onsite OH Investors v. mentors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Caltivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
**Search Text: Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
**Search Text 1: website design, coding, web development, software, bootcamp&lt;br /&gt;
**Search Text 2: General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
'''Desc''':&lt;br /&gt;
'''Characteristics''':&lt;br /&gt;
'''TBD Points''':&lt;br /&gt;
'''Potential Turks''':&lt;br /&gt;
&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=OLD1=&lt;br /&gt;
We will be creating a &amp;quot;Hubs scorecard&amp;quot; to determine how hub-like potential spaces are.  In order to do so, we will evaluate the places based on certain variables.  Previous variables for potential hubs were collected.  Below, we list those as well as other variables we think might be helpful to build out the scorecard.&lt;br /&gt;
&lt;br /&gt;
Ideally, we would have the following variables (not collected previously):&lt;br /&gt;
#Onsite VC/Angel/Investors (Count or binary)&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Onsite Mentors (binary) --- ''Are these the same as advisers?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#&amp;quot;Office hours&amp;quot; with investors or mentors (binary)&lt;br /&gt;
##Comments: Previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.).   We view this separation as important, BUT very difficult to collect&lt;br /&gt;
##Mechanical Turk Comments: &lt;br /&gt;
#Onsite temporary workshops (binary or count)  *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Networking Meetups (Binary or count) *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Sponsors and Partners (binary and list) --- a''re these the same?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Alumni Network (binary)  --- ''not all potential hubslist this and the fact that some do might indicate its importance''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Nonprofit (binary)  --- ''helpful in determining goals of potential hubs''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Mission Includes Key Buzzwords (e.g. &amp;quot;ecosystem&amp;quot;, &amp;quot;community&amp;quot;)  --- ''help separate simple coworking spaces form hubs''&lt;br /&gt;
&lt;br /&gt;
Example of Prior Variables Collected:&lt;br /&gt;
*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text.  We think what we really want is to see if they have a specialty (e.g. healthcare)''&lt;br /&gt;
*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''&lt;br /&gt;
*Price for Single Space --- ''defined as price for flexible desk, relatively complete inputs''&lt;br /&gt;
*Price for Office --- ''no inputs''&lt;br /&gt;
*Twitter Activity (Multinomial or Count) --- ''High=2/Moderate=1/No=0, no explanations on how to categorize the activity. Also no handles''&lt;br /&gt;
*Size (sqft) --- ''no records for majority of the companies''&lt;br /&gt;
*Num Conference Rooms --- ''no records for majority of the companies''&lt;br /&gt;
*Onsite accelerator (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Onsite code school (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Community Membership (binary) --- ''relatively complete inputs''&lt;br /&gt;
&lt;br /&gt;
=OLD2=&lt;br /&gt;
*'''Twitter activity''': '' &lt;br /&gt;
'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed &lt;br /&gt;
&lt;br /&gt;
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''NUMBER OF EVENTS''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
'''Considerations'''&lt;br /&gt;
*Difficulties Encountered:&lt;br /&gt;
*Expected Time to Complete:&lt;br /&gt;
*Expectation of Results (accuracy of turk, comprehensiveness):&lt;br /&gt;
*Other Comments:&lt;br /&gt;
&lt;br /&gt;
'''Procedure'''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events in July 2016 and record it. If there is no information of events on the website, record DNE.&lt;br /&gt;
&lt;br /&gt;
Note***: ''Events include meetups, workshops, info sessions etc. We do not want to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Onsite Mentors''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'&lt;br /&gt;
#If the key words can be identified, mark as 1&lt;br /&gt;
#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring&lt;br /&gt;
#If these exist, mark as 1.&lt;br /&gt;
#If not, go to links related to membership 'benefits,' 'perks,' or related.&lt;br /&gt;
#Do same process as end of 4 and 5&lt;br /&gt;
#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine.  If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.&lt;br /&gt;
#If none of these steps result in a mark of 1, mark as 0 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Nonprofit''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'&lt;br /&gt;
#Look for the key word 'nonprofit'/'non-profit'&lt;br /&gt;
#If 'nonprofit' is identified, mark as 1, otherwise 0.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Number of Members''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#Count the number of members&lt;br /&gt;
#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Sponsors and Partners''':''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related sections&lt;br /&gt;
#If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7206</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7206"/>
		<updated>2016-07-20T16:08:49Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Curriculum and Code School */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Caltivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
**Search Text: Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
**Search Text 1: website design, coding, web development, software, bootcamp&lt;br /&gt;
**Search Text 2: General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=OLD1=&lt;br /&gt;
We will be creating a &amp;quot;Hubs scorecard&amp;quot; to determine how hub-like potential spaces are.  In order to do so, we will evaluate the places based on certain variables.  Previous variables for potential hubs were collected.  Below, we list those as well as other variables we think might be helpful to build out the scorecard.&lt;br /&gt;
&lt;br /&gt;
Ideally, we would have the following variables (not collected previously):&lt;br /&gt;
#Onsite VC/Angel/Investors (Count or binary)&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Onsite Mentors (binary) --- ''Are these the same as advisers?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#&amp;quot;Office hours&amp;quot; with investors or mentors (binary)&lt;br /&gt;
##Comments: Previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.).   We view this separation as important, BUT very difficult to collect&lt;br /&gt;
##Mechanical Turk Comments: &lt;br /&gt;
#Onsite temporary workshops (binary or count)  *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Networking Meetups (Binary or count) *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Sponsors and Partners (binary and list) --- a''re these the same?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Alumni Network (binary)  --- ''not all potential hubslist this and the fact that some do might indicate its importance''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Nonprofit (binary)  --- ''helpful in determining goals of potential hubs''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Mission Includes Key Buzzwords (e.g. &amp;quot;ecosystem&amp;quot;, &amp;quot;community&amp;quot;)  --- ''help separate simple coworking spaces form hubs''&lt;br /&gt;
&lt;br /&gt;
Example of Prior Variables Collected:&lt;br /&gt;
*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text.  We think what we really want is to see if they have a specialty (e.g. healthcare)''&lt;br /&gt;
*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''&lt;br /&gt;
*Price for Single Space --- ''defined as price for flexible desk, relatively complete inputs''&lt;br /&gt;
*Price for Office --- ''no inputs''&lt;br /&gt;
*Twitter Activity (Multinomial or Count) --- ''High=2/Moderate=1/No=0, no explanations on how to categorize the activity. Also no handles''&lt;br /&gt;
*Size (sqft) --- ''no records for majority of the companies''&lt;br /&gt;
*Num Conference Rooms --- ''no records for majority of the companies''&lt;br /&gt;
*Onsite accelerator (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Onsite code school (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Community Membership (binary) --- ''relatively complete inputs''&lt;br /&gt;
&lt;br /&gt;
=OLD2=&lt;br /&gt;
*'''Twitter activity''': '' &lt;br /&gt;
'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed &lt;br /&gt;
&lt;br /&gt;
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''NUMBER OF EVENTS''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
'''Considerations'''&lt;br /&gt;
*Difficulties Encountered:&lt;br /&gt;
*Expected Time to Complete:&lt;br /&gt;
*Expectation of Results (accuracy of turk, comprehensiveness):&lt;br /&gt;
*Other Comments:&lt;br /&gt;
&lt;br /&gt;
'''Procedure'''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events in July 2016 and record it. If there is no information of events on the website, record DNE.&lt;br /&gt;
&lt;br /&gt;
Note***: ''Events include meetups, workshops, info sessions etc. We do not want to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Onsite Mentors''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'&lt;br /&gt;
#If the key words can be identified, mark as 1&lt;br /&gt;
#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring&lt;br /&gt;
#If these exist, mark as 1.&lt;br /&gt;
#If not, go to links related to membership 'benefits,' 'perks,' or related.&lt;br /&gt;
#Do same process as end of 4 and 5&lt;br /&gt;
#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine.  If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.&lt;br /&gt;
#If none of these steps result in a mark of 1, mark as 0 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Nonprofit''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'&lt;br /&gt;
#Look for the key word 'nonprofit'/'non-profit'&lt;br /&gt;
#If 'nonprofit' is identified, mark as 1, otherwise 0.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Number of Members''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#Count the number of members&lt;br /&gt;
#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Sponsors and Partners''':''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related sections&lt;br /&gt;
#If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7204</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7204"/>
		<updated>2016-07-20T15:55:42Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Curriculum and Code School */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Caltivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
**Search Text: Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
**Search Text 1: website design, coding, web development, software&lt;br /&gt;
**Search Text 2: General Assembly, Anyone Can Learn to Code, Umbraco, Designation, Boise CodeWorks, Grand Circus, DevMountain, Silicon Valley Data Academy, Academy Pittsburgh&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 if no result returns.&lt;br /&gt;
#If there is a result, click first link in which result search text appears and record the sentence in which the text appears.&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=OLD1=&lt;br /&gt;
We will be creating a &amp;quot;Hubs scorecard&amp;quot; to determine how hub-like potential spaces are.  In order to do so, we will evaluate the places based on certain variables.  Previous variables for potential hubs were collected.  Below, we list those as well as other variables we think might be helpful to build out the scorecard.&lt;br /&gt;
&lt;br /&gt;
Ideally, we would have the following variables (not collected previously):&lt;br /&gt;
#Onsite VC/Angel/Investors (Count or binary)&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Onsite Mentors (binary) --- ''Are these the same as advisers?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#&amp;quot;Office hours&amp;quot; with investors or mentors (binary)&lt;br /&gt;
##Comments: Previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.).   We view this separation as important, BUT very difficult to collect&lt;br /&gt;
##Mechanical Turk Comments: &lt;br /&gt;
#Onsite temporary workshops (binary or count)  *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Networking Meetups (Binary or count) *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Sponsors and Partners (binary and list) --- a''re these the same?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Alumni Network (binary)  --- ''not all potential hubslist this and the fact that some do might indicate its importance''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Nonprofit (binary)  --- ''helpful in determining goals of potential hubs''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Mission Includes Key Buzzwords (e.g. &amp;quot;ecosystem&amp;quot;, &amp;quot;community&amp;quot;)  --- ''help separate simple coworking spaces form hubs''&lt;br /&gt;
&lt;br /&gt;
Example of Prior Variables Collected:&lt;br /&gt;
*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text.  We think what we really want is to see if they have a specialty (e.g. healthcare)''&lt;br /&gt;
*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''&lt;br /&gt;
*Price for Single Space --- ''defined as price for flexible desk, relatively complete inputs''&lt;br /&gt;
*Price for Office --- ''no inputs''&lt;br /&gt;
*Twitter Activity (Multinomial or Count) --- ''High=2/Moderate=1/No=0, no explanations on how to categorize the activity. Also no handles''&lt;br /&gt;
*Size (sqft) --- ''no records for majority of the companies''&lt;br /&gt;
*Num Conference Rooms --- ''no records for majority of the companies''&lt;br /&gt;
*Onsite accelerator (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Onsite code school (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Community Membership (binary) --- ''relatively complete inputs''&lt;br /&gt;
&lt;br /&gt;
=OLD2=&lt;br /&gt;
*'''Twitter activity''': '' &lt;br /&gt;
'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed &lt;br /&gt;
&lt;br /&gt;
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''NUMBER OF EVENTS''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
'''Considerations'''&lt;br /&gt;
*Difficulties Encountered:&lt;br /&gt;
*Expected Time to Complete:&lt;br /&gt;
*Expectation of Results (accuracy of turk, comprehensiveness):&lt;br /&gt;
*Other Comments:&lt;br /&gt;
&lt;br /&gt;
'''Procedure'''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events in July 2016 and record it. If there is no information of events on the website, record DNE.&lt;br /&gt;
&lt;br /&gt;
Note***: ''Events include meetups, workshops, info sessions etc. We do not want to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Onsite Mentors''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'&lt;br /&gt;
#If the key words can be identified, mark as 1&lt;br /&gt;
#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring&lt;br /&gt;
#If these exist, mark as 1.&lt;br /&gt;
#If not, go to links related to membership 'benefits,' 'perks,' or related.&lt;br /&gt;
#Do same process as end of 4 and 5&lt;br /&gt;
#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine.  If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.&lt;br /&gt;
#If none of these steps result in a mark of 1, mark as 0 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Nonprofit''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'&lt;br /&gt;
#Look for the key word 'nonprofit'/'non-profit'&lt;br /&gt;
#If 'nonprofit' is identified, mark as 1, otherwise 0.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Number of Members''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#Count the number of members&lt;br /&gt;
#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Sponsors and Partners''':''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related sections&lt;br /&gt;
#If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7198</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7198"/>
		<updated>2016-07-20T15:43:09Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Curriculum and Code School */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Caltivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
**Search Text: Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turks'''&lt;br /&gt;
#Google: &amp;quot;Search Text&amp;quot; site:URL&lt;br /&gt;
#Record 0 is no result appears&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
**Key words: website design, coding, web development, software&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*&amp;quot;Potential Turks&amp;quot;&lt;br /&gt;
**Google &amp;quot;General Assembly&amp;quot; site:URL&lt;br /&gt;
***Anyone Can Learn to Code&lt;br /&gt;
***Umbraco&lt;br /&gt;
***Designation&lt;br /&gt;
***Boise CodeWorks&lt;br /&gt;
***Grand Circus&lt;br /&gt;
***DevMountain&lt;br /&gt;
***Silicon Valley Data Academy&lt;br /&gt;
***Academy Pittsburgh&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=OLD1=&lt;br /&gt;
We will be creating a &amp;quot;Hubs scorecard&amp;quot; to determine how hub-like potential spaces are.  In order to do so, we will evaluate the places based on certain variables.  Previous variables for potential hubs were collected.  Below, we list those as well as other variables we think might be helpful to build out the scorecard.&lt;br /&gt;
&lt;br /&gt;
Ideally, we would have the following variables (not collected previously):&lt;br /&gt;
#Onsite VC/Angel/Investors (Count or binary)&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Onsite Mentors (binary) --- ''Are these the same as advisers?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#&amp;quot;Office hours&amp;quot; with investors or mentors (binary)&lt;br /&gt;
##Comments: Previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.).   We view this separation as important, BUT very difficult to collect&lt;br /&gt;
##Mechanical Turk Comments: &lt;br /&gt;
#Onsite temporary workshops (binary or count)  *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Networking Meetups (Binary or count) *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Sponsors and Partners (binary and list) --- a''re these the same?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Alumni Network (binary)  --- ''not all potential hubslist this and the fact that some do might indicate its importance''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Nonprofit (binary)  --- ''helpful in determining goals of potential hubs''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Mission Includes Key Buzzwords (e.g. &amp;quot;ecosystem&amp;quot;, &amp;quot;community&amp;quot;)  --- ''help separate simple coworking spaces form hubs''&lt;br /&gt;
&lt;br /&gt;
Example of Prior Variables Collected:&lt;br /&gt;
*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text.  We think what we really want is to see if they have a specialty (e.g. healthcare)''&lt;br /&gt;
*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''&lt;br /&gt;
*Price for Single Space --- ''defined as price for flexible desk, relatively complete inputs''&lt;br /&gt;
*Price for Office --- ''no inputs''&lt;br /&gt;
*Twitter Activity (Multinomial or Count) --- ''High=2/Moderate=1/No=0, no explanations on how to categorize the activity. Also no handles''&lt;br /&gt;
*Size (sqft) --- ''no records for majority of the companies''&lt;br /&gt;
*Num Conference Rooms --- ''no records for majority of the companies''&lt;br /&gt;
*Onsite accelerator (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Onsite code school (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Community Membership (binary) --- ''relatively complete inputs''&lt;br /&gt;
&lt;br /&gt;
=OLD2=&lt;br /&gt;
*'''Twitter activity''': '' &lt;br /&gt;
'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed &lt;br /&gt;
&lt;br /&gt;
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''NUMBER OF EVENTS''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
'''Considerations'''&lt;br /&gt;
*Difficulties Encountered:&lt;br /&gt;
*Expected Time to Complete:&lt;br /&gt;
*Expectation of Results (accuracy of turk, comprehensiveness):&lt;br /&gt;
*Other Comments:&lt;br /&gt;
&lt;br /&gt;
'''Procedure'''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events in July 2016 and record it. If there is no information of events on the website, record DNE.&lt;br /&gt;
&lt;br /&gt;
Note***: ''Events include meetups, workshops, info sessions etc. We do not want to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Onsite Mentors''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'&lt;br /&gt;
#If the key words can be identified, mark as 1&lt;br /&gt;
#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring&lt;br /&gt;
#If these exist, mark as 1.&lt;br /&gt;
#If not, go to links related to membership 'benefits,' 'perks,' or related.&lt;br /&gt;
#Do same process as end of 4 and 5&lt;br /&gt;
#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine.  If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.&lt;br /&gt;
#If none of these steps result in a mark of 1, mark as 0 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Nonprofit''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'&lt;br /&gt;
#Look for the key word 'nonprofit'/'non-profit'&lt;br /&gt;
#If 'nonprofit' is identified, mark as 1, otherwise 0.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Number of Members''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#Count the number of members&lt;br /&gt;
#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Sponsors and Partners''':''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related sections&lt;br /&gt;
#If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7196</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7196"/>
		<updated>2016-07-20T15:39:03Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Curriculum and Code School */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm and code updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Caltivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
**Search Text: Fullbridge, leadership program, business academy, business course, aspiring entrepreneurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*&amp;quot;Potential Turks&amp;quot;&lt;br /&gt;
**Google &amp;quot;Fullbridge&amp;quot; site:URL&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
**Key words: website design, coding, web development, software&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*&amp;quot;Potential Turks&amp;quot;&lt;br /&gt;
**Google &amp;quot;General Assembly&amp;quot; site:URL&lt;br /&gt;
***Anyone Can Learn to Code&lt;br /&gt;
***Umbraco&lt;br /&gt;
***Designation&lt;br /&gt;
***Boise CodeWorks&lt;br /&gt;
***Grand Circus&lt;br /&gt;
***DevMountain&lt;br /&gt;
***Silicon Valley Data Academy&lt;br /&gt;
***Academy Pittsburgh&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=OLD1=&lt;br /&gt;
We will be creating a &amp;quot;Hubs scorecard&amp;quot; to determine how hub-like potential spaces are.  In order to do so, we will evaluate the places based on certain variables.  Previous variables for potential hubs were collected.  Below, we list those as well as other variables we think might be helpful to build out the scorecard.&lt;br /&gt;
&lt;br /&gt;
Ideally, we would have the following variables (not collected previously):&lt;br /&gt;
#Onsite VC/Angel/Investors (Count or binary)&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Onsite Mentors (binary) --- ''Are these the same as advisers?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#&amp;quot;Office hours&amp;quot; with investors or mentors (binary)&lt;br /&gt;
##Comments: Previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.).   We view this separation as important, BUT very difficult to collect&lt;br /&gt;
##Mechanical Turk Comments: &lt;br /&gt;
#Onsite temporary workshops (binary or count)  *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Networking Meetups (Binary or count) *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Sponsors and Partners (binary and list) --- a''re these the same?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Alumni Network (binary)  --- ''not all potential hubslist this and the fact that some do might indicate its importance''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Nonprofit (binary)  --- ''helpful in determining goals of potential hubs''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Mission Includes Key Buzzwords (e.g. &amp;quot;ecosystem&amp;quot;, &amp;quot;community&amp;quot;)  --- ''help separate simple coworking spaces form hubs''&lt;br /&gt;
&lt;br /&gt;
Example of Prior Variables Collected:&lt;br /&gt;
*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text.  We think what we really want is to see if they have a specialty (e.g. healthcare)''&lt;br /&gt;
*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''&lt;br /&gt;
*Price for Single Space --- ''defined as price for flexible desk, relatively complete inputs''&lt;br /&gt;
*Price for Office --- ''no inputs''&lt;br /&gt;
*Twitter Activity (Multinomial or Count) --- ''High=2/Moderate=1/No=0, no explanations on how to categorize the activity. Also no handles''&lt;br /&gt;
*Size (sqft) --- ''no records for majority of the companies''&lt;br /&gt;
*Num Conference Rooms --- ''no records for majority of the companies''&lt;br /&gt;
*Onsite accelerator (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Onsite code school (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Community Membership (binary) --- ''relatively complete inputs''&lt;br /&gt;
&lt;br /&gt;
=OLD2=&lt;br /&gt;
*'''Twitter activity''': '' &lt;br /&gt;
'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed &lt;br /&gt;
&lt;br /&gt;
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''NUMBER OF EVENTS''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
'''Considerations'''&lt;br /&gt;
*Difficulties Encountered:&lt;br /&gt;
*Expected Time to Complete:&lt;br /&gt;
*Expectation of Results (accuracy of turk, comprehensiveness):&lt;br /&gt;
*Other Comments:&lt;br /&gt;
&lt;br /&gt;
'''Procedure'''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events in July 2016 and record it. If there is no information of events on the website, record DNE.&lt;br /&gt;
&lt;br /&gt;
Note***: ''Events include meetups, workshops, info sessions etc. We do not want to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Onsite Mentors''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'&lt;br /&gt;
#If the key words can be identified, mark as 1&lt;br /&gt;
#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring&lt;br /&gt;
#If these exist, mark as 1.&lt;br /&gt;
#If not, go to links related to membership 'benefits,' 'perks,' or related.&lt;br /&gt;
#Do same process as end of 4 and 5&lt;br /&gt;
#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine.  If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.&lt;br /&gt;
#If none of these steps result in a mark of 1, mark as 0 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Nonprofit''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'&lt;br /&gt;
#Look for the key word 'nonprofit'/'non-profit'&lt;br /&gt;
#If 'nonprofit' is identified, mark as 1, otherwise 0.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Number of Members''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#Count the number of members&lt;br /&gt;
#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Sponsors and Partners''':''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related sections&lt;br /&gt;
#If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7147</id>
		<title>Hubs: Hubs Data</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Hubs:_Hubs_Data&amp;diff=7147"/>
		<updated>2016-07-19T20:41:05Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Actual WIP */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Background=&lt;br /&gt;
This page represents the work used for mechanical turks for the paper: [[Hubs (Academic Paper)]].  As of Spring 2016, a list of potential Hubs with a set of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a scorecard to help subjectively define Hubs based on certain characteristics. &lt;br /&gt;
&lt;br /&gt;
For more information on Mechanical Turks in general, see [[Mechanical Turk (Tool)]].&lt;br /&gt;
&lt;br /&gt;
The main goal of the mechanical turk is to automate the collection of variables for potential hubs as much as possible.  The key steps for the project are:&lt;br /&gt;
#Creating a '''comprehensive''' list of potential hubs&lt;br /&gt;
#Determining the best variables for the scorecard&lt;br /&gt;
#Building '''&amp;quot;filters&amp;quot;''' for automating the collection&lt;br /&gt;
#'''Running''' and '''auditing''' of the automation&lt;br /&gt;
#Collecting the remaining manual data&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=Variables to be Used=&lt;br /&gt;
==Current Complete List==&lt;br /&gt;
'''As of Week of 7/11'''&lt;br /&gt;
#Onsite Venture Capital&lt;br /&gt;
#*Assets Under Management&lt;br /&gt;
#*Number&lt;br /&gt;
#Onsite Angel Investors&lt;br /&gt;
#Onsite Mentors&lt;br /&gt;
#Founding Date&lt;br /&gt;
#Site URL&lt;br /&gt;
#Office hours investors&lt;br /&gt;
#Office hours mentor/advisors&lt;br /&gt;
#Onsite temporary workshops&lt;br /&gt;
#Onsite mentors&lt;br /&gt;
#Networking Meetups&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*University&lt;br /&gt;
#*Corporate&lt;br /&gt;
#Curriculum&lt;br /&gt;
#Onsite code school&lt;br /&gt;
#Alumni Network&lt;br /&gt;
#Nonprofit status &lt;br /&gt;
#Mission statement&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#Price for a space&lt;br /&gt;
#Price for office&lt;br /&gt;
#Twitter activity&lt;br /&gt;
#Size (sqft)&lt;br /&gt;
#Size (# companies)&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Community membership??&lt;br /&gt;
#Franchise &lt;br /&gt;
#Multiple locations within city&lt;br /&gt;
&lt;br /&gt;
==Grouping of Variables==&lt;br /&gt;
There are a few categories the majority of the variables fall under&lt;br /&gt;
&lt;br /&gt;
'''Group 1: Low Hanging Fruit'''&lt;br /&gt;
Variables in this group are very easy to find and automate.&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#Founding Date&lt;br /&gt;
#URL&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#Specific Industry&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 2: The Difficult to Find'''&lt;br /&gt;
There are certain variables where the information is not readily available online or difficult to find.&lt;br /&gt;
#Size (can try to find press releases)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 3: In Between 1 and 2'''&lt;br /&gt;
Variables that aren't too easy or difficult to find and automate.&lt;br /&gt;
#Onsite accelerator&lt;br /&gt;
#Alumni mentor---vs. other mentors???&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 4: The Hard to Differentiate'''&lt;br /&gt;
The key property of this group is that there are several similar variables, which would be difficult for a turk to differentiate.  In order to fix this,  we will need to create filters akin to the DSM5 scorecard.  See the below section.&lt;br /&gt;
#Onsite VC v. Angel Investors&lt;br /&gt;
#Onsite OH Investors v. mentors&lt;br /&gt;
#Onsite temporary workshops v. networking events&lt;br /&gt;
#Curriculum v. code school&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''Group 5: The Need further Discussion Before Collection'''&lt;br /&gt;
Variables that need to be developed more prior to collection.&lt;br /&gt;
#Franchise and multiple locations within a city&lt;br /&gt;
#Community Membership&lt;br /&gt;
&lt;br /&gt;
==Filters/Scorecard==&lt;br /&gt;
===General Approach===&lt;br /&gt;
The Scorecard will be broken down into three main parts: description, characteristics, andTBD parts. The procedure for creating these will be as follows: the description will be determined, develop the characteristics after looking over examples, the creation of possible mechanical turks that have complete accuracy even if not comprehension (e.g.  a task will that always guarantees that there is an onsite mentor that covers only 40% of firms, but never misspecifies the existence of mentors), and auditing of the results.&lt;br /&gt;
&lt;br /&gt;
===Example===&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*'''Potential Turk'''&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
&lt;br /&gt;
'''Temporary Workshops'''&lt;br /&gt;
*'''Desc''':a discussion/learning of a group of people on specific subjects&lt;br /&gt;
*'''Characteristic''':&lt;br /&gt;
**One time&lt;br /&gt;
**Have a topic/subject/goal &lt;br /&gt;
***e.g. learn to code workshop: Java script 101&lt;br /&gt;
&lt;br /&gt;
=Additional Resources=&lt;br /&gt;
#[[Mechanical Turk (Tool)]]&lt;br /&gt;
#Veeral has created a google automating procedure for different lists &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Work in Progress=&lt;br /&gt;
==Goals for WIP==&lt;br /&gt;
#For GROUP 1, creation of mechanical turk steps:&lt;br /&gt;
#*'''EXAMPLE:'''&lt;br /&gt;
#*'''Twitter Activity'''&lt;br /&gt;
#**'''STATUS''': Complete/In Progress/Not Started&lt;br /&gt;
#**'''Previously Collected''': Yes/No&lt;br /&gt;
#**'''Published on Mechanical Turk''': Yes/No&lt;br /&gt;
#**'''Audited''': Yes/No&lt;br /&gt;
#**'''Updates''':&lt;br /&gt;
#**'''Code''':&lt;br /&gt;
#For GROUP 4:&lt;br /&gt;
##Scorecard Example&lt;br /&gt;
##Potential Mechanical Turk Steps (e.g. if specific organization is on website)&lt;br /&gt;
##Mechanical Turk Example (GROUP 1)&lt;br /&gt;
##Add Comments on:&lt;br /&gt;
###How much manual work remains/What is missing&lt;br /&gt;
###Any remaining difficulties&lt;br /&gt;
#For GROUPS 2 and 3:&lt;br /&gt;
##Brainstorm potential ways to find data&lt;br /&gt;
##Follow Steps in Group1&lt;br /&gt;
&lt;br /&gt;
==Steps Needed to Complete==&lt;br /&gt;
#Establish automation process for Groups 1-3&lt;br /&gt;
#*Status (7/19):&lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete By Friday 7/22&lt;br /&gt;
#Differentiate variables in Group 4 &lt;br /&gt;
#*Status (7/19): &lt;br /&gt;
#*Begin Date: Started&lt;br /&gt;
#*Reach Goal: Complete by Wednesday 7/27&lt;br /&gt;
#Test processes and audit &lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/28&lt;br /&gt;
#Have a comprehensive list of potential hubs&lt;br /&gt;
#*Status (7/19): NS&lt;br /&gt;
#*Begin Date: Thursday 7/21&lt;br /&gt;
#*Reach Goal: Complete by Tuesday 7/26&lt;br /&gt;
#Fill in Remaining Data Manually&lt;br /&gt;
#*Status: NS&lt;br /&gt;
#*Begin Date: Monday 7/25&lt;br /&gt;
#*Reach Goal: Complete by Friday 7/29&lt;br /&gt;
&lt;br /&gt;
==Actual WIP==&lt;br /&gt;
===Group 1===&lt;br /&gt;
#Twitter Activity&lt;br /&gt;
#*'''STATUS''': Complete&lt;br /&gt;
#*'''Previously Collected''': YES/NO - Recorded 2/1/0 to represent activity level, but not same as we are&lt;br /&gt;
#*'''Published on Mechanical Turk''': Yes&lt;br /&gt;
#*'''AUDITED''': Yes&lt;br /&gt;
#**'''Audit Results''': Comparing to 30 that manually  done, for '''twitter handle,''' all 3 turkers agreed with our results 81% of the time, but at least 2 turkers agreed with our results 98% (the exception was a company that had several twitter handles based on location).  Results were 52% and 89% respectively.&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
#**'''UPDATE (7/12)''': Audited&lt;br /&gt;
#**'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#*#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#*#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
#URL&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is &amp;lt;15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#Record the URL of the first result in the following format www.___.__/ (e.g. if url is example.us/other, record www.example.us/)&lt;br /&gt;
#Mission Statement&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/18)''': Code written, expected time for each assignment is 20-30 seconds - pay rate, therefore, recommended $.08 &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine (will include site:__ from Company's URL).&lt;br /&gt;
#*#Click on first link that is a subsection (e.g. &amp;quot;Mission&amp;quot;, &amp;quot;About&amp;quot;) from company's website (see Company's URL)&lt;br /&gt;
#*#If this does not exist, repeat steps 1 and 2 with Search Text 2&lt;br /&gt;
#*#If this does not exist, got to Company's URL&lt;br /&gt;
#*#Record the main text on the page up to five paragraphs (some of these will be a single line).  Do NOT record subsections.&lt;br /&gt;
#*#If locating the main text in  the prior step is unclear, record &amp;quot;Unclear&amp;quot;&lt;br /&gt;
#*#If no text exists, record &amp;quot;DNE&amp;quot;&lt;br /&gt;
#Nonprofit&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#**'''REQUIRES ADDITIONAL STEPS''': YES (need to double check results)&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Code written, code 2 of 2 is believed to be more accurate and efficient.  Expected time to complete is 15 seconds - pay rate, therefore, recommended $.04 &lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company's URL.&lt;br /&gt;
#*#Go to links (sometimes will be sections of the URL page) that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'.&lt;br /&gt;
#*#If none of these exist, record DNE for PAGES&lt;br /&gt;
#*#Look for the word 'profit'/'nonprofit'/'non-profit'/'not-for-profit'  (with or without -)&lt;br /&gt;
#*#If any of the key words exist is identified, record as 1, otherwise 0 for EXISTS (1/0).&lt;br /&gt;
#*#If it is marked as 1, record all sentences that the word is found in under SENTENCES. &lt;br /&gt;
#*#If the links do exist, record the name of the link under PAGES&lt;br /&gt;
#*#Repeat steps 4, 5, and 6 on the pages that were linked.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy the text from Search Text into the search bar at http://www.guidestar.org/.&lt;br /&gt;
#*#Record all Organization Names that appear&lt;br /&gt;
#*#If no results appear, record DNE&lt;br /&gt;
#Price for a space + office&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Founding Date&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES, but only year&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Sponsors/Partners&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
#Specific Industry&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, based on LinkedIn identifier&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 2===&lt;br /&gt;
#Size&lt;br /&gt;
#*'''BRAINSTORM''': (7/19) 1), 2), 3): search allintext: sqft/square foot/square feet  site: company URL.  4) Company Name, city, square feet and then choose frist result.  Process might be easier (and cheaper) if Veeral runs code firstto eliminate a bunch of 0 result returned.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': YES/NO, many missing&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Brainstorm updated &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text 1 into a search engine.&lt;br /&gt;
#*#Record DNE if 0 results returned in SEARCH 1&lt;br /&gt;
#*#If there is a result, click first link in which result search text appears and record the sentence in which the text appears in SEARCH 1&lt;br /&gt;
#*#Repeat Steps 1-3 for Search Text 2 and 3 and record in  respective SEARCH 2 and SEARCH 3 respectively&lt;br /&gt;
&lt;br /&gt;
===Group 3===&lt;br /&gt;
#Mentors&lt;br /&gt;
#*'''BRAINSTORM''': Current form of this variable seems to be too general.&lt;br /&gt;
#*'''STATUS''': In Progress&lt;br /&gt;
#*'''Previously Collected''': NO&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/19)''': Two possible codes written. First one requires more manual work&lt;br /&gt;
#*'''CODE 1 of 2'''&lt;br /&gt;
#*#Go to Company URL&lt;br /&gt;
#*#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'. &lt;br /&gt;
#*#If the key words can be identified, record  1  in BINARY, copy the sentence it is included in SENTENCE, and record urlhome in PAGE. &lt;br /&gt;
#*#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., and look for a subsection or mention of mentor/mentorship/mentoring. &lt;br /&gt;
#*#If these exist, record 1  in BINARY, copy the sentence it is included in SENTENCE, and record the link name clicked in PAGE. &lt;br /&gt;
#*#If not, go to links related to membership 'benefits,' 'perks,' or related and repeat Step 5.&lt;br /&gt;
#*#If none of these steps result in a mark of 1, mark as 0.&lt;br /&gt;
#*'''CODE 2 of 2'''&lt;br /&gt;
#*#Copy Search Text into search engine&lt;br /&gt;
#*#Mark as 1 if reliable site is populated, 0 otherwise&lt;br /&gt;
#Onsite Accelerator&lt;br /&gt;
#*'''BRAINSTORM''': Need a count.&lt;br /&gt;
#*'''STATUS''': Not Started&lt;br /&gt;
#*'''Previously Collected''': YES/NO, only a binary variable&lt;br /&gt;
#*'''Published on Mechanical Turk''': NO&lt;br /&gt;
#*'''AUDITED''': NO&lt;br /&gt;
#**'''Audit Results''': TBD&lt;br /&gt;
#*'''UPDATES''':&lt;br /&gt;
#**'''UPDATE (7/_)''': TBD &lt;br /&gt;
#*'''CODE'''&lt;br /&gt;
#*#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#*#TBD&lt;br /&gt;
&lt;br /&gt;
===Group 4===&lt;br /&gt;
====Curriculum and Code School====&lt;br /&gt;
'''Curriculum'''&lt;br /&gt;
*'''Desc''': The potential hub provides training programs for the founders of startups that might have human capital deficits that will lead to them not being about to adequately implement their ideas.&lt;br /&gt;
*'''Characteristics''': &lt;br /&gt;
**Education that is for a founder (as opposed to code schools which can be for people who just want to join a startup)&lt;br /&gt;
***Code schools are for startup labor supply&lt;br /&gt;
**Active input into a current entrepreneurial endeavor&lt;br /&gt;
***e.g. &amp;quot; The program is designed to augment and support the real-life business experiences that the students are facing every day in their entrepreneurial endeavors&amp;quot;	&lt;br /&gt;
**Not an ad hoc session, not a one time meeting but a full &amp;quot;course&amp;quot;, evidence of this could be&lt;br /&gt;
**Has evidence of a integrated curriculum leading to a new compentance &lt;br /&gt;
**Has evidence of a set fixed start and end dates that last XXX long&lt;br /&gt;
**Caltivate leadership for entrepreneurs &lt;br /&gt;
**Tagged &amp;quot;Business&amp;quot; as opposed to 'Tech' or 'Design'&lt;br /&gt;
**Is a session linked to others that regularly occurs&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
**Do we care about outsourcing?&lt;br /&gt;
*&amp;quot;Potential Turks&amp;quot;&lt;br /&gt;
**Google &amp;quot;Fullbridge&amp;quot; site:URL&lt;br /&gt;
***&amp;quot;leadership programs&amp;quot;&lt;br /&gt;
***&amp;quot;aspiring entrepreneurs&amp;quot;&lt;br /&gt;
***&amp;quot;business academy&amp;quot;&lt;br /&gt;
&lt;br /&gt;
'''Code School'''&lt;br /&gt;
*'''Desc''': training programs that teach coding, data processing, webpage building and other technical skills.&lt;br /&gt;
*'''Characteristics''':&lt;br /&gt;
**Bootcamps&lt;br /&gt;
**Target group are the developers or people who want to join the startups but not the founders themselves&lt;br /&gt;
**Scheduled classes, not a one time meeting (as opposed to workshops)&lt;br /&gt;
**Key words: website design, coding, web development, software&lt;br /&gt;
*'''TBD points'''&lt;br /&gt;
*&amp;quot;Potential Turks&amp;quot;&lt;br /&gt;
**Google &amp;quot;General Assembly&amp;quot; site:URL&lt;br /&gt;
***Anyone Can Learn to Code&lt;br /&gt;
***Umbraco&lt;br /&gt;
***Designation&lt;br /&gt;
***Boise CodeWorks&lt;br /&gt;
***Grand Circus&lt;br /&gt;
***DevMountain&lt;br /&gt;
***Silicon Valley Data Academy&lt;br /&gt;
***Academy Pittsburgh&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
====Onsite VC v. Angel Investors====&lt;br /&gt;
====Onsite OH Investors v. mentors====&lt;br /&gt;
====Onsite temporary workshops v. networking events====&lt;br /&gt;
&lt;br /&gt;
==Companies Used for Auditing/etc.==&lt;br /&gt;
 Capital Factory, Austin&lt;br /&gt;
 1871, Chicago&lt;br /&gt;
 Rocket Space, San Francisco&lt;br /&gt;
 1776, Washington D.C.&lt;br /&gt;
 Betamore, Baltimore&lt;br /&gt;
 Packard Place, Charlotte&lt;br /&gt;
 The venture Center, Little Rock&lt;br /&gt;
 GSV Labs, San Francisco&lt;br /&gt;
 The Hive, Palo Alto&lt;br /&gt;
 Innovation Pavilion, Denver&lt;br /&gt;
 OSC Tech Lab, Akron&lt;br /&gt;
 Speakeasy, Indianapolis&lt;br /&gt;
 Riverside.io, Riverside&lt;br /&gt;
 The Salt Mines, Columbus&lt;br /&gt;
 InNEVation, Las Vegas&lt;br /&gt;
 804 RVA&lt;br /&gt;
 Impact Hub, Salt Lake&lt;br /&gt;
 Awesome Inc, Louisville&lt;br /&gt;
 Geekdom, San Antonio&lt;br /&gt;
 Alloy26, Pittsburg&lt;br /&gt;
 ReSET, Hartford&lt;br /&gt;
 Ansir Innovation Center, San Diego&lt;br /&gt;
 Domistation, Tallahassee&lt;br /&gt;
 Atlanta Tech Village, Atlanta&lt;br /&gt;
 Spark Labs, New York&lt;br /&gt;
&lt;br /&gt;
=Completed Work=&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=OLD1=&lt;br /&gt;
We will be creating a &amp;quot;Hubs scorecard&amp;quot; to determine how hub-like potential spaces are.  In order to do so, we will evaluate the places based on certain variables.  Previous variables for potential hubs were collected.  Below, we list those as well as other variables we think might be helpful to build out the scorecard.&lt;br /&gt;
&lt;br /&gt;
Ideally, we would have the following variables (not collected previously):&lt;br /&gt;
#Onsite VC/Angel/Investors (Count or binary)&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Onsite Mentors (binary) --- ''Are these the same as advisers?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#&amp;quot;Office hours&amp;quot; with investors or mentors (binary)&lt;br /&gt;
##Comments: Previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.).   We view this separation as important, BUT very difficult to collect&lt;br /&gt;
##Mechanical Turk Comments: &lt;br /&gt;
#Onsite temporary workshops (binary or count)  *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Networking Meetups (Binary or count) *** '''see mechanical turk'''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Sponsors and Partners (binary and list) --- a''re these the same?''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Alumni Network (binary)  --- ''not all potential hubslist this and the fact that some do might indicate its importance''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Nonprofit (binary)  --- ''helpful in determining goals of potential hubs''&lt;br /&gt;
##Comments:&lt;br /&gt;
##Mechanical Turk Comments:&lt;br /&gt;
#Mission Includes Key Buzzwords (e.g. &amp;quot;ecosystem&amp;quot;, &amp;quot;community&amp;quot;)  --- ''help separate simple coworking spaces form hubs''&lt;br /&gt;
&lt;br /&gt;
Example of Prior Variables Collected:&lt;br /&gt;
*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text.  We think what we really want is to see if they have a specialty (e.g. healthcare)''&lt;br /&gt;
*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''&lt;br /&gt;
*Price for Single Space --- ''defined as price for flexible desk, relatively complete inputs''&lt;br /&gt;
*Price for Office --- ''no inputs''&lt;br /&gt;
*Twitter Activity (Multinomial or Count) --- ''High=2/Moderate=1/No=0, no explanations on how to categorize the activity. Also no handles''&lt;br /&gt;
*Size (sqft) --- ''no records for majority of the companies''&lt;br /&gt;
*Num Conference Rooms --- ''no records for majority of the companies''&lt;br /&gt;
*Onsite accelerator (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Onsite code school (binary) --- ''relatively complete inputs''&lt;br /&gt;
*Community Membership (binary) --- ''relatively complete inputs''&lt;br /&gt;
&lt;br /&gt;
=OLD2=&lt;br /&gt;
*'''Twitter activity''': '' &lt;br /&gt;
'''UPDATE (7/14)''': Updated turk to reflect our desired formats&lt;br /&gt;
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed &lt;br /&gt;
&lt;br /&gt;
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site.  Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet.  Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs&lt;br /&gt;
#Record the company's Twitter Handle into Twitter Handle&lt;br /&gt;
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''NUMBER OF EVENTS''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
'''Considerations'''&lt;br /&gt;
*Difficulties Encountered:&lt;br /&gt;
*Expected Time to Complete:&lt;br /&gt;
*Expectation of Results (accuracy of turk, comprehensiveness):&lt;br /&gt;
*Other Comments:&lt;br /&gt;
&lt;br /&gt;
'''Procedure'''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to events, such as 'Events' or 'Calendar' on the homepage. &lt;br /&gt;
#If not found on the homepage, check 'About' and check 'Community'&lt;br /&gt;
#Count the number of events in July 2016 and record it. If there is no information of events on the website, record DNE.&lt;br /&gt;
&lt;br /&gt;
Note***: ''Events include meetups, workshops, info sessions etc. We do not want to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.''&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Onsite Mentors''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for links related to mentorship such as 'mentors', 'mentorship' or 'mentoring programs'&lt;br /&gt;
#If the key words can be identified, mark as 1&lt;br /&gt;
#If there is no explicit 'mentoring' section, look for links related to a description of the company, such as: 'About,' 'Our Team,' 'Our Mission,' etc., look for a subsection or mention of mentor/mentorship/mentoring&lt;br /&gt;
#If these exist, mark as 1.&lt;br /&gt;
#If not, go to links related to membership 'benefits,' 'perks,' or related.&lt;br /&gt;
#Do same process as end of 4 and 5&lt;br /&gt;
#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine.  If a link to a reliable website (such as Desktime) appears and mentorship can be found in the description, mark as 1.&lt;br /&gt;
#If none of these steps result in a mark of 1, mark as 0 &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Nonprofit''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,' 'Mission'&lt;br /&gt;
#Look for the key word 'nonprofit'/'non-profit'&lt;br /&gt;
#If 'nonprofit' is identified, mark as 1, otherwise 0.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Number of Members''': ''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link 'Members' or 'Residents', usually they are under the links 'Community', 'Membership', 'Our Space' or 'The Space'.&lt;br /&gt;
#Count the number of members&lt;br /&gt;
#If the link or section of 'Members' is not found, go the 'Community' and 'Coworking' and look for the description on number of startups/founders/members in the community. Record the number.&lt;br /&gt;
#If number of members cannot be identified using above steps, record DNE.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*'''Sponsors and Partners''':''UPDATE: written, not published, on amazon's mechanical turk site''&lt;br /&gt;
#Copy the text in the Search Text into a search engine.&lt;br /&gt;
#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNE.&lt;br /&gt;
#Look for the link or mention of 'Sponsors' or 'Partners', many times of which is often under the section of 'About', 'Community', or related sections&lt;br /&gt;
#If sponsors or partners can be found mark as 1 and list them, otherwise mark as 0.&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matching_VentureOne_(Data)&amp;diff=7134</id>
		<title>Matching VentureOne (Data)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matching_VentureOne_(Data)&amp;diff=7134"/>
		<updated>2016-07-19T18:46:22Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Updated */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Project Title=Matching VentureOne (Data)&lt;br /&gt;
|Topic Area=Patents and Innovation&lt;br /&gt;
|Owner=Ariel Sun, Rosemarie Ziedonis&lt;br /&gt;
|Start Term=Summer 2016&lt;br /&gt;
|Status=Active&lt;br /&gt;
|Deliverable=Other&lt;br /&gt;
|Primary Billing= AccMcNair01&lt;br /&gt;
}}&lt;br /&gt;
=Updated=&lt;br /&gt;
'''New Requirements'''&lt;br /&gt;
*re-run the match using *both* name-related fields in the startups_cl.dta file:  “name” and “name_prev”.&lt;br /&gt;
#the latter field pulls in patents applied for under a former name of the same company&lt;br /&gt;
&lt;br /&gt;
*in the output file, please include…&lt;br /&gt;
#include the field “entityid” that corresponds to each startup (this step is critical; else, we can’t link patents filed under alternative names of the company to the same firm); in startups_cl.dta&lt;br /&gt;
#all assignee-related fields in your patent data (e.g., assignee name, and any original and current uspto assignee codes listed for the patent); merge in from your patent files&lt;br /&gt;
&lt;br /&gt;
'''Output'''&lt;br /&gt;
&lt;br /&gt;
Text files are in:&amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\&amp;lt;/code&amp;gt;&lt;br /&gt;
#&amp;lt;code&amp;gt;summarytablefinal&amp;lt;/code&amp;gt;: summary on number of patents and grant year for all companies&lt;br /&gt;
#&amp;lt;code&amp;gt;ullyjoinedtable&amp;lt;/code&amp;gt;: all patent and assignee information for entities that have patents (combining 3 and 4)&lt;br /&gt;
#&amp;lt;code&amp;gt;fullyjoinednow&amp;lt;/code&amp;gt;: patent information under current name of the company&lt;br /&gt;
#&amp;lt;code&amp;gt;fullyjoinedprev&amp;lt;/code&amp;gt;: patent information under previous name of the company &lt;br /&gt;
&lt;br /&gt;
A new version of sql script can be cound at :&amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\sql script.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''notes'''&lt;br /&gt;
  In summarytablefinal table: (for all entities)&lt;br /&gt;
  Variables:&lt;br /&gt;
  Entity Name|Standard Orgname|Number of patent|&lt;br /&gt;
  Previous Name|Previous Standard Orgname|Previous Number of Patent|&lt;br /&gt;
  Total number of Patent|&lt;br /&gt;
  Orinigal ID | Revised ID|&lt;br /&gt;
  min grant year|max grant year|avg grant year|&lt;br /&gt;
  ***One company have the exactly same name for entity name and previous name(Z-KAT) and there is double counting of patent. So the total number of patent&lt;br /&gt;
  should be 11 instead of 22.&lt;br /&gt;
&lt;br /&gt;
  In fullyjoinedtable: (for entities that have patent)&lt;br /&gt;
  Including all patent and assignee variables&lt;br /&gt;
  33 variables in total&lt;br /&gt;
  variables start with 'asg' are assignee information, e.g. asgtype =  assignee type&lt;br /&gt;
  The rest are patent information.&lt;br /&gt;
&lt;br /&gt;
=Old=&lt;br /&gt;
==Overview==&lt;br /&gt;
In this matching process, we will join patent data to VentureOne companies and count the number of patents that affiliated to each company. &lt;br /&gt;
&lt;br /&gt;
===Raw Data===&lt;br /&gt;
Original data set of VentureOne companies can be found at: &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx&amp;lt;/code&amp;gt;&lt;br /&gt;
*All Variables: EntityName,Employees, City, State, Zip, AreaCode, Business Status, IndustryGroup...etc&lt;br /&gt;
*Variables used for matching: EntityName&lt;br /&gt;
&lt;br /&gt;
Original patent data is in our database: &amp;lt;code&amp;gt;128.42.44.181/bulk/allpatent&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Procedure===&lt;br /&gt;
We first get the standard company names for VentureOne companies from the source VentureOne data set. Then we standardize the names of the companies that have patents from our patent database. Based on the common standard company names, we join patent information to VentureOne companies.&lt;br /&gt;
&lt;br /&gt;
===Final Matched Tables===&lt;br /&gt;
#Summary table displaying number of patents owned, minimum grant year, maximum grant year and average grant year for each company (including the ones that own no patents). It can be found at:&amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturesummary.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
#A table contains all patent information for the companies that have patents and can be found at &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturefullyjoined.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Desired Variables===&lt;br /&gt;
&lt;br /&gt;
Below is the list of variables that were in the STATA file we were given:&lt;br /&gt;
 Contains data from C:\Users\ArielSun\Downloads\allpats_3sectors_06jun13.dta&lt;br /&gt;
   obs:        19,409                          &lt;br /&gt;
  vars:            36                          11 Jun 2016 17:31&lt;br /&gt;
  size:    10,655,541                          (_dta has notes)&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
               storage   display    value&lt;br /&gt;
 variable name   type    format     label      variable label&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
 id_vone         double  %9.0g                 VentureOne id&lt;br /&gt;
 name            str39   %39s                  startup name&lt;br /&gt;
 patent          str9    %9s                   patno in string&lt;br /&gt;
 apn             str6    %6s                   pat application number&lt;br /&gt;
 nmi             str40   %40s                  inventor name&lt;br /&gt;
 ttl             str244  %40s                  invention title&lt;br /&gt;
 nma             str65   %65s                  original assignee&lt;br /&gt;
 ocd             str15   %15s                  main us patent class&lt;br /&gt;
 icd             str15   %15s                  main intl patent class&lt;br /&gt;
 apd             float   %td                   application date&lt;br /&gt;
 gdateold        float   %td                   Grant date&lt;br /&gt;
 fnd_year        float   %8.0g                 startup founding year&lt;br /&gt;
 last_yr         float   %9.0g               * OLD last_yr, 2006; see notes&lt;br /&gt;
 source          byte    %8.0g                 1 if 2012 delphion searches; else from 2004/5 search&lt;br /&gt;
 pdate           float   %td                   priority date, delphion; may pre-date application date if provisional apps&lt;br /&gt;
 utility         float   %9.0g               * 1 if utility patent as initially awarded; 0 if other (reissued, reexamed, design&lt;br /&gt;
 state_country   str3    %9s                   state/country of first inventor listed&lt;br /&gt;
 asscode         float   %9.0g                 assignee code; basic.dta&lt;br /&gt;
 ayear           int     %9.0g                 application year&lt;br /&gt;
 amonth          byte    %9.0g                 application month&lt;br /&gt;
 atype           str1    %9s                 * initial assignee type; see notes&lt;br /&gt;
 class           str3    %9s                   3 digit us pat class&lt;br /&gt;
 subclass        str6    %9s                   patent subclass&lt;br /&gt;
 gdate           int     %d                    grant, or issuance, date&lt;br /&gt;
 industry        str15   %15s                  semi, software, or med devices&lt;br /&gt;
 state_hq        str2    %9s                   firm hq location; vone&lt;br /&gt;
 status06        str4    %9s                 * status of firm known in 2006; rhs truncation varies by sector&lt;br /&gt;
 exitdate        str8    %9s                   exit date, if known&lt;br /&gt;
 exityr          str4    %9s                   exit year, if known&lt;br /&gt;
 status08        str6    %9s                 * status of firm in 2008, see notes&lt;br /&gt;
 last_yr08       int     %8.0g               * exityr if ipo/acq, else 2008&lt;br /&gt;
 dcohort         float   %9.0g                 1 if founding yr during 1987-99&lt;br /&gt;
 lastyr08_minu~r float   %9.0g                 &lt;br /&gt;
 dsearch_assign  float   %9.0g                 1 if searches of pat assignment data need to be conducted; carlosn confirm?&lt;br /&gt;
 carlos_chk      float   %9.0g                 carlos: pls confirm assignment data = compiled for these pats&lt;br /&gt;
 entityid        long    %12.0g                unique startup id as of 2008, vone&lt;br /&gt;
                                             * indicated variables have notes&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
&lt;br /&gt;
==Detailed Data Processing==&lt;br /&gt;
;*Get the VentureOne data ready&lt;br /&gt;
#Source file for VentureOne data &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx&amp;lt;/code&amp;gt; Original data source&lt;br /&gt;
#Clean it up &amp;lt;code&amp;gt;E:\McNair\Software\Scripts\Matcher\Input\Venture Data 1.txt&amp;lt;/code&amp;gt; extraneous symbols and words removed&lt;br /&gt;
#Match it against itself to get standardized entity names &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Cleaned and Matched Data.xlsx&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Get the patent data ready&lt;br /&gt;
#Draw the distinct assignees &amp;lt;code&amp;gt;Z:\allpatentsprocessed\DistinctAssignees2.txt &amp;lt;/code&amp;gt;&lt;br /&gt;
#Match them against themselves to get standardized org names for patent data &amp;lt;code&amp;gt;Z:\allpatentsprocessed\DistinctAssignees2matched.txt &amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Match standardized org names of patent data to standardized entity names of venture data&lt;br /&gt;
:&amp;lt;code&amp;gt;Z:\allpatentsprocessed\Venture Patent Matched.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Join patent data to venture data to get patent information of each venture-backed company&lt;br /&gt;
#Join &amp;lt;code&amp;gt;patent&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;assignee&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;firstjoin_cleaned&amp;lt;/code&amp;gt; which matches assignees to patent numbers.&lt;br /&gt;
#Join &amp;lt;code&amp;gt;firstjoin_cleaned&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;matchassignee&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;secondjoin_cleaned&amp;lt;/code&amp;gt; which matches standard org names to patent numbers&lt;br /&gt;
#Join &amp;lt;code&amp;gt;secondjoin_cleaned&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;venturepatentmatched&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;fourthjoin_cleaned&amp;lt;/code&amp;gt; which matches standard venture company names to patent numbers&lt;br /&gt;
&lt;br /&gt;
;*Final summary tables&lt;br /&gt;
#Summary table displaying number of patents owned, minimum grant year, maximum grant year and average grant year for each company &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturepatentreallyfinal.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
#A table of all patent information for each company that has patents &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturepatentfullyjoined.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Notes&lt;br /&gt;
#All data in &amp;lt;code&amp;gt;allpatentsprocessed database&amp;lt;/code&amp;gt;. Access it by logging on to &amp;lt;code&amp;gt;researcher@McNair DBServ:/bulk/allpatentsprocessed&amp;lt;/code&amp;gt;&lt;br /&gt;
#A script of detailed processing procedure can be found at &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\patent data script.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==The matched data==&lt;br /&gt;
&lt;br /&gt;
We are giving back two files:&lt;br /&gt;
*One is at the patent level and contains information on 38,497 patents held by the 1,557 of the 3,357 companies.&lt;br /&gt;
*The other file is at the company level and aggregate patent information for the 3,357 companies.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;includeonly&amp;gt;&lt;br /&gt;
[[Category: McNair Projects]]&lt;br /&gt;
&amp;lt;/includeonly&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matching_VentureOne_(Data)&amp;diff=7133</id>
		<title>Matching VentureOne (Data)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matching_VentureOne_(Data)&amp;diff=7133"/>
		<updated>2016-07-19T18:44:31Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Updated */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Project Title=Matching VentureOne (Data)&lt;br /&gt;
|Topic Area=Patents and Innovation&lt;br /&gt;
|Owner=Ariel Sun, Rosemarie Ziedonis&lt;br /&gt;
|Start Term=Summer 2016&lt;br /&gt;
|Status=Active&lt;br /&gt;
|Deliverable=Other&lt;br /&gt;
|Primary Billing= AccMcNair01&lt;br /&gt;
}}&lt;br /&gt;
=Updated=&lt;br /&gt;
'''New Requirements'''&lt;br /&gt;
*re-run the match using *both* name-related fields in the startups_cl.dta file:  “name” and “name_prev”.&lt;br /&gt;
#the latter field pulls in patents applied for under a former name of the same company&lt;br /&gt;
&lt;br /&gt;
*in the output file, please include…&lt;br /&gt;
#include the field “entityid” that corresponds to each startup (this step is critical; else, we can’t link patents filed under alternative names of the company to the same firm); in startups_cl.dta&lt;br /&gt;
#all assignee-related fields in your patent data (e.g., assignee name, and any original and current uspto assignee codes listed for the patent); merge in from your patent files&lt;br /&gt;
&lt;br /&gt;
'''Output'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\&amp;lt;/code&amp;gt;&lt;br /&gt;
#&amp;lt;code&amp;gt;summarytablefinal&amp;lt;/code&amp;gt;: summary on number of patents and grant year for all companies&lt;br /&gt;
#&amp;lt;code&amp;gt;ullyjoinedtable&amp;lt;/code&amp;gt;: all patent and assignee information for entities that have patents (combining 3 and 4)&lt;br /&gt;
#&amp;lt;code&amp;gt;fullyjoinednow&amp;lt;/code&amp;gt;: patent information under current name of the company&lt;br /&gt;
#&amp;lt;code&amp;gt;fullyjoinedprev&amp;lt;/code&amp;gt;: patent information under previous name of the company &lt;br /&gt;
&lt;br /&gt;
'''notes'''&lt;br /&gt;
  In summarytablefinal table: (for all entities)&lt;br /&gt;
  Variables:&lt;br /&gt;
  Entity Name|Standard Orgname|Number of patent|&lt;br /&gt;
  Previous Name|Previous Standard Orgname|Previous Number of Patent|&lt;br /&gt;
  Total number of Patent|&lt;br /&gt;
  Orinigal ID | Revised ID|&lt;br /&gt;
  min grant year|max grant year|avg grant year|&lt;br /&gt;
  ***One company have the exactly same name for entity name and previous name(Z-KAT) and there is double counting of patent. So the total number of patent&lt;br /&gt;
  should be 11 instead of 22.&lt;br /&gt;
&lt;br /&gt;
  In fullyjoinedtable: (for entities that have patent)&lt;br /&gt;
  Including all patent and assignee variables&lt;br /&gt;
  33 variables in total&lt;br /&gt;
  variables start with 'asg' are assignee information, e.g. asgtype =  assignee type&lt;br /&gt;
  The rest are patent information.&lt;br /&gt;
&lt;br /&gt;
=Old=&lt;br /&gt;
==Overview==&lt;br /&gt;
In this matching process, we will join patent data to VentureOne companies and count the number of patents that affiliated to each company. &lt;br /&gt;
&lt;br /&gt;
===Raw Data===&lt;br /&gt;
Original data set of VentureOne companies can be found at: &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx&amp;lt;/code&amp;gt;&lt;br /&gt;
*All Variables: EntityName,Employees, City, State, Zip, AreaCode, Business Status, IndustryGroup...etc&lt;br /&gt;
*Variables used for matching: EntityName&lt;br /&gt;
&lt;br /&gt;
Original patent data is in our database: &amp;lt;code&amp;gt;128.42.44.181/bulk/allpatent&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Procedure===&lt;br /&gt;
We first get the standard company names for VentureOne companies from the source VentureOne data set. Then we standardize the names of the companies that have patents from our patent database. Based on the common standard company names, we join patent information to VentureOne companies.&lt;br /&gt;
&lt;br /&gt;
===Final Matched Tables===&lt;br /&gt;
#Summary table displaying number of patents owned, minimum grant year, maximum grant year and average grant year for each company (including the ones that own no patents). It can be found at:&amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturesummary.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
#A table contains all patent information for the companies that have patents and can be found at &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturefullyjoined.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Desired Variables===&lt;br /&gt;
&lt;br /&gt;
Below is the list of variables that were in the STATA file we were given:&lt;br /&gt;
 Contains data from C:\Users\ArielSun\Downloads\allpats_3sectors_06jun13.dta&lt;br /&gt;
   obs:        19,409                          &lt;br /&gt;
  vars:            36                          11 Jun 2016 17:31&lt;br /&gt;
  size:    10,655,541                          (_dta has notes)&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
               storage   display    value&lt;br /&gt;
 variable name   type    format     label      variable label&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
 id_vone         double  %9.0g                 VentureOne id&lt;br /&gt;
 name            str39   %39s                  startup name&lt;br /&gt;
 patent          str9    %9s                   patno in string&lt;br /&gt;
 apn             str6    %6s                   pat application number&lt;br /&gt;
 nmi             str40   %40s                  inventor name&lt;br /&gt;
 ttl             str244  %40s                  invention title&lt;br /&gt;
 nma             str65   %65s                  original assignee&lt;br /&gt;
 ocd             str15   %15s                  main us patent class&lt;br /&gt;
 icd             str15   %15s                  main intl patent class&lt;br /&gt;
 apd             float   %td                   application date&lt;br /&gt;
 gdateold        float   %td                   Grant date&lt;br /&gt;
 fnd_year        float   %8.0g                 startup founding year&lt;br /&gt;
 last_yr         float   %9.0g               * OLD last_yr, 2006; see notes&lt;br /&gt;
 source          byte    %8.0g                 1 if 2012 delphion searches; else from 2004/5 search&lt;br /&gt;
 pdate           float   %td                   priority date, delphion; may pre-date application date if provisional apps&lt;br /&gt;
 utility         float   %9.0g               * 1 if utility patent as initially awarded; 0 if other (reissued, reexamed, design&lt;br /&gt;
 state_country   str3    %9s                   state/country of first inventor listed&lt;br /&gt;
 asscode         float   %9.0g                 assignee code; basic.dta&lt;br /&gt;
 ayear           int     %9.0g                 application year&lt;br /&gt;
 amonth          byte    %9.0g                 application month&lt;br /&gt;
 atype           str1    %9s                 * initial assignee type; see notes&lt;br /&gt;
 class           str3    %9s                   3 digit us pat class&lt;br /&gt;
 subclass        str6    %9s                   patent subclass&lt;br /&gt;
 gdate           int     %d                    grant, or issuance, date&lt;br /&gt;
 industry        str15   %15s                  semi, software, or med devices&lt;br /&gt;
 state_hq        str2    %9s                   firm hq location; vone&lt;br /&gt;
 status06        str4    %9s                 * status of firm known in 2006; rhs truncation varies by sector&lt;br /&gt;
 exitdate        str8    %9s                   exit date, if known&lt;br /&gt;
 exityr          str4    %9s                   exit year, if known&lt;br /&gt;
 status08        str6    %9s                 * status of firm in 2008, see notes&lt;br /&gt;
 last_yr08       int     %8.0g               * exityr if ipo/acq, else 2008&lt;br /&gt;
 dcohort         float   %9.0g                 1 if founding yr during 1987-99&lt;br /&gt;
 lastyr08_minu~r float   %9.0g                 &lt;br /&gt;
 dsearch_assign  float   %9.0g                 1 if searches of pat assignment data need to be conducted; carlosn confirm?&lt;br /&gt;
 carlos_chk      float   %9.0g                 carlos: pls confirm assignment data = compiled for these pats&lt;br /&gt;
 entityid        long    %12.0g                unique startup id as of 2008, vone&lt;br /&gt;
                                             * indicated variables have notes&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
&lt;br /&gt;
==Detailed Data Processing==&lt;br /&gt;
;*Get the VentureOne data ready&lt;br /&gt;
#Source file for VentureOne data &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx&amp;lt;/code&amp;gt; Original data source&lt;br /&gt;
#Clean it up &amp;lt;code&amp;gt;E:\McNair\Software\Scripts\Matcher\Input\Venture Data 1.txt&amp;lt;/code&amp;gt; extraneous symbols and words removed&lt;br /&gt;
#Match it against itself to get standardized entity names &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Cleaned and Matched Data.xlsx&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Get the patent data ready&lt;br /&gt;
#Draw the distinct assignees &amp;lt;code&amp;gt;Z:\allpatentsprocessed\DistinctAssignees2.txt &amp;lt;/code&amp;gt;&lt;br /&gt;
#Match them against themselves to get standardized org names for patent data &amp;lt;code&amp;gt;Z:\allpatentsprocessed\DistinctAssignees2matched.txt &amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Match standardized org names of patent data to standardized entity names of venture data&lt;br /&gt;
:&amp;lt;code&amp;gt;Z:\allpatentsprocessed\Venture Patent Matched.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Join patent data to venture data to get patent information of each venture-backed company&lt;br /&gt;
#Join &amp;lt;code&amp;gt;patent&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;assignee&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;firstjoin_cleaned&amp;lt;/code&amp;gt; which matches assignees to patent numbers.&lt;br /&gt;
#Join &amp;lt;code&amp;gt;firstjoin_cleaned&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;matchassignee&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;secondjoin_cleaned&amp;lt;/code&amp;gt; which matches standard org names to patent numbers&lt;br /&gt;
#Join &amp;lt;code&amp;gt;secondjoin_cleaned&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;venturepatentmatched&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;fourthjoin_cleaned&amp;lt;/code&amp;gt; which matches standard venture company names to patent numbers&lt;br /&gt;
&lt;br /&gt;
;*Final summary tables&lt;br /&gt;
#Summary table displaying number of patents owned, minimum grant year, maximum grant year and average grant year for each company &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturepatentreallyfinal.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
#A table of all patent information for each company that has patents &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturepatentfullyjoined.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Notes&lt;br /&gt;
#All data in &amp;lt;code&amp;gt;allpatentsprocessed database&amp;lt;/code&amp;gt;. Access it by logging on to &amp;lt;code&amp;gt;researcher@McNair DBServ:/bulk/allpatentsprocessed&amp;lt;/code&amp;gt;&lt;br /&gt;
#A script of detailed processing procedure can be found at &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\patent data script.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==The matched data==&lt;br /&gt;
&lt;br /&gt;
We are giving back two files:&lt;br /&gt;
*One is at the patent level and contains information on 38,497 patents held by the 1,557 of the 3,357 companies.&lt;br /&gt;
*The other file is at the company level and aggregate patent information for the 3,357 companies.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;includeonly&amp;gt;&lt;br /&gt;
[[Category: McNair Projects]]&lt;br /&gt;
&amp;lt;/includeonly&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matching_VentureOne_(Data)&amp;diff=7132</id>
		<title>Matching VentureOne (Data)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matching_VentureOne_(Data)&amp;diff=7132"/>
		<updated>2016-07-19T18:44:06Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Updated */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Project Title=Matching VentureOne (Data)&lt;br /&gt;
|Topic Area=Patents and Innovation&lt;br /&gt;
|Owner=Ariel Sun, Rosemarie Ziedonis&lt;br /&gt;
|Start Term=Summer 2016&lt;br /&gt;
|Status=Active&lt;br /&gt;
|Deliverable=Other&lt;br /&gt;
|Primary Billing= AccMcNair01&lt;br /&gt;
}}&lt;br /&gt;
=Updated=&lt;br /&gt;
'''New Requirements'''&lt;br /&gt;
*re-run the match using *both* name-related fields in the startups_cl.dta file:  “name” and “name_prev”.&lt;br /&gt;
#the latter field pulls in patents applied for under a former name of the same company&lt;br /&gt;
&lt;br /&gt;
*in the output file, please include…&lt;br /&gt;
#include the field “entityid” that corresponds to each startup (this step is critical; else, we can’t link patents filed under alternative names of the company to the same firm); in startups_cl.dta&lt;br /&gt;
#all assignee-related fields in your patent data (e.g., assignee name, and any original and current uspto assignee codes listed for the patent); merge in from your patent files&lt;br /&gt;
&lt;br /&gt;
'''Output'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\&amp;lt;/code&amp;gt;&lt;br /&gt;
#&amp;lt;code&amp;gt;summarytablefinal&amp;lt;/code&amp;gt;: summary on number of patents and grant year for all companies&lt;br /&gt;
#&amp;lt;code&amp;gt;ullyjoinedtable&amp;lt;/code&amp;gt;: all patent and assignee information for entities that have patents (combining 3 and 4)&lt;br /&gt;
#&amp;lt;code&amp;gt;fullyjoinednow&amp;lt;/code&amp;gt;: patent information under current name of the company&lt;br /&gt;
#&amp;lt;code&amp;gt;fullyjoinedprec&amp;lt;/code&amp;gt;: patent information under previous name of the company &lt;br /&gt;
&lt;br /&gt;
'''notes'''&lt;br /&gt;
  In summarytablefinal table: (for all entities)&lt;br /&gt;
  Variables:&lt;br /&gt;
  Entity Name|Standard Orgname|Number of patent|&lt;br /&gt;
  Previous Name|Previous Standard Orgname|Previous Number of Patent|&lt;br /&gt;
  Total number of Patent|&lt;br /&gt;
  Orinigal ID | Revised ID|&lt;br /&gt;
  min grant year|max grant year|avg grant year|&lt;br /&gt;
  ***One company have the exactly same name for entity name and previous name(Z-KAT) and there is double counting of patent. So the total number of patent&lt;br /&gt;
  should be 11 instead of 22.&lt;br /&gt;
&lt;br /&gt;
  In fullyjoinedtable: (for entities that have patent)&lt;br /&gt;
  Including all patent and assignee variables&lt;br /&gt;
  33 variables in total&lt;br /&gt;
  variables start with 'asg' are assignee information, e.g. asgtype =  assignee type&lt;br /&gt;
  The rest are patent information.&lt;br /&gt;
&lt;br /&gt;
=Old=&lt;br /&gt;
==Overview==&lt;br /&gt;
In this matching process, we will join patent data to VentureOne companies and count the number of patents that affiliated to each company. &lt;br /&gt;
&lt;br /&gt;
===Raw Data===&lt;br /&gt;
Original data set of VentureOne companies can be found at: &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx&amp;lt;/code&amp;gt;&lt;br /&gt;
*All Variables: EntityName,Employees, City, State, Zip, AreaCode, Business Status, IndustryGroup...etc&lt;br /&gt;
*Variables used for matching: EntityName&lt;br /&gt;
&lt;br /&gt;
Original patent data is in our database: &amp;lt;code&amp;gt;128.42.44.181/bulk/allpatent&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Procedure===&lt;br /&gt;
We first get the standard company names for VentureOne companies from the source VentureOne data set. Then we standardize the names of the companies that have patents from our patent database. Based on the common standard company names, we join patent information to VentureOne companies.&lt;br /&gt;
&lt;br /&gt;
===Final Matched Tables===&lt;br /&gt;
#Summary table displaying number of patents owned, minimum grant year, maximum grant year and average grant year for each company (including the ones that own no patents). It can be found at:&amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturesummary.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
#A table contains all patent information for the companies that have patents and can be found at &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturefullyjoined.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Desired Variables===&lt;br /&gt;
&lt;br /&gt;
Below is the list of variables that were in the STATA file we were given:&lt;br /&gt;
 Contains data from C:\Users\ArielSun\Downloads\allpats_3sectors_06jun13.dta&lt;br /&gt;
   obs:        19,409                          &lt;br /&gt;
  vars:            36                          11 Jun 2016 17:31&lt;br /&gt;
  size:    10,655,541                          (_dta has notes)&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
               storage   display    value&lt;br /&gt;
 variable name   type    format     label      variable label&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
 id_vone         double  %9.0g                 VentureOne id&lt;br /&gt;
 name            str39   %39s                  startup name&lt;br /&gt;
 patent          str9    %9s                   patno in string&lt;br /&gt;
 apn             str6    %6s                   pat application number&lt;br /&gt;
 nmi             str40   %40s                  inventor name&lt;br /&gt;
 ttl             str244  %40s                  invention title&lt;br /&gt;
 nma             str65   %65s                  original assignee&lt;br /&gt;
 ocd             str15   %15s                  main us patent class&lt;br /&gt;
 icd             str15   %15s                  main intl patent class&lt;br /&gt;
 apd             float   %td                   application date&lt;br /&gt;
 gdateold        float   %td                   Grant date&lt;br /&gt;
 fnd_year        float   %8.0g                 startup founding year&lt;br /&gt;
 last_yr         float   %9.0g               * OLD last_yr, 2006; see notes&lt;br /&gt;
 source          byte    %8.0g                 1 if 2012 delphion searches; else from 2004/5 search&lt;br /&gt;
 pdate           float   %td                   priority date, delphion; may pre-date application date if provisional apps&lt;br /&gt;
 utility         float   %9.0g               * 1 if utility patent as initially awarded; 0 if other (reissued, reexamed, design&lt;br /&gt;
 state_country   str3    %9s                   state/country of first inventor listed&lt;br /&gt;
 asscode         float   %9.0g                 assignee code; basic.dta&lt;br /&gt;
 ayear           int     %9.0g                 application year&lt;br /&gt;
 amonth          byte    %9.0g                 application month&lt;br /&gt;
 atype           str1    %9s                 * initial assignee type; see notes&lt;br /&gt;
 class           str3    %9s                   3 digit us pat class&lt;br /&gt;
 subclass        str6    %9s                   patent subclass&lt;br /&gt;
 gdate           int     %d                    grant, or issuance, date&lt;br /&gt;
 industry        str15   %15s                  semi, software, or med devices&lt;br /&gt;
 state_hq        str2    %9s                   firm hq location; vone&lt;br /&gt;
 status06        str4    %9s                 * status of firm known in 2006; rhs truncation varies by sector&lt;br /&gt;
 exitdate        str8    %9s                   exit date, if known&lt;br /&gt;
 exityr          str4    %9s                   exit year, if known&lt;br /&gt;
 status08        str6    %9s                 * status of firm in 2008, see notes&lt;br /&gt;
 last_yr08       int     %8.0g               * exityr if ipo/acq, else 2008&lt;br /&gt;
 dcohort         float   %9.0g                 1 if founding yr during 1987-99&lt;br /&gt;
 lastyr08_minu~r float   %9.0g                 &lt;br /&gt;
 dsearch_assign  float   %9.0g                 1 if searches of pat assignment data need to be conducted; carlosn confirm?&lt;br /&gt;
 carlos_chk      float   %9.0g                 carlos: pls confirm assignment data = compiled for these pats&lt;br /&gt;
 entityid        long    %12.0g                unique startup id as of 2008, vone&lt;br /&gt;
                                             * indicated variables have notes&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
&lt;br /&gt;
==Detailed Data Processing==&lt;br /&gt;
;*Get the VentureOne data ready&lt;br /&gt;
#Source file for VentureOne data &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx&amp;lt;/code&amp;gt; Original data source&lt;br /&gt;
#Clean it up &amp;lt;code&amp;gt;E:\McNair\Software\Scripts\Matcher\Input\Venture Data 1.txt&amp;lt;/code&amp;gt; extraneous symbols and words removed&lt;br /&gt;
#Match it against itself to get standardized entity names &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Cleaned and Matched Data.xlsx&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Get the patent data ready&lt;br /&gt;
#Draw the distinct assignees &amp;lt;code&amp;gt;Z:\allpatentsprocessed\DistinctAssignees2.txt &amp;lt;/code&amp;gt;&lt;br /&gt;
#Match them against themselves to get standardized org names for patent data &amp;lt;code&amp;gt;Z:\allpatentsprocessed\DistinctAssignees2matched.txt &amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Match standardized org names of patent data to standardized entity names of venture data&lt;br /&gt;
:&amp;lt;code&amp;gt;Z:\allpatentsprocessed\Venture Patent Matched.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Join patent data to venture data to get patent information of each venture-backed company&lt;br /&gt;
#Join &amp;lt;code&amp;gt;patent&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;assignee&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;firstjoin_cleaned&amp;lt;/code&amp;gt; which matches assignees to patent numbers.&lt;br /&gt;
#Join &amp;lt;code&amp;gt;firstjoin_cleaned&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;matchassignee&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;secondjoin_cleaned&amp;lt;/code&amp;gt; which matches standard org names to patent numbers&lt;br /&gt;
#Join &amp;lt;code&amp;gt;secondjoin_cleaned&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;venturepatentmatched&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;fourthjoin_cleaned&amp;lt;/code&amp;gt; which matches standard venture company names to patent numbers&lt;br /&gt;
&lt;br /&gt;
;*Final summary tables&lt;br /&gt;
#Summary table displaying number of patents owned, minimum grant year, maximum grant year and average grant year for each company &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturepatentreallyfinal.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
#A table of all patent information for each company that has patents &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturepatentfullyjoined.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Notes&lt;br /&gt;
#All data in &amp;lt;code&amp;gt;allpatentsprocessed database&amp;lt;/code&amp;gt;. Access it by logging on to &amp;lt;code&amp;gt;researcher@McNair DBServ:/bulk/allpatentsprocessed&amp;lt;/code&amp;gt;&lt;br /&gt;
#A script of detailed processing procedure can be found at &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\patent data script.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==The matched data==&lt;br /&gt;
&lt;br /&gt;
We are giving back two files:&lt;br /&gt;
*One is at the patent level and contains information on 38,497 patents held by the 1,557 of the 3,357 companies.&lt;br /&gt;
*The other file is at the company level and aggregate patent information for the 3,357 companies.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;includeonly&amp;gt;&lt;br /&gt;
[[Category: McNair Projects]]&lt;br /&gt;
&amp;lt;/includeonly&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matching_VentureOne_(Data)&amp;diff=7131</id>
		<title>Matching VentureOne (Data)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matching_VentureOne_(Data)&amp;diff=7131"/>
		<updated>2016-07-19T18:43:15Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Updated */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Project Title=Matching VentureOne (Data)&lt;br /&gt;
|Topic Area=Patents and Innovation&lt;br /&gt;
|Owner=Ariel Sun, Rosemarie Ziedonis&lt;br /&gt;
|Start Term=Summer 2016&lt;br /&gt;
|Status=Active&lt;br /&gt;
|Deliverable=Other&lt;br /&gt;
|Primary Billing= AccMcNair01&lt;br /&gt;
}}&lt;br /&gt;
=Updated=&lt;br /&gt;
'''New Requirements'''&lt;br /&gt;
*re-run the match using *both* name-related fields in the startups_cl.dta file:  “name” and “name_prev”.&lt;br /&gt;
#the latter field pulls in patents applied for under a former name of the same company&lt;br /&gt;
&lt;br /&gt;
*in the output file, please include…&lt;br /&gt;
#include the field “entityid” that corresponds to each startup (this step is critical; else, we can’t link patents filed under alternative names of the company to the same firm); in startups_cl.dta&lt;br /&gt;
#all assignee-related fields in your patent data (e.g., assignee name, and any original and current uspto assignee codes listed for the patent); merge in from your patent files&lt;br /&gt;
&lt;br /&gt;
'''Output'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\&amp;lt;/code&amp;gt;&lt;br /&gt;
#summarytablefinal: summary on number of patents and grant year for all companies&lt;br /&gt;
#ullyjoinedtable: all patent and assignee information for entities that have patents (combining 3 and 4)&lt;br /&gt;
#fullyjoinednow: patent information under current name of the company&lt;br /&gt;
#fullyjoinedprec: patent information under previous name of the company &lt;br /&gt;
&lt;br /&gt;
'''notes'''&lt;br /&gt;
  In summarytablefinal table: (for all entities)&lt;br /&gt;
  Variables:&lt;br /&gt;
  Entity Name|Standard Orgname|Number of patent|&lt;br /&gt;
  Previous Name|Previous Standard Orgname|Previous Number of Patent|&lt;br /&gt;
  Total number of Patent|&lt;br /&gt;
  Orinigal ID | Revised ID|&lt;br /&gt;
  min grant year|max grant year|avg grant year|&lt;br /&gt;
  ***One company have the exactly same name for entity name and previous name(Z-KAT) and there is double counting of patent. So the total number of patent&lt;br /&gt;
  should be 11 instead of 22.&lt;br /&gt;
&lt;br /&gt;
  In fullyjoinedtable: (for entities that have patent)&lt;br /&gt;
  Including all patent and assignee variables&lt;br /&gt;
  33 variables in total&lt;br /&gt;
  variables start with 'asg' are assignee information, e.g. asgtype =  assignee type&lt;br /&gt;
  The rest are patent information.&lt;br /&gt;
&lt;br /&gt;
=Old=&lt;br /&gt;
==Overview==&lt;br /&gt;
In this matching process, we will join patent data to VentureOne companies and count the number of patents that affiliated to each company. &lt;br /&gt;
&lt;br /&gt;
===Raw Data===&lt;br /&gt;
Original data set of VentureOne companies can be found at: &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx&amp;lt;/code&amp;gt;&lt;br /&gt;
*All Variables: EntityName,Employees, City, State, Zip, AreaCode, Business Status, IndustryGroup...etc&lt;br /&gt;
*Variables used for matching: EntityName&lt;br /&gt;
&lt;br /&gt;
Original patent data is in our database: &amp;lt;code&amp;gt;128.42.44.181/bulk/allpatent&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Procedure===&lt;br /&gt;
We first get the standard company names for VentureOne companies from the source VentureOne data set. Then we standardize the names of the companies that have patents from our patent database. Based on the common standard company names, we join patent information to VentureOne companies.&lt;br /&gt;
&lt;br /&gt;
===Final Matched Tables===&lt;br /&gt;
#Summary table displaying number of patents owned, minimum grant year, maximum grant year and average grant year for each company (including the ones that own no patents). It can be found at:&amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturesummary.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
#A table contains all patent information for the companies that have patents and can be found at &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturefullyjoined.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Desired Variables===&lt;br /&gt;
&lt;br /&gt;
Below is the list of variables that were in the STATA file we were given:&lt;br /&gt;
 Contains data from C:\Users\ArielSun\Downloads\allpats_3sectors_06jun13.dta&lt;br /&gt;
   obs:        19,409                          &lt;br /&gt;
  vars:            36                          11 Jun 2016 17:31&lt;br /&gt;
  size:    10,655,541                          (_dta has notes)&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
               storage   display    value&lt;br /&gt;
 variable name   type    format     label      variable label&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
 id_vone         double  %9.0g                 VentureOne id&lt;br /&gt;
 name            str39   %39s                  startup name&lt;br /&gt;
 patent          str9    %9s                   patno in string&lt;br /&gt;
 apn             str6    %6s                   pat application number&lt;br /&gt;
 nmi             str40   %40s                  inventor name&lt;br /&gt;
 ttl             str244  %40s                  invention title&lt;br /&gt;
 nma             str65   %65s                  original assignee&lt;br /&gt;
 ocd             str15   %15s                  main us patent class&lt;br /&gt;
 icd             str15   %15s                  main intl patent class&lt;br /&gt;
 apd             float   %td                   application date&lt;br /&gt;
 gdateold        float   %td                   Grant date&lt;br /&gt;
 fnd_year        float   %8.0g                 startup founding year&lt;br /&gt;
 last_yr         float   %9.0g               * OLD last_yr, 2006; see notes&lt;br /&gt;
 source          byte    %8.0g                 1 if 2012 delphion searches; else from 2004/5 search&lt;br /&gt;
 pdate           float   %td                   priority date, delphion; may pre-date application date if provisional apps&lt;br /&gt;
 utility         float   %9.0g               * 1 if utility patent as initially awarded; 0 if other (reissued, reexamed, design&lt;br /&gt;
 state_country   str3    %9s                   state/country of first inventor listed&lt;br /&gt;
 asscode         float   %9.0g                 assignee code; basic.dta&lt;br /&gt;
 ayear           int     %9.0g                 application year&lt;br /&gt;
 amonth          byte    %9.0g                 application month&lt;br /&gt;
 atype           str1    %9s                 * initial assignee type; see notes&lt;br /&gt;
 class           str3    %9s                   3 digit us pat class&lt;br /&gt;
 subclass        str6    %9s                   patent subclass&lt;br /&gt;
 gdate           int     %d                    grant, or issuance, date&lt;br /&gt;
 industry        str15   %15s                  semi, software, or med devices&lt;br /&gt;
 state_hq        str2    %9s                   firm hq location; vone&lt;br /&gt;
 status06        str4    %9s                 * status of firm known in 2006; rhs truncation varies by sector&lt;br /&gt;
 exitdate        str8    %9s                   exit date, if known&lt;br /&gt;
 exityr          str4    %9s                   exit year, if known&lt;br /&gt;
 status08        str6    %9s                 * status of firm in 2008, see notes&lt;br /&gt;
 last_yr08       int     %8.0g               * exityr if ipo/acq, else 2008&lt;br /&gt;
 dcohort         float   %9.0g                 1 if founding yr during 1987-99&lt;br /&gt;
 lastyr08_minu~r float   %9.0g                 &lt;br /&gt;
 dsearch_assign  float   %9.0g                 1 if searches of pat assignment data need to be conducted; carlosn confirm?&lt;br /&gt;
 carlos_chk      float   %9.0g                 carlos: pls confirm assignment data = compiled for these pats&lt;br /&gt;
 entityid        long    %12.0g                unique startup id as of 2008, vone&lt;br /&gt;
                                             * indicated variables have notes&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
&lt;br /&gt;
==Detailed Data Processing==&lt;br /&gt;
;*Get the VentureOne data ready&lt;br /&gt;
#Source file for VentureOne data &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx&amp;lt;/code&amp;gt; Original data source&lt;br /&gt;
#Clean it up &amp;lt;code&amp;gt;E:\McNair\Software\Scripts\Matcher\Input\Venture Data 1.txt&amp;lt;/code&amp;gt; extraneous symbols and words removed&lt;br /&gt;
#Match it against itself to get standardized entity names &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Cleaned and Matched Data.xlsx&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Get the patent data ready&lt;br /&gt;
#Draw the distinct assignees &amp;lt;code&amp;gt;Z:\allpatentsprocessed\DistinctAssignees2.txt &amp;lt;/code&amp;gt;&lt;br /&gt;
#Match them against themselves to get standardized org names for patent data &amp;lt;code&amp;gt;Z:\allpatentsprocessed\DistinctAssignees2matched.txt &amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Match standardized org names of patent data to standardized entity names of venture data&lt;br /&gt;
:&amp;lt;code&amp;gt;Z:\allpatentsprocessed\Venture Patent Matched.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Join patent data to venture data to get patent information of each venture-backed company&lt;br /&gt;
#Join &amp;lt;code&amp;gt;patent&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;assignee&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;firstjoin_cleaned&amp;lt;/code&amp;gt; which matches assignees to patent numbers.&lt;br /&gt;
#Join &amp;lt;code&amp;gt;firstjoin_cleaned&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;matchassignee&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;secondjoin_cleaned&amp;lt;/code&amp;gt; which matches standard org names to patent numbers&lt;br /&gt;
#Join &amp;lt;code&amp;gt;secondjoin_cleaned&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;venturepatentmatched&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;fourthjoin_cleaned&amp;lt;/code&amp;gt; which matches standard venture company names to patent numbers&lt;br /&gt;
&lt;br /&gt;
;*Final summary tables&lt;br /&gt;
#Summary table displaying number of patents owned, minimum grant year, maximum grant year and average grant year for each company &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturepatentreallyfinal.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
#A table of all patent information for each company that has patents &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturepatentfullyjoined.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Notes&lt;br /&gt;
#All data in &amp;lt;code&amp;gt;allpatentsprocessed database&amp;lt;/code&amp;gt;. Access it by logging on to &amp;lt;code&amp;gt;researcher@McNair DBServ:/bulk/allpatentsprocessed&amp;lt;/code&amp;gt;&lt;br /&gt;
#A script of detailed processing procedure can be found at &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\patent data script.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==The matched data==&lt;br /&gt;
&lt;br /&gt;
We are giving back two files:&lt;br /&gt;
*One is at the patent level and contains information on 38,497 patents held by the 1,557 of the 3,357 companies.&lt;br /&gt;
*The other file is at the company level and aggregate patent information for the 3,357 companies.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;includeonly&amp;gt;&lt;br /&gt;
[[Category: McNair Projects]]&lt;br /&gt;
&amp;lt;/includeonly&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matching_VentureOne_(Data)&amp;diff=7130</id>
		<title>Matching VentureOne (Data)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matching_VentureOne_(Data)&amp;diff=7130"/>
		<updated>2016-07-19T18:42:32Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: /* Updated */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Project Title=Matching VentureOne (Data)&lt;br /&gt;
|Topic Area=Patents and Innovation&lt;br /&gt;
|Owner=Ariel Sun, Rosemarie Ziedonis&lt;br /&gt;
|Start Term=Summer 2016&lt;br /&gt;
|Status=Active&lt;br /&gt;
|Deliverable=Other&lt;br /&gt;
|Primary Billing= AccMcNair01&lt;br /&gt;
}}&lt;br /&gt;
=Updated=&lt;br /&gt;
'''New Requirements'''&lt;br /&gt;
#re-run the match using *both* name-related fields in the startups_cl.dta file:  “name” and “name_prev”.&lt;br /&gt;
*the latter field pulls in patents applied for under a former name of the same company&lt;br /&gt;
&lt;br /&gt;
#in the output file, please include…&lt;br /&gt;
*include the field “entityid” that corresponds to each startup (this step is critical; else, we can’t link patents filed under alternative names of the company to the same firm); in startups_cl.dta&lt;br /&gt;
*all assignee-related fields in your patent data (e.g., assignee name, and any original and current uspto assignee codes listed for the patent); merge in from your patent files&lt;br /&gt;
&lt;br /&gt;
'''Output'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\&amp;lt;/code&amp;gt;&lt;br /&gt;
#summarytablefinal: summary on number of patents and grant year for all companies&lt;br /&gt;
#ullyjoinedtable: all patent and assignee information for entities that have patents (combining 3 and 4)&lt;br /&gt;
#fullyjoinednow: patent information under current name of the company&lt;br /&gt;
#fullyjoinedprec: patent information under previous name of the company &lt;br /&gt;
&lt;br /&gt;
'''notes'''&lt;br /&gt;
  In summarytablefinal table: (for all entities)&lt;br /&gt;
  Variables:&lt;br /&gt;
  Entity Name|Standard Orgname|Number of patent|&lt;br /&gt;
  Previous Name|Previous Standard Orgname|Previous Number of Patent|&lt;br /&gt;
  Total number of Patent|&lt;br /&gt;
  Orinigal ID | Revised ID|&lt;br /&gt;
  min grant year|max grant year|avg grant year|&lt;br /&gt;
  ***One company have the exactly same name for entity name and previous name(Z-KAT) and there is double counting of patent. So the total number of patent&lt;br /&gt;
  should be 11 instead of 22.&lt;br /&gt;
&lt;br /&gt;
  In fullyjoinedtable: (for entities that have patent)&lt;br /&gt;
  Including all patent and assignee variables&lt;br /&gt;
  33 variables in total&lt;br /&gt;
  variables start with 'asg' are assignee information, e.g. asgtype =  assignee type&lt;br /&gt;
  The rest are patent information.&lt;br /&gt;
&lt;br /&gt;
=Old=&lt;br /&gt;
==Overview==&lt;br /&gt;
In this matching process, we will join patent data to VentureOne companies and count the number of patents that affiliated to each company. &lt;br /&gt;
&lt;br /&gt;
===Raw Data===&lt;br /&gt;
Original data set of VentureOne companies can be found at: &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx&amp;lt;/code&amp;gt;&lt;br /&gt;
*All Variables: EntityName,Employees, City, State, Zip, AreaCode, Business Status, IndustryGroup...etc&lt;br /&gt;
*Variables used for matching: EntityName&lt;br /&gt;
&lt;br /&gt;
Original patent data is in our database: &amp;lt;code&amp;gt;128.42.44.181/bulk/allpatent&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Procedure===&lt;br /&gt;
We first get the standard company names for VentureOne companies from the source VentureOne data set. Then we standardize the names of the companies that have patents from our patent database. Based on the common standard company names, we join patent information to VentureOne companies.&lt;br /&gt;
&lt;br /&gt;
===Final Matched Tables===&lt;br /&gt;
#Summary table displaying number of patents owned, minimum grant year, maximum grant year and average grant year for each company (including the ones that own no patents). It can be found at:&amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturesummary.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
#A table contains all patent information for the companies that have patents and can be found at &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturefullyjoined.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Desired Variables===&lt;br /&gt;
&lt;br /&gt;
Below is the list of variables that were in the STATA file we were given:&lt;br /&gt;
 Contains data from C:\Users\ArielSun\Downloads\allpats_3sectors_06jun13.dta&lt;br /&gt;
   obs:        19,409                          &lt;br /&gt;
  vars:            36                          11 Jun 2016 17:31&lt;br /&gt;
  size:    10,655,541                          (_dta has notes)&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
               storage   display    value&lt;br /&gt;
 variable name   type    format     label      variable label&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
 id_vone         double  %9.0g                 VentureOne id&lt;br /&gt;
 name            str39   %39s                  startup name&lt;br /&gt;
 patent          str9    %9s                   patno in string&lt;br /&gt;
 apn             str6    %6s                   pat application number&lt;br /&gt;
 nmi             str40   %40s                  inventor name&lt;br /&gt;
 ttl             str244  %40s                  invention title&lt;br /&gt;
 nma             str65   %65s                  original assignee&lt;br /&gt;
 ocd             str15   %15s                  main us patent class&lt;br /&gt;
 icd             str15   %15s                  main intl patent class&lt;br /&gt;
 apd             float   %td                   application date&lt;br /&gt;
 gdateold        float   %td                   Grant date&lt;br /&gt;
 fnd_year        float   %8.0g                 startup founding year&lt;br /&gt;
 last_yr         float   %9.0g               * OLD last_yr, 2006; see notes&lt;br /&gt;
 source          byte    %8.0g                 1 if 2012 delphion searches; else from 2004/5 search&lt;br /&gt;
 pdate           float   %td                   priority date, delphion; may pre-date application date if provisional apps&lt;br /&gt;
 utility         float   %9.0g               * 1 if utility patent as initially awarded; 0 if other (reissued, reexamed, design&lt;br /&gt;
 state_country   str3    %9s                   state/country of first inventor listed&lt;br /&gt;
 asscode         float   %9.0g                 assignee code; basic.dta&lt;br /&gt;
 ayear           int     %9.0g                 application year&lt;br /&gt;
 amonth          byte    %9.0g                 application month&lt;br /&gt;
 atype           str1    %9s                 * initial assignee type; see notes&lt;br /&gt;
 class           str3    %9s                   3 digit us pat class&lt;br /&gt;
 subclass        str6    %9s                   patent subclass&lt;br /&gt;
 gdate           int     %d                    grant, or issuance, date&lt;br /&gt;
 industry        str15   %15s                  semi, software, or med devices&lt;br /&gt;
 state_hq        str2    %9s                   firm hq location; vone&lt;br /&gt;
 status06        str4    %9s                 * status of firm known in 2006; rhs truncation varies by sector&lt;br /&gt;
 exitdate        str8    %9s                   exit date, if known&lt;br /&gt;
 exityr          str4    %9s                   exit year, if known&lt;br /&gt;
 status08        str6    %9s                 * status of firm in 2008, see notes&lt;br /&gt;
 last_yr08       int     %8.0g               * exityr if ipo/acq, else 2008&lt;br /&gt;
 dcohort         float   %9.0g                 1 if founding yr during 1987-99&lt;br /&gt;
 lastyr08_minu~r float   %9.0g                 &lt;br /&gt;
 dsearch_assign  float   %9.0g                 1 if searches of pat assignment data need to be conducted; carlosn confirm?&lt;br /&gt;
 carlos_chk      float   %9.0g                 carlos: pls confirm assignment data = compiled for these pats&lt;br /&gt;
 entityid        long    %12.0g                unique startup id as of 2008, vone&lt;br /&gt;
                                             * indicated variables have notes&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
&lt;br /&gt;
==Detailed Data Processing==&lt;br /&gt;
;*Get the VentureOne data ready&lt;br /&gt;
#Source file for VentureOne data &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx&amp;lt;/code&amp;gt; Original data source&lt;br /&gt;
#Clean it up &amp;lt;code&amp;gt;E:\McNair\Software\Scripts\Matcher\Input\Venture Data 1.txt&amp;lt;/code&amp;gt; extraneous symbols and words removed&lt;br /&gt;
#Match it against itself to get standardized entity names &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Cleaned and Matched Data.xlsx&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Get the patent data ready&lt;br /&gt;
#Draw the distinct assignees &amp;lt;code&amp;gt;Z:\allpatentsprocessed\DistinctAssignees2.txt &amp;lt;/code&amp;gt;&lt;br /&gt;
#Match them against themselves to get standardized org names for patent data &amp;lt;code&amp;gt;Z:\allpatentsprocessed\DistinctAssignees2matched.txt &amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Match standardized org names of patent data to standardized entity names of venture data&lt;br /&gt;
:&amp;lt;code&amp;gt;Z:\allpatentsprocessed\Venture Patent Matched.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Join patent data to venture data to get patent information of each venture-backed company&lt;br /&gt;
#Join &amp;lt;code&amp;gt;patent&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;assignee&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;firstjoin_cleaned&amp;lt;/code&amp;gt; which matches assignees to patent numbers.&lt;br /&gt;
#Join &amp;lt;code&amp;gt;firstjoin_cleaned&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;matchassignee&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;secondjoin_cleaned&amp;lt;/code&amp;gt; which matches standard org names to patent numbers&lt;br /&gt;
#Join &amp;lt;code&amp;gt;secondjoin_cleaned&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;venturepatentmatched&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;fourthjoin_cleaned&amp;lt;/code&amp;gt; which matches standard venture company names to patent numbers&lt;br /&gt;
&lt;br /&gt;
;*Final summary tables&lt;br /&gt;
#Summary table displaying number of patents owned, minimum grant year, maximum grant year and average grant year for each company &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturepatentreallyfinal.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
#A table of all patent information for each company that has patents &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturepatentfullyjoined.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Notes&lt;br /&gt;
#All data in &amp;lt;code&amp;gt;allpatentsprocessed database&amp;lt;/code&amp;gt;. Access it by logging on to &amp;lt;code&amp;gt;researcher@McNair DBServ:/bulk/allpatentsprocessed&amp;lt;/code&amp;gt;&lt;br /&gt;
#A script of detailed processing procedure can be found at &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\patent data script.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==The matched data==&lt;br /&gt;
&lt;br /&gt;
We are giving back two files:&lt;br /&gt;
*One is at the patent level and contains information on 38,497 patents held by the 1,557 of the 3,357 companies.&lt;br /&gt;
*The other file is at the company level and aggregate patent information for the 3,357 companies.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;includeonly&amp;gt;&lt;br /&gt;
[[Category: McNair Projects]]&lt;br /&gt;
&amp;lt;/includeonly&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
	<entry>
		<id>http://www.edegan.com/mediawiki/index.php?title=Matching_VentureOne_(Data)&amp;diff=7129</id>
		<title>Matching VentureOne (Data)</title>
		<link rel="alternate" type="text/html" href="http://www.edegan.com/mediawiki/index.php?title=Matching_VentureOne_(Data)&amp;diff=7129"/>
		<updated>2016-07-19T18:41:13Z</updated>

		<summary type="html">&lt;p&gt;ArielSun: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{McNair Projects&lt;br /&gt;
|Project Title=Matching VentureOne (Data)&lt;br /&gt;
|Topic Area=Patents and Innovation&lt;br /&gt;
|Owner=Ariel Sun, Rosemarie Ziedonis&lt;br /&gt;
|Start Term=Summer 2016&lt;br /&gt;
|Status=Active&lt;br /&gt;
|Deliverable=Other&lt;br /&gt;
|Primary Billing= AccMcNair01&lt;br /&gt;
}}&lt;br /&gt;
=Updated=&lt;br /&gt;
'''New Requirements'''&lt;br /&gt;
#re-run the match using *both* name-related fields in the startups_cl.dta file:  “name” and “name_prev”.&lt;br /&gt;
note:  the latter field pulls in patents applied for under a former name of the same company&lt;br /&gt;
&lt;br /&gt;
#in the output file, please include…&lt;br /&gt;
*include the field “entityid” that corresponds to each startup (this step is critical; else, we can’t link patents filed under alternative names of the company to the same firm); in startups_cl.dta&lt;br /&gt;
*all assignee-related fields in your patent data (e.g., assignee name, and any original and current uspto assignee codes listed for the patent); merge in from your patent files&lt;br /&gt;
&lt;br /&gt;
'''Output'''&lt;br /&gt;
&amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\&amp;lt;/code&amp;gt;&lt;br /&gt;
#summarytablefinal: summary on number of patents and grant year for all companies&lt;br /&gt;
#ullyjoinedtable: all patent and assignee information for entities that have patents (combining 3 and 4)&lt;br /&gt;
#fullyjoinednow: patent information under current name of the company&lt;br /&gt;
#fullyjoinedprec: patent information under previous name of the company &lt;br /&gt;
&lt;br /&gt;
'''notes'''&lt;br /&gt;
  In summarytablefinal table: (for all entities)&lt;br /&gt;
  Variables:&lt;br /&gt;
  Entity Name|Standard Orgname|Number of patent|&lt;br /&gt;
  Previous Name|Previous Standard Orgname|Previous Number of Patent|&lt;br /&gt;
  Total number of Patent|&lt;br /&gt;
  Orinigal ID | Revised ID|&lt;br /&gt;
  min grant year|max grant year|avg grant year|&lt;br /&gt;
&lt;br /&gt;
  ***One company have the exactly same name for entity name and previous name(Z-KAT) and there is double counting of patent. So the total number of patent&lt;br /&gt;
  should be 11 instead of 22.&lt;br /&gt;
&lt;br /&gt;
  In fullyjoinedtable: (for entities that have patent)&lt;br /&gt;
  Including all patent and assignee variables&lt;br /&gt;
  33 variables in total&lt;br /&gt;
  variables start with 'asg' are assignee information, e.g. asgtype =  assignee type&lt;br /&gt;
  The rest are patent information.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=Old=&lt;br /&gt;
==Overview==&lt;br /&gt;
In this matching process, we will join patent data to VentureOne companies and count the number of patents that affiliated to each company. &lt;br /&gt;
&lt;br /&gt;
===Raw Data===&lt;br /&gt;
Original data set of VentureOne companies can be found at: &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx&amp;lt;/code&amp;gt;&lt;br /&gt;
*All Variables: EntityName,Employees, City, State, Zip, AreaCode, Business Status, IndustryGroup...etc&lt;br /&gt;
*Variables used for matching: EntityName&lt;br /&gt;
&lt;br /&gt;
Original patent data is in our database: &amp;lt;code&amp;gt;128.42.44.181/bulk/allpatent&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Procedure===&lt;br /&gt;
We first get the standard company names for VentureOne companies from the source VentureOne data set. Then we standardize the names of the companies that have patents from our patent database. Based on the common standard company names, we join patent information to VentureOne companies.&lt;br /&gt;
&lt;br /&gt;
===Final Matched Tables===&lt;br /&gt;
#Summary table displaying number of patents owned, minimum grant year, maximum grant year and average grant year for each company (including the ones that own no patents). It can be found at:&amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturesummary.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
#A table contains all patent information for the companies that have patents and can be found at &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturefullyjoined.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Desired Variables===&lt;br /&gt;
&lt;br /&gt;
Below is the list of variables that were in the STATA file we were given:&lt;br /&gt;
 Contains data from C:\Users\ArielSun\Downloads\allpats_3sectors_06jun13.dta&lt;br /&gt;
   obs:        19,409                          &lt;br /&gt;
  vars:            36                          11 Jun 2016 17:31&lt;br /&gt;
  size:    10,655,541                          (_dta has notes)&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
               storage   display    value&lt;br /&gt;
 variable name   type    format     label      variable label&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
 id_vone         double  %9.0g                 VentureOne id&lt;br /&gt;
 name            str39   %39s                  startup name&lt;br /&gt;
 patent          str9    %9s                   patno in string&lt;br /&gt;
 apn             str6    %6s                   pat application number&lt;br /&gt;
 nmi             str40   %40s                  inventor name&lt;br /&gt;
 ttl             str244  %40s                  invention title&lt;br /&gt;
 nma             str65   %65s                  original assignee&lt;br /&gt;
 ocd             str15   %15s                  main us patent class&lt;br /&gt;
 icd             str15   %15s                  main intl patent class&lt;br /&gt;
 apd             float   %td                   application date&lt;br /&gt;
 gdateold        float   %td                   Grant date&lt;br /&gt;
 fnd_year        float   %8.0g                 startup founding year&lt;br /&gt;
 last_yr         float   %9.0g               * OLD last_yr, 2006; see notes&lt;br /&gt;
 source          byte    %8.0g                 1 if 2012 delphion searches; else from 2004/5 search&lt;br /&gt;
 pdate           float   %td                   priority date, delphion; may pre-date application date if provisional apps&lt;br /&gt;
 utility         float   %9.0g               * 1 if utility patent as initially awarded; 0 if other (reissued, reexamed, design&lt;br /&gt;
 state_country   str3    %9s                   state/country of first inventor listed&lt;br /&gt;
 asscode         float   %9.0g                 assignee code; basic.dta&lt;br /&gt;
 ayear           int     %9.0g                 application year&lt;br /&gt;
 amonth          byte    %9.0g                 application month&lt;br /&gt;
 atype           str1    %9s                 * initial assignee type; see notes&lt;br /&gt;
 class           str3    %9s                   3 digit us pat class&lt;br /&gt;
 subclass        str6    %9s                   patent subclass&lt;br /&gt;
 gdate           int     %d                    grant, or issuance, date&lt;br /&gt;
 industry        str15   %15s                  semi, software, or med devices&lt;br /&gt;
 state_hq        str2    %9s                   firm hq location; vone&lt;br /&gt;
 status06        str4    %9s                 * status of firm known in 2006; rhs truncation varies by sector&lt;br /&gt;
 exitdate        str8    %9s                   exit date, if known&lt;br /&gt;
 exityr          str4    %9s                   exit year, if known&lt;br /&gt;
 status08        str6    %9s                 * status of firm in 2008, see notes&lt;br /&gt;
 last_yr08       int     %8.0g               * exityr if ipo/acq, else 2008&lt;br /&gt;
 dcohort         float   %9.0g                 1 if founding yr during 1987-99&lt;br /&gt;
 lastyr08_minu~r float   %9.0g                 &lt;br /&gt;
 dsearch_assign  float   %9.0g                 1 if searches of pat assignment data need to be conducted; carlosn confirm?&lt;br /&gt;
 carlos_chk      float   %9.0g                 carlos: pls confirm assignment data = compiled for these pats&lt;br /&gt;
 entityid        long    %12.0g                unique startup id as of 2008, vone&lt;br /&gt;
                                             * indicated variables have notes&lt;br /&gt;
 ----------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
&lt;br /&gt;
==Detailed Data Processing==&lt;br /&gt;
;*Get the VentureOne data ready&lt;br /&gt;
#Source file for VentureOne data &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Venture Data 1.xlsx&amp;lt;/code&amp;gt; Original data source&lt;br /&gt;
#Clean it up &amp;lt;code&amp;gt;E:\McNair\Software\Scripts\Matcher\Input\Venture Data 1.txt&amp;lt;/code&amp;gt; extraneous symbols and words removed&lt;br /&gt;
#Match it against itself to get standardized entity names &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\Cleaned and Matched Data.xlsx&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Get the patent data ready&lt;br /&gt;
#Draw the distinct assignees &amp;lt;code&amp;gt;Z:\allpatentsprocessed\DistinctAssignees2.txt &amp;lt;/code&amp;gt;&lt;br /&gt;
#Match them against themselves to get standardized org names for patent data &amp;lt;code&amp;gt;Z:\allpatentsprocessed\DistinctAssignees2matched.txt &amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Match standardized org names of patent data to standardized entity names of venture data&lt;br /&gt;
:&amp;lt;code&amp;gt;Z:\allpatentsprocessed\Venture Patent Matched.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Join patent data to venture data to get patent information of each venture-backed company&lt;br /&gt;
#Join &amp;lt;code&amp;gt;patent&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;assignee&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;firstjoin_cleaned&amp;lt;/code&amp;gt; which matches assignees to patent numbers.&lt;br /&gt;
#Join &amp;lt;code&amp;gt;firstjoin_cleaned&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;matchassignee&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;secondjoin_cleaned&amp;lt;/code&amp;gt; which matches standard org names to patent numbers&lt;br /&gt;
#Join &amp;lt;code&amp;gt;secondjoin_cleaned&amp;lt;/code&amp;gt; data to &amp;lt;code&amp;gt;venturepatentmatched&amp;lt;/code&amp;gt; data, creating &amp;lt;code&amp;gt;fourthjoin_cleaned&amp;lt;/code&amp;gt; which matches standard venture company names to patent numbers&lt;br /&gt;
&lt;br /&gt;
;*Final summary tables&lt;br /&gt;
#Summary table displaying number of patents owned, minimum grant year, maximum grant year and average grant year for each company &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturepatentreallyfinal.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
#A table of all patent information for each company that has patents &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\venturepatentfullyjoined.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
;*Notes&lt;br /&gt;
#All data in &amp;lt;code&amp;gt;allpatentsprocessed database&amp;lt;/code&amp;gt;. Access it by logging on to &amp;lt;code&amp;gt;researcher@McNair DBServ:/bulk/allpatentsprocessed&amp;lt;/code&amp;gt;&lt;br /&gt;
#A script of detailed processing procedure can be found at &amp;lt;code&amp;gt;E:\McNair\Projects\Venture One Data\patent data script.txt&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==The matched data==&lt;br /&gt;
&lt;br /&gt;
We are giving back two files:&lt;br /&gt;
*One is at the patent level and contains information on 38,497 patents held by the 1,557 of the 3,357 companies.&lt;br /&gt;
*The other file is at the company level and aggregate patent information for the 3,357 companies.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;includeonly&amp;gt;&lt;br /&gt;
[[Category: McNair Projects]]&lt;br /&gt;
&amp;lt;/includeonly&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&amp;lt;!-- flush flush --&amp;gt;&lt;/div&gt;</summary>
		<author><name>ArielSun</name></author>
		
	</entry>
</feed>