Changes

Jump to navigation Jump to search
89 bytes removed ,  17:35, 2 September 2016
no edit summary
This =Hubs Pages=*The main page represents the work used for mechanical turks for the paperHubs can be found: [[Hubs (Academic Paper)]]. *For the current work in progress for building the Hubs datasheet for the scorecard As of Spring 2016, go to: [[Hubs: Hubs Scorecard]]*For a list tracker of potential work in progress for the dataset building for the scorecard go to [[Hubs: Hubs with Data Building]]*For a set high-level overview of characteristics was created. Many of these are not what will be defined as Hubs. We will be creating a the variables for the scorecard go to help subjectively define [[Hubs: Hubs based on certain characteristics. Data]]
=List of Variables=For a more information on Mechanical Turks in general, -depth of the variables and procedure please see : [[Mechanical Turk (Tool)Hubs: Hubs Scorecard]]. This page will reflect the variables being collected separated into three categories. Each variable will include a breakdown of levels being collected if the definition is not trivial and an approximate approach.
=Variables for Hubs=
We will be creating a "Hubs scorecard" to determine how hub-like potential spaces are. In order to do so, we will evaluate the places based on certain variables. Previous variables for potential hubs were collected. Below, we list those as well as other variables we think might be helpful to build out the scorecard.
Ideally, we would have the following variables (not collected previously):
*Onsite VC/Angel/Investors (Count or binary)
##
##
*Onsite Mentors (binary) --- ''Are these the same as advisers?''
##
##
*"Office hours" with investors or mentors (binary) --- ''note: previously collected included number of events, but did not separate them into categories (e.g. networking events, workshops, etc.). We view this separation as important''
*Onsite temporary workshops (binary or count) *** '''see mechanical turk'''
*Networking Meetups (Binary or count) *** '''see mechanical turk'''
*Sponsors and Partners (binary and list) --- a''re these the same?''
*Alumni Network (binary) --- ''not all potential hubslist this and the fact that some do might indicate its importance''
*Num of Companies --- ''to help determine size as getting physical sqfootage is difficult''
*Nonprofit (binary) --- ''helpful in determining goals of potential hubs''
*Mission Includes Key Buzzwords (e.g. "ecosystem", "community") --- ''help separate simple coworking spaces form hubs''
Example of Prior Variables Collected:*Specific Industry -- ''defined as LinkedIN Self Identifier, no categories just plain text. We think what we really want is to see if they have a specialty (e.g. healthcare)'07/29'*Num of Events --- ''relatively complete inputs, but from March 2016 (see above as well)''*Price Ariel: code Hubs variable for Single Space --- ''defined as price for flexible desk, relatively complete inputs''Hubs *Price for Office --- ''no inputs''*Twitter Activity (Multinomial or Count) --- ''High=2:<code>E:/McNair/Projects/Moderate=1Hubs/No=0, no explanations on how to categorize the activity. Also no handles''*Size (sqft) Hubs Variable--- ''no records for majority of the companies''*Num Conference Rooms --- ''no records for majority of the companies''*Onsite accelerator (binary) --- ''relatively complete inputs''*Onsite Ariel</code school (binary) --- ''relatively complete inputs''*Community Membership (binary) --- ''relatively complete inputs''>
=Test2=
*'''Twitter activity''': ''
'''UPDATE (7/14)''': Updated turk to reflect our desired formats
'''UPDATE (7/12)''': '''AUDIT RESULTS''': We noticed
'''UPDATE (7/11)''': uploaded and published on amazon's mechanical turk site. Given the time cost to either record number of tweets in a month or look up more than 10 tweets, we decided to record the date of the last 10th tweet. Using a sample of ~10 companies, We noticed minimal differences in data observations among using 10,20, and 30 tweets.''
#Copy the text in the Search Text into a search engine.
#Click on result from twitter.com with the company name. If the link does not appear on the first 3 pages, record DNE for both outputs
#Record the company's Twitter Handle into Twitter Handle
#Record the date (MM/DD/YY) of that tweet for Twitter Activity. If there are less than 10 tweets, record DNE.
'''As of Week of 7/25'''===Group 1==='''Variables Difficult to Obtain'''#'''Founding Date''' ''(date_founded)''#*'''NUMBER OF EVENTS''Difficulty:'' ''' Finding date based on our strategies#*''' ''New Approach: ''UPDATE: written, not published, on amazon's mechanical turk site''#*#Whois.net Date#*#Factavia/other press release searches #'''ConsiderationsMultiple locations within city + Franchise''' (as of now just addresses) ''(multi_address)''#*Difficulties Encountered''' ''Difficulty:'' ''' Company or establishment level will impact measurements#*Expected Time to Complete''' ''New Approach:'' ''' Will record all addresses at company level#'''Onsite Venture Capital v. Angel Investors''' (e.g. # and Assets Under Management) ''(onsite_Vc_bin)/(onsite_vc_list)'' ''(onsite_angel_bin)/etc.''#*Expectation ''' ''Levels:'' ''' Binary, list of Results (accuracy of turk, comprehensiveness)investors#*''' ''Difficulty:'' ''' Hub website usually does not include investors#*Other Comments''' ''New Approach:'' ''' #*#Google key terms with address of Hub#*#Start with partners and use google/crunchbase
===Group 2==='''Variables Comfortable, Not Complete''' (rough order of most difficult to least difficult)#'''Onsite accelerator''''Procedure'(onsite_accel_bin)/(onsite_accel_cnt)/(onsite_accel_list)''#Copy the text in the Search Text into *''' ''Levels:'' ''' Binary, count, list#*''' ''Difficulty:'' ''' Usually not a list, which requires more scrubbing as many other variables just require us to find one page on a search enginewebsite.#Click *''' ''Approach:'' '''#*#Google searches and procedure to use on the result that is the website yields decent results#*#Similar procedure to onsite investors#'''Size (# members)''' ''(num_members)''#*''' ''Levels:'' ''' Count for companies (currently not planning to include list of the companycompanies given that some potential hubs have 200+ members)#*''' ''Difficulty:'' ''' Some companies don’t list all members - only selective ones-, others do not separate current members and alumni, and some just write "we have served more than 120 startups..."#*''' ''Approach:'' ''' For companies that have a list, we will count. If For those with select members, we will count those they listed and try to see if there does not exist is a comment about how many they have. For those that just have a listing on statement "with over," we will write the first three number and + (e.g. "120+).#'''Office hours investors''' and '''Office hours mentor/advisors''' ''(OH_bin)/(OH_inv_bin)/(OH_inv_list)/etc.''#*''' ''Levels:'' ''' Binary for OH, binary for two separate OH, list of names/descriptions of OH#*''' ''Difficulty:'' ''' Some companies do not list who OH are with, not always obvious if investor, mentor, or advisor, sometimes not clear if mentor is investor/future investor#*''' ''Approach:'' ''' Google approach to get to OH pagesand then lookup key words in description to separate out#'''Onsite temporary workshops and Networking Meetups''' (Count) ''(onsite_temp_events_bin)/(onsite_temp_workshop_bin)/(onsite_temp_workshop_cnt)/etc.''#*''' ''Levels:'' ''' Binary for do they exist, mark as DNEcount for each#*''' ''Difficulty:'' ''' Difficult for Turkers to differentiate between these two and also other potential events (e.g.symposiums)#Look *''' ''Approach:'' ''' Uses key search terms (e.g. Java/etc.) to separate out workshops and key terms (e.g. lunch/happy hour) for links related networking meetings#'''Onsite code school''' and '''Curriculum''' ''(onsite_long_term_courses)/(onsite_code_school_bin)''#*''' ''Levels:'' ''' Binary for do they exist, binary for each#*''' ''Difficulty:'' ''' Difficult for Turkers to differentiate between long-term coding programs for individuals and curriculum for startups#*''' ''Approach:'' ''' Uses key search terms (e.g. specific code schools) to separate out known code schools and also to eventslook into key terms (e.g. leadership) for curriculum#'''Sponsors/Partners''' (University, Corporate) ''(sponsors_cnt)/(sponsors_list)/etc.''#*''' ''Levels:'' ''' Count, list of sponsors/partners (if exist), such as separate columns for university and corporate#*''' ''Difficulty:''Events' or 'Calendar' on Not all companies will list sponsors, partnesrs, or either. Not always clear the homepagedifference among sponsors, partners, investors. #*''' ''Approach:'' ''' Use two different levels and use of google search, then if list exists, separate by "college"/"university" and rest#'''Alumni Network''' ''(alumni_bin)/(alumni_list)''#*''' ''Levels:'' ''' Binary, list#*''' ''Difficulty:'' ''' Not all companies list alumni, some only list "selected"#*''' ''Approach:'' ''' Include all that have lists#'''Size (sqft)''' ''(size_sqft)''#*''' ''Levels:'' ''' Number in sqft#*''' ''Difficulty:'' ''' Not all companies list square feet online#*''' ''Approach:'' '''#*#Google search with key words#*#If results do not found on the homepageappear, check use of press releases is possible#'''Onsite Mentors'About' and check 'Community''(onsite_mentors_bin)/(onsite_mentors_cnt)/(onsite_mentors_list)''#*''' ''Levels:'' ''' Count the number and list of events in July 2016 mentors (if exist)#*''' ''Difficulty:'' ''' Not all companies list mentors - bigger issue is onsite investors#*''' ''Approach:'' ''' Use two different levels and record it. If there is no information use of events on the website, record DNE.google search
Note***: ===Group 3==='''Events include meetups, workshops, info sessions etc. We do not want Variables Easy to count them separately since it is difficult to do so. Most companies put all the events on the same section and do not put event types in the titles of the events. We have to look into the details of the events to find out the type and even we do so some events descriptions do not allow us to determine the type easily. Differentiating the types of the events demands more time and effort and therefore is not suitable to be a mechanical turk project.Obtain'''#'''Twitter activity''' ''(twit_handle)/(twit_prev_mon_cnt_tweets)/(twit_cnt_followers)/(twit_cnt_retweets)'' #*'''Onsite Mentors'''Levels: ''UPDATE: written, not published, on amazon's mechanical turk site''Twitter Handle, #Copy the text Tweets in the Search Text into a search engine.Month, # Followers, # Retweets#Click on the result *''' ''Approach:'' ''' Easy to get twitter handle from Turk or Veeral's code that is the website allows us to run a series of the company. If there does not exist a listing searches on the first three pages, mark as DNE.google and then use Gunny's Twitter crawler to get other levels from handle#Look for links related to mentorship such as 'mentors', 'mentorshipSite URL''' '' or (url)'mentoring programs'#If the key words can be identified, mark as 1*''' ''Levels:'' ''' URL#If there is no explicit *''' 'mentoring' section, look for links related to a description of the company, such asApproach: 'About,' 'Our Team,' 'Our Mission,Google using Veeral' etc., look for a subsection or mention of mentor/mentorship/mentorings code that allows us to search #If these exist, mark as 1.''' ''Whois Date'' ''' ''(date_whois)''#If not, go to links related to membership *''' ''Levels:''benefits,' 'perks,' or related.Date#Do same process as end of 4 and 5*''' ''Approach:'' ''' Date active website was registered#If there is no mention of mentorship in these sections, type the company, city, and 'mentoring' into a search engine. If a link to a reliable website 'Address''' ''(such as Desktimeaddress) appears and mentorship can be found in the description, mark as 1.''#If none of these steps result in a mark of 1, mark as 0   *'''Nonprofit'''Levels: ''UPDATE: written, not published, on amazon's mechanical turk site''Will include all addresses#Copy the text in the Search Text into a search engine*''' ''Approach:'' ''' Google key terms (e.#Click on the result that is the website of the company. If there does not exist a listing on the first three pages, mark as DNEg.Contact Us) and URL using Veeral's code#Go to links that describe the company, usually they are labelled: 'About', 'Our Story,Nonprofit status''' '' (nonprofit_binary)'Mission'#Look for the key word *''' ''nonprofitLevels:'/'non-profit'#If 'nonprofit' Binary variable indicating if the potential Hub is identified, mark as 1, otherwise 0.  a nonprofit organization#*'''Number of Members'''Approach: ''UPDATE: written, not published, on amazon's mechanical turk site''#Copy the text in the Search Text into http://www.guidestar.org/ is a site that we can use to search engine.#Click on the result that if a company is the website of the company. If there does nonprofit or not exist a listing on the first three pages, mark as DNE.#Look for the link 'Members' or 'ResidentsMission statement''', usually they are under the links 'Community', (missions_stmt)'Membership', #*'Our Space' or 'The Space'.#Count the number of members#If the link or section of 'MembersLevels:' is not found, go the 'Community' and 'Coworking' and look for the Official mission statement or description on number of startups/founders/members in the community. Record the number.company (if mission does not exist)#If number of members cannot be identified using above steps, record DNE.  *'''Sponsors and Partners''Approach:''':''UPDATE: written, If not publishedexplicitly stated mission statement, will include "About" or statements on amazonmain page#'''Specific Industry''' ''s mechanical turk site(spec_industry)''#Copy the text *''' ''Levels:'' ''' Industry included in the Search Text into a search engine.statement (no aggregation)#Click *''' ''Approach:'' ''' *Based on the result that is the website of the company. If there does Mission Statement, not exist aggregated#'''Price for a listing on the first three pages, mark as DNE.space/office''' ''(price_space)''#Look for the link or mention of *'''Sponsors' or 'PartnersLevels:', many times of which is often under the section of 'About', 'Community'Two prices one for shared, or related sectionsother for private#If sponsors or partners can be found mark as 1 *''' ''Approach:'' ''' Uses google methodology with key terms and list them, otherwise mark as 0.URL[[Category: Internal]][[Internal Classification: Legacy| ]]

Navigation menu