Changes

Jump to navigation Jump to search
no edit summary
{{Project
|Has project output=Data
|Has sponsor=Kauffman Incubator Project
|Has title=Ecosystem Organization Classifier
|Has owner=Libby Bassini, Anne Freeman,
The purpose of this project is to build a classifier, which takes the description of an ecosystem organization (i.e., a startup, a venture capitalist, an incubator, etc.) and either correctly classifies the organization's type or correctly classifies incubators vs. non-incubators.
 
==Related Projects==
 
Subsumed Projects:
{{#show: Ecosystem Organization Classifier|?Does subsume}}
 
This project is dependent on:
{{#show: Ecosystem Organization Classifier|?Is dependent on}}
===Text Processing===
We can use [[The Matcher (Tool)]] to match organization names to portfolio companies and VC funds and firms taken from '''vcdb3''' (see [[VentureXpert Database]]). We will also search this data by hand for incubators to get an initial set. Later on, we'll match our early list of incubators to crunchbase organization names to expand our list.
 
==Related Projects==
 
Subsumed Projects:
{{#show: Ecosystem Organization Classifier|?Does subsume}}
 
This project is dependent on:
{{#show: Ecosystem Organization Classifier|?Is dependent on}}
==Incubator Scores of Crunchbase Results==
==Process Notes for Calculating Incubator Scores==
 
Two new files were generated from the '''crunchbase3''' dbase as follows:
 
\COPY (SELECT uuid, company_name, short_description FROM Organizations WHERE country_code='USA' AND short_description LIKE '%incubat%') TO 'CrunchbaseShortOrgDescsUSAIncubat.txt' WITH DELIMITER AS E'\t' HEADER NULL AS '' CSV
--466
\COPY (SELECT A.uuid, A.company_name, B.description FROM Organizations AS A JOIN organization_descriptions AS B on A.uuid=B.uuid WHERE country_code='USA' AND description LIKE '%incubat%') TO 'CrunchbaseLongOrgDescsUSAIncubat.txt' WITH DELIMITER AS E'\t' HEADER NULL AS '' CSV
--933
 
These files were put in E:\projects\crunchbase3.
1. New file - Renamed E:\projects\crunchbase3\organizations as E:\projects\crunchbase3\organizations_OnlyIncubators_PlusIncubatorScores

Navigation menu