Difference between revisions of "AngelList Database"

From edegan.com
Jump to navigation Jump to search
Line 12: Line 12:
 
*Typed "incubator" in the search box
 
*Typed "incubator" in the search box
 
*Clicked on "Search for 'incubator'
 
*Clicked on "Search for 'incubator'
 +
 +
===500 Results===
  
 
Revised process:
 
Revised process:
Line 21: Line 23:
 
**URL\tConame
 
**URL\tConame
 
*Note that restricting to "Companies" reduces it to 1,339 results.
 
*Note that restricting to "Companies" reduces it to 1,339 results.
 +
 +
===Failed workarounds===
  
 
Tried work around with pages:
 
Tried work around with pages:
Line 31: Line 35:
  
 
Signed up for an account as Ed Egan, ed@edegan.com, littleAmount. Then the link More -> Incubators takes you to https://angel.co/accelerators/apply. But there doesn't seem to be an advanced search. Count of incubator results increased while on the site!
 
Signed up for an account as Ed Egan, ed@edegan.com, littleAmount. Then the link More -> Incubators takes you to https://angel.co/accelerators/apply. But there doesn't seem to be an advanced search. Count of incubator results increased while on the site!
 +
 +
===Another Deadend===
 +
 +
The page https://angel.co/incubators shows 6,054 companies. It stopped adding to the list after 20 More clicks, which turned out to be 400 results. Saved page as E:\projects\AngelList\Incubator - CompanyTypes - AngelList.html
 +
 +
Given the page title, this is likely the just the "Incubator" company type organizations. However, there is some useful information that could be extracted from just that page.
 +
 +
===Possible Process===
  
 
Tried searching incubator TX but it looks like only the name and text descriptions is searched. Tried searching "incubator a", "incubator b", "incubator c" and each had less than 500 results, so that ''might'' work.
 
Tried searching incubator TX but it looks like only the name and text descriptions is searched. Tried searching "incubator a", "incubator b", "incubator c" and each had less than 500 results, so that ''might'' work.

Revision as of 18:20, 16 April 2019


Project
AngelList Database
Project logo 02.png
Project Information
Has title AngelList Database
Has start date
Has deadline date
Has project status Active
Copyright © 2019 edegan.com. All Rights Reserved.


The purpose of this project is to build a database of incubators, perhaps as well as other ecosystem organizations, from AngelList.

So Far

Process from before:

  • Opened source link (http://www.angel.co)
  • Typed "incubator" in the search box
  • Clicked on "Search for 'incubator'

500 Results

Revised process:

  • Visit https://angel.co/search?q=incubator
  • Click More (a lot)
  • Save the HTML page as E:\projects\AngelList\AngelList.html
  • That gets you 500 (out of 1,447 claimed results)
  • Process the HTML using Regular Expressions to produce AngelListPages.txt, which is in the format:
    • URL\tConame
  • Note that restricting to "Companies" reduces it to 1,339 results.

Failed workarounds

Tried work around with pages:

But 40 results per page, page 13 ends with No Results Yet after More, and page 14 opens with it. So still capped at 500 results.

It appears from the format of results that Angellist has a type "incubator", though some likely incubators have other types (e.g., BMW iVentures Incubator is a "VC Firm" and Austin Technology Incubator is a "Company". And I can't see a way to restrict search by type.

Signed up for an account as Ed Egan, ed@edegan.com, littleAmount. Then the link More -> Incubators takes you to https://angel.co/accelerators/apply. But there doesn't seem to be an advanced search. Count of incubator results increased while on the site!

Another Deadend

The page https://angel.co/incubators shows 6,054 companies. It stopped adding to the list after 20 More clicks, which turned out to be 400 results. Saved page as E:\projects\AngelList\Incubator - CompanyTypes - AngelList.html

Given the page title, this is likely the just the "Incubator" company type organizations. However, there is some useful information that could be extracted from just that page.

Possible Process

Tried searching incubator TX but it looks like only the name and text descriptions is searched. Tried searching "incubator a", "incubator b", "incubator c" and each had less than 500 results, so that might work.