Difference between revisions of "Crunchbase Accelerator Founders"

From edegan.com
Jump to navigation Jump to search
Line 10: Line 10:
 
[[Crunchbase Accelerator Equity]]
 
[[Crunchbase Accelerator Equity]]
 
[[LinkedIn Crawler (Python)]]
 
[[LinkedIn Crawler (Python)]]
 +
 +
==Getting Data==
 +
To get the founder UUIDs from each accelerator, input the accelerator UUID (or name all lowercase if its one work) into this link:
 +
  https://api.crunchbase.com/v3.1/organizations/ + UUID of organization + ?relationships=founders&user_key=662e263576fe3e4ea5991edfbcfb9883
 +
 +
==Code==
 +
===scrapefounders.py===
 +
This code lives in Z:\crunchbase2\scrapefounders.py
 +
 +
This program takes Accelerators and UUIDs.txt found in a Z:\crunchbase2\Accelerators and UUIDs.txt and extracts the accelerator UUIDs and loads the information of each founder from the crunchbase API using the link above. It then takes the information given by the API and returns a dictionary of accelerator UUIDs as keys and founder UUIDs as values.
  
 
==Updated LinkedIn Crawler==
 
==Updated LinkedIn Crawler==
Line 26: Line 36:
 
Use the selenium computer on Rice Visitor wifi.
 
Use the selenium computer on Rice Visitor wifi.
 
After logging in a couple of times, LinkedIn will get suspicious and ask you to confirm that you are not a robot using reCaptcha. I got around this by delaying the program by 3 minutes so that I had time to complete the reCaptcha test. However, sometimes reCaptcha loses connection and it forces you to continue the tests which can be frustrating. When this happens, I disconnect and reconnect from the wifi as well as switch between the test accounts.
 
After logging in a couple of times, LinkedIn will get suspicious and ask you to confirm that you are not a robot using reCaptcha. I got around this by delaying the program by 3 minutes so that I had time to complete the reCaptcha test. However, sometimes reCaptcha loses connection and it forces you to continue the tests which can be frustrating. When this happens, I disconnect and reconnect from the wifi as well as switch between the test accounts.
 
==Getting Data==
 
To get the founder UUIDs from each accelerator, input the accelerator UUID (or name all lowercase if its one work) into this link:
 
  https://api.crunchbase.com/v3.1/organizations/ + UUID of organization + ?relationships=founders&user_key=662e263576fe3e4ea5991edfbcfb9883
 
  
 
==Code==
 
==Code==
===scrapefounders.py===
 
This code lives in Z:\crunchbase2\scrapefounders.py
 
 
This program takes Accelerators and UUIDs.txt found in a Z:\crunchbase2\Accelerators and UUIDs.txt and extracts the accelerator UUIDs and loads the information of each founder from the crunchbase API using the link above. It then takes the information given by the API and returns a dictionary of accelerator UUIDs as keys and founder UUIDs as values.
 

Revision as of 17:17, 10 July 2018


McNair Project
Crunchbase Accelerator Founders
Project logo 02.png
Project Information
Project Title Crunchbase Accelerator Founders
Owner Grace Tan
Start Date 6/18/18
Deadline
Primary Billing
Notes
Has project status Active
Copyright © 2016 edegan.com. All Rights Reserved.


Related Pages

Crunchbase Data Crunchbase Accelerator Equity LinkedIn Crawler (Python)

Getting Data

To get the founder UUIDs from each accelerator, input the accelerator UUID (or name all lowercase if its one work) into this link:

 https://api.crunchbase.com/v3.1/organizations/ + UUID of organization + ?relationships=founders&user_key=662e263576fe3e4ea5991edfbcfb9883

Code

scrapefounders.py

This code lives in Z:\crunchbase2\scrapefounders.py

This program takes Accelerators and UUIDs.txt found in a Z:\crunchbase2\Accelerators and UUIDs.txt and extracts the accelerator UUIDs and loads the information of each founder from the crunchbase API using the link above. It then takes the information given by the API and returns a dictionary of accelerator UUIDs as keys and founder UUIDs as values.

Updated LinkedIn Crawler

We will be using a LinkedIn Crawler to find information about accelerator founders. There is a previous project whose code is found in

 E:/McNair/Projects/LinkedIn Crawler/LinkedIn_Crawler/linkedin

My code is found in the selenium computer at the root and at

 E:/McNair/Projects/LinkedIn Crawler 2018

There are 5 python files needed to run the crawler in addition to search.py which I included but did not use because it was in the previous code I found.

New Test Account

 Username: mcboatfaceboaty670@gmail.com
 Password: McNair2018

Use the selenium computer on Rice Visitor wifi. After logging in a couple of times, LinkedIn will get suspicious and ask you to confirm that you are not a robot using reCaptcha. I got around this by delaying the program by 3 minutes so that I had time to complete the reCaptcha test. However, sometimes reCaptcha loses connection and it forces you to continue the tests which can be frustrating. When this happens, I disconnect and reconnect from the wifi as well as switch between the test accounts.

Code