Changes

Jump to navigation Jump to search
no edit summary
[[LinkedIn Crawler (Python)]]
==Project Introduction==This project uses the Crunchbase Data and API to find founders of the accelerators we are interested in. We then take the founders and run their names through the LinkedIn Crawler to find information about them.  ==Part 1: Getting Data==
To get the founder UUIDs from each accelerator, input the accelerator UUID (or name all lowercase if its one work) into this link:
https://api.crunchbase.com/v3.1/organizations/ + UUID of organization + ?relationships=founders&user_key=662e263576fe3e4ea5991edfbcfb9883
==Code==
===scrapefounders.py===
This code lives is located in :
Z:\crunchbase2\scrapefounders.py
This program takes Accelerators and UUIDs.txt found in a Z:\crunchbase2\Accelerators and UUIDs.txt and extracts the accelerator UUIDs and loads the information of each founder from the crunchbase API using the link above. It then takes the information given by the API and returns a dictionary of accelerator UUIDs as keys and founder UUIDs as values.
==Part 2: Updated LinkedIn Crawler==
We will be using a LinkedIn Crawler to find information about accelerator founders. There is a previous project whose code is found in
E:\McNair\Projects\LinkedIn Crawler\LinkedIn_Crawler\linkedin
E:\McNair\Projects\LinkedIn Crawler 2018
There are 6 python filesin the LinkedIn Crawler 2018 directory===linkedin_crawler_main.py===This contains the main function that will run the LinkedIn Crawler. It includes two test accounts at the top which I went back and forth on to prevent LinkedIn from finding me. Inputs (set outside of function) : username(of test account), password(of test account). query_filepath(txt file that includes name of accelerator, first_name, last_name, linkedin_url).Output: 3 txt files - founders_education.txt, founders_experience.txt, founders_main.txt    
===New Test Account===
108

edits

Navigation menu