LinkedIn Crawler (Python)

From edegan.com
Revision as of 18:56, 2 March 2017 by Peterjalbert (talk | contribs)
Jump to navigation Jump to search


McNair Project
LinkedIn Crawler (Python)
Project logo 02.png
Project Information
Project Title LinkedIn Crawler (Python)
Start Date March 2, 2017
Deadline
Keywords Selenium, LinkedIn, Crawler
Primary Billing
Notes
Has project status
Copyright © 2016 edegan.com. All Rights Reserved.


Overview

This page is dedicated to a new LinkedIn Crawler built using Selenium and Python. The goal of this project is to be able to crawl LinkedIn without being caught by LinkedIn's aggressive anti-scraping rules. To do this, we will use Selenium to behave like a human, and use time delays to hide bot-like tendencies.

The documentation for Selenium Web Driver can be found [here http://selenium-python.readthedocs.io/index.html].

Relevant scripts can be found in the following directory:

E:\McNair\Projects\LinkedIn Crawler

The main script for this crawler is:

crawl_linkedin.py

Accounts

Test Account:

email: testapplicat6@gmail.com

pass: McNair2017

Functionality

This section lists functions in the crawl_linkedin.py script that can be combined for higher functionality.

login(username, password)

This function opens the LinkedIn home page and logs in using the credentials given to the function. You will be taken to the home news feed for your account.

search(query)

This function assumes you are already logged into LinkedIn. It will type in the search bar the query that it is given, and begin the search for the given query.