Changes

Jump to navigation Jump to search
no edit summary
This python script takes a textfile of company names, and uses the Crunchbase Snapshot to determine the founder names of each company. If Crunchbase does not have the records of the founder, it is unlikely that a generic search on LinkedIn will provide any useful results. The script returns a new textfile with each company name replaced with "CompanyName Founder FounderName" for each founder of the company listed in the Crunchbase Snapshot. This new textfile can then be used directly with the LinkedIn Crawler to generate accurate search results, and retrieve accurate html pages.
=Previous Posts about the LinkedIn Crawler=Using Crunchbase==Currently, we have SnapShot data for the year 2013. This method works for the companies that existed in that period, but is not useful for any companies not listed in that given file. Ideally, we will be able to get data directly from Crunchbase. If not, one option is to crawl Crunchbase directly. 
== To what extent are we able to reproduce the network structure in LinkedIn (From Previous) ==

Navigation menu