Changes

Jump to navigation Jump to search
519 bytes added ,  11:50, 3 April 2019
no edit summary
==Retrieve Data from URLS Generated==
We wrote a web crawler that
# reads in the csv file into a pandas dataframe
# changes the urls by -- replacing ''?c=companyprofile&'' with ''companyprofile?'' and appending the domain http://exchange.inbia.org/network/findacompany to each url
# open each url and extract information using element tree parser
# write information for each url to csv file
 
This crawler is called inbia_scrape.py and it is located in E:\projects\Kauffman Incubator Project\01 Classify entrepreneurship ecosystem organizations\INBIA
83

edits

Navigation menu