Changes

Jump to navigation Jump to search
757 bytes added ,  11:36, 3 April 2019
no edit summary
==Initial Review of INBIA==The INBIA directory contains information for 415 incubators within the United States. It provides reliable links to a secondary page within the INBIA domain. This page contains information including the incubator's name, address, a link to the home page of their website, and information for key contacts. The secondary pages have the same HTML structure and are reliable in the data they contain, making INBIAan ideal candidate for web crawling methods to collect data from the internal pages. See [http://www.edegan.com/wiki/Incubator_Seed_Data#Evaluation_of_Sources_from_Specific_Google_Searches Wiki Page Table] for more details on source evaluations. ==Retrieve URLS from INBIA Directory==
We retrieved the INBIA data as follows:
We can then rip out the contact information, including URL, and the people, using either beautiful soup or regular expressions.
 
 
==Retrieve Data from URLS Generated==
83

edits

Navigation menu