Changes

Jump to navigation Jump to search
55 bytes added ,  17:26, 18 July 2016
*Created a function that took a URL and gave back all of the substantial text blocks on the page (used to find company descriptions)
**Uses BeautifulSoup to access and explore HTML files.
**The function explores the HTML source code of the URL and finds all parts of the source code with the <p> tag to indicate a text paragraph.
**Then, the function goes though each paragraph, and if it is above a certain number of characters (eliminate for short, unnecessary information), the function adds the description in a new column of the csv file under "description".
383

edits

Navigation menu