Changes

Jump to navigation Jump to search
18 bytes added ,  12:16, 19 June 2017
no edit summary
It takes in as input the full HTML file of the website, converts it to "soup" object and scrapes the resulting html file. <br>
The items from the scrape are inputted into a Pandas DataFrame object, which is then converted to a tab-separated text file. <br>
When converting to text file, make sure to set the explicit the following explicityly: encoding = "utf-8", sep = "\t", and index = False <br>
This ensures that the resulting strings are converted properly, the file is tab separated, and the data looks clean, respectively. <br>
146

edits

Navigation menu