Changes

Jump to navigation Jump to search
===Approaches (IN PROGRESS)===
[[Listing Page Classifier Progress|Progress Log(updated on 4/4/2019)]]
* Internal URL CrawlerFinding all internal links of a webpage
#BFS approach
E:\projects\listing page identifier\Internal_Link\urlcrawler_BFSInternal_url_BFS.py
#DFS approach(IN PROGRESS)
E:\projects\listing page identifier\Internal_Link\Internal_url_DFS.py
===Image Processing===
This method would likely rely on a [https://en.wikipedia.org/wiki/Convolutional_neural_network convolutional neural network (CNN)] to classify HTML elements present in web page screenshots. Implementation could be achieved by combining the VGG16 model or ResNet architecture with batch normalization to increase accuracy in this context.
227

edits

Navigation menu