Changes

Jump to navigation Jump to search
Issues may occur:
* The href may not give us the full url, like above example it excludes the domain name: "http://www.edegan.com"
* Some may not exclude the domain name and we should take consideration of both cases when extracting the url
'''Part II: Distinguish Internal Links'''
* If the href is not presented in a full url format (referring to the example above), then it is for sure an internal link
* If the href is in a full url format, but it does not contain the domain name, then it is an external link (see example below, assuming domain name is not facebook.com) https://www.facebook.com/...
'''Part III: Algorithm on Collecting Internal Links'''
227

edits

Navigation menu