brejnko

brejnko / UrlLinkList / 0.1.0

README.md
Link extractor. 

On input URL the HTML is fetched and all of the distinct links (a[href]) are extracted and presented as Set<String>.

Dependencies:
  • jsoup
  • Apache commons validator

The algorithm does a url validation check and connection timeout is set to 5 seconds.