Skip to content

tsotne95/PageSimpleScraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PageSimpleScraper

Web Page simple scraper using Jsoup

and for url validate i'm using Apache Commons UrlValidator class

for unique images and links it create tab delimited file with details on it:

images:

  • image number
  • image url
  • image width
  • image height
  • image alt

links:

  • link url
  • link text

You can view exapmle files for this link

Releases

No releases published

Packages

No packages published

Languages