Skip to content

elisarver/reach

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

95 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Reach gathers urls from a website.

Installation:

You can use go get -u github.com/elisarver/reach/cmd/reach if you're only interested in the executable.

We build with go.mod support under go v1.11.

The application displays full usage with the -h flag.

  • Depends on goquery for querying html
  • Uses gen to generate custom sets and slices

Example of default use (gets a\[href] by default):

> reach http://google.com/
http://google.com/intl/en/policies/privacy/
http://google.com/intl/en/policies/terms/
/intl/en/ads/
/services/

Reparenting urls:

> reach -p http://google.com/
http://google.com/intl/en/policies/privacy/
http://google.com/intl/en/policies/terms/
http://google.com/intl/en/ads/
http://google.com/services/

Specify tag type:

> reach -tag meta:name http://google.com/
description
robots

Use a raw query:

> reach -query "div.all_external_links" http://elisarver.com/another-week-of-accomplishment

// Document body

Contact: eli.sarver@gmail.com

About

Experimental code to spider a site as a smoke test.

Resources

License

Security policy

Stars

Watchers

Forks

Packages

No packages published

Languages