Skip to content
Scott Behrens edited this page Aug 1, 2014 · 3 revisions

Sketchy

Overview

Sketchy is a task based API for taking screenshots and scraping text from websites. Sketchy utilizes PhantomJS with lazy-rendering to ensure Ajax heavy sites are captured correctly. Sketchy leverages Celery task management system to allow users to scale Sketchy accordingly and manage time intensive captures.

Screenshots (sketches), text scrapes, and html files can either be stored locally or on an S3 bucket. Optionally, token auth can be configured for creating and retrieving captures.

Sketchy can also perform callbacks if required.

A Capture model contains all of the information associated with taking screenshots, scraping text, and storing html files from a provided URL.