Skip to content

nasa-jpl-memex/nutch-python

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

62 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

nutch-python

A Python client library for the Apache Nutch that makes Nutch 1.x capabilities available using the Nutch REST Server.

See (https://wiki.apache.org/nutch/NutchTutorial) for installing Nutch 1.x and alternatively operating it via the command line.

This Python client library for Nutch is installable via Setuptools, Pip and Easy Install.

Installation (with pip)

  1. pip install nutch

Installation (without pip)

  1. python setup.py build
  2. python setup.py install

Wiki Documentation

See the wiki for instructions on how to use Nutch-Python and its API.

New Command Line Tool

When you install Nutch-Python you also get a new command line client tool, nutch-python installed in your /path/to/python/bin directory.

The options and help for the command line tool can be seen by typing nutch-python without any arguments.

Questions, comments?

Send them to Chris A. Mattmann.

Contributors

  • Brian D. Wilson, JPL
  • Chris A. Mattmann, JPL
  • Aron Ahmadia, Continuum Analytics

License

Apache License, version 2

About

Python port of Nutch that allows controlling Apache Nutch via its REST API.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 98.9%
  • Other 1.1%