Skip to content

Popular repositories

  1. behemoth behemoth Public archive

    Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.

    Java 282 60

  2. TextClassification TextClassification Public

    A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and can be used as a front end to various ML algorithms. libSVM …

    Java 48 21

  3. textclassification-examples textclassification-examples Public

    Use cases for DigitalPebble's TextClassification API

    Java 10 3

  4. stormcrawlerfight stormcrawlerfight Public

    Crawl configurations for benchmarking / testing StormCrawler

    Shell 9 5

  5. ansible-storm ansible-storm Public

    Ansible playbook for deploying a Storm cluster

    7 1

  6. stormcrawler-docker stormcrawler-docker Public

    Resources for running StormCrawler with Docker services

    Dockerfile 7 2

Repositories

Showing 10 of 26 repositories

Top languages

Loading…

Most used topics

Loading…