Skip to content
@ContentMine

The ContentMine

The ContentMine is extracting 100 million facts from the academic literature

Popular repositories

  1. quickscrape quickscrape Public

    A scraping command line tool for the modern web

    JavaScript 256 44

  2. getpapers getpapers Public

    Get metadata, fulltexts or fulltext URLs of papers matching a search query

    JavaScript 197 37

  3. journal-scrapers journal-scrapers Public

    Journal scraper definitions for the ContentMine framework

    Ruby 66 34

  4. norma norma Public

    Convert XML/SVG/PDF into normalised, sectioned, scholarly HTML

    HTML 36 21

  5. workshop-resources workshop-resources Public

    This repository contains material helping you to set up a ContentMine workshop. It also includes tutorials for learning the ContentMine tools on your own.

    36 13

  6. scraperJSON scraperJSON Public

    The scraperJSON standard for defining web scrapers as JSON objects

    33 2

Repositories

Showing 10 of 101 repositories