Skip to content
Thad Guidry edited this page Dec 8, 2021 · 3 revisions

List of other software products that can be used with or instead of OpenRefine

(in no particular order)

  • JunoLab - open source extendable Interactive IDE built on Atom and Julia that provides live feedback and a programmable GUI to create facets like OpenRefine
  • Mito - Python and Jupyter based interactive programmable GUI
  • Weka - open source collection of tools for data mining and machine learning.
  • Orange open source data mining and machine learning
  • Apache NiFi - open source automated dataflow & batch processing, has expression languages, & extendable
  • Talend Data Preparation - open source tool that performs similar functions as OpenRefine
  • Apache Hop - open source automated stream & batch processing, supports expression languages
  • Rattle - open source A Graphical User Interface for Data Mining using R
  • d:swarm - open source An open-source data management platform for knowledge workers
  • Rapid Miner (a light version is free)
  • dataiku - Collaborative Data Science Platform (with a free version)
  • Tableau
  • Spotfire
  • Pentaho
  • CloverETL
  • Informatica Data Quality
  • Elixir Repertoire
  • Pervasive Datarush (sold and closed)
  • Palantir
  • TextPipe
  • Bonita Business Process Manager From the academia (open source for the basic community edition)
  • Potter's Wheel (screenshots) by Vijayshankar Raman and Joseph M. Hellerstein (UCBerkeley) (open source)
  • AJAX by H.Galhardas, D. Florescu, D. Shasha, E. Simon, J.P. Matsumoto, C.A. Saita (error 404)
  • Clio - UToronto, IBM (university project)
  • Data Wrangler - an interactive tool for data cleaning and transformation.
  • Trifacta Wrangler - Commercial version of Data Wrangler, but free version available.
  • Yeroon.net/ggplot2
    • a javascript web interface for R package, ggplot2 - Jeroen C.L. Ooms, Hadley Wickham (university project)
  • Visidata - open source interactive multitool for tabular data. Command line and Open Source.
  • Kylo - open source data lake platform with interactive data tooling. Open Source (no longer supported by Teradata)

Feel free to add suggestions in the comments.

Clone this wiki locally