Skip to content
#

data-analytics

Here are 2,985 public repositories matching this topic...

desbordante-core

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

  • Updated May 11, 2024
  • C++

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, with more integrations coming..

  • Updated May 12, 2024
  • Rust

Improve this page

Add a description, image, and links to the data-analytics topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-analytics topic, visit your repo's landing page and select "manage topics."

Learn more