Framework for processing and filtering datasets
-
Updated
Jun 2, 2024 - Python
Framework for processing and filtering datasets
SQL-like interface to tabular data structures
A public repository for all things RAG (Retrieval Augmented Generation)
PHP - ETL (Extract Transform Load) data processing library
real-time neuro-/biosignal processing and streaming pipeline
Kubernetes-native platform to run massively parallel data/streaming jobs
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Data and tools for generating and inspecting OLMo pre-training data.
Advanced and Fast Data Transformation in R
Doctrine DBAL Bulk Operations for selected database engines
A nvImageCodec library of GPU- and CPU- accelerated codecs featuring a unified interface
Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or plain 'ol Postgres, even!) with definitions imported from Collibra, Datahub, ODD and the like.
Extraction pipeline and analysis tools for Aperture Masking Interferometry mode of the last generation of instruments (ground-based and space).
A light-weight, flexible, and expressive statistical data testing library
Python Stream Processing
Data sources used by the Big Data Innovation Team
The MDSplus data management system
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
Niamoto is a command-line application and library focused on processing and publishing botanical data
Add a description, image, and links to the data-processing topic page so that developers can more easily learn about it.
To associate your repository with the data-processing topic, visit your repo's landing page and select "manage topics."