A transformation pipeline for Delta Lake using AWS SDK for Pandas
-
Updated
Jul 12, 2023 - Python
A transformation pipeline for Delta Lake using AWS SDK for Pandas
Global Markets Options Pricing
Recommendation system approaches
AI Starter Kit to generate structured synthetic data using Intel® Distribution of Modin
Delve deeper into data manipulation using Python's prominent libraries. Explore the functionalities of Pandas and get a glimpse of alternatives like Polars, Dask, and Modin.
A Bioinformatics demo in Python working with FASTQ files and using the Modin library
Simple example on how Modin can peed up your Pandas workflows by changing a single line of code
HHA507 / Data Science / Assignment 2 / Data Manipulation
Using the MovieLens dataset with Surprise to compare different algorithms for rating prediction, and also create a movie recommendation system on top of it.
oneAPI Hackathon: The LLM Challenge
Open Data Profiling, Quality and Analysis on NYC OpenData dataset with semantic profiling using fuzzy ratio, Levenshtein distance and regex
A Kedro plugin that provides pandas dropin replacements for the pandas datasets (e.g modin and cuDF)
A low-level execution library for analytic data processing.
Distributed XGBoost on Ray
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
Add a description, image, and links to the modin topic page so that developers can more easily learn about it.
To associate your repository with the modin topic, visit your repo's landing page and select "manage topics."