massive-datasets

Here are 21 public repositories matching this topic...

helmholtz-analytics / heat

Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python

python data-science machine-learning hpc gpu numpy mpi pytorch distributed parallelism data-analytics tensors data-processing multi-gpu mpi4py massive-datasets multi-node-cluster array-api

Updated May 29, 2024
Python

polardb / polardbx-sql

Star

PolarDB-X is a cloud native distributed SQL Database designed for high concurrency, massive storage, complex querying scenarios.

mysql distributed-transactions cloud-native high-availability relational-database high-concurrency massive-datasets htap horizontal-scaling enterprise-class

Updated May 22, 2024
Java

polardb / polardbx

Star

PolarDB-X is a cloud native distributed SQL Database designed for high concurrency, massive storage, complex querying scenarios.

mysql distributed-transactions cloud-native high-availability relational-databases high-concurrency massive-datasets htap horizontal-scaling enterprise-class

Updated May 14, 2024
Makefile

arhcoder / Netflix-Recommendation

Star

📺 Content Recommendation System for the Netflix Prize Challenge with Collaborative Filtering.

python jupyter-notebook collaborative-filtering netflix recommendation-system recommendation-engine recommender-system massive-datasets netflix-prize massive-data

Updated Feb 17, 2024
Jupyter Notebook

FedericoBruzzone / algorithms-for-massive-datasets

Star

This repository contains a LaTeX file that generates a PDF document comprising comprehensive notes for the course "Algorithms for Massive Datasets"

deep-learning algorithms recommender-system massive-datasets unimi linkanalysis

Updated Jan 15, 2024
TeX

FedericoBruzzone / anti-money-laundering

Star

The project is based on the analysis of the «IBM Transactions for Anti Money Laundering» dataset published on Kaggle. The task is to implement a model which predicts whether or not a transaction is illicit, using the attribute "Is Laundering" as a label to be predicted.

machine-learning machine-learning-algorithms pyspark massive-datasets

Updated Oct 4, 2023
Jupyter Notebook

KolwaBrad / massivedataset

Star

Training the MASSIVE dataset by Amazon(english-US, German-DE and Swahili-KE)

python massive-datasets

Updated Oct 2, 2023
Python

simkarwin / mimo_keras

Star

TF-Package: Multiple-Input Multiple-Output Keras Data-Generator for massive and complex datasets

massive-datasets keras-datagenerator mimo-models

Updated Jan 2, 2023
Python

joshuaboud / gen-dataset

Star

Command line tool to quickly generate a lot of files in a lot of directories

linux benchmarking evaluation multithreading dataset dataset-generation massive-datasets cli-tool dataset-generator

Updated Feb 18, 2022
C++

Alex4gtx / Massive-Data-Handler

Star

Permite abrir e manipular arquivos massivos de texto/dados cujo seria impossivel abrir em um computador, por exemplo um arquivo de texto de +20gb, permite manipular o arquivo pegando apenas as linhas necessárias sem travar o computador por falta de memória.

big-data dictionaries python-script massive-datasets manipulacao-arquivos