π A ranked list of awesome machine learning python libraries. Updated weekly.
This curated list contains 610 awesome open-source projects with a total of 2M stars grouped into 31 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from Github and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!
- Machine Learning Frameworks 38 projects
- Data Visualization 30 projects
- Text Data & NLP 60 projects
- Image Data 31 projects
- Graph Data 21 projects
- Audio Data 14 projects
- Geospatial Data 20 projects
- Financial Data 17 projects
- Time Series Data 12 projects
- Medical Data 18 projects
- Data Containers & Structures 26 projects
- Data Loading & Extraction 14 projects
- Web Scraping & Crawling 17 projects
- Data Pipelines & Streaming 26 projects
- Distributed Machine Learning 17 projects
- Hyperparameter Optimization & AutoML 38 projects
- Reinforcement Learning 16 projects
- Recommender Systems 10 projects
- Privacy Machine Learning 4 projects
- Workflow & Experiment Tracking 23 projects
- Model Serialization & Conversion 6 projects
- Model Interpretability 24 projects
- Vector Similarity Search (ANN) 8 projects
- Probabilistic Methods 12 projects
- Adversarial Robustness 6 projects
- GPU Utilities 16 projects
- Tensorflow Utilities 5 projects
- Pytorch Utilities 7 projects
- Database Clients 34 projects
- System Monitoring & Profiling 13 projects
- Others 31 projects
- π₯π₯π₯ Combined project-quality score
- βοΈ Star count from Github
- π£ New project (less than 6 month old)
- π€ Inactive project (6 month no activity)
- π Dead project (12 month no activity)
- βοΈ Warning (e.g. missing/risky license)
- π¨βπ» Contributors count from Github
- π Fork count from Github
- π Issue count from Github
- β±οΈ Last update timestamp on package manager
- π₯ Download count from package manager
- π¦ Number of dependent projects
General-purpose machine learning and deep learning frameworks.
Tensorflow (π₯44 Β· β 140K) - An Open Source Machine Learning Framework for Everyone. Apache-2
scikit-learn (π₯41 Β· β 39K) - scikit-learn: machine learning in Python. BSD-3
-
GitHub (π¨βπ» 1.8K Β· π 19K Β· π₯ 150 Β· π¦ 89K Β· π 7.7K - 26% open Β· β±οΈ 22.01.2020):
git clone https://github.com/scikit-learn/scikit-learn
-
PyPi (π₯ 13M / month Β· π¦ 38K Β· β±οΈ 02.01.2020):
pip install scikit-learn
-
Conda (β±οΈ 09.01.2020):
conda install -c anaconda scikit-learn
PyTorch (π₯39 Β· β 36K) - Tensors and Dynamic neural networks in Python with strong GPU acceleration. BSD-3
StatsModels (π₯36 Β· β 4.6K) - Statsmodels: statistical modeling and econometrics in Python. BSD-3
-
GitHub (π¨βπ» 250 Β· π 1.8K Β· π₯ 16 Β· π¦ 18K Β· π 3.8K - 48% open Β· β±οΈ 23.01.2020):
git clone https://github.com/statsmodels/statsmodels
-
PyPi (π₯ 1.7M / month Β· π¦ 6.7K Β· β±οΈ 15.01.2020):
pip install statsmodels
-
Conda (β±οΈ 30.07.2019):
conda install -c anaconda statsmodels
Theano (π₯35 Β· β 9.1K) - Theano is a Python library that allows you to define, optimize, and evaluate.. BSD-3
MXNet (π₯34 Β· β 18K) - Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic,.. Apache-2
XGBoost (π₯33 Β· β 18K) - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM).. Apache-2
LightGBM (π₯33 Β· β 10K) - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or.. MIT
Vowpal Wabbit (π₯31 Β· β 6.8K) - Vowpal Wabbit is a machine learning system which pushes the frontier of.. BSD-3
Catboost (π₯30 Β· β 4.8K) - A fast, scalable, high performance Gradient Boosting on Decision Trees.. Apache-2
TFlearn (π₯29 Β· β 9.4K Β· π€) - Deep learning library featuring a higher-level API for TensorFlow. MIT
PaddlePaddle (π₯28 Β· β 11K) - PArallel Distributed Deep LEarning: Machine Learning Framework from.. Apache-2
tensorpack (π₯28 Β· β 5.2K) - A Neural Net Training Interface on TensorFlow, with focus on speed +.. Apache-2
CNTK (π₯27 Β· β 17K) - Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit. MIT
Turi Create (π₯27 Β· β 9.3K) - Turi Create simplifies the development of custom machine learning models. BSD-3
Ignite (π₯27 Β· β 2.5K) - High-level library to help with training neural networks in PyTorch. BSD-3
pytorch-lightning (π₯26 Β· β 3.4K) - The lightweight PyTorch wrapper for ML researchers. Scale your.. Apache-2
skorch (π₯25 Β· β 2.8K) - A scikit-learn compatible neural network library that wraps pytorch. BSD-3
Neural Network Libraries (π₯25 Β· β 2.3K) - Neural Network Libraries. Apache-2
Ludwig (π₯24 Β· β 6.3K) - Ludwig is a toolbox built on top of TensorFlow that allows to train and test.. Apache-2
xLearn (π₯24 Β· β 2.6K) - High performance, easy-to-use, and scalable machine learning (ML) package,.. Apache-2
NeuPy (π₯22 Β· β 600) - NeuPy is a Tensorflow based python library for prototyping and building neural networks. MIT
Torchbearer (π₯20 Β· β 520) - torchbearer: A model fitting library for PyTorch. MIT
ThunderSVM (π₯19 Β· β 1.1K) - ThunderSVM: A Fast SVM Library on GPUs and CPUs. Apache-2
ThunderGBM (π₯17 Β· β 480) - ThunderGBM: Fast GBDTs and Random Forests on GPUs. Apache-2
Neural Tangents (π₯17 Β· β 390) - Fast and Easy Infinite Neural Networks in Python. Apache-2
StarSpace (π₯14 Β· β 3.2K) - Learning embeddings for classification, retrieval and ranking. MIT
-
GitHub (π¨βπ» 17 Β· π 420 Β· π 180 - 18% open Β· β±οΈ 13.12.2019):
git clone https://github.com/facebookresearch/StarSpace
Show 3 hidden projects...
NuPIC (π₯24 Β· β 6.1K) - Numenta Platform for Intelligent Computing is an implementation of Hierarchical.. βοΈAGPL-3.0
MindsDB (π₯21 Β· β 2.1K) - Machine Learning in one line of code. βοΈGPL-3.0
General-purpose and task-specific data visualization libraries.
Matplotlib (π₯38 Β· β 11K) - matplotlib: plotting with Python. βοΈPython-2.0
Plotly (π₯33 Β· β 6K) - An open-source, interactive graphing library for Python (includes Plotly Express). MIT
pandas-profiling (π₯29 Β· β 4.2K) - Create HTML profiling reports from pandas DataFrame objects. MIT
datashader (π₯28 Β· β 2.1K) - Reveal everything even in your largest datasets, by turning them into images. BSD-3
Perspective (π₯24 Β· β 2.2K) - Streaming pivot visualization via WebAssembly. Apache-2
-
GitHub (π¨βπ» 42 Β· π 180 Β· π¦ 29 Β· π 180 - 11% open Β· β±οΈ 16.01.2020):
git clone https://github.com/finos/perspective
-
PyPi (π₯ 470 / month Β· π¦ 4 Β· β±οΈ 19.12.2019):
pip install perspective-python
-
NPM (π₯ 340 / month Β· β±οΈ 08.01.2020):
npm install @finos/perspective-jupyterlab
PyVista (π₯24 Β· β 310) - 3D plotting and mesh analysis through a streamlined interface for the.. MIT
facets-overview (π₯23 Β· β 6K) - Visualizations for machine learning datasets. Apache-2
HyperTools (π₯23 Β· β 1.5K) - A Python toolbox for gaining geometric insights into high-dimensional data. MIT
pythreejs (π₯23 Β· β 580) - A Jupyter - Three.js bridge. BSD-3
-
GitHub (π¨βπ» 23 Β· π 120 Β· π¦ 10 Β· π 170 - 25% open Β· β±οΈ 17.01.2020):
git clone https://github.com/jupyter-widgets/pythreejs
-
PyPi (π₯ 9.7K / month Β· π¦ 13 Β· β±οΈ 04.07.2019):
pip install pythreejs
-
NPM (π₯ 2.2K / month Β· π¦ 8 Β· β±οΈ 04.07.2019):
npm install jupyter-threejs
hvPlot (π₯23 Β· β 240) - A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews. BSD-3
Chartify (π₯22 Β· β 2.4K) - Python library that makes it easy for data scientists to create charts. Apache-2
Pandas-Bokeh (π₯21 Β· β 320) - Bokeh Plotting Backend for Pandas and GeoPandas. MIT
Show 4 hidden projects...
Altair (π₯26 Β· β 240 Β· π) - Declarative statistical visualization library for Python. BSD-3
pivottablejs (π₯19 Β· β 350 Β· π) - Dragndrop Pivot Tables and Charts for Jupyter/IPython Notebook, care of.. MIT
Libraries for processing, cleaning, manipulating, and analyzing text data as well as libraries for NLP tasks such as language detection, fuzzy matching, classification, seq2seq learning, conversational AI, keyword extraction, and translation.
spaCy (π₯36 Β· β 16K) - Industrial-strength Natural Language Processing (NLP) with Python and Cython. MIT
nltk (π₯33 Β· β 8.6K) - Suite of libraries and programs for symbolic and statistical natural language.. Apache-2
transformers (π₯32 Β· β 21K) - Transformers: State-of-the-art Natural Language Processing for.. Apache-2
Rasa (π₯31 Β· β 7.6K) - Open source machine learning framework to automate text- and voice-based.. Apache-2
TextBlob (π₯29 Β· β 6.8K) - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun.. MIT
sentencepiece (π₯29 Β· β 3.5K) - Unsupervised text tokenizer for Neural Network-based text generation. Apache-2
flair (π₯28 Β· β 8K) - A very simple framework for state-of-the-art Natural Language Processing (NLP). MIT
fairseq (π₯28 Β· β 6.8K) - Facebook AI Research Sequence-to-Sequence Toolkit written in Python. MIT
inflect (π₯28 Β· β 370) - Correctly generate plurals, ordinals, indefinite articles; convert numbers to words. MIT
Dedupe (π₯27 Β· β 2.5K) - A python library for accurate and scalable fuzzy matching, record deduplication and.. MIT
phonenumbers (π₯27 Β· β 2.4K) - Python port of Google's libphonenumber. Apache-2
DeepPavlov (π₯26 Β· β 3.9K) - An open source library for deep learning end-to-end dialog systems and.. Apache-2
GluonNLP (π₯26 Β· β 1.9K) - Toolkit that enables easy text preprocessing, datasets loading and neural.. Apache-2
jellyfish (π₯26 Β· β 1.2K) - a python library for doing approximate and phonetic matching of strings. BSD-2
Sumy (π₯25 Β· β 2.2K) - Module for automatic summarization of text documents and HTML pages. Apache-2
stanfordnlp (π₯24 Β· β 2.7K) - Official Stanford NLP Python Library for Many Human Languages. Apache-2
pytorch-nlp (π₯24 Β· β 1.6K) - Basic Utilities for PyTorch Natural Language Processing (NLP). BSD-3
TextDistance (π₯24 Β· β 1.6K) - Compute distance between sequences. 30+ algorithms, pure python implementation,.. MIT
pyahocorasick (π₯24 Β· β 430) - Python module (C extension and plain python) implementing Aho-Corasick.. BSD-3
MatchZoo (π₯23 Β· β 2.9K) - Facilitating the design, comparison and sharing of deep text matching models. Apache-2
Texar (π₯23 Β· β 1.8K) - Toolkit for Machine Learning, Natural Language Processing, and Text.. Apache-2
TensorFlow Text (π₯23 Β· β 530) - Making text a first-class citizen in TensorFlow. Apache-2
NLP Architect (π₯22 Β· β 2.3K) - A model library for exploring state-of-the-art deep learning topologies.. Apache-2
Tokenizers (π₯22 Β· β 1.8K Β· π£) - Fast State-of-the-Art Tokenizers optimized for Research and Production. Apache-2
anaGo (π₯22 Β· β 1.3K) - Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech.. MIT
PyTextRank (π₯22 Β· β 1.1K) - Python implementation of TextRank for phrase extraction and summarization of text.. MIT
Kashgari (π₯21 Β· β 1.4K) - Kashgari is a Production-ready NLP Transfer learning framework for text-.. Apache-2
scattertext (π₯21 Β· β 1.2K) - Beautiful visualizations of how language differs among document types. Apache-2
spacy-transformers (π₯21 Β· β 580 Β· π£) - spaCy pipelines for pre-trained BERT, XLNet and GPT-2. MIT
DeepMatcher (π₯20 Β· β 2.7K) - Python package for performing Entity and Text Matching using Deep Learning. BSD-3
DELTA (π₯20 Β· β 1.1K) - DELTA is a deep learning based natural language and speech processing platform. Apache-2
Sockeye (π₯20 Β· β 900) - Sequence-to-sequence framework with a focus on Neural Machine Translation.. Apache-2
YouTokenToMe (π₯20 Β· β 490) - Unsupervised text tokenizer focused on computational efficiency. MIT
NeuroNER (π₯18 Β· β 1.4K) - Named-entity recognition using neural networks. Easy-to-use and state-of-the-art.. MIT
FARM (π₯18 Β· β 510 Β· π£) - Fast & easy transfer learning for NLP. Harvesting language models for the.. Apache-2
ParlAI (π₯17 Β· β 5.2K) - A framework for training and evaluating AI models on a variety of openly available.. MIT
-
GitHub (π¨βπ» 100 Β· π 980 Β· π 580 - 19% open Β· β±οΈ 23.01.2020):
git clone https://github.com/facebookresearch/ParlAI
TransferNLP (π₯15 Β· β 280) - NLP library designed for reproducible experimentation management. MIT
Show 8 hidden projects...
gensim (π₯34 Β· β 10K) - Topic Modelling for Humans. βοΈLGPL-2.1
fuzzywuzzy (π₯29 Β· β 6.4K) - Fuzzy String Matching in Python. βοΈGPL-2.0
polyglot (π₯25 Β· β 1.5K) - Multilingual text (NLP) processing toolkit. βοΈGPL-3.0
flashtext (π₯22 Β· β 4.1K Β· π) - Extract Keywords from sentence or Replace keywords in sentences. MIT
stop-words (π₯20 Β· β 100 Β· π) - Get list of common stop words in various languages in Python. BSD-3
pyfasttext (π₯19 Β· β 230 Β· π) - Yet another Python binding for fastText. βοΈGPL-3.0
Libraries for image & video processing, manipulation, and augmentation as well as libraries for computer vision tasks such as facial recognition, object detection, character recognition, and classification.
scikit-image (π₯36 Β· β 3.5K) - Image processing in Python. BSD-2
torchvision (π₯33 Β· β 5.4K) - Datasets, Transforms and Models specific to Computer Vision. BSD-3
Tesseract (π₯30 Β· β 2.6K) - Python-tesseract is an optical character recognition (OCR) tool for python. Apache-2
opencv-python (π₯30 Β· β 1.1K) - Automated CI toolchain to produce precompiled opencv-python, opencv-python-.. MIT
Face Recognition (π₯29 Β· β 31K) - The world's simplest facial recognition api for Python and the command.. MIT
imutils (π₯28 Β· β 2.7K) - A series of convenience functions to make basic image processing operations such as.. MIT
imageai (π₯27 Β· β 4.4K) - A python library built to empower developers to build applications and systems with.. MIT
Albumentations (π₯27 Β· β 4.3K) - fast image augmentation library and easy to use wrapper around other.. MIT
Kornia (π₯24 Β· β 1.8K) - Open Source Differentiable Computer Vision Library for PyTorch. Apache-2
InsightFace (π₯22 Β· β 6K) - Face Analysis Project on MXNet. MIT
Image Deduplicator (π₯22 Β· β 2.9K) - Finding duplicate images made easy!. Apache-2
tensorflow-graphics (π₯22 Β· β 2K) - TensorFlow Graphics: Differentiable Graphics Layers for TensorFlow. Apache-2
Image Super-Resolution (π₯22 Β· β 1.5K) - Super-scale your images and run experiments with Residual.. Apache-2
-
GitHub (π¨βπ» 5 Β· π 270 Β· π¦ 9 Β· π 87 - 27% open Β· β±οΈ 08.01.2020):
git clone https://github.com/idealo/image-super-resolution
-
PyPi (π₯ 1.2K / month Β· π¦ 4 Β· β±οΈ 08.01.2020):
pip install ISR
-
Dockerhub (π₯ 61 Β· β±οΈ 01.04.2019):
docker pull idealo/image-super-resolution-gpu
Pythia (π₯20 Β· β 3K) - A modular framework for vision & language multimodal research from Facebook AI.. BSD-3
-
GitHub (π¨βπ» 21 Β· π 380 Β· π¦ 3 Β· π 170 - 22% open Β· β±οΈ 11.12.2019):
git clone https://github.com/facebookresearch/pythia
MMDetection (π₯19 Β· β 8.2K) - Open MMLab Detection Toolbox and Benchmark. Apache-2
-
GitHub (π¨βπ» 89 Β· π 2.5K Β· π¦ 3 Β· π 1.6K - 22% open Β· β±οΈ 22.01.2020):
git clone https://github.com/open-mmlab/mmdetection
Face Alignment (π₯19 Β· β 3.5K) - 2D and 3D Face alignment library build using pytorch. BSD-3
detectron2 (π₯16 Β· β 7.5K Β· π£) - Detectron2 is FAIR's next-generation research platform for object.. Apache-2
-
GitHub (π¨βπ» 61 Β· π 1.3K Β· π 640 - 7% open Β· β±οΈ 22.01.2020):
git clone https://github.com/facebookresearch/detectron2
Classy Vision (π₯16 Β· β 570 Β· π£) - An end-to-end PyTorch framework for image and video classification. MIT
keras-ocr (π₯16 Β· β 360 Β· π£) - A packaged and flexible version of the CRAFT text detector and Keras CRNN.. MIT
pycls (π₯11 Β· β 540) - Codebase for Image Classification Research, written in PyTorch. MIT
-
GitHub (π¨βπ» 5 Β· π 32 Β· π 4 - 25% open Β· β±οΈ 13.01.2020):
git clone https://github.com/facebookresearch/pycls
PySlowFast (π₯10 Β· β 1.9K Β· π£) - PySlowFast: video understanding codebase from FAIR for reproducing.. Apache-2
-
GitHub (π¨βπ» 1 Β· π 280 Β· π 97 - 29% open Β· β±οΈ 31.12.2019):
git clone https://github.com/facebookresearch/SlowFast
Show 2 hidden projects...
Pillow-SIMD (π₯25 Β· β 1.3K) - The friendly PIL fork. βοΈPIL
Libraries for graph processing, clustering, embedding, and machine learning tasks.
PyTorch Geometric (π₯25 Β· β 6.4K) - Geometric Deep Learning Extension Library for PyTorch. MIT
dgl (π₯24 Β· β 3.9K) - Python package built to ease deep learning on graph, on top of existing DL.. Apache-2
StellarGraph (π₯23 Β· β 910) - StellarGraph - Machine Learning on Graphs. Apache-2
graph-nets (π₯20 Β· β 4.1K) - Build Graph Nets in Tensorflow. Apache-2
torch-cluster (π₯20 Β· β 190) - PyTorch Extension Library of Optimized Graph Cluster Algorithms. MIT
PyTorch-BigGraph (π₯19 Β· β 2K) - Generate embeddings from large-scale graph-structured data. BSD-3
AmpliGraph (π₯18 Β· β 1K) - Python library for Representation Learning on Knowledge Graphs.. Apache-2
DeepGraph (π₯15 Β· β 210 Β· π€) - Analyze Data with Pandas-based Networks. Documentation:. BSD-3
OpenKE (π₯14 Β· β 1.6K) - An Open-Source Package for Knowledge Embedding (KE). MIT
-
GitHub (π¨βπ» 10 Β· π 510 Β· π 210 - 11% open Β· β±οΈ 04.01.2020):
git clone https://github.com/thunlp/OpenKE
OpenNE (π₯14 Β· β 1.2K) - An Open-Source Package for Network Embedding (NE). MIT
-
GitHub (π¨βπ» 10 Β· π 390 Β· π 79 - 56% open Β· β±οΈ 12.08.2019):
git clone https://github.com/thunlp/OpenNE
GraphEmbedding (π₯13 Β· β 800) - Implementation and experiments of graph embedding algorithms.deep.. MIT
-
GitHub (π¨βπ» 4 Β· π 260 Β· π¦ 6 Β· π 23 - 65% open Β· β±οΈ 03.11.2019):
git clone https://github.com/shenweichen/GraphEmbedding
GraphVite (π₯10 Β· β 590 Β· π£) - GraphVite: A General and High-performance Graph Embedding System. Apache-2
Show 5 hidden projects...
igraph (π₯24 Β· β 570) - Python interface for igraph. βοΈGPL-2.0
DeepWalk (π₯19 Β· β 1.7K) - DeepWalk - Deep Learning for Graphs. βοΈGPL-3.0
PyKEEN (π₯15 Β· β 74) - A Python library for learning and evaluating knowledge graph embeddings. MIT
GraphSAGE (π₯14 Β· β 1.4K Β· π) - Representation learning on large graphs using stochastic graph convolutions. MIT
-
GitHub (π¨βπ» 9 Β· π 440 Β· π 100 - 54% open Β· β±οΈ 19.09.2018):
git clone https://github.com/williamleif/GraphSAGE
Karate Club (π₯14 Β· β 260 Β· π£) - A general purpose community detection and network embedding library for.. βοΈGPL-3.0
Libraries for audio analysis, manipulation, transformation, and extraction, as well as speech recognition and music generation tasks.
DeepSpeech (π₯30 Β· β 13K) - A TensorFlow implementation of Baidu's DeepSpeech architecture. MPL-2.0
Pydub (π₯29 Β· β 4.1K Β· π€) - Manipulate audio with a simple and easy high level interface. MIT
SpeechRecognition (π₯28 Β· β 4.5K) - Speech recognition module for Python, supporting several engines and.. BSD-3
spleeter (π₯25 Β· β 9.7K Β· π£) - Deezer source separation library including pretrained models. MIT
pyAudioAnalysis (π₯24 Β· β 2.8K) - Python Audio Analysis Library: Feature Extraction, Classification,.. Apache-2
torchaudio (π₯24 Β· β 790) - Data manipulation and transformation for audio signal processing, powered by.. BSD-2
Show 3 hidden projects...
Essentia (π₯23 Β· β 1.4K) - C++ library for audio and music analysis, description and synthesis,.. βοΈAGPL-3.0
aubio (π₯20 Β· β 1.6K) - a library for audio and music analysis. βοΈGPL-3.0
-
GitHub (π¨βπ» 19 Β· π 240 Β· π¦ 120 Β· π 230 - 28% open Β· β±οΈ 17.11.2019):
git clone https://github.com/aubio/aubio
Libraries to load, process, analyze, and write geographic data as well as libraries for spatial analysis, map visualization, and geocoding.
pyproj (π₯29 Β· β 430) - Python interface to PROJ (cartographic projections and coordinate transformations.. MIT
ipyleaflet (π₯25 Β· β 840) - A Jupyter - Leaflet.js bridge. MIT
-
GitHub (π¨βπ» 52 Β· π 220 Β· π¦ 9 Β· π 220 - 37% open Β· β±οΈ 17.01.2020):
git clone https://github.com/jupyter-widgets/ipyleaflet
-
PyPi (π₯ 11K / month Β· π¦ 98 Β· β±οΈ 17.01.2020):
pip install ipyleaflet
-
NPM (π₯ 9.4K / month Β· π¦ 1 Β· β±οΈ 17.01.2020):
npm install jupyter-leaflet
ArcGIS API (π₯23 Β· β 650) - Documentation and samples for ArcGIS API for Python. Apache-2
-
GitHub (π¨βπ» 37 Β· π 470 Β· π 200 - 34% open Β· β±οΈ 16.01.2020):
git clone https://github.com/Esri/arcgis-python-api
-
PyPi (π₯ 4.3K / month Β· π¦ 8 Β· β±οΈ 31.10.2019):
pip install arcgis
-
Dockerhub (π₯ 2.9K Β· β 25 Β· β±οΈ 02.07.2019):
docker pull esridocker/arcgis-api-python-notebook
Mapbox GL (π₯22 Β· β 500) - Use Mapbox GL JS to visualize data in a Python Jupyter notebook. MIT
geoplotlib (π₯20 Β· β 810 Β· π€) - python toolbox for visualizing geographical data and making maps. MIT
Show 6 hidden projects...
Cartopy (π₯27 Β· β 1.2K) - Rasterio reads and writes geospatial raster datasets. βοΈLGPL-3.0
Satpy (π₯21 Β· β 570) - Python package for earth-observing satellite data processing. βοΈGPL-3.0
Sentinelsat (π₯20 Β· β 390) - Search and download Copernicus Sentinel satellite images. βοΈGPL-3.0
pymap3d (π₯20 Β· β 110) - pure-Python (Numpy optional) 3D coordinate conversions for geospace ecef enu eci. BSD-2
Libraries for algorithmic stock/crypto trading, risk analytics, backtesting, technical analysis, and other tasks on financial data.
Alpha Vantage (π₯28 Β· β 1.9K) - A python wrapper for Alpha Vantage API for financial data. MIT
yfinance (π₯25 Β· β 860) - Yahoo! Finance market data downloader (+faster Pandas Datareader). Apache-2
empyrical (π₯24 Β· β 470) - Common financial risk and performance metrics. Used by zipline and pyfolio. Apache-2
finmarketpy (π₯19 Β· β 2.1K) - Python library for backtesting trading strategies & analyzing financial.. Apache-2
Crypto Signals (π₯19 Β· β 1.9K) - Automated Crypto Trading & Technical Analysis (TA) Bot for Bittrex, Binance,.. MIT
Show 6 hidden projects...
backtrader (π₯24 Β· β 3.2K) - Python Backtesting library for trading strategies. βοΈGPL-3.0
Enigma Catalyst (π₯23 Β· β 1.7K Β· π) - An Algorithmic Trading Library for Crypto-Assets in Python. Apache-2
PyAlgoTrade (π₯22 Β· β 2.7K Β· π) - Python Algorithmic Trading Library. Apache-2
arch (π₯21 Β· β 440) - ARCH models in Python. βοΈNCSA
FinTA (π₯20 Β· β 370) - Common financial technical indicators implemented in Pandas. βοΈLGPL-3.0
Backtesting.py (π₯15 Β· β 260) - Backtest trading strategies in Python. βοΈAGPL-3.0
Libraries for forecasting, anomaly detection, feature extraction, and machine learning on time-series and sequential data.
Prophet (π₯28 Β· β 10K) - Tool for producing high quality forecasts for time series data that has multiple.. MIT
sktime (π₯19 Β· β 470) - A scikit-learn compatible Python toolbox for machine learning with time series. BSD-3
tick (π₯19 Β· β 260) - Module for statistical learning, with a particular emphasis on time-dependent modelling. BSD-3
STUMPY (π₯18 Β· β 1K) - STUMPY is a powerful and scalable Python library that can be used for a variety of.. BSD-3
ADTK (π₯17 Β· β 360 Β· π£) - A Python toolkit for rule-based/unsupervised anomaly detection in time series. MPL-2.0
Show 2 hidden projects...
Libraries for processing and analyzing medical data such as MRIs, EEGs, genomic data, and other medical imaging formats.
MNE (π₯30 Β· β 1.2K) - MNE : Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python. BSD-3
DIPY (π₯28 Β· β 300) - DIPY is the paragon 3D/4D+ imaging library in Python. Contains generic methods for.. BSD-3
NiBabel (π₯28 Β· β 300) - Python package to access a cacophony of neuro-imaging file formats. MIT
NiftyNet (π₯24 Β· β 1.1K) - An open-source convolutional neural networks platform for research in.. Apache-2
DeepVariant (π₯20 Β· β 1.9K) - DeepVariant is an analysis pipeline that uses a deep neural network to call.. BSD-3
MedicalTorch (π₯15 Β· β 580) - A medical imaging framework for Pytorch. Apache-2
Medical Detection Toolkit (π₯13 Β· β 650) - The Medical Detection Toolkit contains 2D + 3D.. Apache-2
-
GitHub (π¨βπ» 3 Β· π 170 Β· π 90 - 18% open Β· β±οΈ 19.12.2019):
git clone https://github.com/pfjaeger/medicaldetectiontoolkit
MedicalNet (π₯12 Β· β 760 Β· π£) - Many studies have shown that the performance on deep learning is significantly.. MIT
-
GitHub (π¨βπ» 1 Β· π 200 Β· π 41 - 68% open Β· β±οΈ 07.01.2020):
git clone https://github.com/Tencent/MedicalNet
Show 4 hidden projects...
NIPY (π₯20 Β· β 260 Β· π€) - Neuroimaging in Python FMRI analysis package. βοΈDSDP
MedPy (π₯19 Β· β 230 Β· π€) - Medical image processing in Python. βοΈGPL-3.0
General-purpose data containers & structures as well as utilities & extensions for pandas.
pandas (π₯41 Β· β 23K) - Flexible and powerful data analysis / manipulation library for Python, providing.. BSD-3
h5py (π₯34 Β· β 1.3K) - HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data.. BSD-3
Arrow (π₯32 Β· β 5K) - Apache Arrow is a cross-language development platform for in-memory data. It.. Apache-2
numexpr (π₯30 Β· β 1.2K) - Fast numerical array expression evaluator for Python, NumPy, PyTables, pandas, bcolz.. MIT
TinyDB (π₯29 Β· β 3.3K) - TinyDB is a lightweight document oriented database optimized for your happiness :). MIT
Bottleneck (π₯28 Β· β 440) - Fast NumPy array functions written in C. BSD-2
datasketch (π₯27 Β· β 1.1K) - MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH.. MIT
Modin (π₯26 Β· β 4.1K) - Modin: Speed up your Pandas workflows by changing a single line of code. Apache-2
zarr (π₯26 Β· β 420) - An implementation of chunked, compressed, N-dimensional arrays for Python. MIT
Vaex (π₯23 Β· β 2.7K) - Out-of-Core DataFrames for Python, visualize and explore big tabular data at a billion.. MIT
swifter (π₯21 Β· β 980) - A package which efficiently applies any function to a pandas dataframe or series in.. MIT
pickleDB (π₯21 Β· β 440) - pickleDB is an open source key-value store using Python's json module. BSD-3
Pandas Summary (π₯21 Β· β 310) - An extension to pandas dataframes describe function. MIT
PandaralΒ·lel (π₯20 Β· β 600) - A simple and efficient tool to parallelize Pandas operations on all.. BSD-3
Bounter (π₯17 Β· β 870) - Efficient Counter that uses a limited (bounded) amount of memory regardless of data.. MIT
Show 4 hidden projects...
sklearn-pandas (π₯25 Β· β 2.1K Β· π€) - Pandas integration with sklearn. βοΈZlib
Arctic (π₯24 Β· β 1.8K) - Arctic is a high performance datastore for numeric data. βοΈLGPL-2.1
StaticFrame (π₯16 Β· β 140) - The StaticFrame library consists of the Series and Frame, immutable data.. MIT
Libraries for loading, collecting, and extracting data from a variety of data sources and formats.
xmltodict (π₯31 Β· β 3.8K) - Python module that makes working with XML feel like you are working with JSON. MIT
tensorflow-datasets (π₯30 Β· β 1.8K) - A collection of datasets ready to use with TensorFlow. Apache-2
pandas-datareader (π₯30 Β· β 1.3K) - Extract data from a wide range of Internet sources into a pandas.. BSD-3
smart-open (π₯29 Β· β 1.5K) - Utils for streaming large files (S3, HDFS, gzip, bz2...). MIT
PDFMiner (π₯28 Β· β 4.1K) - Python PDF Parser (Not actively maintained). Check out pdfminer.six. MIT
csvkit (π₯28 Β· β 4.1K) - A suite of utilities for converting to and working with CSV, the king of tabular file.. MIT
snorkel (π₯25 Β· β 3.6K) - A system for quickly generating training data with weak supervision. Apache-2
Intake (π₯25 Β· β 370) - Intake is a lightweight package for finding, investigating, loading and.. BSD-2
Show 2 hidden projects...
Singer (π₯23 Β· β 460 Β· π€) - Standard for moving data between databases, web APIs, files, queues, and.. βοΈAGPL-3.0
Libraries for web scraping, crawling, downloading, and mining as well as libraries.
youtube-dl (π₯37 Β· β 61K) - Command-line program to download videos from YouTube.com and other video.. Unlicense
Scrapy (π₯36 Β· β 36K) - Scrapy, a fast high-level web crawling & scraping framework for Python. BSD-3
newspaper3k (π₯29 Β· β 9K Β· π€) - News, full-text, and article metadata extraction in Python 3. Advanced docs:. MIT
Requests-HTML (π₯28 Β· β 11K) - Pythonic HTML Parsing for Humans. MIT
Google Images Download (π₯27 Β· β 6.3K Β· π€) - Python Script to download hundreds of images from 'Google.. MIT
MechanicalSoup (π₯27 Β· β 3.4K) - A Python library for automating interaction with websites. MIT
beautifulsoup4 (π₯21) - Screen-scraping library. MIT
-
PyPi (π₯ 7.8M / month Β· π¦ 62K Β· β±οΈ 24.12.2019):
pip install beautifulsoup4
Transistor (π₯14 Β· β 200) - Transistor, a Python web scraping framework for intelligent use cases. MIT
lazynlp (π₯13 Β· β 1.7K) - Library to scrape and clean web pages to create massive datasets. MIT
-
GitHub (π¨βπ» 4 Β· π 250 Β· π 8 - 62% open Β· β±οΈ 07.10.2019):
git clone https://github.com/chiphuyen/lazynlp
Show 3 hidden projects...
Pattern (π₯29 Β· β 7.3K Β· π) - Web mining module for Python, with tools for scraping, natural language.. BSD-3
Photon (π₯20 Β· β 6.2K) - Incredibly fast crawler designed for OSINT. βοΈGPL-3.0
Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.
Celery (π₯37 Β· β 14K) - Asynchronous task queue/job queue based on distributed message passing. BSD-3
Airflow (π₯35 Β· β 15K) - Platform to programmatically author, schedule, and monitor workflows. Apache-2
-
GitHub (π¨βπ» 1.3K Β· π 5.8K Β· π₯ 400 Β· π¦ 720 Β· β±οΈ 23.01.2020):
git clone https://github.com/apache/airflow
-
PyPi (π₯ 460K / month Β· π¦ 290 Β· β±οΈ 24.12.2019):
pip install apache-airflow
-
conda install -c conda-forge airflow
-
Dockerhub (π₯ 330K Β· β 79 Β· β±οΈ 23.01.2020):
docker pull apache/airflow
luigi (π₯33 Β· β 13K) - Luigi is a Python module that helps you build complex pipelines of batch jobs. It.. Apache-2
Beam (π₯32 Β· β 3.7K) - Unified programming model to define and execute data processing pipelines,.. Apache-2
Kedro (π₯26 Β· β 1.9K) - A Python library that implements software engineering best-practice for data and.. Apache-2
PyFunctional (π₯26 Β· β 1.6K) - Python library for creating data pipelines with chain functional programming. MIT
dbt (π₯26 Β· β 1.3K) - dbt (data build tool) enables data analysts and engineers to transform their data.. Apache-2
TFX (π₯26 Β· β 860) - TFX is an end-to-end platform for deploying production ML pipelines. Apache-2
Dagster (π₯25 Β· β 1.2K) - A Python library for building data applications: ETL, ML, Data Pipelines, and.. Apache-2
streamparse (π₯24 Β· β 1.4K) - Run Python in Apache Storm topologies. Pythonic API, CLI tooling, and a.. Apache-2
Great Expectations (π₯24 Β· β 1.2K) - Always know what to expect from your data. Apache-2
pysparkling (π₯20 Β· β 210) - A pure Python implementation of Apache Spark's RDD and DStream interfaces. MIT
Data Integration (π₯18 Β· β 1.4K) - A lightweight opinionated ETL framework, halfway between plain scripts and.. MIT
Databolt Flow (π₯18 Β· β 830) - Python library for building highly effective data science workflows. MIT
riko (π₯17 Β· β 1.5K Β· π€) - A Python stream processing engine modeled after Yahoo! Pipes. MIT
Botflow (π₯15 Β· β 1.1K Β· π€) - Python Fast Dataflow programming framework for Data pipeline work( Web.. BSD-3
Show 1 hidden projects...
Libraries that provide capabilities to distribute and parallelize machine learning tasks across large-scale compute infrastructure.
Ray (π₯32 Β· β 10K) - A fast and simple framework for building and running distributed applications. Ray.. Apache-2
horovod (π₯29 Β· β 8.5K) - Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet. Apache-2
ipyparallel (π₯28 Β· β 1.6K) - Interactive Parallel Computing in Python. BSD-3
TensorFlowOnSpark (π₯25 Β· β 3.4K) - TensorFlowOnSpark brings TensorFlow programs to Apache Spark.. Apache-2
BigDL (π₯24 Β· β 3.2K) - BigDL: Distributed Deep Learning Library for Apache Spark. Apache-2
-
GitHub (π¨βπ» 69 Β· π 820 Β· π¦ 9 Β· π 860 - 18% open Β· β±οΈ 08.01.2020):
git clone https://github.com/intel-analytics/BigDL
-
PyPi (π₯ 820 / month Β· π¦ 2 Β· β±οΈ 04.11.2019):
pip install bigdl
-
<dependency> <groupId>com.intel.analytics.bigdl</groupId> <artifactId>bigdl-SPARK_2.4</artifactId> <version>[VERSION]</version> </dependency>
MMLSpark (π₯19 Β· β 1.8K) - Microsoft Machine Learning for Apache Spark. MIT
-
GitHub (π¨βπ» 53 Β· π 400 Β· π 300 - 42% open Β· β±οΈ 23.01.2020):
git clone https://github.com/Azure/mmlspark
TensorFrames (π₯19 Β· β 770) - [DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark. Apache-2
Apache Singa (π₯17 Β· β 1.9K) - Mirror of Apache Singa. Apache-2
BytePS (π₯16 Β· β 2K) - A high performance and generic framework for distributed DNN training. Apache-2
Show 2 hidden projects...
DEAP (π₯28 Β· β 3.4K) - Distributed Evolutionary Algorithms in Python. βοΈLGPL-3.0
LazyCluster (π₯7 Β· β 16 Β· π£) - Distributed machine learning made simple. Apache-2
Libraries for hyperparameter optimization, automl and neural architecture search.
Hyperopt (π₯29 Β· β 4.2K) - Distributed Asynchronous Hyperparameter Optimization in Python. BSD-3
featuretools (π₯27 Β· β 4.5K) - An open source python library for automated feature engineering. BSD-3
scikit-optimize (π₯27 Β· β 1.6K) - Sequential model-based optimization with a `scipy.optimize` interface. BSD-3
Bayesian Optimization (π₯26 Β· β 3.8K) - A Python implementation of global optimization with gaussian processes. MIT
auto-sklearn (π₯25 Β· β 4.2K) - Automated Machine Learning with scikit-learn. BSD-3
Hyperas (π₯25 Β· β 1.9K) - Keras + Hyperopt: A very simple wrapper for convenient hyperparameter optimization. MIT
NNI (π₯24 Β· β 5.2K) - An open source AutoML toolkit for neural architecture search, model compression and.. MIT
Keras Tuner (π₯22 Β· β 1.7K) - Hyperparameter tuning for humans. Apache-2
Test Tube (π₯21 Β· β 530) - Python library to easily log experiments and parallelize hyperparameter search for.. MIT
HpBandSter (π₯19 Β· β 320 Β· π€) - a distributed Hyperband implementation on Steroids. BSD-3
Advisor (π₯18 Β· β 1.1K) - Open-source implementation of Google Vizier for hyper parameters tuning. Apache-2
HyperparameterHunter (π₯18 Β· β 570) - Easy hyperparameter optimization and automatic result saving across.. MIT
Dragonfly (π₯18 Β· β 460) - An open source python library for scalable Bayesian optimisation. MIT
Auto Tune Models (π₯17 Β· β 470) - Auto Tune Models - A multi-tenant, multi-data system for automated machine.. MIT
AlphaPy (π₯17 Β· β 360) - Automated Machine Learning [AutoML] with Python, scikit-learn, and Keras. Apache-2
automl-gs (π₯15 Β· β 1.6K Β· π€) - Provide an input CSV and a target field to predict, generate a model + code to.. MIT
ENAS (π₯13 Β· β 2.1K Β· π€) - PyTorch implementation of Efficient Neural Architecture Search via Parameters.. Apache-2
-
GitHub (π¨βπ» 5 Β· π 390 Β· π 41 - 82% open Β· β±οΈ 18.06.2019):
git clone https://github.com/carpedm20/ENAS-pytorch
Devol (π₯11 Β· β 890) - Genetic neural architecture search with Keras. MIT
-
GitHub (π¨βπ» 17 Β· π 100 Β· π 26 - 23% open Β· β±οΈ 19.08.2019):
git clone https://github.com/joeddav/devol
Show 9 hidden projects...
TPOT (π₯28 Β· β 6.7K) - A Python Automated Machine Learning tool that optimizes machine learning.. βοΈLGPL-3.0
auto_ml (π₯20 Β· β 1.4K Β· π) - [UNMAINTAINED] Automated machine learning for analytics & production. MIT
MLBox (π₯19 Β· β 990) - MLBox is a powerful Automated Machine Learning python library. βοΈBSD-1-Clause
Sherpa (π₯19 Β· β 180) - Hyperparameter optimization that enables researchers to experiment, visualize,.. βοΈGPL-3.0
Xcessiv (π₯16 Β· β 1.2K Β· π) - A web-based application for quick, scalable, and automated hyperparameter.. Apache-2
Parfit (π₯16 Β· β 200 Β· π) - A package for parallelizing the fit and flexibly scoring of sklearn machine.. MIT
Hypertunity (π₯13 Β· β 110) - A toolset for black-box hyperparameter optimisation. Apache-2
Auptimizer (π₯12 Β· β 120 Β· π£) - An automatic ML model optimization tool. βοΈGPL-3.0
Libraries for building and evaluating reinforcement learning & agent-based systems.
OpenAI Gym (π₯34 Β· β 19K) - A toolkit for developing and comparing reinforcement learning algorithms. MIT
TensorLayer (π₯27 Β· β 5.9K) - Deep Learning and Reinforcement Learning Library for Scientists. Apache-2
Dopamine (π₯25 Β· β 8.6K) - Dopamine is a research framework for fast prototyping of reinforcement.. Apache-2
TensorForce (π₯25 Β· β 2.6K) - Tensorforce: a TensorFlow library for applied reinforcement learning. Apache-2
Stable Baselines (π₯25 Β· β 1.6K) - A fork of OpenAI Baselines, implementations of reinforcement learning.. MIT
TF-Agents (π₯25 Β· β 1.1K) - TF-Agents is a library for Reinforcement Learning in TensorFlow. Apache-2
ViZDoom (π₯23 Β· β 990) - Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information. MIT
ChainerRL (π₯23 Β· β 790) - ChainerRL is a deep reinforcement learning library built on top of Chainer. MIT
Coach (π₯21 Β· β 1.6K) - Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with.. Apache-2
TensorTrade (π₯19 Β· β 1.1K Β· π£) - An open source reinforcement learning framework for training,.. Apache-2
ReAgent (π₯16 Β· β 2.4K) - A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits,.. BSD-3
-
GitHub (π¨βπ» 46 Β· π 300 Β· π 69 - 23% open Β· β±οΈ 13.01.2020):
git clone https://github.com/facebookresearch/ReAgent
PARL (π₯15 Β· β 570) - PARL A high-performance distributed training framework for Reinforcement Learning. Apache-2
Show 1 hidden projects...
DeepMind Lab (π₯17 Β· β 6K) - A customisable 3D platform for agent-based AI research. βοΈGPL-2.0
-
GitHub (π¨βπ» 7 Β· π 1.2K Β· π 170 - 18% open Β· β±οΈ 07.01.2020):
git clone https://github.com/deepmind/lab
Libraries for building and evaluating recommendation systems.
scikit-surprise (π₯28 Β· β 3.7K) - A Python scikit for building and analyzing recommender systems. BSD-3
lightfm (π₯25 Β· β 2.9K) - A Python implementation of LightFM, a hybrid recommendation algorithm. Apache-2
implicit (π₯25 Β· β 1.8K) - Fast Python Collaborative Filtering for Implicit Feedback Datasets. MIT
TensorFlow Ranking (π₯21 Β· β 1.5K) - Learning to Rank in TensorFlow. Apache-2
Recommenders (π₯19 Β· β 6.9K) - Best Practices on Recommendation Systems. MIT
-
GitHub (π¨βπ» 56 Β· π 940 Β· π¦ 1 Β· π 430 - 19% open Β· β±οΈ 17.01.2020):
git clone https://github.com/microsoft/recommenders
tensorrec (π₯19 Β· β 1K) - A TensorFlow recommendation algorithm and framework in Python. Apache-2
Case Recommender (π₯17 Β· β 220) - Case Recommender: A Flexible and Extensible Python Framework for.. MIT
OpenRec (π₯16 Β· β 290) - OpenRec is an open-source and modular library for neural network-inspired.. Apache-2
Show 1 hidden projects...
recmetrics (π₯17 Β· β 120) - A library of metrics for evaluating recommender systems. MIT
Libraries for encrypted and privacy-preserving machine learning using methods like federated learning & differential privacy.
PySyft (π₯25 Β· β 4.6K) - A library for encrypted, privacy preserving machine learning. Apache-2
TensorFlow Privacy (π₯21 Β· β 990) - Library for training machine learning models with privacy for.. Apache-2
TFEncrypted (π₯21 Β· β 590) - A Framework for Machine Learning on Encrypted Data. Apache-2
CrypTen (π₯11 Β· β 420 Β· π£) - A framework for Privacy Preserving Machine Learning. MIT
-
GitHub (π¨βπ» 11 Β· π 56 Β· π 9 - 33% open Β· β±οΈ 16.01.2020):
git clone https://github.com/facebookresearch/CrypTen
Libraries to organize, track, and visualize machine learning experiments.
Tensorboard (π₯36 Β· β 4.4K) - TensorFlow's Visualization Toolkit. Apache-2
tensorboardX (π₯29 Β· β 6K) - tensorboard for pytorch (and chainer, mxnet, numpy, ...). MIT
sacred (π₯28 Β· β 2.6K) - Sacred is a tool to help you configure, organize, log and reproduce experiments.. MIT
SageMaker SDK (π₯27 Β· β 960) - A library for training and deploying machine learning models on.. Apache-2
AzureML SDK (π₯26 Β· β 1.3K) - Python notebooks with ML and deep learning examples with Azure Machine Learning.. MIT
TensorWatch (π₯22 Β· β 2.6K) - Debugging, monitoring and visualization for Python Machine Learning and Data.. MIT
TRAINS (π₯22 Β· β 1.1K) - TRAINS - Auto-Magical Experiment Manager & Version Control for AI - NOW WITH.. Apache-2
Metaflow (π₯21 Β· β 2.8K Β· π£) - Build and manage real-life data science projects with ease. Apache-2
knockknock (π₯20 Β· β 1.2K) - Knock Knock: Get notified when your training ends with only two additional lines.. MIT
Guild AI (π₯20 Β· β 240) - Open source experiment tracking and optimization for machine learning. Apache-2
VisualDL (π₯18 Β· β 2.4K) - A platform to visualize the deep learning process and result. Apache-2
TensorBoard Logger (π₯18 Β· β 600) - Log TensorBoard events without touching TensorFlow. MIT
Show 3 hidden projects...
SKLL (π₯20 Β· β 500) - SciKit-Learn Laboratory (SKLL) makes it easy to run machine learning.. βοΈBSD-1-Clause
steppy (π₯15 Β· β 110 Β· π) - Lightweight, Python library for fast and reproducible experimentation. MIT
ModelChimp (π₯14 Β· β 120) - Experiment tracking for machine and deep learning projects. BSD-2
Libraries to serialize models to files, convert between a variety of model formats, and optimize models for deployment.
Core ML Tools (π₯27 Β· β 1.6K) - Core ML Community Tools. BSD-3
mmdnn (π₯23 Β· β 4.4K) - MMdnn is a set of tools to help users inter-operate among different deep learning.. MIT
model-optimization (π₯23 Β· β 630) - A suite of tools that users, both novice and advanced, can use to.. Apache-2
pytorch2keras (π₯18 Β· β 460) - PyTorch to Keras model convertor. MIT
sklearn-porter (π₯17 Β· β 770) - Transpile trained scikit-learn estimators to C, Java, JavaScript and others. MIT
Libraries to visualize, explain, debug, evaluate, and interpret machine learning models.
shap (π₯29 Β· β 7.7K) - A game theoretic approach to explain the output of any machine learning model. MIT
eli5 (π₯28 Β· β 1.8K) - A library for debugging/inspecting machine learning classifiers and explaining their.. MIT
Lime (π₯27 Β· β 7K) - Lime: Explaining the predictions of any machine learning classifier. BSD-2
Model Analysis (π₯26 Β· β 920) - Model analysis tools for TensorFlow. Apache-2
yellowbrick (π₯25 Β· β 2.6K) - Visual analysis and diagnostic tools to facilitate machine learning.. Apache-2
pyLDAvis (π₯25 Β· β 1.1K Β· π€) - Python library for interactive topic model visualization. Port of the R.. BSD-3
Lucid (π₯23 Β· β 3.4K) - A collection of infrastructure and tools for research in neural network.. Apache-2
InterpretML (π₯23 Β· β 2.3K) - Fit interpretable machine learning models. Explain blackbox machine learning. MIT
Fairness 360 (π₯22 Β· β 820) - A comprehensive set of fairness metrics for datasets and machine learning.. Apache-2
TreeInterpreter (π₯20 Β· β 550 Β· π€) - Package for interpreting scikit-learn's decision tree and random.. BSD-3
tf-explain (π₯20 Β· β 520 Β· π£) - Interpretability Methods for tf.keras models with Tensorflow 2.0. MIT
Alibi (π₯19 Β· β 420) - Algorithms for monitoring and explaining machine learning models. Apache-2
iNNvestigate (π₯18 Β· β 500 Β· π€) - A toolbox to iNNvestigate neural networks' predictions!. BSD-2
Explainability 360 (π₯18 Β· β 450 Β· π£) - Interpretability and explainability of data and machine learning.. Apache-2
sklearn-evaluation (π₯17 Β· β 280 Β· π€) - scikit-learn model evaluation made easy: plots, tables and markdown.. MIT
Show 3 hidden projects...
scikit-plot (π₯23 Β· β 1.8K Β· π) - An intuitive library to add plotting functionality to scikit-learn objects. MIT
Skater (π₯17 Β· β 880) - Python Library for Model Interpretation/Explanations. βοΈUPL-1.0
Attribution Priors (π₯12 Β· β 51) - Tools for training explainable models using attribution priors. MIT
Libraries for Approximate Nearest Neighbor Search and Vector Indexing/Similarity Search. For an awesome comparison, refer to this benchmark.
Annoy (π₯29 Β· β 6.5K) - Approximate Nearest Neighbors in C++/Python optimized for memory usage and.. Apache-2
Faiss (π₯27 Β· β 8.8K) - A library for efficient similarity search and clustering of dense vectors. MIT
NMSLIB (π₯27 Β· β 1.8K) - Non-Metric Space Library (NMSLIB): An efficient similarity search library and a.. Apache-2
hnswlib (π₯20 Β· β 730) - Header-only C++/python library for fast approximate nearest neighbors. Apache-2
PyNNDescent (π₯20 Β· β 200) - A Python nearest neighbor descent for approximate nearest neighbors. BSD-2
Show 3 hidden projects...
Magnitude (π₯21 Β· β 1.2K Β· π) - A fast, efficient universal vector embedding utility package. MIT
Libraries providing capabilities for probabilistic programming/reasoning, bayesian inference, and gaussian processes.
PyMC3 (π₯30 Β· β 4.8K) - Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine.. Apache-2
tensorflow-probability (π₯30 Β· β 2.6K) - Probabilistic reasoning and statistical analysis in TensorFlow. Apache-2
Pyro (π₯28 Β· β 5.9K) - Deep universal probabilistic programming with Python and PyTorch. Apache-2
pomegranate (π₯27 Β· β 2.2K) - Fast, flexible and easy to use probabilistic modelling in Python. MIT
GPyTorch (π₯26 Β· β 1.8K) - A highly efficient and modular implementation of Gaussian Processes in PyTorch. MIT
ZhuSuan (π₯15 Β· β 1.8K) - A library for Bayesian deep learning, generative models, based on Tensorflow. MIT
-
GitHub (π¨βπ» 20 Β· π 330 Β· π 60 - 13% open Β· β±οΈ 05.08.2019):
git clone https://github.com/thu-ml/zhusuan
Show 3 hidden projects...
PyStan (π₯26 Β· β 760) - PyStan, the Python interface to Stan. βοΈGPL-3.0
Libraries for testing the robustness of machine learning models against attacks with adversarial/malicious examples.
Foolbox (π₯25 Β· β 1.3K) - Python toolbox to create adversarial examples that fool neural networks in PyTorch,.. MIT
ART (π₯25 Β· β 1.2K) - Python library for adversarial machine learning (evasion, extraction, poisoning,.. MIT
CleverHans (π₯24 Β· β 4.4K) - An adversarial example library for constructing attacks, building defenses,.. MIT
AdvBox (π₯17 Β· β 790) - Advbox is a toolbox to generate adversarial examples that fool neural networks in.. Apache-2
Show 2 hidden projects...
advertorch (π₯19 Β· β 490) - A Toolbox for Adversarial Robustness Research. βοΈGPL-3.0
Libraries that require and make use of CUDA/GPU system capabilities to optimize data handling and machine learning tasks.
jax (π₯27 Β· β 6.3K) - Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT.. Apache-2
scikit-cuda (π₯22 Β· β 730) - Python interface to GPU-powered libraries. BSD-3
Apex (π₯19 Β· β 3.3K) - A PyTorch Extension: Tools for easy mixed precision and distributed training in.. BSD-3
DALI (π₯18 Β· β 2.3K) - A library containing both highly optimized building blocks and an execution engine.. Apache-2
-
GitHub (π¨βπ» 41 Β· π 270 Β· π 490 - 29% open Β· β±οΈ 23.01.2020):
git clone https://github.com/NVIDIA/DALI
SpeedTorch (π₯18 Β· β 530 Β· π£) - Library for faster pinned CPU - GPU transfer in Pytorch. MIT
BlazingSQL (π₯16 Β· β 1.1K) - BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built.. Apache-2
cuSignal (π₯11 Β· β 210 Β· π£) - GPU accelerated signal processing. Apache-2
-
GitHub (π¨βπ» 6 Β· π 19 Β· π 9 - 77% open Β· β±οΈ 10.01.2020):
git clone https://github.com/rapidsai/cusignal
Show 3 hidden projects...
py3nvml (π₯20 Β· β 130) - Python 3 Bindings for NVML library. Get NVIDIA GPU status inside your program. BSD-3
nvidia-ml-py3 (π₯16 Β· β 40 Β· π€) - Python 3 Bindings for the NVIDIA Management Library. BSD-3
ipyexperiments (π₯13 Β· β 97 Β· π€) - jupyter/ipython experiment containers for GPU and general RAM re-use. Apache-2
Libraries that extend TensorFlow with general-purpose capabilities.
tensor2tensor (π₯31 Β· β 9.5K) - Library of deep learning models and datasets designed to make deep.. Apache-2
tensorflow-hub (π₯29 Β· β 2.1K) - A library for transfer learning by reusing parts of TensorFlow models. Apache-2
TensorFlow Transform (π₯28 Β· β 790) - Input pipeline framework. Apache-2
TensorNets (π₯21 Β· β 890) - High level network definitions with pre-trained weights in TensorFlow. MIT
Neural Structured Learning (π₯18 Β· β 560 Β· π£) - Training neural models with structured signals. Apache-2
Libraries that extend Pytorch with general-purpose capabilities.
SRU (π₯22 Β· β 1.8K) - Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755). MIT
torch-scatter (π₯22 Β· β 290) - PyTorch Extension Library of Optimized Scatter Operations. MIT
AdaBound (π₯20 Β· β 2.7K Β· π€) - An optimizer that trains as fast as Adam and as good as SGD. Apache-2
EfficientNets (π₯18 Β· β 750) - Pretrained EfficientNet, MixNet, MobileNetV3, MNASNet A1 and B1, FBNet,.. Apache-2
Torchmeta (π₯18 Β· β 520) - A collection of extensions and data-loaders for few-shot learning & meta-.. MIT
PyTorch Sparse (π₯18 Β· β 170) - PyTorch Extension Library of Optimized Autograd Sparse Matrix Operations. MIT
Higher (π₯10 Β· β 540 Β· π£) - higher is a pytorch library allowing users to obtain higher order.. Apache-2
-
GitHub (π¨βπ» 3 Β· π 35 Β· π 20 - 55% open Β· β±οΈ 16.01.2020):
git clone https://github.com/facebookresearch/higher
Libraries for connecting to, operating, and querying databases.
google-cloud-storage (π₯36 Β· β 3.1K) - Google Cloud Client Library for Python. Apache-2
SQLAlchemy (π₯36 Β· β 1.8K) - The Database Toolkit for Python. MIT
elasticsearch (π₯34 Β· β 2.6K) - Official Python low-level client for Elasticsearch. Apache-2
kafka-python (π₯33 Β· β 3.6K) - Python client for Apache Kafka. Apache-2
Cassandra Driver (π₯31 Β· β 1.1K) - DataStax Python Driver for Apache Cassandra. Apache-2
confluent-kafka-python (π₯30 Β· β 1.6K) - Confluent's Kafka Python Client. Apache-2
Prometheus Client (π₯30 Β· β 1.4K) - Prometheus instrumentation library for Python applications. Apache-2
Motor (π₯29 Β· β 1.4K) - Motor - the async Python driver for MongoDB and Tornado or asyncio. Apache-2
dataset (π₯28 Β· β 3.7K) - Easy-to-use data handling for SQL data stores with support for implicit table.. MIT
libcloud (π₯28 Β· β 1.6K) - Apache Libcloud is a Python library which hides differences between different.. Apache-2
pandas-gbq (π₯28 Β· β 180) - Pandas Google BigQuery. BSD-3
cx-Oracle (π₯26 Β· β 570) - Python interface to Oracle Database conforming to the Python DB API 2.0.. BSD-3
Ibis (π₯24 Β· β 1.3K) - A pandas-like deferred expression system, with first-class SQL support. Apache-2
HappyBase (π₯24 Β· β 510) - A developer-friendly Python library to interact with Apache HBase. MIT
neo4j-driver (π₯23 Β· β 450) - Neo4j Bolt driver for Python. Apache-2
AWS Data Wrangler (π₯20 Β· β 450) - DataFrames on AWS. Apache-2
SuperSQLite (π₯15 Β· β 610) - A supercharged SQLite library for Python. MIT
Show 8 hidden projects...
psycopg2 (π₯34 Β· β 1.6K) - PostgreSQL database adapter for the Python programming.. βοΈBSD-3-Clause-Attribution
mysqlclient (π₯30 Β· β 1.6K) - MySQL database connector for Python (with Python 3 support). βοΈGPL-2.0
mongo-connector (π₯25 Β· β 1.6K Β· π) - MongoDB data stream pipeline tools by YouGov (adopted from MongoDB). Apache-2
Libraries to monitor, collect, and visualize system metrics (e.g., memory, CPU, GPU).
memory-profiler (π₯27 Β· β 2.1K) - Monitor Memory usage of Python code. BSD-3
pyinstrument (π₯25 Β· β 1.5K) - Call stack profiler for Python. Shows you why your code is slow!. BSD-3
line_profiler (π₯24 Β· β 3.3K Β· π€) - Line-by-line profiling for Python. BSD-3
gpustat (π₯23 Β· β 1.6K) - A simple command-line utility for querying and monitoring GPU status. MIT
GPUtil (π₯22 Β· β 460) - A Python module for getting the GPU status from NVIDA GPUs using nvidia-smi.. MIT
Scalene (π₯19 Β· β 1.5K Β· π£) - a high-performance, high-precision CPU and memory profiler for Python. Apache-2
StackImpact (π₯14 Β· β 660 Β· π€) - StackImpact Python Profiler - Production-Grade Performance Profiler: CPU,.. BSD-3
Show 2 hidden projects...
Diamond (π₯28 Β· β 1.5K Β· π) - Diamond is a python daemon that collects system metrics and publishes them to.. MIT
Glances (π₯27 Β· β 15K) - Glances an Eye on your system. A top/htop alternative for GNU/Linux, BSD, Mac.. βοΈLGPL-3.0
imbalanced-learn (π₯29 Β· β 4.1K) - A Python Package to Tackle the Curse of Imbalanced Datasets in Machine.. MIT
Mlxtend (π₯27 Β· β 2.7K) - A library of extension and helper modules for Python's data analysis and.. BSD-3
PyOD (π₯26 Β· β 2.8K) - A Python Toolbox for Scalable Outlier Detection (Anomaly Detection). BSD-2
Streamlit (π₯25 Β· β 6.5K Β· π£) - Streamlit The fastest way to build custom ML tools. Apache-2
category_encoders (π₯25 Β· β 1.2K) - A library of sklearn compatible categorical variable encoders. BSD-3
DeepChem (π₯23 Β· β 1.9K) - Democratizing Deep-Learning for Drug Discovery, Quantum Chemistry, Materials.. MIT
kmodes (π₯23 Β· β 660) - Python implementations of the k-modes and k-prototypes clustering algorithms, for.. MIT
metric-learn (π₯22 Β· β 930) - Metric learning algorithms in Python. MIT
AstroML (π₯22 Β· β 590) - Machine learning, statistics, and data mining for astronomy and astrophysics. BSD-2
pyjanitor (π₯22 Β· β 450) - Clean APIs for data cleaning. Python implementation of R package Janitor. MIT
scikit-multilearn (π₯21 Β· β 470 Β· π€) - A scikit-learn based module for multi-label et. al. classification. BSD-2
StreamAlert (π₯20 Β· β 2.1K) - StreamAlert is a serverless, realtime data analysis framework which.. Apache-2
-
GitHub (π¨βπ» 30 Β· π 240 Β· π 280 - 25% open Β· β±οΈ 30.08.2019):
git clone https://github.com/airbnb/streamalert
Mars (π₯20 Β· β 1.7K) - Mars is a tensor-based unified framework for large-scale data computation which.. Apache-2
sklearn-contrib-lightning (π₯20 Β· β 1.3K Β· π€) - Large-scale linear classification, regression and ranking.. BSD-3
gplearn (π₯19 Β· β 750 Β· π€) - Genetic Programming in Python, with a scikit-learn inspired API. BSD-3
apricot (π₯16 Β· β 240) - apricot implements submodular optimization for the purpose of selecting subsets of.. MIT
baikal (π₯15 Β· β 380) - A graph-based functional API for building complex scikit-learn pipelines. BSD-3
Show 2 hidden projects...
Cython BLIS (π₯25 Β· β 140) - Fast matrix-multiplication as a self-contained Python library no system.. BSD-3
- Papers With Code: Discover ML papers, code, and evaluation tables.
- Sotabench: Discover & compare open-source ML models.
- Google Dataset Search: Dataset search engine by Google.
- Dataset List: List of the biggest ML datasets from across the web.
- Awesome Public Datasets: A topic-centric list of open datasets.
TBD
Contributions are encouraged and always welcome! If you like to add or update projects, choose one of the following ways:
- Open an issue by selecting one of the provided categories from the issue page and fill in the requested information.
- Edit the projects.yaml file directly on Github and create a pull request with your additions or changes.
- Fork this repository, clone it to your computer, modify the projects.yaml file, and submit a pull request.
If you like to contribute to or share suggestions regarding the project metadata collection or markdown generation, please refer to the best-of-generator repository.
For more information on how to add or update projects, please read the contribution guidelines. By participating in this project, you agree to abide by its Code of Conduct.
Created and maintained with β€οΈ by developers from SAP in Berlin.