Skip to content
@neuralmagic

Neural Magic

Neural Magic helps developers in accelerating deep learning performance using automated model sparsification technologies and a CPU inference engine.

Pinned

  1. nm-vllm nm-vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 131 5

  2. deepsparse deepsparse Public

    Sparsity-aware deep learning inference runtime for CPUs

    Python 2.8k 162

  3. sparseml sparseml Public

    Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

    Python 2k 138

  4. sparsezoo sparsezoo Public

    Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

    Python 353 23

  5. examples examples Public

    Notebooks using the Neural Magic libraries 📓

    Jupyter Notebook 36 5

  6. docs docs Public

    Top-level directory for documentation and general content

    MDX 118 6

Repositories

Showing 10 of 33 repositories