Skip to content
@NolanoOrg

Nolano.ai

Compressing Foundation models for deployment on clouds, phones and laptops

Popular repositories

  1. cformers cformers Public

    SoTA Transformers with C-backend for fast inference on your CPU.

    C 313 29

  2. smol-gpt smol-gpt Public

    Smol but mighty language model

    C 61 3

  3. sparse_quant_llms sparse_quant_llms Public

    SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia

    Python 36 3

  4. llama-int4-quant llama-int4-quant Public archive

    C 25 2

  5. InstructLLaMa.cpp InstructLLaMa.cpp Public

    Fast inference of Instruct tuned LLaMa on your personal devices.

    C 22 1

  6. pydalai pydalai Public

    Python 9 2

Repositories

Showing 9 of 9 repositories

Top languages

Loading…

Most used topics

Loading…