Skip to content
@ModelTC

ModelTC

Model Infra

Pinned

  1. MQBench MQBench Public

    Model Quantization Benchmark

    Shell 724 136

  2. United-Perception United-Perception Public

    United Perception

    Python 423 65

  3. NNLQP NNLQP Public

    Python 32 3

  4. Dipoorlet Dipoorlet Public

    Offline Quantization Tools for Deploy.

    Python 102 13

  5. lightllm lightllm Public

    LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

    Python 1.9k 165

Repositories

Showing 10 of 34 repositories
  • statecs Public
    Rust 1 Apache-2.0 1 0 0 Updated May 10, 2024
  • lightllm Public

    LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

    Python 1,856 Apache-2.0 165 50 4 Updated May 10, 2024
  • mtc-token-healing Public

    Token healing implementation in Rust

    Rust 0 Apache-2.0 0 0 1 Updated May 10, 2024
  • llmc Public

    llmc is an efficient LLM compression tool with various advanced compression methods, supporting multiple inference backends.

    Python 56 Apache-2.0 4 0 0 Updated Apr 30, 2024
  • Python 10 Apache-2.0 0 1 0 Updated Apr 27, 2024
  • general-sam Public

    A general suffix automaton implementation in Rust with Python bindings

    Rust 2 Apache-2.0 0 0 0 Updated Apr 25, 2024
  • MQBench Public

    Model Quantization Benchmark

    Shell 724 Apache-2.0 136 2 5 Updated Apr 24, 2024
  • general-sam-py Public

    Python bindings for general-sam and some utilities

    Python 1 Apache-2.0 0 0 3 Updated Apr 22, 2024
  • TFMQ-DM Public

    [CVPR 2024 Highlight] TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models

    Jupyter Notebook 21 Apache-2.0 3 0 0 Updated Apr 11, 2024
  • DeepSpeed Public Forked from microsoft/DeepSpeed

    DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

    Python 0 Apache-2.0 3,999 0 0 Updated Mar 28, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…