Skip to content
View TechxGenus's full-sized avatar
🎯
Focusing
🎯
Focusing
  • USTC

Highlights

  • Pro
Block or Report

Block or report TechxGenus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

  1. vllm-project/vllm vllm-project/vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 20.5k 2.8k

  2. AutoGPTQ/AutoGPTQ AutoGPTQ/AutoGPTQ Public

    An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

    Python 4k 400

  3. casper-hansen/AutoAWQ casper-hansen/AutoAWQ Public

    AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

    Python 1.3k 145

  4. Deepseek-Coder-MoE Deepseek-Coder-MoE Public

    Sparse Deepseek-Coder.

    Python 4

  5. deepseek-ai/DeepSeek-VL deepseek-ai/DeepSeek-VL Public

    DeepSeek-VL: Towards Real-World Vision-Language Understanding

    Python 1.8k 172

  6. huggingface/transformers huggingface/transformers Public

    🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

    Python 127k 25.2k