Skip to content
@mit-han-lab

MIT HAN Lab

Efficient AI Computing. PI: Song Han

Pinned

  1. streaming-llm streaming-llm Public

    [ICLR 2024] Efficient Streaming Language Models with Attention Sinks

    Python 6.3k 355

  2. smoothquant smoothquant Public

    [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

    Python 1.1k 123

  3. llm-awq llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python 2k 140

  4. bevfusion bevfusion Public

    [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

    Python 2.1k 377

  5. once-for-all once-for-all Public

    [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

    Python 1.8k 333

  6. temporal-shift-module temporal-shift-module Public

    [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

    Python 2k 417

Repositories

Showing 10 of 50 repositories
  • torchquantum Public

    A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.

    Jupyter Notebook 1,218 MIT 179 61 (5 issues need help) 12 Updated Jun 7, 2024
  • litepose Public

    [CVPR'22] Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation

    Python 296 MIT 35 18 1 Updated Jun 5, 2024
  • gan-compression Public

    [CVPR 2020] GAN Compression: Efficient Architectures for Interactive Conditional GANs

    Python 1,096 147 3 6 Updated Jun 5, 2024
  • TinyChatEngine Public

    TinyChatEngine: On-Device LLM Inference Library

    C++ 581 MIT 56 24 2 Updated Jun 5, 2024
  • llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python 1,995 MIT 140 108 6 Updated Jun 5, 2024
  • efficientvit Public

    EfficientViT is a new family of vision models for efficient high-resolution vision.

    Python 1,524 Apache-2.0 136 74 0 Updated Jun 4, 2024
  • distrifuser Public

    [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

    Python 459 MIT 12 3 0 Updated Jun 2, 2024
  • torchsparse Public

    [MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.

    Cuda 1,136 MIT 128 23 0 Updated May 31, 2024
  • patch_conv Public

    Patch convolution to avoid large GPU memory usage of Conv2D

    Python 63 MIT 4 1 1 Updated May 26, 2024
  • qserve Public

    QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

    Python 277 Apache-2.0 6 14 1 Updated May 14, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…