Skip to content
@mit-han-lab

MIT HAN Lab

Efficient AI Computing. PI: Song Han

Pinned

  1. streaming-llm streaming-llm Public

    [ICLR 2024] Efficient Streaming Language Models with Attention Sinks

    Python 6.2k 349

  2. smoothquant smoothquant Public

    [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

    Python 1k 110

  3. llm-awq llm-awq Public

    AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python 1.8k 125

  4. bevfusion bevfusion Public

    [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

    Python 2k 364

  5. once-for-all once-for-all Public

    [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

    Python 1.8k 332

  6. temporal-shift-module temporal-shift-module Public

    [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

    Python 2k 417

Repositories

Showing 10 of 48 repositories
  • llm-awq Public

    AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python 1,804 MIT 125 95 5 Updated Apr 29, 2024
  • torchquantum Public

    A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.

    Jupyter Notebook 1,191 MIT 168 53 (5 issues need help) 4 Updated Apr 28, 2024
  • smoothquant Public

    [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

    Python 1,020 MIT 110 53 1 Updated Apr 29, 2024
  • efficientvit Public

    EfficientViT is a new family of vision models for efficient high-resolution vision.

    Python 1,321 Apache-2.0 119 65 0 Updated Apr 27, 2024
  • TinyChatEngine Public

    TinyChatEngine: On-Device LLM Inference Library

    C++ 541 MIT 52 23 2 Updated Apr 26, 2024
  • distrifuser Public

    [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

    Python 429 MIT 10 2 0 Updated Apr 26, 2024
  • sparsevit Public

    [CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer

    Python 51 Apache-2.0 2 1 0 Updated Apr 24, 2024
  • patch_conv Public

    Patch convolution to avoid large GPU memory usage of Conv2D

    Python 46 MIT 2 0 0 Updated Apr 3, 2024
  • mcunet Public

    [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning

    Python 401 MIT 77 20 2 Updated Mar 29, 2024
  • tinyengine Public

    [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 256KB Memory

    C 738 MIT 125 31 1 Updated Mar 29, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…