Skip to content
@mit-han-lab

MIT HAN Lab

Efficient AI Computing. PI: Song Han

Pinned Loading

  1. streaming-llm streaming-llm Public

    [ICLR 2024] Efficient Streaming Language Models with Attention Sinks

    Python 7.2k 395

  2. llm-awq llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python 3.5k 302

  3. efficientvit efficientvit Public

    Efficient vision foundation models for high-resolution generation and perception.

    Python 3.3k 235

  4. bevfusion bevfusion Public archive

    [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

    Python 3k 561

  5. temporal-shift-module temporal-shift-module Public

    [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

    Python 2.2k 425

  6. once-for-all once-for-all Public

    [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

    Python 1.9k 345

Repositories

Showing 10 of 66 repositories
  • lpd Public

    [ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation

    mit-han-lab/lpd’s past year of commit activity
    Python 91 MIT 7 1 0 Updated Mar 12, 2026
  • fouroversix Public

    Code for the paper “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling”

    mit-han-lab/fouroversix’s past year of commit activity
    Python 136 MIT 11 1 0 Updated Mar 7, 2026
  • vlash Public

    Real-Time VLAs via Future-state-aware Asynchronous Inference.

    mit-han-lab/vlash’s past year of commit activity
    Python 341 Apache-2.0 20 14 1 Updated Mar 6, 2026
  • vcpo Public

    Code for the paper “Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs”

    mit-han-lab/vcpo’s past year of commit activity
    Python 14 Apache-2.0 1 0 0 Updated Mar 3, 2026
  • fastrl Public

    [ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter

    mit-han-lab/fastrl’s past year of commit activity
    Python 154 Apache-2.0 14 6 0 Updated Feb 27, 2026
  • foreact Public

    [CVPR 2026] ForeAct: Steering Your VLA with Efficient Visual Foresight Planning

    mit-han-lab/foreact’s past year of commit activity
    Python 45 1 0 0 Updated Feb 26, 2026
  • Block-Sparse-Attention Public

    A sparse attention kernel supporting mix sparse patterns

    mit-han-lab/Block-Sparse-Attention’s past year of commit activity
    C++ 477 BSD-3-Clause 45 11 0 Updated Jan 18, 2026
  • flash-moba Public
    mit-han-lab/flash-moba’s past year of commit activity
    C++ 229 BSD-3-Clause 7 2 0 Updated Nov 20, 2025
  • radial-attention Public

    [NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation

    mit-han-lab/radial-attention’s past year of commit activity
    Python 587 Apache-2.0 32 17 1 Updated Nov 12, 2025
  • torchquantum Public

    A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.

    mit-han-lab/torchquantum’s past year of commit activity
    Jupyter Notebook 1,607 MIT 245 69 (4 issues need help) 24 Updated Oct 28, 2025