Skip to content
@LaVi-Lab

LaVi Lab

We are the Language and Vision (LaVi) Lab in CSE@CUHK led by Prof. Liwei Wang.

Popular repositories Loading

  1. Video-3D-LLM Video-3D-LLM Public

    [CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.

    Python 137 10

  2. VG-LLM VG-LLM Public

    The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'

    Jupyter Notebook 89 2

  3. CLEVA CLEVA Public

    [EMNLP 2023 Demo] "CLEVA: Chinese Language Models EVAluation Platform"

    Shell 63 3

  4. NaviLLM NaviLLM Public

    Forked from zd11024/NaviLLM

    [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'

    Python 45 3

  5. AIM AIM Public

    [ICCV 2025] Official code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"

    Python 34 2

  6. Visual-Table Visual-Table Public

    [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"

    Python 20 1

Repositories

Showing 10 of 13 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…