LaVi Lab

Video-3D-LLM Public

[CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.

Python 137 10

VG-LLM Public

The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'

Jupyter Notebook 89 2

CLEVA Public

[EMNLP 2023 Demo] "CLEVA: Chinese Language Models EVAluation Platform"

Shell 63 3

NaviLLM Public

[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'

Python 45 3

AIM Public

[ICCV 2025] Official code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"

Python 34 2

Visual-Table Public

[EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"

Python 20 1

Provide feedback