🎯
Focusing
PhD student @dvlab-research, CSE@CUHK. Multimodal Large Language Models
-
The Chinese University of Hong Kong
- Hong Kong SAR
- https://wcy1122.github.io/
Block or Report
Block or report wcy1122
Report abuse
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePinned
-
dvlab-research/MGM
dvlab-research/MGM PublicOfficial repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
-
dvlab-research/LLaMA-VID
dvlab-research/LLaMA-VID PublicOfficial Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
-
dvlab-research/GroupContrast
dvlab-research/GroupContrast Public[CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.