Research assistant in SUSTech. My current focus is vision-language learning.
I am looking for a PhD position in multimodal learning currently,
-
SUSTech & XMU & CQU
- https://feielysia.github.io/
Block or Report
Block or report FeiElysia
Report abuse
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePinned
-
ttengwang/Caption-Anything
ttengwang/Caption-Anything PublicCaption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.