#

lmm

Here are 77 public repositories matching this topic...

TIGER-AI-Lab / Mantis

Official code for Paper "Mantis: Multi-Image Instruction Tuning"

language video vision mantis vlm multimodal lmm fuyu mllm llava-llama3 multi-image-understanding

Updated May 23, 2024
Python

Flavjack / inti

Tools and Statistical Procedures in Plant Science

cran shiny agriculture apps r-package plant-breeding lmm plant-science inkaverse

Updated May 16, 2024
R

jinghuazhao / R

R packages

genetics imputation lmm

Updated May 9, 2024
HTML

MLLM-Tool / MLLM-Tool

MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning

lmm gpt4 llm tool-agent

Updated May 9, 2024
Python

AparicioJohan / agriutilities

plant-breeding lmm met-analysis

Updated May 3, 2024
R

graphic-design-ai / graphist

Official Repo of Graphist

graphic-design hlg lmm llm mllm layout-generation

Updated Apr 23, 2024

mbzuai-oryx / groundingLMM

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

vision-and-language lmm foundation-models vision-language-model llm-agent

Updated Apr 15, 2024
Python

BAAI-Agents / Cradle

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.

ai gcc multimodality vlm cradle computer-control lmm grounding ai-agent large-language-models llm generative-ai vision-language-model ai-agents-framework general-computer-control personoid foundation-agent

Updated Apr 15, 2024
Python

LLaVA-CLI-with-multiple-images

mapluisch / LLaVA-CLI-with-multiple-images

LLaVA inference with multiple images at once for cross-image analysis.

python image-processing inference pillow python3 pytorch vqa lmms visual-question-answering lmm image-concatenation llava llama2 llama2-13b

Updated Mar 25, 2024
Python

xieyuquanxx / awesome-Large-MultiModal-Hallucination

😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.

multi-modal multimodal lmm hallucination

Updated Mar 23, 2024

tianyi-lab / HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

benchmark benchmarks lmm hallucination gpt-4 large-language-models llm llava large-vision-language-models vlms gpt-4v

Updated Mar 17, 2024
Python

odisha-ml / AI-Glossary

A glossary of terms in AI and their corresponding papers.

ai glossary lmm llm generative-ai

Updated Mar 14, 2024

ccjeremylo / FinEng-in-IRFX

Financial Engineering in IRFX in C++

cmake python-bindings curve-construction pybind11 monte-carlo-methods derivative-pricing hull-white lmm sabr-model stochastic-volatility interest-rate-derivatives exotic-options hjm

Updated Mar 1, 2024
Jupyter Notebook

GlitchBench / Benchmark

Code and Data for GlitchBench

llama lmm gpt-4 llama2

Updated Feb 27, 2024
Python

solanacryptodev / nextjs-ai-chat

An AI Chatbot that Interacts With the Solana Blockchain

ai openai lmm solana solana-blockchain

Updated Feb 26, 2024
TypeScript

roboflow / multimodal-maestro

Effective prompting for Large Multimodal Models like GPT-4 Vision, LLaVA or CogVLM. 🔥

object-detection cross-modal multimodality instance-segmentation lmm gpt-4 visual-prompting prompt-engineering vision-language-model llava segment-anything gpt-4-vision

Updated Feb 13, 2024
Python

LLaVA-VL / LLaVA-Interactive-Demo

LLaVA-Interactive-Demo

Updated Feb 7, 2024
Python

mouraffa / LLM-gemini-streamlit-app-end-to-end-llm

A Streamlit web application powered by the Gemini API for question/chat and image generation.

python ops gemini google-api lmm streamlit generative-ai

Updated Jan 13, 2024
Python

mbzuai-oryx / Video-LLaVA

PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models

video transcription lmm grounding video-grounding llm video-conversation

Updated Jan 2, 2024
Python

autodistill / autodistill-gemini

Use Gemini to auto-label images for use with Autodistill.

computer-vision gemini lmm autodistill

Updated Dec 18, 2023
Python

Improve this page

Add a description, image, and links to the lmm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the lmm topic, visit your repo's landing page and select "manage topics."