autoupdate paper list
-
Updated
May 14, 2024 - Python
autoupdate paper list
Implementation for the different ML tasks on Kaggle platform with GPUs.
Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Data Infrastructure for Multimodal AI: Data, models, and orchestration in a unified declarative interface.
Visualize streams of multimodal data. Fast, easy to use, and simple to integrate. Built in Rust using egui.
React component library for crafting user-friendly and engaging conversational experiences
Official code for Paper "Mantis: Multi-Image Instruction Tuning"
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.
The World's Largest Decentralized AGI Multimodal Dataset
A web UI Project In order to learn the large language model. This project includes features such as chat, quantization, fine-tuning, prompt engineering templates, and multimodality.
Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.
This is a simple application that generates scripts for the user to read. Based on the audio, the application would provide a score for their pronunciation and suggest possible methods to improve it.
[NLPCC 2024] Shared Task 10: Regulating Large Language Models
Real-Time Multimodal Pipelines for GenAI
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
A configurable engine for analysing multi-lingual and multi-modal content.
NSMusicS,Multi platform Multi mode Music Software ,Electron(Vue3+Vite+TypeScript)+.net core+AI
Add a description, image, and links to the multimodal topic page so that developers can more easily learn about it.
To associate your repository with the multimodal topic, visit your repo's landing page and select "manage topics."