Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding New Features to LLMUnity #149

Open
TKTSWalker opened this issue May 12, 2024 · 2 comments
Open

Adding New Features to LLMUnity #149

TKTSWalker opened this issue May 12, 2024 · 2 comments
Labels
enhancement New feature or request

Comments

@TKTSWalker
Copy link

Describe the feature

I made a few features I am going to open source during the week for LLMUnity under the name Project Replicant, this includes

  • A system which automatically saves and load conversations in ChatML format
  • A way to easily turn a folder of chatml info into the AI
  • A shogtounge encoder to compress text (Still WIP)
  • RVC based voice output (WIP)
  • Whisper based voice input

I also wanted to know if you might add support for multimodal LMs like

https://huggingface.co/NousResearch/Obsidian-3B-V0.5

I'm willing to actively help with development to push LLMUnity further, regardless of if this is a near, far or not plan at all!

@TKTSWalker TKTSWalker added the enhancement New feature or request label May 12, 2024
@amakropoulos
Copy link
Collaborator

Sounds amazing, looking forward to seeing your work!
Feel free to also create a PR at any point to add anything missing from LLMUnity.
How would you envision the support of multimodal LLMs?
For instance having functionality to input/output images?

@TKTSWalker
Copy link
Author

Thank you! As we speak i'm working on finishing the Shogtounge encoder!

I'm hoping that something like (https://huggingface.co/nisten/obsidian-3b-multimodal-q6-gguf) to run locally!.
For now it would start with text + image outputs (With the image having the options as being sent as a path or bytes) with the output being text and a texture. From there it would move to audio and video as well with better texture!

Personally, i'm interested in also seeing if pose animation is possible too (although at a later date)!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants