Llmon-py is a local Streamlit webui for Large Language Models.
Using the llama-cpp-python backend, it supports the GGUF format.
Inference LLMs with support for STT/TTS and function calling!
Currently using SDXL Turbo and Moondream2 for image generation/vision.
Can be deployed using ngrok in minutes. Easily modified.