Ollama API: A UI and Backend Server to interact with Ollama and Stable Diffusion

Ollama is a fantastic software that allows you to get up and running open-source LLM models quickly alongside with Stable Diffusion this repository is the quickest way to chat with multiple LLMs, generate images and perform VLM analysis. The complied code will deploy a Flask Server on your choice of hardware.

It would be great if you can support the project and give it a ⭐️.

Roadmap

Apache Support: We plan to support a production service API using WSGI
Restful Support Creating a quick RESTful deployment to query your favorite models with ease
Docker Ensure deployment is seamless and simple using docker
API Route Documentation Documentation to create your own interfaces or interactions with the backend service

How to Run

Complete all the prequisite steps
Run the program python3 app.py

Hardware Specs

Ensure that you have a machine with the following Hardware Specifications:

Ubuntu Linux or Macintosh (Windows is not supported)
32 GB of RAM
6 CPUs/vCPUs
50 GB of Storage
NVIDIA GPU

Prerequisites

In order to run Ollama including Stable Diffusion models you must create a read-only HuggingFace API key. Creation of API Key
Upon completion of generating an API Key you need to edit the config.json located in the ./app/config.json

3. Install neccessary dependencies and requirements:

# Update your machine (Linux Only)
sudo apt-get update
# Install pip
sudo apt-get install python3-pip 
# Navigate to the directory containing requirements.txt
./app
# Run the pip install command
pip install -r requirements.txt

# Enable port 5000 (ufw)
sudo ufw allow 5000
sudo ufw status

# CUDA update drivers 
sudo apt-get install -y cuda-drivers

Front End Features

Dynamic Model Selection: Users can select from a range of installed language models to interact with.
Installation Management: Users can install or uninstall models by dragging them between lists.
Chat Interface: Interactive chat area for users to communicate with the chosen language model.
Support for Text-to-Image Generation: It includes a feature to send requests to a Stable Diffusion endpoint for text-to-image creation.
Image Uploads for LLaVA: Allows image uploads when interacting with the LLaVA model.

Frontend

HTML: templates/index.html provides the structure of the chat interface and model management area.
JavaScript: static/js/script.js contains all the interactive logic, including event listeners, fetch requests, and functions for managing models.
CSS: static/css/style.css presents the styling for the web interface.

Proxy-Backend

Python with Flask: main.py acts as the server, handling the various API endpoints, requests to the VALDI endpoint, and serving the frontend files. While python, this is more of a frontend file than backend; similar to cloud functions on firebase. It functions as a serverless backend endpoint, but is a proxy to your real backend

API Endpoints

This directly interacts with the Backend Server hosted on VALDI.

/: Serves the main chat interface.
/api/chat: Handles chat messages sent to different language models.
/api/llava: Specialized chat handler for the LLaVA model that includes image data.
/txt2img: Endpoint for handling text-to-image generation requests.
/list-models: Returns the list of available models installed on the server.
/install-model: Installs a given model.
/uninstall-model: Uninstalls a given model.
/install: Endpoint used for initial setup, installing necessary components.

Credits ✨

This project would not be possible without continous contributions from the Open Source Community.

Ollama

Ollama Github

Ollama Website

@cantrell

Cantrell Github

Stable Diffusion API Server

Valdi

Our preferred HPC partner 🖥️

Valdi

Support us

Replit

Our preferred IDE and deployment platform 🚀

Replit

Created by Dublit - Delivering Ollama to the masses

Troubleshooting

Our prefered HPC provider is Valdi. We access machine's securly by generating a privte and public key ssh file. You will need to ensure the permissions are correct before accessing any machine.

chmod 600 Ojama.pem

Python versions

We support python versions 3.8 and above, however code can run more efficiently on most stable versions of python such as 3.10 or 3.11. Here is a helpful guide as to how your python version can be updated.

https://cloudbytes.dev/snippets/upgrade-python-to-latest-version-on-ubuntu-linux

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
app		app
assets		assets
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

app

app

assets

assets

.DS_Store

.DS_Store

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Ollama API: A UI and Backend Server to interact with Ollama and Stable Diffusion

Roadmap

How to Run

Hardware Specs

Prerequisites

Front End Features

Frontend

Proxy-Backend

API Endpoints

Credits ✨

Ollama

@cantrell

Valdi

Replit

Troubleshooting

Python versions

About

Releases

Packages

Contributors 2

Languages

License

Dublit-Development/ollama-api

Folders and files

Latest commit

History

Repository files navigation

Ollama API: A UI and Backend Server to interact with Ollama and Stable Diffusion

Roadmap

How to Run

Hardware Specs

Prerequisites

Front End Features

Frontend

Proxy-Backend

API Endpoints

Credits ✨

Ollama

@cantrell

Valdi

Replit

Troubleshooting

Python versions

About

Topics

Resources

License

Stars

Watchers

Forks

Languages