Horace: LLM Chatbot Server with ChatGPT Plugins

Important: The war in Ukraine is still ongoing. Every day, Russian soldiers rape, murder, torture and deport Ukrainian civilians. Visit this page to see how you can support Ukraine today.

Horace lets you implement a custom chatbot with your choice of an LLM and a set of ChatGPT plugins. It runs as a WebSocket server, allowing you to easily add an LLM-powered chatbot to your web or mobile app.

Features:

WebSocket chatbot server
Docker-friendly
Sample web and CLI clients provided
Pluggable LLM backends (currently OpenAI API with ChatGPT or GPT-4)
Out-of-the-box support for ChatGPT plugins
ChatGPT plugin auth methods supported: none, user_http, service_http
Request validation for the plugins according to their OpenAPI specifications
Prompt customization for fine-grained instructions.

Horace builds upon and extends GRACE, my original LLM-powered chatbot. Here is Horace running in debug mode with the Klarna ChatGPT plugin, accessed via a web client:

demo_b7G7ztbb.mp4

Quick Start

Install Python 3.10, if not already installed.
Clone this repository.
Navigate to the cloned repository directory: cd horace
Create a new virtual environment: python3 -m venv ./venv
Activate the virtual environment: source ./venv/bin/activate
Install project requirements: pip3 install -r requirements.txt
Navigate to the app directory: cd app
Start the server: OPENAI_API_KEY=openai-api-key python3 main.py (replace openai-api-key with your OpenAI API key - get it here)
Launch a client:

For the CLI client, run this in another terminal window:

cd horace
source ./venv/bin/activate
python3 app/horace-cli.py

For the web client, double-click client-demo/index.html in Explorer/Finder etc. to open it in your browser.

Running via Docker

Clone this repository.
Navigate to the cloned repository directory: cd horace
Build the Docker image: docker build -t horace:latest .
Start the server:

docker run --rm \
  -e OPENAI_API_KEY=openai-api-key \
  -p 8001:8001 \
  --name horace \
  horace:latest

Launch a client:

For the CLI client, run this in another terminal window: docker exec -it horace python3 /app/horace-cli.py
For the web client, double-click client-demo/index.html in Explorer/Finder etc. to open it in your browser.

Command-Line Arguments

main.py and horace-cli.py support a few command-line arguments:

python3 main.py --help
usage: main.py [-h] [--host HOST] [--port PORT] [--debug]

optional arguments:
  -h, --help   show this help message and exit
  --host HOST  bind host name
  --port PORT  bind port number
  --debug      enable debug mode

python3 app/horace-cli.py --help
usage: horace-cli.py [-h] [--host HOST] [--port PORT]

optional arguments:
  -h, --help   show this help message and exit
  --host HOST  server host name
  --port PORT  server port number

Working with the LLM Backends

At the moment, only the OpenAI API backend is supported. (I am not aware of non-OpenAI LLMs with a level of instruction-following sufficient for plugin invocation right now.) However, adding a custom LLM backend with Horace is quite straightforward:

Inherit a new backend class from the Backend base located in backends/backend.py and implement the complete() method
Place the new class module in the backends directory
Add an import of the new module to main.py
Adjust the BACKENDS mapping in main.py to include your new backend's alias and class name
Switch to the new backend in config.yaml:

backend:
  name: my_llm_backend

Refer to the OpenAI API backend (backends/openai_backend.py) as an example.

Model Switching with the OpenAI API Backend

The OpenAI API backend lets you switch between the following models:

ChatGPT (default): A cheap-to-access model that works reasonably well.
GPT-4: Much more expensive than ChatGPT but should be more bullet-proof.

See config.yaml for switching between the OpenAI API models.

Using ChatGPT Plugins

Horace works with ChatGPT plugins out of the box. To enable a plugin, add its hostname with an optional port number to the plugins section in config.yaml:

router:
  plugins:
    # For https://github.com/artmatsak/chatgpt-todo-plugin/
    # - localhost:5002
    - www.klarna.com

Upon starting, the server requests http://[hostname]/.well-known/ai-plugin.json for each plugin and takes it from there to load them.

Horace currently supports the none, user_http and server_http auth methods for ChatGPT plugins. If an auth token is required for a plugin, Horace asks you for one during server startup. At the moment, auth tokens are saved unencrypted in .plugin_auth.json.

Providing Extra Prompt Instructions

The default LLM prompt for Horace is designed to make the bot neutral. The bot is neither limited to plugin-facilitated user requests (like a restaurant booking bot would be, for example), nor does it proactively push the plugin-enabled functionality onto the user. In other words, you can chat with the bot like you normally would with ChatGPT; if the bot feels that invoking a plugin method is needed, it will do so.

In real-world scenarios, you may want to limit the bot to a particular scope like booking a table (see GRACE's prompt for inspiration), or perhaps provide it with a unique voice/personality. To do this, you can add instructions to the LLM prompt using the extra_instructions property in config.yaml:

horace:
  # Any extra instructions to prepend the prompt with
  extra_instructions: >-
    You are a helpful and polite AI assistant.
  # extra_instructions: >-
  #   In your speech, you impersonate Jedi Master Yoda and crack jokes in response
  #   to mentions of other Star Wars characters.

Try uncommenting the Yoda block above to see how the voice of the chatbot changes accordingly. (Don't forget to restart the server after making any changes to the config.)

Running Tests

Tests have not yet been updated since forking from GRACE. To be fixed.

Name		Name	Last commit message	Last commit date
Latest commit History 166 Commits
app		app
client-demo		client-demo
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

app

app

client-demo

client-demo

tests

tests

.gitignore

.gitignore

Dockerfile

Dockerfile

LICENSE

LICENSE

README.md

README.md

requirements.txt

requirements.txt

Repository files navigation

Horace: LLM Chatbot Server with ChatGPT Plugins

Quick Start

Running via Docker

Command-Line Arguments

Working with the LLM Backends

Model Switching with the OpenAI API Backend

Using ChatGPT Plugins

Providing Extra Prompt Instructions

Running Tests

About

Languages

License

artmatsak/horace

Folders and files

Latest commit

History

Repository files navigation

Horace: LLM Chatbot Server with ChatGPT Plugins

Quick Start

Running via Docker

Command-Line Arguments

Working with the LLM Backends

Model Switching with the OpenAI API Backend

Using ChatGPT Plugins

Providing Extra Prompt Instructions

Running Tests

About

Topics

Resources

License

Stars

Watchers

Forks

Languages