GePpeTto - GPT-Whisper-based Voice Assistant for Home Assistant (Experimental)

This is a small experiment to create a fully functional Voice Assistant for Home Assistant in the least time possible. This uses Porcupine's wake word engine for wake word recognition, OpenAI's Whisper for Speech-To-Text transcription, GPT-4 chat completion model and Google Cloud TTS API for answers. Most of the code, in the form of snippets that were later adapted, was also generated by asking ChatGPT.

Features:

Simple Websocket server to sync assistant state with a UI
Control lights, switches, locks, covers, climate and mediaplayers ** Simultaneous commands, granular
Generic assistant questions – including time
Text-to-speech responses with Google Cloud TTS

Not supported:

Weather
Timers

This was developed for personal use and I have no intention on making this a project for general use. I might add features for myself (like Local TTS and better dialog management) but that's only to scratch an itch out of desperation with existing data-collecting incompetent assistants from the big three.

This fetches your switches, buttons, fans, covers and mediaplayers from Home Assistant. OpenAI has a limit for 4096 tokens, so if you have lots and lots of devices you might be out of luck with this approach and perhaps you'll have to fine tune a model. If you make this run with an Stanford Alpaca local model, I'll buy you few beers!

Setup

DISCLAIMER: Tested with Python 3.10.9 - it does not work with 3.11 yet due to the dependencies

$ pip install -r requirements.txt

Create a .env file based on the .env.example file and add your API keys, Wake Word models and Home Assistant host and Long-lived Access Token

.env file:

HASS_TOKEN="" # Long-lived Access Token
HASS_HOST="" # Your Home Assistant server (with :port)
OPENAI_API_KEY=""
PORCUPINE_ACCESS_KEY=""
PORCUPINE_KEYWORD="" # keyword of your model – The name of the assistant
PORCUPINE_KEYWORD_PATH="" # full path to wake word model file – if you leave both KEYWORD AND KEYWORD_PATH will use default porcupine models and wake words
GOOGLE_APPLICATION_CREDENTIALS="" # A service account file with Cloud TTS API access
USER_NAME="Pinocchio" # How you want the assistant to call you

Then just:

$ python assistant_server.py

Notes:

If you want to record the prompts and answer for fine-tuning set the environment var GEPPETTO_TRAINING=True prior to run the script. This creates .yml files that can be processed for creating the payload for the OpenAI fine-tuning endpoint

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
sounds		sounds
training		training
.env.example		.env.example
LICENSE.md		LICENSE.md
README.md		README.md
asr.py		asr.py
assistant.py		assistant.py
assistant_server.py		assistant_server.py
example_client.py		example_client.py
gpt.py		gpt.py
porcupine.py		porcupine.py
recorder.py		recorder.py
requirements.txt		requirements.txt
soundboard.py		soundboard.py
soundboard.yml		soundboard.yml
tts.py		tts.py

License

moimart/geppetto

Folders and files

Latest commit

History

Repository files navigation

GePpeTto - GPT-Whisper-based Voice Assistant for Home Assistant (Experimental)

Features:

Not supported:

Setup

Notes:

About

Topics

Resources

License

Stars

Watchers

Forks

Languages