xtts_live

The aim of this project is to provide a simple wrapper around XTTS-v2 that allows for low latency streaming output.

Requirements

numpy
librosa
TTS
An audio stream backend such as pyaudio or sounddevice

Getting The Model

If you do not already have the xtts_v2 model you will need to download it. Follow the instructions at https://huggingface.co/coqui/XTTS-v2 and specify the path to it when running the script.

Usage

# Import the wrapper
from xtts_live import TextToSpeech

# Initialize an instance of the TextToSpeech class
TTS = TextToSpeech(model_path, speaker_wavs)

# Add text to the processing queue
TTS.speak("Text to be spoken.")

# Read frames from the audio buffer
TTS.audio_buffer.get_samples("Number of samples to retrieve")

# Clean up the tts buffers and threads
TTS.stop()

See demo.py for example stream setup and integration.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
xtts_live		xtts_live
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
demo.py		demo.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xtts_live

xtts_live

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

demo.py

demo.py

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

xtts_live

Requirements

Getting The Model

Usage

About

Releases

Packages

Languages

License

Jcwscience/xtts_live

Folders and files

Latest commit

History

Repository files navigation

xtts_live

Requirements

Getting The Model

Usage

About

Resources

License

Stars

Watchers

Forks

Languages