Virtual-Voice-Assisant

Model uses

This project uses three different technologies to transcribe speech to text, search text query using GPT and give out output and then convert the output to speech:

Whisper: Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.

CHATGPT: GPT-3 is a state-of-the-art language model developed by OpenAI.

gTTS (Google Text-to-Speech) library gTTS is a Python library and CLI tool that uses Google Text-to-Speech API to convert text to speech. This library supports several languages and offers various customization options such as voice, speed, and pitch. For this project, we used gTTS to convert the transcribed text to speech and saved the audio as an MP3 file.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
assistant.ipynb		assistant.ipynb
gpt3+whisper+gtts.py		gpt3+whisper+gtts.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

assistant.ipynb

assistant.ipynb

gpt3+whisper+gtts.py

gpt3+whisper+gtts.py

Repository files navigation

Virtual-Voice-Assisant

About

Releases

Packages

Languages

IanDublew/Virtual-Voice-Assisant

Folders and files

Latest commit

History

Repository files navigation

Virtual-Voice-Assisant

About

Topics

Resources

Stars

Watchers

Forks

Languages