Skip to content
#

gtts-api

Here are 52 public repositories matching this topic...

An open-source project that uses cutting-edge NLP models and real-time web search to provide dynamic voice query responses. Features include speech-to-text with Nemo, text generation with Mistral-7B, DuckDuckGo search integration, and text-to-speech with edge-tts, all in a user-friendly Gradio interface.

  • Updated May 24, 2024
  • Python

This project aims to assist visually impaired individuals by providing a solution to convert images into spoken language. Leveraging deep learning and natural language processing, the system processes images, generates descriptive captions, and converts these captions into audio output.

  • Updated Oct 16, 2023
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the gtts-api topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gtts-api topic, visit your repo's landing page and select "manage topics."

Learn more