Skip to content

RyanBaig/AI-Assistant

Repository files navigation

AI Assistant

LOGO

Description

It is an AI Assistant application that can recognize speech, interact with users, and provide text-to-speech responses. It uses Google's Gemini-Pro model for generating text responses.

Features

  • Speech recognition using the Google Speech Recognition API.
  • Interaction with the Falcon model for generating responses.
  • Text-to-speech conversion using the gTTS (Google Text-to-Speech) library.

Installation

Windows:

  1. Download the Packages EXE's ZIP file
  2. Unzip the file.
  3. Run the main.exe file. It will open a console/terminal window.

Linux:

  1. Clone this repository to your system.
  2. cd into the repository files.
  3. Run this:
./build/exe.linux-x86_64-3.10/main

Usage

  1. Run the unzipped executable or the main.py script.

  2. When the application is running, speak to the AI Assistant. The Assistant will send your speech to the Gemini model for generating a response.

  3. The generated text response will be displayed, and the AI Assistant will also convert it to speech and play it.

Configuration

You can customize the behavior of the AI Assistant by modifying the code in main.py and the options in setup.py.

Contributing

If you'd like to contribute to this project, please follow these steps:

  1. Fork the repository.
  2. Create a new branch for your feature or bug fix.
  3. Make your changes.
  4. Test your changes to ensure they work as expected.
  5. Submit a pull request with your changes.

License

This project is licensed under the MIT License. You are free to use, modify, and distribute it as you see fit.