Skip to content

Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features

License

Notifications You must be signed in to change notification settings

overcrash66/OpenTranslator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

65 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features

Can be run as Online mode ( connected to internet and make api calls using Google translate - No api keys are needed )

Or Fully Locally (using local LLM's)

Or as a Hybrid mode (mix of Online and one single Local LLM )

Features

  • Translate from and to 17 Languages :

    • The translator supports various languages, including English, Spanish, French, German, Dutch , Japanese, Korean, Turkish, Arabic, Russian, Hebrew, Hindi, Italian, Portuguese, Chinese, Czech and Hungarian.

Options

  • File Menu available options:

  • Convert Audio file to MP3

  • Extract audio from Video

  • YouTube Downloader

  • Replace Audio in Video

  • Video Text Adder

  • Voice Recorder

  • PyTranscriber (shortcut)

  • Exit

Requirements

Make sure you have the following dependencies installed:

  • Python >= 3.10
  • Pip (Python package installer)
  • FFmpeg #Should be installed manually and added to sys env path

Usage

1- Clone the repository:

git clone https://github.com/overcrash66/OpenTranslator.git

2- Navigate to folder:

cd OpenTranslator

3- Create a vitrual env:

py -3.10 -m venv venv
venv\Scripts\activate

4- Install the required Python packages using:

If you would like to use CUDA 118 - GPU:

pip install torch==2.1.2+cu118 torchaudio==2.1.2+cu118 --index-url https://download.pytorch.org/whl/cu118
pip install -r requirements.txt

OR by default you use CPU only:

pip install -r requirements.txt

5- Run the Script:

python OpenTranslator.py

GUI Preview

Redesigned (Custom)

Configuration

  • You can customize the translation models and other settings by modifying the script.

License

This project is licensed under the GPL License - see the LICENSE file for details.

Acknowledgements

Special thanks to: XTTS_V2 whisper v3 Large Llama2-13b-Language-translate autosub gTTS