Skip to content

olllayor/whisper-terminal

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 

Repository files navigation

Speech Recognition and OpenAI Whisper

This Python project demonstrates speech recognition using the speech_recognition library and utilizes OpenAI's Whisper model for transcribing speech.

Installation

  1. Clone the repository or download the project files.
  2. Create a virtual environment (optional but recommended):
    python -m venv venv
    source venv/bin/activate  # On Windows, use: venv\Scripts\activate
  3. Install the required packages:
    pip install -r requirements.txt

Usage

  1. Ensure you have a microphone connected to your system.
  2. Run the speech.py script:
    python speech.py
    This script listens for speech input for up to 120 seconds, transcribes it using OpenAI's Whisper model, and displays the transcribed text.

File Structure

  • speech.py: Main Python script for speech recognition and transcribing using the Whisper model.
  • requirements.txt: Contains the required Python packages and their versions.
  • audio.wav: Temporary file to save the recorded audio.

Contributing

Contributions are welcome! Feel free to submit issues or pull requests.

License

This project is licensed under the MIT License - see the LICENSE file for details.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages