Skip to content

A simple voice recognition that turns words into keystrokes

Notifications You must be signed in to change notification settings

MaxAFriedrich/deepkey

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Deepkey

This is a modified version of the example project from mozila here. This version is designed to allow you to dictate passages with automatic punctuation.

Note that this is alpha software designed as a proof of concept.

Stream from microphone to DeepSpeech, using VAD (voice activity detection). A fairly simple example demonstrating the DeepSpeech streaming API in Python. Also useful for quick, real-time testing of models and decoding parameters.

Installation

Uses portaudio for microphone access, so on Linux, you may need to install its header files to compile the pyaudio package and you need xdotool to get key inject to work.

You will so need to install the pip packages in the requirements.txt.

You also needs to download scoring and the model files from mozilla.

You can install the entire application using the following commands.

git clone https://github.com/MaxAFriedrich/deepkey
cd deepkey
sudo apt install portaudio19-dev xdotool
pip install -r requirements.txt
wget -O moz.pbmm https://github.com/mozilla/DeepSpeech/releases/download/v0.9.3/deepspeech-0.9.3-models.pbmm
wget -O moz.scorer https://github.com/mozilla/DeepSpeech/releases/download/v0.9.3/deepspeech-0.9.3-models.scorer

Usage

python deepkey.py