Local, private voice controlled notepad using Next.js, whisper.cpp, ModelFusion, and @ricky0123/vad.
Voice activity detection (VAD) and speech-to-text (STT) are run locally on your machine. WhisperWriter always listens while it's running, and automatically transcribes what you say when it's active.
Besides transcription, you can use the following voice commands:
- "Command: clear": Clear the text in the panel
- "Command: stop": Stop transcribing
- "Command: start": Start transcribing
- Clone and compile whisper.cpp
- Start the whisper.cpp server
- Tip: you can download stronger models for better speech recognition. Usually they take longer to respond though. See the ModelFusion Whisper.cpp guide for details.
- Run
npm install
- Run
npm run build
- Run
npm start
- Go to http://localhost:3000/
Note
There is a bug that can lead to double initialize when using npm run dev
. You need to use npm run build
and npm start
instead.
Note
Transcriptions are better than what's shown in the video if you use stronger models in whisper.cpp