New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Auto speech recognition realtime software #176

Open

bakustarver opened this issue Aug 15, 2023 · 1 comment

Labels

bakustarver commented Aug 15, 2023 •

edited

What are your thoughts on real-time speech-to-text conversion as a feature in Memento?

There are relatively lightweight open-source models like Kaldi, which use little memory in real time.

There's also Whisper, which is heavier and more accurate.

There are already examples of programs like LiveCaptions and vosk api that are lightweight.
Vosk has a Japanese model

There are also Google and Azure STT, but they only work online.

bakustarver changed the title ~~Auto speech recognition realtime~~ Auto speech recognition realtime software

ripose-jp added the enhancement label

Owner

ripose-jp commented Aug 16, 2023

I'm not against it. I'm not too familiar with this stuff, so it will take research.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment