Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Auto speech recognition realtime software #176

Open
bakustarver opened this issue Aug 15, 2023 · 1 comment
Open

Auto speech recognition realtime software #176

bakustarver opened this issue Aug 15, 2023 · 1 comment
Labels
enhancement New feature or request

Comments

@bakustarver
Copy link

bakustarver commented Aug 15, 2023

What are your thoughts on real-time speech-to-text conversion as a feature in Memento?

There are relatively lightweight open-source models like Kaldi, which use little memory in real time.

There's also Whisper, which is heavier and more accurate.

There are already examples of programs like LiveCaptions and vosk api that are lightweight.
Vosk has a Japanese model

There are also Google and Azure STT, but they only work online.

@bakustarver bakustarver changed the title Auto speech recognition realtime Auto speech recognition realtime software Aug 15, 2023
@ripose-jp ripose-jp added the enhancement New feature or request label Aug 16, 2023
@ripose-jp
Copy link
Owner

I'm not against it. I'm not too familiar with this stuff, so it will take research.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants