Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add other languages #34

Open
ujagaga opened this issue Aug 11, 2021 · 2 comments
Open

Add other languages #34

ujagaga opened this issue Aug 11, 2021 · 2 comments
Labels

Comments

@ujagaga
Copy link

ujagaga commented Aug 11, 2021

Hi! Would you consider adding Serbian language to the dataset? I am interesetd to contribute my voice and as many as I can gather. I suppose this would also be simpler to accomplish if we could gather audio online using an automated website.

@Jakobovski
Copy link
Owner

Why do you want to use the serbian language?

@ujagaga
Copy link
Author

ujagaga commented Aug 12, 2021

Why do you want to use the serbian language?

Because it is my native language and my older relatives do not speak english well. I intend to collect my own samples, so I just deployed a website to collect the samples in serbian. So far I shared it with a specific group of facebook friends, but soon I will ask others to join, so I hope to gather a decent sample.

https://audiosampler.herokuapp.com/

I adjusted the website code so it can be used in any language and uploaded it to github:

https://github.com/ujagaga/audioSampler

so if you reference it here, perhaps the audio repository can grow in other languages too.
The goal for me is to train a personal assistant for offline speach to text and custom command execution based on serbian language.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants