Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing translation/voice recogniton/ability to choose language #13

Open
FliiFe opened this issue Nov 9, 2015 · 9 comments
Open

Missing translation/voice recogniton/ability to choose language #13

FliiFe opened this issue Nov 9, 2015 · 9 comments

Comments

@FliiFe
Copy link

FliiFe commented Nov 9, 2015

Hi.

I noticed the app understands the language your computer is in.
For example, if I am on a chinese computer, I will not be able to speak english if I am not chinese.
However, this is a good thing if you speak the language your computer understands.

It's not annoying once everything is reconfigured, but maybe adding some languages templates would be great.

@igorantun
Copy link

Instead of having "language templates" or specific strings for specific languages, we could use some module to translate the result of the recognition, and compare it to the english equivalent string?

Example:
The computer language is detected as Portuguese, therefore the recognition API is set to recognize pt-BR speech. User says "Meu nome é Igor", which then gets translated to "My name is Igor" and analyzed.

tl;dr: Use a module to translate the speech to en-US text

@FliiFe
Copy link
Author

FliiFe commented Nov 9, 2015

Yeah, that's a better idea, as it does not require to make an exauhstive list of supported languages. If such module can be used, it can be used to traduce output too.

Example (French talking to MySam):
User : "Je m'appelle John"
Software translates : "My name is John"
Software analyses...
Pre-Output : "Hi John."
Final output: "Bonjour John."

@daffl
Copy link
Member

daffl commented Nov 9, 2015

The problem is that the tagging mechanism when training something new won't work using translations. It also might be fairly slow.

In theory the classification should still work with other languages, there just aren't any non-english training sets yet. I can give it a shot with German and see how well it does.

There definitely should be an option to change your input language.

@igorantun
Copy link

True that. Well, it could use both. Training sets/language templates if avaiblable, and use translation as a fallback.

Yes, there should be an option to change the input language.

@FliiFe
Copy link
Author

FliiFe commented Nov 14, 2015

@daffl Why wouldn't tagging work using translation ? Mysam receives input in english, and outputs english, with some kind of wrapper that do the translation

(I'm not saying it's false, I just don't understand)

@daffl
Copy link
Member

daffl commented Nov 14, 2015

Well let's say you have a German input like

Ich heisse David

With the name tag at position 2 but it would translate to

My name is David

Now the name tag is at position 3.
There is still a lot of room for improvement for a smarter the tagging mechanism but like I mentioned I think everything may already work with other languages.

@FliiFe
Copy link
Author

FliiFe commented Nov 14, 2015

But this is if MySam takes the original input in account.
If it gets translated, and _then_ mysam analyses, It will just receive

My name is David

Won't it ?

@daffl
Copy link
Member

daffl commented Nov 14, 2015

But a user has to tag their own input when teaching (see https://youtu.be/VxFtSsCM_bo?t=1m46s). So the only way with using a translation would be to select it from the translated text.

@igorantun
Copy link

True that, makes sense.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants