Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Detect wrongly-set caption language (LLM?) #5709

Open
nicolas-raoul opened this issue May 2, 2024 · 4 comments
Open

Detect wrongly-set caption language (LLM?) #5709

nicolas-raoul opened this issue May 2, 2024 · 4 comments

Comments

@nicolas-raoul
Copy link
Member

Many Commons contributors contribute in various languages, for instance in Urdu when posting a picture of a local dish then in English when posting a picture showing a technology. That's great, but they often forget to select the right language for the caption:

Screenshot_20240502-095800.png

The app should try to detect when there seems to be a language mismatch, and show a popup such as:

Your caption seems to be in Japanese, but English is declared as the caption language. Do you want to declare the caption language as being Japanese?

Implementation: Per our privacy policy we can not call third-party APIs. This task is probably not important enough to justify specific ML/LLM model training, but it is a great use case for device-embedded LLM on devices where that feature is available.

@shankarpriyank
Copy link
Contributor

The idea is great, but I dont know about any device embedded LLMs @nicolas-raoul can you point me to some LLMs that you may know about?

@nicolas-raoul
Copy link
Member Author

@shankarpriyank I think Pixel 8+ and recent Samsung devices are planned to get that soon. Hopefully the emulator will get it soon too.

https://developer.android.com/ai/aicore

https://blog.google/products/pixel/pixel-feature-drop-december-2023/

https://www.samsung.com/us/galaxy-ai/

@karyotakisg
Copy link
Contributor

We consider working on it with @vtalos. Maybe tools like Apache Tika can be suitable for language recognition. Αlternatively, there are Github repositories that offer similar functionality, like https://github.com/shuyo/language-detection or https://github.com/optimaize/language-detector. What do you think? @nicolas-raoul

@nicolas-raoul
Copy link
Member Author

@karyotakisg These projects would add weight to the APK, and as I said it is low-priority. Also, embedded LLM sounds like a fun thing to try. :-)
If anyone has a device with AiCore (I think that currently means Samsung S23 Ultra or Pixel 8 Pro) please let us know.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants
@nicolas-raoul @shankarpriyank @karyotakisg and others