[QUESTION] Disambiguation using unfactored bert model does not yield same results as using the Camelira Web Interface #130

amsu2 · 2023-12-16T15:56:37Z

I installed the project. Did all things.

Used the example code from https://camel-tools.readthedocs.io/en/stable/api/disambig/bert.html#examples.

Tried out various input sentences. In pretty much every sentence, often in verbs, the last letter remains without diacritization.

But more importantly, every so often, a word gets disambiguated completely different to what the Camlira Website would do, and the weightings are also different.

Example:
Input: وهي مدرسة
Output: وَهِيَ مَدْرَسَةٌ
Camelira Website Output: وَهِيَ مُدَرِّسَةٌ

For some words, not only are the weightings or the chosing between two 1.0 results different, but the analysis is completely different.

Example:
Input: مهمة
Output: مَهَمَّةً
Camelia Website Output: 30 versions of مُهِمَّةٌ; mahammah is not once included.

Thanks in advance for your help. I'm a CS Student and have been interested in linguistics and Arabic for a few years now; I'm a big fan of your work. This would really help me.

Windows 10, Python 3.9

Hamed1Hamed · 2024-01-25T03:40:46Z

I have the same issue!

amsu2 added the question label Dec 16, 2023

amsu2 assigned owo Dec 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QUESTION] Disambiguation using unfactored bert model does not yield same results as using the Camelira Web Interface #130

[QUESTION] Disambiguation using unfactored bert model does not yield same results as using the Camelira Web Interface #130

amsu2 commented Dec 16, 2023

Hamed1Hamed commented Jan 25, 2024

[QUESTION] Disambiguation using unfactored bert model does not yield same results as using the Camelira Web Interface #130

[QUESTION] Disambiguation using unfactored bert model does not yield same results as using the Camelira Web Interface #130

Comments

amsu2 commented Dec 16, 2023

Hamed1Hamed commented Jan 25, 2024