Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

官方标签分布统计错误 #65

Open
heyoma opened this issue Oct 11, 2021 · 2 comments
Open

官方标签分布统计错误 #65

heyoma opened this issue Oct 11, 2021 · 2 comments

Comments

@heyoma
Copy link

heyoma commented Oct 11, 2021

当一个句子中出现多次同一词语被标注为同一实体类型时,官方只计数了一次。
比如
{"text": "两队上季曾在足总杯中相遇,纽卡客场0比0,主场4比1过关。不过纽卡本季的表现实在糟糕,", "label": {"organization": {"足总杯": [[6, 8]], "纽卡": [[13, 14], [31, 32]]}}}
官方计数为2,实际应为3

@heyoma
Copy link
Author

heyoma commented Oct 11, 2021

问题不大,只是想提醒一下。写论文或发表时引用该数字会不太严谨

@ocean33
Copy link

ocean33 commented Jan 14, 2022

我也发现了这个问题,转成我们平台的格式,发现两边统计分类数不一致,实际应为:
{'name': 3755, 'company': 2980, 'game': 2376, 'organization': 3210, 'movie': 1123, 'address': 2891, 'position': 3141, 'government': 1835, 'scene': 1521, 'book': 1139}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants