Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

跑测试case的时候有bug,切词切不开 #64

Open
wq343580510 opened this issue Apr 20, 2022 · 0 comments
Open

跑测试case的时候有bug,切词切不开 #64

wq343580510 opened this issue Apr 20, 2022 · 0 comments

Comments

@wq343580510
Copy link

大部分样本是正常的,少量样本异常
输入:
lines = ["小熊公仔布娃娃抱枕女生毛绒玩具可爱礼物抱女孩公主儿童睡觉玩偶", "【二静姐专享】小黄鸭男女童棒球服春季拼接撞色夹克韩版潮GDC016 "]

输出:
{'word': ['小熊公仔布娃娃抱枕女生毛绒玩具可爱礼物抱女孩公主', '儿童', '睡觉', '玩偶'], 'postag': ['nw', 'n', 'v', 'n'], 'head': [4, 3, 4, 0], 'deprel': ['ATT', 'SBV', 'ATT', 'HED']}
{'word': ['【', '二静', '姐', '专享', '】', '小黄鸭男女童棒球服春季拼接撞色夹克韩版潮GDC016', ' '], 'postag': ['w', 'PER', 'n', 'v', 'w', 'nw', 'w'], 'head': [4, 4, 4, 7, 4, 7, 0], 'deprel': ['MT', 'ATT', 'ATT', 'ADV', 'MT', 'SBV', 'HED']}

执行代码
ddp = DDParser(encoding_model='transformer', batch_size=64, use_cuda=False, use_pos=True)
为什么会有这么长的词?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant