pos-tags, "muss" #2

mxi-hug · 2018-12-06T09:07:36Z

small observation on pos-tags:

count("GERMAPARL", query = '"muss"', cqp = T, breakdown = T, p_attribute = "pos")

results in ~30k pos = "NN", which is about 20% of all hits.

As I'm not familiar with the pos-tagger, i've no idea, whether it is possible or feasible to optimize the results...

The text was updated successfully, but these errors were encountered:

ablaette · 2021-02-22T21:27:23Z

This issue is old ... but still relevant. I just inspected the the NN-tagged "muss" matches using this snippet:

k <- kwic("GERMAPARL", query = '[word = "muss" & pos = "NN"]', cqp = T)

I would hope that our pos tagger performed better than that. We have started to use StanfordNLP - and need to inspect these POS tags.

ablaette closed this as completed Feb 22, 2021

ablaette reopened this Feb 22, 2021

Provide feedback