Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adverse effect of processors v9 on eidos #783

Open
kwalcock opened this issue Feb 13, 2024 · 5 comments
Open

Adverse effect of processors v9 on eidos #783

kwalcock opened this issue Feb 13, 2024 · 5 comments

Comments

@kwalcock
Copy link
Member

I'm not sure where (or if) these were recorded before. I'll try to get to the bottom of them here.

[info] *** 226 TESTS FAILED ***
[error] Failed tests:
[error]         org.clulab.wm.eidos.text.english.raps.TestRaps
[error]         org.clulab.wm.eidos.text.english.eval6.TestDoc5
[error]         org.clulab.wm.eidos.serialization.jsonld.TestJLDSerializer
[error]         org.clulab.wm.eidos.text.english.raps.TestRaps1
[error]         org.clulab.wm.eidos.text.english.eval6.TestDoc8
[error]         org.clulab.wm.eidos.text.englishGrounding.TestGrounding
[error]         org.clulab.wm.eidos.text.english.cag.TestCagP1
[error]         org.clulab.wm.eidos.text.english.cag.TestExtraText
[error]         org.clulab.wm.eidos.serialization.TestDocSerialization
[error]         org.clulab.wm.eidos.text.english.cag.TestCagP0
[error]         org.clulab.wm.eidos.text.englishGrounding.TestSpecificGroundings
[error]         org.clulab.wm.eidos.utils.TestLauncher
[error]         org.clulab.wm.eidos.text.english.eval6.TestDoc2
[error]         org.clulab.wm.eidos.text.english.cag.TestCagP4
[error]         org.clulab.wm.eidos.system.TestCrLf
[error]         org.clulab.wm.eidos.serialization.jsonld.TestJLDDeserializer
[error]         org.clulab.wm.eidos.text.english.eval6.TestDoc3
[error]         org.clulab.wm.eidos.text.english.eval6.TestDoc6
[error]         org.clulab.wm.eidos.text.english.cag.TestCagP3
[error]         org.clulab.wm.eidos.text.english.eval6.TestDoc1
[error]         org.clulab.wm.eidos.text.english.eval6.TestDoc4
[error]         org.clulab.wm.eidos.text.englishGrounding.TestGrounderStability
[error]         org.clulab.wm.eidos.system.TestEidosMention
[error]         org.clulab.wm.eidos.text.english.cag.TestCagP2
[error]         org.clulab.wm.eidos.document.TestSentenceClassifier
[error]         org.clulab.wm.eidos.text.english.eval6.TestDoc7
[error] (Test / test) sbt.TestsFailedException: Tests unsuccessful
[error] Total time: 2235 s (37:15), completed Feb 13, 2024 10:09:21 AM
sbt:eidos>
@MihaiSurdeanu
Copy link
Contributor

Thank you @kwalcock !!

@kwalcock
Copy link
Member Author

@MihaiSurdeanu, TestJLDSerializer is failing because one date does not get turned into an attachment. This seems to be because an entity in a sentence is expected to be DATE in eidos and it was so using the old version of processors, but it is B-DATE in the new version. Does this ring any bells?

@MihaiSurdeanu
Copy link
Contributor

Ah, I see. This happens because we use the BIO notation for named and numeric entities, whereas CoreNLP does not.
This is a small change that does not matter, so I think we should adjust the unit tests!

@kwalcock
Copy link
Member Author

It doesn't matter much, but the particular tests would be difficult to change. Instead, for now I've converted B-DATE and I-DATE to DATE and errors for two unit tests went away.

The next problem is that eidos is seeing empty strings for norms where earlier it had seen O. I'm patching that up as well. Is it an expected change?

@MihaiSurdeanu
Copy link
Contributor

No, that's another instance of me forgetting what I did before :)

I'm now thinking perhaps it's simpler to path things up directly in processors, to:

  • Remove B- and I- from labels;
  • Add "O" for empty norms.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants