Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ResilentSparqlQueryModuleTest gets stuck #287

Open
KonradHoeffner opened this issue Mar 8, 2022 · 4 comments
Open

ResilentSparqlQueryModuleTest gets stuck #287

KonradHoeffner opened this issue Mar 8, 2022 · 4 comments

Comments

@KonradHoeffner
Copy link
Contributor

After several minutes, there is no progress on

[INFO] Running org.aksw.limes.core.io.query.ResilientSparqlQueryModuleTest

Tested on the current master branch with fork count 1.

@Aklakan
Copy link
Contributor

Aklakan commented Mar 8, 2022

Oh my, there is a good chance that I already fixed that issue with nondeterministic hangs due to bugs in the range cache. The irony of having made the resilient module brittle. I'll check today.

@KonradHoeffner
Copy link
Contributor Author

It seems to hang every time we tried it however, not sure if that is another issue in addition to nondeterministic problems.

@Aklakan
Copy link
Contributor

Aklakan commented Mar 8, 2022

So there are multiple issues here:

  • The unit test queries a remote endpoint (dbpedia) - instead, there should be a test dataset that gets loaded into an RDF file on the test classpath
  • It seems that DBpedia was recently updated to a buggy version - I am pretty sure that this query worked last week but right now it gives Virtuoso 22023 Error SR350: TOP parameter < 0 .
    (Last time I checked 4000 was > 0 - but maybe I am just not up to date with the "math is racist" debate XD)
  • My slightly hasty release of the range cache doesn't have error handling completed when the backend fails - right now it causes the cache read to wait for data which due to the failing backend and the consequently exceptionally terminated worker thread never arrives.

@Aklakan
Copy link
Contributor

Aklakan commented Mar 8, 2022

So I'll change the test case to use local data and add an version of the range cache with improved error handling (Actually maybe its not only the range cache causing the hang but the retry-system which retries too often / indefinitely) - the server returns HTTP 500 so we'd just assume some temporary server outage - without classifying the response we would not expect a problem with the query.
In any case, a local dataset would make the test work again.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants