Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Disable source build cache on pocl/nvidia #749

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

matthiasdiener
Copy link
Contributor

@matthiasdiener matthiasdiener commented May 7, 2024

Followup of #738

@matthiasdiener matthiasdiener changed the title Disabled source build cache on pocl/nvidia Disable source build cache on pocl/nvidia May 9, 2024
@matthiasdiener matthiasdiener force-pushed the disable-build-cache-v2 branch 3 times, most recently from a226c17 to 1a9ad0d Compare May 9, 2024 23:06
@matthiasdiener
Copy link
Contributor Author

1a9ad0d outlines a way to ensure pocl uses PID-local caches. This is ready for a first look @inducer @majosm

@matthiasdiener matthiasdiener marked this pull request as ready for review May 10, 2024 14:29
Comment on lines +1551 to +1555
# This must be done before PoCL is actually loaded
from pyopencl.characterize import setup_pocl_cache_dirs
setup_pocl_cache_dirs()
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you explain why this is here? This seems to defeat inter-run caching, which, in my book, is definitely desirable.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a workaround for the intermittent failures we have seen as part of #738.

I couldn't think of a way to ensure PID-local caches that also support inter-run caching, without resorting to using MPI rank numbers (which wouldn't even help for the CI errors seen in #738).

Edit: some sources regarding caching issues in pocl: pocl/pocl#430, pocl/pocl#466

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Before we go in this direction, I think we should engage with the pocl folks and least file an issue to see what they think about fixing concurrent access to their caches.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Reported here: pocl/pocl#1480, including a small reproducer.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants