Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

S3FS takes time to sync up #21

Open
2 tasks
calebwin opened this issue Aug 13, 2021 · 0 comments
Open
2 tasks

S3FS takes time to sync up #21

calebwin opened this issue Aug 13, 2021 · 0 comments
Labels
banyan-jl Concerning Banyan.jl bug Something isn't working jl-executor Concerning the Julia code executor

Comments

@calebwin
Copy link
Contributor

Sometimes S3FS takes time to sync up files or file metadata. On one occassion with the Black Scholes tests, we would have HDF5.ishdf5 cause an error about the file that we were trying to write to being invalid because the file still existed according to S3FS. The file had recently been deleted by some code that ran on the client side.

On another occasion, we got a ERROR: LoadError: ArgumentError: "/home/ec2-user/s3fs/banyan-cluster-data-pumpkincluster02-f47c1c35/iris_large.csv" is not a valid file when trying to load a file that definitely got written (by some code on the client side).

This issue could affect usage of S3FS on the client side or on the cluster. There are several action items to take:

  • Ensure that we are not configuring S3FS to allow caching.
  • Use fsync to sync up files that are written to.
@calebwin calebwin added banyan-jl Concerning Banyan.jl bug Something isn't working jl-executor Concerning the Julia code executor labels Aug 13, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
banyan-jl Concerning Banyan.jl bug Something isn't working jl-executor Concerning the Julia code executor
Projects
None yet
Development

No branches or pull requests

1 participant