Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to use StorageManager to cache files on NFS storage #1237

Open
mkurczew opened this issue Mar 29, 2024 · 1 comment
Open

Unable to use StorageManager to cache files on NFS storage #1237

mkurczew opened this issue Mar 29, 2024 · 1 comment
Labels
bug Something isn't working

Comments

@mkurczew
Copy link

Describe the bug

I am unable to use StorageManager to download and cache data from mounted NFS storage.

My use case:
I have a lot of data stored on quite slow NFS storage mounted under /mnt/xyz,
I am using NFS to store datasets, because I manage them with our in-house tools and need them accessible on a per-file basis (I can't use Clearml Datasets because it stores files in chunks)

I would like to leverage local dataset caching by using StorageManager.download_folder however it doesn't seem to download anything, even though it returns path to local cache where the files should be downloaded.

When I use StorageManager.download_files() it just returns-back the NFS path, because it thinks the files are local and it skips download.

To reproduce

  1. Remove/comment the line: { url: "file://*" } # file-urls are always directly referenced in my clearml.conf under sdk.storage.direct_access
  2. Open python terminal and try to download the directory (it has 700MiB):
from clearml import StorageManager
StorageManager.download_folder("/mnt/xyz/dataset_y")
  1. download_folder() will return my local cache path `~/.clearml/cache/storage_manager/global but no data is there, nothing was downloaded.

Expected behaviour

I expected the files to be copied from NFS share and locally cached.

Environment

  • Server type - self hosted
  • ClearML SDK Version - 1.14.4
  • ClearML Server Version - 1.15.0-472
  • Python Version - 3.11.8
  • OS - Linux (ubuntu 22.04)

Related Discussion

If this continues a slack thread, please provide a link to the original slack thread.

@mkurczew mkurczew added the bug Something isn't working label Mar 29, 2024
@ainoam
Copy link
Collaborator

ainoam commented Apr 1, 2024

@mkurczew Thanks for pointing this out - We'll take a look.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants