Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Local chunk error when listing/reading files after restarting Docker Desktop #8142

Open
smalyala opened this issue Sep 2, 2022 · 1 comment
Labels

Comments

@smalyala
Copy link
Contributor

smalyala commented Sep 2, 2022

What happened?:
When attempting to read data from Pachyderm after quitting and opening Docker Desktop, I get a chunk error. This occurs with listing files in a repo and getting files from a repo:

❯ pachctl list repo
NAME  CREATED    SIZE (MASTER) DESCRIPTION
repo2 2 days ago ≤ 0B
repo1 3 days ago ≤ 10.09KiB
❯ pachctl list file repo2@dev
local://.pach/ does not contain item: (chunk/2731330a0cc4b75b8ff71d8e4b484e3665e6a80663b5faa61883b5499f24b549.000000000000000f)
❯ pachctl get file repo2@dev:/file4
couldn't download /file4 from repo2@dev=: local://.pach/ does not contain item: (chunk/2731330a0cc4b75b8ff71d8e4b484e3665e6a80663b5faa61883b5499f24b549.000000000000000f)

What you expected to happen?:
To successfully read data from Pachyderm.

How to reproduce it (as minimally and precisely as possible)?:
Create a repo and add files to it. Quit Docker Desktop. Open Docker Desktop. Attempt to list or read a file from the repo.

Anything else we need to know?:

Environment?:

  • Kubernetes version (use kubectl version): 1.24.2
  • Pachyderm CLI and pachd server version (use pachctl version): 2.2.7
  • Cloud provider (e.g. aws, azure, gke) or local deployment (e.g. minikube vs dockerized k8s): Local (Docker Desktop)
  • If you deployed with helm, the values you used (helm get values pachyderm): default values + local install
  • OS (e.g. from /etc/os-release): macOS 12.5
  • Others: Docker Desktop 4.11.1, Docker Engine 20.10.17
@smalyala smalyala added the bug label Sep 2, 2022
@jrockway
Copy link
Member

jrockway commented Sep 2, 2022

Brody and I discussed this a little bit. With deployTarget=LOCAL, we use a hostPath for chunks rather than anything that pretends to be persistent, and yeah, your chunk storage is going away. I'd deploy Minio and use that if you want something that works right now. I think we'll change the chart to provision a PV for local cases. (There might be some history behind this where some tests could run on your workstation when the k8s pachd's files were shared with it, but CI isn't using that; it deploys minio and eschews pachd.storage=LOCAL.)

A template of values that I personally recommend are here: https://github.com/pachyderm/jonathan-install-scripts/blob/main/values.yaml and the minio manifest that CI uses is here: https://github.com/pachyderm/pachyderm/blob/master/etc/testing/minio.yaml

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants