Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Storage: kubeflow kubernetes pvc (persistent volumes) and mounted storage in Azure #215

Open
cpereir1 opened this issue Feb 8, 2023 · 1 comment

Comments

@cpereir1
Copy link

cpereir1 commented Feb 8, 2023

Hi,
I have a Azure deployment of Kubeflow, in a Kubernetes cluster.
I would like to use Kedro, and kedro kubeflow to abstract from the difficulty of creating kubeflow pipelines.
I am wondering a few things:

  • Hoes kedro, or kedro kubeflow, interact with, for example, a locally downloaded dataset that kubeflow mounted into a local volume?
  • How can a kedro node use GPUs to run, for example, a training operation?
  • How can a kedro node execute, for example, a training pipeline, that consists of but multiple nodes and dependencies within it?
  • Where/how does kedro kubeflow generate the kubeflow pipeline.yaml necessary by Kubeflow to execute pipelines?

Thank you so much!

@marrrcin
Copy link
Contributor

Hi @cpereir1 , sorry for a late response:

Hoes kedro, or kedro kubeflow, interact with, for example, a locally downloaded dataset that kubeflow mounted into a local volume?

You can just read it from disk. Our plugin also mounts data volume under /home/kedro/datavolume as defined here:

def _setup_volumes(self, volume_name, image, image_pull_policy):

How can a kedro node use GPUs to run, for example, a training operation?

See #202 - we're going to release it soon. You can build the plugin from develop branch to use it right away.

How can a kedro node execute, for example, a training pipeline, that consists of but multiple nodes and dependencies within it?

Node dependencies from Kedro are automatically translated to dependencies between nodes in KFP.

Where/how does kedro kubeflow generate the kubeflow pipeline.yaml necessary by Kubeflow to execute pipelines?

It's generated in the directory from which you're running your kedro pipeline. You can check out kedro kubeflow compile command.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants