You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The trainer step of the pipeline times out with ImagePullBackOff as it cannot find the image produced. Problem: According to the builder step log, the builder does not seem to build (and push) the image, because of a kaniko issue:
requesting list . done
invalid API response status 500
mlflow/trainer:754795333 not found in registry.fuseml-registry, building...
kaniko should only be run inside of a container, run with the --force flag if you are sure you want to continue
Step completed
Perhaps a further secondary issue is that the builder step status is shown as Completed in the tekton status tab, while the image has not actually been built. Let me know if a separate bug should be created for this.
…ble.
This is a hack/workaround for the currently unsolved kaniko issue of refusing
to build images on non-container/bare-metal environments, see:
GoogleContainerTools/kaniko#1542
Workaround for fuseml/fuseml#252
I have set the container environment variable to kube in the builder step of mlflow-e2e as a hack to work around this issue. The kaniko issue needs to be fixed for this to be properly resolved.
To reproduce:
Fuseml (HEAD
f09f8679
) was installed successfully usingfuseml-installer install
on k3s, running on x86_64 bare metal machine.The
mlflow-e2e
tutorial https://fuseml.github.io/docs/v0.2/tutorials/ was followed, with no problems up to and including step 6.Actual behavior:
The trainer step of the pipeline times out with
ImagePullBackOff
as it cannot find the image produced.Problem: According to the builder step log, the builder does not seem to build (and push) the image, because of a kaniko issue:
Possibly related upstream kaniko issue: GoogleContainerTools/kaniko#1542
Expected behavior:
All steps of the pipeline run successfully.
Perhaps a further secondary issue is that the builder step status is shown as
Completed
in the tekton status tab, while the image has not actually been built. Let me know if a separate bug should be created for this.The text was updated successfully, but these errors were encountered: