Kubeflow Contributor Guide

Welcome to the Kubeflow project! We'd love to accept your patches and contributions to this project. Please read the contributor's guide in our docs.

The contributor's guide

shows you where to find the Contributor License Agreement (CLA) that you need to sign,
helps you get started with your first contribution to Kubeflow,
and describes the pull request and review workflow in detail, including the OWNERS files and automated workflow tool.

This document focus on technical aspects while contributing to the Model Registry project

Contributing to Model Registry using Apple-silicon/ARM-based computers

Some limitations apply when developing on this project, specifically using Apple-silicon and Mac OSX. The content from this guide might also be applicable in part for general ARM-based developers/users, beyond Mac OSX.

Makefile

The make command shipped with Mac OSX (at the time of writing) is a bit old:

% make --version
GNU Make 3.81
Copyright (C) 2006  Free Software Foundation, Inc.
This is free software; see the source for copying conditions.
There is NO warranty; not even for MERCHANTABILITY or FITNESS FOR A
PARTICULAR PURPOSE.

This program built for i386-apple-darwin11.3.0

and known to cause problems when using this project's Makefile:

% make build
openapi-generator-cli validate -i api/openapi/model-registry.yaml
make: openapi-generator-cli: No such file or directory
make: *** [openapi/validate] Error 1

i.e. failing to locate the bin/ executables managed in the Makefile.

The solution is to use and updated version of make.

You can install it with Homebrew:

% brew install make
...
==> Pouring make--4.4.1.arm64_ventura.bottle.tar.gz
==> Caveats
GNU "make" has been installed as "gmake".
If you need to use it as "make", you can add a "gnubin" directory
to your PATH from your bashrc like:

    PATH="/opt/homebrew/opt/make/libexec/gnubin:$PATH"
...

and now you can substitute gmake every time the make command is mentioned in guides (or perform the path management per the caveat).

Docker engine

Several options of docker engines are available for Mac. Having Docker installed is also helpful for Testcontainers.

Colima

Colima offers Rosetta (Apple specific) emulation which is handy since the Google MLMD project dependency is x86 specific. You can install Colima (and Docker) with Homebrew.

You can create a Colima "docker context" focusing on x86 emulation with:

colima start --vz-rosetta --vm-type vz --arch x86_64 --cpu 4 --memory 8

To use with Testcontainers for Go you can use these commands:

export DOCKER_HOST="unix://${HOME}/.colima/default/docker.sock"
export TESTCONTAINERS_DOCKER_SOCKET_OVERRIDE="/var/run/docker.sock"

as instructed in this guide.

This colima setups allows to:

launch Integration tests in Go (used in Core go layer) with Testcontainers for Go
launch DevContainer to be able to install MLMD python wheel dependency (which is x86 specific)

Remote-only MLMD Python library

The Model Registry Python client wraps ml-metadata python dependency (which is x86 specific).

When developing and contributing to the Model Registry Python client using Apple-silicon/ARM-based computers, it can be helpful to evaluate using locally this soft-fork of the upstream MLMD library supporting only remote gRPC connections: https://github.com/opendatahub-io/ml-metadata/releases/tag/v1.14.0%2Bremote.1

This dependency can be substituted in the clients/python/pyproject.toml and installed locally with poetry lock. It is recommended not to check-in this substitution, only helpful for local development while using Apple-silicon/ARM-based computers.

DevContainer

Using a DevContainer is helpful to develop with the Model Registry Python client, since it needs to wrap MLMD python dependency (which is x86 specific).

This allows for instance with VSCode DevContainer extension to re-open VSCode window "inside" an x86 emulated docker container. The experience is very similar to when on GitHub.com you press dot . and you get a VSCode "inside something", except it is local to your machine. It's not super fast because x86 is emulated via Rosetta, but works "good enough" to complete most tasks without requiring remote connection to a real x86 server.

To use DevContainer as a framework directly, a command-line tool and an SDK is available as well on the upstream project: https://containers.dev.

Don't forget you will need a Docker context for x86 emulation, for instance with colima (see previous step) this can be achieved with:

colima start --vz-rosetta --vm-type vz --arch x86_64 --cpu 4 --memory 8

Define this .devcontainer/devcontainer.json file :

{
	"name": "Python 3",
	// "image": "mcr.microsoft.com/devcontainers/python:1-3.10-bullseye"
	"build": {
		"dockerfile": "Dockerfile"
	},
	"runArgs": [
		"--network=host"
	],
	"customizations": {
		"vscode": {
			"extensions": [
				// does not work well in DevContainer: "robocorp.robotframework-lsp",
				"d-biehl.robotcode"
			]
		}
	},
//...
}

The network=host allow from inside the devcontainer to reach any "service" exposed on your computer (host). This is helpful if other containers are started on your computer (eg: a PostgreSQL or DB in another container).

The customizations.vscode.extensions pre-loads additional extensions needed in VSCode to be executing from inside the DevContainer.

Define this .devcontainer/Dockerfile file:

FROM mcr.microsoft.com/devcontainers/python:1-3.10-bullseye

# Here I use the USER from the FROM image
ARG USERNAME=vscode
ARG GROUPNAME=vscode

# Here I use the UID/GID from _my_ computer, as I'm _not_ using Docker Desktop
ARG USER_UID=501
ARG USER_GID=20

RUN groupmod --gid $USER_GID -o $GROUPNAME \
    && usermod --uid $USER_UID --gid $USER_GID $USERNAME \
    && chown -R $USER_UID:$USER_GID /home/$USERNAME

# General setup which is "cached" for convenience
RUN pip install -U pip setuptools
RUN pip install -U poetry
RUN pip install -U "ml-metadata==1.14.0"
RUN pip install -U robotframework
RUN pip install -U robotframework-requests
RUN pip install -U PyYAML

The group/user is needed as on Mac anything but Docker Desktop will need to set correct FS permissions. (more details here: https://github.com/devcontainers/spec/issues/325).

The RUN pip install "caches" the local installation (inside the DevContainer) of some wheels which are almost always needed, for convenience.

Please notice the line RUN pip install -U "ml-metadata==1.14.0": that pip install would otherwise fail on Apple-silicon/ARM-based machines.

E.g. when issued on bare-metal-Apple-silicon fails with:

% pip install -U "ml-metadata==1.14.0"
ERROR: Could not find a version that satisfies the requirement ml-metadata==1.14.0 (from versions: 0.12.0.dev0, 0.13.0.dev0, 0.13.1.dev0)
ERROR: No matching distribution found for ml-metadata==1.14.0

As all the wheels are x86 specific: https://pypi.org/project/ml-metadata/1.14.0/#files

So it's not even possible to receive code assists. However, after clicking to re-open the project inside an (emulated) DevContainer:

Then with the given setup MLMD is already installed inside the DevContainer:

At this point Poetry is already installed as well and can be used to build and run test of the Model Registry Python client.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CONTRIBUTING.md

CONTRIBUTING.md

Kubeflow Contributor Guide

Contributing to Model Registry using Apple-silicon/ARM-based computers

Makefile

Docker engine

Colima

Remote-only MLMD Python library

DevContainer

Files

CONTRIBUTING.md

Latest commit

History

CONTRIBUTING.md

File metadata and controls

Kubeflow Contributor Guide

Contributing to Model Registry using Apple-silicon/ARM-based computers

Makefile

Docker engine

Colima

Remote-only MLMD Python library

DevContainer