[WIP] Implement `MlflowClient.log_model` #11906

rachthree · 2024-05-04T17:59:46Z

🛠 DevTools 🛠

Install mlflow from this PR

pip install git+https://github.com/mlflow/mlflow.git@refs/pull/11906/merge

Checkout with GitHub CLI

gh pr checkout 11906

Related Issues/PRs

Resolve #7392

What changes are proposed in this pull request?

This adds the log_model method to MlflowClient as requested in the above issue.

I believe this PR is the path of least resistance to provide this API, but this is my first PR to the mlflow repo, so there may be a better path I may have missed. Tests will be added pending discussion of this implementation. I hope there is a better path because this implementation has caveats that I think are not clean:

Due to mlflow's default reliance on global tracking and registry URIs, internal functions are updated to accept an optional MlflowClient and to use its tracking URI when provided.
- mlflow.tracking._tracking_service.utils._resolve_tracking_uri updated to provide the client's tracking URI should the client be provided. The hierarchy is now a specified tracking URI, the client's tracking URI, then the global tracking URI.
- mlflow.store.artifact.runs_artifact_repo.RunsArtifactRepository's get_underlying_uri updated to use the client's tracking URI should the client be provided.
- The usage of Model.log in flavors by default use the global URIs. To use the client's URIs, it must be provided as a kwarg.
  - For now, I've only updated the PyTorch, ONNX, and PyFunc flavors. If this approach is the way forward, I will update all flavors.
  - How do we communicate this to those implementing their own custom flavor?
Many of the fluent APIs create their own instance of MlflowClient, using the global defaults. This PR updates the ones that are needed to log the model, using the provided client when applicable.
- Updating the other fluent APIs to use a provided client may be out of scope of this PR. I can create a followup PR if needed, but if you want me to include it in this PR, then I will.

Possible follow-ups:

For this PR if approach has consensus: Update all flavors to accept client for log_model.
Update all fluent APIs to use a provided client when possible.
I think naturally the next API to implement would be MlflowClient.load_model.

How is this PR tested?

Existing unit/integration tests - Will test pending discussion
New unit/integration tests - Will create pending discussion
Manual tests

With PostgreSQL and MLflow servers deployed as Docker containers, ran:

import mlflow
from mlflow import MlflowClient
from pathlib import Path

from torchvision.models import resnet50

model = resnet50(pretrained=True)
model.cpu()
model.eval()

server_uri = "http://localhost:5000"
experiment_name = "r50"
local_data_dir = str(Path("./local_runs").expanduser().resolve())

local_client = MlflowClient()
server_client = MlflowClient(tracking_uri=server_uri, registry_uri=server_uri)

print("Registering using local client...")
local_exp_info = local_client.get_experiment_by_name(experiment_name)
if local_exp_info:
    local_exp_id = local_exp_info.experiment_id
else:
    local_exp_id = local_client.create_experiment(experiment_name)
local_run_info = local_client.create_run(local_exp_id)
local_run_id = local_run_info.info.run_id
local_client.log_model(local_run_id, model, "torch-model", mlflow.pytorch, registered_model_name="r50-torch")

print("Registering using server client...")
server_exp_info = server_client.get_experiment_by_name(experiment_name)
if server_exp_info:
    server_exp_id = server_exp_info.experiment_id
else:
    server_exp_id = server_client.create_experiment(experiment_name)
server_run_info = server_client.create_run(server_exp_id)
server_run_id = server_run_info.info.run_id
server_client.log_model(server_run_id, model, "torch-model", mlflow.pytorch, registered_model_name="r50-torch")

Results:
Client for local file registration behavior has not changed, showing usage of default local filestore has been maintained.

MLflow UI shows successful logging of the model (note: I ran the above example twice, showing two versions of the model in the server below):

Does this PR require documentation update?

No. You can skip the rest of this section.
Yes. I've updated:
- Examples
- API references
- Instructions
Maybe? This adds further requirements when using Model.log to ensure we use the client's tracking / registry URI.

Release Notes

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

MfllowClient.log_model is added, which users can now use should they want to use the client instead of setting the global tracking and registry URIs to log models.

What component(s), interfaces, languages, and integrations does this PR affect?

Components

Interface

area/uiux: Front-end, user experience, plotting, JavaScript, JavaScript dev server
area/docker: Docker use across MLflow's components, such as MLflow Projects and MLflow Models
area/sqlalchemy: Use of SQLAlchemy in the Tracking Service or Model Registry
area/windows: Windows support

Language

language/r: R APIs and clients
language/java: Java APIs and clients
language/new: Proposals for new client languages

Integrations

integrations/azure: Azure and Azure ML integrations
integrations/sagemaker: SageMaker integrations
integrations/databricks: Databricks integrations

How should the PR be classified in the release notes? Choose one:

rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

Should this PR be included in the next patch release?

Yes should be selected for bug fixes, documentation updates, and other small changes. No should be selected for new features and larger changes. If you're unsure about the release classification of this PR, leave this unchecked to let the maintainers decide.

What is a minor/patch release?

Minor release: a release that increments the second part of the version number (e.g., 1.2.0 -> 1.3.0).
Bug fixes, doc updates and new features usually go into minor releases.
Patch release: a release that increments the third part of the version number (e.g., 1.2.0 -> 1.2.1).
Bug fixes and doc updates usually go into patch releases.

Yes (this PR will be cherry-picked and included in the next patch release)
No (this PR will be included in the next minor release)

Signed-off-by: Craig Chan <46288912+rachthree@users.noreply.github.com>

github-actions · 2024-05-04T18:00:24Z

Documentation preview for 96ce6cc will be available when this CircleCI job
completes successfully.

More info

Ignore this comment if this PR does not change the documentation.
It takes a few minutes for the preview to be available.
The preview is updated when a new commit is pushed to this PR.
This comment was created by https://github.com/mlflow/mlflow/actions/runs/9113222062.

Signed-off-by: Craig Chan <46288912+rachthree@users.noreply.github.com>

rachthree · 2024-05-07T13:06:31Z

@WeichenXu123 @harupy @dbczumar tagging you here since you commented on the linked issue... not sure why I can't add you as reviewers. How does this implementation look? Thank you in advance!

Signed-off-by: Craig Chan <46288912+rachthree@users.noreply.github.com>

Initial work for MlflowClient.log_model

fa7e5d0

Signed-off-by: Craig Chan <46288912+rachthree@users.noreply.github.com>

rachthree mentioned this pull request May 4, 2024

[FR] Why is there no 'log_model()' function in mlflow.client? #7392

Open

21 tasks

Updates from main

f078797

Signed-off-by: Craig Chan <46288912+rachthree@users.noreply.github.com>

rachthree added 2 commits May 15, 2024 23:08

Updates from main

1193c9a

Signed-off-by: Craig Chan <46288912+rachthree@users.noreply.github.com>

Possible doc fix

96ce6cc

Signed-off-by: Craig Chan <46288912+rachthree@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Implement `MlflowClient.log_model` #11906

[WIP] Implement `MlflowClient.log_model` #11906

rachthree commented May 4, 2024 •

edited

github-actions bot commented May 4, 2024 •

edited

rachthree commented May 7, 2024

[WIP] Implement MlflowClient.log_model #11906

Are you sure you want to change the base?

[WIP] Implement MlflowClient.log_model #11906

Conversation

rachthree commented May 4, 2024 • edited

Install mlflow from this PR

Checkout with GitHub CLI

Related Issues/PRs

What changes are proposed in this pull request?

How is this PR tested?

Does this PR require documentation update?

Release Notes

Is this a user-facing change?

What component(s), interfaces, languages, and integrations does this PR affect?

How should the PR be classified in the release notes? Choose one:

Should this PR be included in the next patch release?

github-actions bot commented May 4, 2024 • edited

rachthree commented May 7, 2024

[WIP] Implement `MlflowClient.log_model` #11906

[WIP] Implement `MlflowClient.log_model` #11906

rachthree commented May 4, 2024 •

edited

github-actions bot commented May 4, 2024 •

edited