Upgrade to Cloud Hypervisor v37.0 (LTS) #8695

likebreath · 2023-12-18T18:55:21Z

This release has been tracked in our roadmap project as iteration
v37.0. The following user visible changes have been made:

Long Term Support (LTS) Release

This release is a LTS release. Point releases for bug fixes will be made
for the next 18 months; live migration and live upgrade will be
supported between the point releases of the LTS.

Multiple PCI segments Support for 32-bit VFIO Devices

Now VFIO devices with 32-bit memory BARs can be attached to non-zero PCI
segments on the guest, allowing users to have more 32-bit devices and
assign such devices to appropriate NUMA nodes for better performance.

Configurable Named TAP Devices

Named TAP devices now accepts IP configuration from users, such as IP
and MAC address, as long as the named TAP device is created by Cloud
Hypervisor (e.g. not existing TAP devices).

TTY Output from Both Serial Device and Virtio Console

Now legacy serial device and virtio console can be set as TTY mode as
the same time. This allows users to capture early boot logs with the
legacy serial device without losing performance benefits of using
virtio-console, when appropriate kernel configuration is used (such as
using kernel command-line console=hvc0 earlyprintk=ttyS0 on x86).

Faster VM Restoration from Snapshots

The speed of VM restoration from snapshots is improved with a better
implementation of deserializing JSON files.

Notable Bug Fixes

Fix aio backend behavior for block devices when writeback cache
disabled
Fix PvPanic device PCI BAR alignment
Bug fix to OpenAPI specification file
Error out early for live migration when TDX is enabled

Fixes: #8694

likebreath · 2023-12-18T19:02:25Z

Given it is a LTS release, please let me know if you want me to back-port to our stable branch. Thanks.

likebreath · 2023-12-18T21:45:50Z

/test

amshinde · 2023-12-18T22:47:45Z

@likebreath Unless there are some critical security fixes that have gone in as well, I would not backport a version update to a past release. If you feel there are some security fixes that are worth backporting, then let us know.

skaegi · 2023-12-18T22:48:44Z

We would very much also like this back-ported to 3.2.x as we are already in-process to do this in our own tree and intending to follow the LTS release.

likebreath · 2023-12-18T22:57:07Z

@amshinde @skaegi Thanks for the quick response. Sounds like backporting only to stable-3.2 will the best way to go. We can do that together after landing this one.

likebreath · 2023-12-18T23:09:04Z

I believe there are some unrelated failures (say clh/qemu-tracing/metrics, etc), while the error from worker run-nerdctl-tests (cloud-hypervisor) [1] looks to be a real catch (which might also be the reason for other worker failures). The reported error is:

time="2023-12-18T22:31:35Z" level=warning msg="cannot set cgroup manager to \"systemd\" for runtime \"io.containerd.kata-cloud-hypervisor.v2\""
time="2023-12-18T22:31:36Z" level=fatal msg="failed to create shim task: Others(\"failed to handle message try init runtime instance\\n\\nCaused by:\\n    0: init runtime handler\\n    1: start sandbox\\n    2: set up device after start vm\\n    3: failed to set up network\\n    4: setup network\\n    5: attach\\n    6: do handle network Veth endpoint device failed.\\n    7: failed to add deivce\\n    8: add network device.\\n    9: Server responded with an error: InternalServerError: ApiError(VmAddNet(DeviceManager(CreateVirtioNet(OpenTap(TapSetNetmask(IoctlError(35100, Os { code: 99, kind: AddrNotAvailable, message: \\\"Address not available\\\" })))))))\"): unknown"

I could be related to cloud-hypervisor/cloud-hypervisor#5924, which changed the behavior of using a named tap device with Cloud Hypervisor. For example, with --net tap=newTap,ip=,mac=,mask=, the new release v37.0 Cloud Hypervisor will create the newTap TAP device on the host and configure the TAP device with default ip (192.168.249.1), mac (random value), mask (255.255.255.0), while the original behavior is leaving the TAP device unconfigured.

Will such behavior change from Cloud Hypervisor cause the test failure above?

amshinde · 2023-12-20T07:16:15Z

@likebreath Thats the failure seen with the rust runtime. Looks like tests are failing with the go runtime as well: https://github.com/kata-containers/kata-containers/actions/runs/7252876624/job/19796037426?pr=8695

time="2023-12-18T22:31:10Z" level=fatal msg="failed to create shim task: \"update interface: Link not found (Address: 1e:f3:1f:68:b7:4c)\": unknown"

With the go runtime, we open the the tap interface and pass the file descriptor to clh as seen here:
https://github.com/kata-containers/kata-containers/blob/main/src/runtime/virtcontainers/clh.go#L184
With the rust runtime, we pass the tap interface name instead. (Plans to add multi-queue support and passing the fd eventually are there, but we will get to this in the future).
Will clh try to create another tap if its already existing in that case? We also rely on the hypervisor not changing the mac address, as the kata-agent uses the mac-address to identify the network device inside the guest and configure its name and IP address later.

likebreath · 2023-12-20T23:50:36Z

@likebreath Thats the failure seen with the rust runtime. Looks like tests are failing with the go runtime as well: https://github.com/kata-containers/kata-containers/actions/runs/7252876624/job/19796037426?pr=8695
time="2023-12-18T22:31:10Z" level=fatal msg="failed to create shim task: \"update interface: Link not found (Address: 1e:f3:1f:68:b7:4c)\": unknown"
With the go runtime, we open the the tap interface and pass the file descriptor to clh as seen here: https://github.com/kata-containers/kata-containers/blob/main/src/runtime/virtcontainers/clh.go#L184 With the rust runtime, we pass the tap interface name instead. (Plans to add multi-queue support and passing the fd eventually are there, but we will get to this in the future).

Thank you for the context. For the case where a fd is used for creating a vitio-net device, there is no behavior change from Cloud Hypervisor side. So for the case with go-lang runtime, the error is caused by something else. Do you have any thoughts?

Will clh try to create another tap if its already existing in that case? We also rely on the hypervisor not changing the mac address, as the kata-agent uses the mac-address to identify the network device inside the guest and configure its name and IP address later.

No, Cloud Hypervisor won't create another tap device if the given tap device already exists based on the input tap device name see code here [1]. Does kata create the tap device and configure its MAC for runtime-rs? If that's the case, runtime-rs should not see any behavior change from Cloud Hypervisor either..

[1] https://github.com/cloud-hypervisor/cloud-hypervisor/blob/24f384d2397a93ca32b7efcda2105e67bdac7b3c/net_util/src/open_tap.rs#L76-L78

amshinde · 2023-12-22T21:56:07Z

Does kata create the tap device and configure its MAC for runtime-rs?

Yes, Kata creates the tap device as seen here:
https://github.com/kata-containers/kata-containers/blob/main/src/runtime/virtcontainers/network_linux.go#L829

It then configures the mac address as seen here:
https://github.com/kata-containers/kata-containers/blob/main/src/runtime/virtcontainers/network_linux.go#L858

I ran the CI again to see if this was a one-off error due to a race condition, but the CI seems to fail consistently on not being able to find the mac address. SO maybe clh is updating the mac address at some point.
Will have to reproduce this locally to see if thats the case.

likebreath · 2024-01-02T18:54:30Z

Does kata create the tap device and configure its MAC for runtime-rs?

Yes, Kata creates the tap device as seen here: https://github.com/kata-containers/kata-containers/blob/main/src/runtime/virtcontainers/network_linux.go#L829

It then configures the mac address as seen here: https://github.com/kata-containers/kata-containers/blob/main/src/runtime/virtcontainers/network_linux.go#L858

I ran the CI again to see if this was a one-off error due to a race condition, but the CI seems to fail consistently on not being able to find the mac address. SO maybe clh is updating the mac address at some point. Will have to reproduce this locally to see if thats the case.

Agree. Would you please help setup an environment to reproduce the error locally so that we can look into it in details? Thanks a lot. @amshinde

likebreath · 2024-04-24T16:30:30Z

/test

likebreath · 2024-04-24T18:46:23Z

For future references, @amshinde and myself have been following this issue recently, and here is a summary of this long standing PR. There are essentially two different issues involved:

Issue runtime-rs: ch: runtime crashes with Docker when creating network tap device with newer Cloud Hypervisor versions #9254 that is related to this PR and impacts only rust-runtime (was fixed via runtime-rs: ch: Provide valid default value for NetConfig #9295);
Issue nerdctl tests not working with cloud hypervisor runtime-rs #8831 that is a totally separate issue (nothing to do with the changes here) that both impacts golang and rust runtime;

With that, this PR is ready for review and to be landed. Note that the following CI failures (non-required jobs) are not related to the changes here:

 run-k8s-tests (qemu, kubeadm)
 run-k8s-tests-on-sev (qemu-sev, nydus, guest-pull)
 run-k8s-tests-on-tdx (qemu-tdx, nydus, guest-pull)
 run-k8s-tests-sev-snp (qemu-snp, nydus, guest-pull)
 run-monitor (qemu, containerd)

likebreath · 2024-04-24T18:47:36Z

@GabyCT @amshinde ~~Not sure why the Jenkins based CI workers are not running after being triggered for 2 hours. Would you please help and take a look? Thanks.~~

Looks like it is a matter of availability of the underlining aarch64 system that was back locked. I guess it just need that much time to pick up the job and run it.

skaegi · 2024-04-24T21:00:46Z

We've been using this in our kata release since December. Would suggest bumping this to 37.1 though for a few fixes.

likebreath · 2024-04-24T21:02:18Z

We've been using this in our kata release since December. Would suggest bumping this to 37.1 though for a few fixes.

That's the plan, but we will upgrade to the latest release v38.0 (instead of v37.1). I will follow-up with another PR for that.

likebreath · 2024-04-25T17:15:00Z

Rebased the PR to have newly added tests on runtime-rs + CH: #9525

/cc @jodh-intel

/test

Details of this release can be found in ourroadmap project as iteration v37.0: https://github.com/orgs/cloud-hypervisor/projects/6. Fixes: kata-containers#8694 Signed-off-by: Bo Chen <chen.bo@intel.com>

This patch re-generates the client code for Cloud Hypervisor v37.0. Note: The client code of cloud-hypervisor's OpenAPI is automatically generated by openapi-generator. Fixes: kata-containers#8694 Signed-off-by: Bo Chen <chen.bo@intel.com>

likebreath · 2024-04-26T16:03:56Z

Rebased to include #9562 for fixing unrelated CI worker failures.

likebreath · 2024-04-26T16:04:04Z

/test

katacontainersbot added the size/large Task of significant size label Dec 18, 2023

likebreath added ok-to-test no-forward-port-needed labels Dec 18, 2023

likebreath force-pushed the 1218/upgrade_clh_v37.0 branch from 852a70f to 71dd40d Compare March 7, 2024 16:49

jodh-intel mentioned this pull request Mar 11, 2024

runtime-rs: ch: runtime crashes with Docker when creating network tap device with newer Cloud Hypervisor versions #9254

Closed

stevenhorsman removed the no-forward-port-needed label Apr 10, 2024

amshinde added ok-to-test and removed ok-to-test labels Apr 18, 2024

jodh-intel mentioned this pull request Apr 19, 2024

runtime-rs: ch: Update Cloud Hypervisor VmConfig config type definitions #9522

Open

likebreath force-pushed the 1218/upgrade_clh_v37.0 branch from 71dd40d to 9ff0206 Compare April 19, 2024 23:37

katacontainersbot added size/huge Largest and most complex task (probably needs breaking into small pieces) and removed size/large Task of significant size labels Apr 19, 2024

likebreath marked this pull request as ready for review April 24, 2024 16:30

likebreath requested review from fidencio, amshinde and jodh-intel April 24, 2024 18:46

likebreath force-pushed the 1218/upgrade_clh_v37.0 branch from 9ff0206 to 039d966 Compare April 25, 2024 17:11

likebreath added 2 commits April 26, 2024 09:02

versions: Upgrade to Cloud Hypervisor v37.0 (LTS)

9862ba7

Details of this release can be found in ourroadmap project as iteration v37.0: https://github.com/orgs/cloud-hypervisor/projects/6. Fixes: kata-containers#8694 Signed-off-by: Bo Chen <chen.bo@intel.com>

runtime: clh: Re-generate the client code

01652a5

This patch re-generates the client code for Cloud Hypervisor v37.0. Note: The client code of cloud-hypervisor's OpenAPI is automatically generated by openapi-generator. Fixes: kata-containers#8694 Signed-off-by: Bo Chen <chen.bo@intel.com>

likebreath force-pushed the 1218/upgrade_clh_v37.0 branch from 039d966 to 01652a5 Compare April 26, 2024 16:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade to Cloud Hypervisor v37.0 (LTS) #8695

Upgrade to Cloud Hypervisor v37.0 (LTS) #8695

likebreath commented Dec 18, 2023

likebreath commented Dec 18, 2023

likebreath commented Dec 18, 2023

amshinde commented Dec 18, 2023

skaegi commented Dec 18, 2023

likebreath commented Dec 18, 2023

likebreath commented Dec 18, 2023 •

edited

amshinde commented Dec 20, 2023

likebreath commented Dec 20, 2023

amshinde commented Dec 22, 2023

likebreath commented Jan 2, 2024

likebreath commented Apr 24, 2024

likebreath commented Apr 24, 2024

likebreath commented Apr 24, 2024 •

edited

skaegi commented Apr 24, 2024

likebreath commented Apr 24, 2024

likebreath commented Apr 25, 2024

likebreath commented Apr 26, 2024

likebreath commented Apr 26, 2024

Upgrade to Cloud Hypervisor v37.0 (LTS) #8695

Are you sure you want to change the base?

Upgrade to Cloud Hypervisor v37.0 (LTS) #8695

Conversation

likebreath commented Dec 18, 2023

Long Term Support (LTS) Release

Multiple PCI segments Support for 32-bit VFIO Devices

Configurable Named TAP Devices

TTY Output from Both Serial Device and Virtio Console

Faster VM Restoration from Snapshots

Notable Bug Fixes

likebreath commented Dec 18, 2023

likebreath commented Dec 18, 2023

amshinde commented Dec 18, 2023

skaegi commented Dec 18, 2023

likebreath commented Dec 18, 2023

likebreath commented Dec 18, 2023 • edited

amshinde commented Dec 20, 2023

likebreath commented Dec 20, 2023

amshinde commented Dec 22, 2023

likebreath commented Jan 2, 2024

likebreath commented Apr 24, 2024

likebreath commented Apr 24, 2024

likebreath commented Apr 24, 2024 • edited

skaegi commented Apr 24, 2024

likebreath commented Apr 24, 2024

likebreath commented Apr 25, 2024

likebreath commented Apr 26, 2024

likebreath commented Apr 26, 2024

likebreath commented Dec 18, 2023 •

edited

likebreath commented Apr 24, 2024 •

edited