Update Dockerfile with template for breaking build cache #204

ruffsl · 2018-11-10T04:01:53Z

Fixes: #112

ruffsl · 2018-11-16T07:45:45Z

@tianon or @yosifkit , is this approach reasonable?
Context: #112 (comment)

tianon · 2018-11-17T00:37:13Z

gazebo/7/ubuntu/xenial/gzserver7/Dockerfile

@@ -16,6 +16,10 @@ RUN apt-key adv --keyserver hkp://keyserver.ubuntu.com:80 --recv-keys D2486D2DD8
 RUN . /etc/os-release \
    && echo "deb http://packages.osrfoundation.org/gazebo/$ID-stable `lsb_release -sc` main" > /etc/apt/sources.list.d/gazebo-latest.list

+# break build cache for sync
+RUN echo "Release: Fri, 09 Nov 2018 14:51:50 UTC" \
+    && echo "Digest: 1b1a9870d531fcfee20ae1ab3df60d499d6b206c11fe30e3c89dd111ae31c756"


Instead of just a useless echo to do a cache bust, here's what I had in mind:

# This is an auto generated Dockerfile for gazebo:gzserver7 # generated from docker_images/create_gzserver_image.Dockerfile.em FROM ubuntu:xenial # install packages RUN apt-get update && apt-get install -q -y \ dirmngr \ gnupg2 \ lsb-release \ && rm -rf /var/lib/apt/lists/* # setup keys RUN apt-key adv --keyserver hkp://keyserver.ubuntu.com:80 --recv-keys D2486D2DD83DB69272AFE98867170598AF249743 # setup sources.list RUN . /etc/os-release \ && echo "deb http://packages.osrfoundation.org/gazebo/$ID-stable `lsb_release -sc` main" > /etc/apt/sources.list.d/gazebo-latest.list # install gazebo packages RUN apt-get update \ && . /etc/os-release \ && echo "021bafbd47748fa31505efcf66f499f8f9d7601c18f6d8456ec76686b76711df */var/lib/apt/lists/packages.osrfoundation.org_gazebo_$ID-stable_dists_$(lsb_release -sc)_InRelease" | sha256sum -c - \ && apt-get install -q -y \ gazebo7=7.14.0-1* \ && rm -rf /var/lib/apt/lists/* # setup environment EXPOSE 11345 # setup entrypoint COPY ./gzserver_entrypoint.sh / ENTRYPOINT ["/gzserver_entrypoint.sh"] CMD ["gzserver"]

(Actually verifying the hash of the interesting file in the appropriate place.)

Ok @tianon , I've adjusted the break point to check the digest of the InRelease file.
I didn't insert it inline with the last install step, as I wanted to the break point to be early, just after the repo is added, so that the rest of repo packages have a better chance staying in sync.

130s · 2018-11-21T00:13:47Z

Is there anywhere I can look at the result of this change?
I'm not super knowledgeable about this sort of stuff so can't give code-review, but still willing to contribute regarding the result of this change (I came from #197).

tianon

👍

ruffsl · 2018-11-21T02:46:52Z

Is there anywhere I can look at the result of this change?

@130s , could you clarify what you mean by "result"?

Do you me to ask:

the change to the resulting images produced?
the change to the status quo of the pipeline of PR wrt this repo?

The docker images produced would resulting include a new layer that would have the history of this InRelease check. So the metadata about the release this was built with could be inferred. It wouldn't be as obvious as reading an environment variable or checking the image tag. I briefly considered have the template set an ENV of the time and/or digest to break the catch, but I'm hesitant to pollute downstream user environment (@tianon if there is a convention or precedent for this already in the library, I'd take a look).

The other thing that 8e4a537 does in addition to 03b69b7 is that it will actually assert that the InRlease in the image matches the the digest templated into the dockerfile, and return non zero otherwise. So in addition to the version of the relevelt pined package bumping, a sync to any package in the package repo will break the build, helping to more tightly preserve the repeatability of the Dockerfile.

This will effectively block newer builds of an image until the Dockerfile itself is updated to reflect the latest sync until the CI changes are propagated upstream. Still, I'm not sure this is best practice, as the blocking might slow down the propagation of upstream security patches inherited from base images (e.g. ubuntu, debian, etc). My ideal level of repeatability would be a little loser in that pinged packaged would be consistent across consecutive builds of a dockerfile, but the unpinned supported packages could be subject to upstream security updates.

This is why I first opted for just a passive echo in 03b69b7 , just to make a change in the dockerfile to break the build cache, but not to necessary block future builds. I'd say until #125 is resolved, and we have a more streamlined end-to-end PR pipeline, I'd be hesitant in merging 03b69b7, as the demand on maintainers here would increase given we'd more frequently break the critical build path of security updates with any repo sync, regardless if relevant pinned packages where actually bumped.

@nuclearsandwich is there an easy way now of having bots trigger new PRs by being triggered from other PR's being merged. #125 would benefit from having something ensure the upstream manifest files always stays in sync with copy of the manifest we generate locally in this repo.

nuclearsandwich · 2018-11-21T15:39:39Z

is there an easy way now of having bots trigger new PRs by being triggered from other PR's being merged.

Can you draw me a little diagram of what you envision here. What upstream repos are you trying to track changes in?

sloretz

IIUC we'll need to update the sha256 sum and PR https://github.com/docker-library/official-images/ every time a package is released on packages.osrfoundation.org or a sync happens on on packages.ros.org. This is pretty frequent, but at least it's visible when a build on dockerhub fails.

Just checking I understand, it looks like updates from the base images would only be blocked if there was also a release or sync.

Approving the concept, I'll check what the current hashes are.

ruffsl · 2019-02-28T01:41:38Z

This is pretty frequent

Indeed, that's why I'd like further automate the PR process upstream. I think we could use CircleCI's Manual Job Approval in a Workflow to allows us to control and deployed upstream PRs, so that our bots don't submit upstream PRs unsupervised, while still not burdening approvers with having to open them ourselves. Then we would be able to keep with the the relatively frequent syncs.

https://circleci.com/blog/manual-job-approval-and-scheduled-workflow-runs/

@sloretz , I can add the circleci logic, but would you be able to add another python script like we have currently controlling our PR bot to make PRs targeting upstream master based from our manifest in local master?

sloretz · 2019-02-28T15:44:39Z

I can add the circleci logic, but would you be able to add another python script like we have currently controlling our PR bot

I can do that. Where's the script powering the PR bot?

to make PRs targeting upstream master based from our manifest in local master?

As in, a script that creates a PR like this one? Is this script also updating the hashes and creating the dockerfiles? How should it get the github token from the context it's run in? Keyring? Env var?

ruffsl · 2019-02-28T18:12:54Z

I can do that. Where's the script powering the PR bot?

The current code for the PR bot resides in the travis.py file here:

docker_images/.travis/travis.py

Lines 241 to 246 in 99b90d8

    
           # Create new PR for remote banch 
        
           g_upstream_repo.create_pull( 
        
               title=title, 
        
               body=body, 
        
               base=GIT_BRANCH, 
        
               head=pr_head_name)

As in, a script that creates a PR like this one?

Yep, it'll need to generalized for both the ros and gazebo manifests.
The travis.py script does this by checking the HUB_REPO env.

Is this script also updating the hashes and creating the dockerfiles?

The script/feature needed doesn't have to update the hashes in the local manifests, this is already achieved by the create_docker_library script that are created by the existing PR CI. What this script should do is check if the respective manifest differs from that of upstream and open a github PR with some kind of bot template if they are out of sync.

How should it get the github token from the context it's run in? Keyring? Env var?

You can check the from env vars like the existing travis.py one does, and we'll just configure them into the CI admin panel as private variables.

The idea in general: I'm thinking we could have the existing CI as-is, that just continues to open PR for when dockerfiles and manfest are generated out of date. The added circleci would additionally trigger on each PR and the workflow would immediately block waiting on maintainer confirmation. The maintainer checks that the existing CI can build and pass the generated PR, then clicks the github merge button, then click the CircleCI link to approve button to deploy the upstream PR, where the circleci workflow continues and runs the new script that diffs the respective manifest the local PR just merged wrt upstream. If they differ, it then check if an existing template PR is already open, if so it could error out, if not it opens a new one up using a template that pings the maintainer group.

mikaelarguedas · 2019-03-10T13:52:16Z

I believe this was discussed in the tickets, but I'd like to highlight that this means many more invalidations and rebuilds than the other suggested approaches, situations that come to mind:

the release of a python package will invalidate all ROS (1 and 2) images (e.g. python3-colcon-* that get new releases pretty often)
a sync of a specific distro will invalidate images of other distributions, e.g a ROS Lunar sync will result in the ROS Kinetic xenial and ROS Melodic stretch invalidation
~~- the sync of a ROS 1 distro will invalidate images of ROS 2 distros: e.g. a ROS Melodic sync will invalidate the Bouncy, Crystal and Dashing images (and likely ROS2 E)~~
Edit: ⚠️ this is actually not true as ROS1 and ROS2 packages are hosted on different repositories
release of any leaf package not part of desktop_full will invalidate every image of that ROS distro and other ROS distro targetting the same OS.

As ROS2 now also installs always a single package, it may be not too cluttering to just provide the version number suggested in #112 (comment) and avoid invalidating when not necessary.

I agree that automated builds and PR creation would be very beneficial if this is the strategy adopted 👍.
It may be worth considering if there are ways to reduce the number of automated PRs (both here and upstream) as well as manual edits needed when new distros (ROS or Ubuntu/Debian) come out.

sloretz · 2019-03-11T22:15:31Z

@ruffsl Added script to create upstream PRs if the librarry definition files are different in 8931512 (edit, and a6fcc52)

I pushed it to the branch in #230

mikaelarguedas · 2020-09-27T15:18:43Z

Superseded by osrf/docker_templates#90, example or resulting automatic PR: #459

Update Dockerfile with template for breaking build cache

03b69b7

ruffsl mentioned this pull request Nov 10, 2018

[ros] Provide exact version numbers to ensure rebuilds #112

Open

ruffsl requested review from cottsay and sloretz and removed request for cottsay November 10, 2018 04:11

This was referenced Nov 16, 2018

Preserve snapshot version of images? #197

Open

Recent ros:kinetic-ros-base-xenial causes some ROS packages to crash #198

Closed

tianon reviewed Nov 17, 2018

View reviewed changes

Use digest to check InRelease

8e4a537

tianon approved these changes Nov 21, 2018

View reviewed changes

sloretz approved these changes Feb 27, 2019

View reviewed changes

sloretz mentioned this pull request Feb 28, 2019

Release hash script and update hashes #230

Closed

mikaelarguedas closed this Sep 27, 2020

mikaelarguedas deleted the release_hash branch September 27, 2020 15:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Dockerfile with template for breaking build cache #204

Update Dockerfile with template for breaking build cache #204

ruffsl commented Nov 10, 2018

ruffsl commented Nov 16, 2018

tianon Nov 17, 2018

ruffsl Nov 17, 2018

130s commented Nov 21, 2018

tianon left a comment

ruffsl commented Nov 21, 2018

nuclearsandwich commented Nov 21, 2018

sloretz left a comment

ruffsl commented Feb 28, 2019

sloretz commented Feb 28, 2019

ruffsl commented Feb 28, 2019 •

edited

mikaelarguedas commented Mar 10, 2019 •

edited

sloretz commented Mar 11, 2019 •

edited

mikaelarguedas commented Sep 27, 2020

Update Dockerfile with template for breaking build cache #204

Update Dockerfile with template for breaking build cache #204

Conversation

ruffsl commented Nov 10, 2018

ruffsl commented Nov 16, 2018

tianon Nov 17, 2018

Choose a reason for hiding this comment

ruffsl Nov 17, 2018

Choose a reason for hiding this comment

130s commented Nov 21, 2018

tianon left a comment

Choose a reason for hiding this comment

ruffsl commented Nov 21, 2018

nuclearsandwich commented Nov 21, 2018

sloretz left a comment

Choose a reason for hiding this comment

ruffsl commented Feb 28, 2019

sloretz commented Feb 28, 2019

ruffsl commented Feb 28, 2019 • edited

mikaelarguedas commented Mar 10, 2019 • edited

sloretz commented Mar 11, 2019 • edited

mikaelarguedas commented Sep 27, 2020

ruffsl commented Feb 28, 2019 •

edited

mikaelarguedas commented Mar 10, 2019 •

edited

sloretz commented Mar 11, 2019 •

edited