Keppel Operator Guide

In this document:

Terminology and data model
- Validation and garbage collection
Building and running Keppel
Prometheus metrics

In other documents:

Supported drivers

Terminology and data model

This document assumes that you have already read and understood the general README. If not, start reading there.

As outlined in the general README, Keppel stores data in accounts (each of which is tied to exactly one backing storage) that are associated with auth tenants that control access to them.

Accounts are structured into repositories. Each repository contains any number of blobs, manifests and tags:

Manifests collect a number of references to other manifests and blobs plus some metadata. Keppel parses manifests before storing them, so users are limited to the manifest formats supported by Keppel. Those supported formats are Docker images and image lists, OCI image manifests and OCI image indexes. Manifests are identified by their SHA-256 digest.
Blobs are binary objects with arbitrary contents. Keppel does not inspect blobs' contents, but because only blobs referenced by manifests can be stored long-term (all blobs without a manifest reference are garbage-collected, see below), only blob types supported by the aforementioned manifest formats will be seen in practice. Those are mostly image layers and image configurations. Blobs, like manifests, are identified by their SHA-256 digest.
Tags are named references to manifests (similar to how Git tags are just nice names for Git commits).

For example, consider the following image reference appearing in a Docker command:

$ docker pull keppel.example.com/os-images/debian:jessie-slim

This is referring to the tag jessie-slim within the repository debian in the Keppel account os-images. The Docker client (as well as the OCI Distribution API spec) would consider the repository to be os-images/debian in this case since they don't know about Keppel accounts. In general, the Keppel account name is always the first part of the repository name, up to the first slash. In Keppel, we usually refer to debian as the "repository name" and os-images/debian as the "full repository name".

Multiple Keppel deployments can be set up as peers of each other, which enables users to setup replication between accounts of the same name on multiple peers. For example, if keppel-1.example.com and keppel-2.example.com are peers of each other, and the account library is set up as a primary account on keppel-1.example.com and a replica of that primary account on keppel-2.example.com, then the following will work:

$ docker push keppel-1.example.com/library/myimage:mytag
$ docker pull keppel-2.example.com/library/myimage:mytag # same image!

When Keppel instances are configured as peers for each other, they will regularly check in with each other to issue each other service user passwords. This process is known as peering.

There's one more thing you need to know: In Keppel's data model, blobs are actually not sorted into repositories, but one level higher, into accounts. This allows us to deduplicate blobs that are referenced by multiple repositories in the same account. To model which repositories contain which blobs, Keppel's data model has an additional object, the blob mount. A blob mount connects a blob stored within an account with a repo within that account where that blob can be accessed by the user. All in all, our data model looks like this: (Peers and quotas are not pictured for simplicity's sake.)

Validation and garbage collection

The chart above indicates various recurring tasks that need to be run on a regular basis. Keppel has a dedicated server component, the janitor, which is responsible for performing these recurring tasks. The table below explains all the tasks performed by the janitor.

Task	Explanation
Manifest reference validation	Takes a manifest, parses its contents and check that the references to other manifests and blobs included therein are correctly entered in the database. Rhythm: every 24 hours (per manifest) Clock: database field `manifests.validated_at` Signal: Prometheus counter `keppel_manifest_validations` Success signal: database field `manifests.validation_error_message` cleared Failure signal: database field `manifests.validation_error_message` filled
Blob content validation	Takes a blob and computes the digest of its contents to see if it checks the digest stored in the database. Rhythm: every 7 days (per blob) Clock: database field `blobs.validated_at` Success signal: Prometheus counter `keppel_blob_validations` Success signal: database field `blobs.validation_error_message` cleared Failure signal: Prometheus counter `keppel_blob_validations` Failure signal: database field `blobs.validation_error_message` filled
Blob mount GC	Takes a repository and unmounts all blobs that are not referenced by any manifest in this repository. Rhythm: every hour (per repository), BUT not while any manifests in the repository fail validation Clock: database field `repos.next_blob_mount_sweep_at` Signal: Prometheus counter `keppel_mount_sweeps`
Blob GC	Takes an account and deletes all blobs that are not mounted into any repository. Rhythm: every hour (per account) Clock: database field `accounts.next_blob_sweep_at` Signal: Prometheus counter `keppel_blob_sweeps`
Storage GC	Takes an account's backing storage and deletes all blobs and manifests in it that are not referenced in the database. Rhythm: every 6 hours (per account) Clock: database field `accounts.next_storage_sweep_at` Signal: Prometheus counter `keppel_storage_sweeps`
Tag/manifest sync	Takes a repo in a replica account and deletes all manifests stored in it that have been deleted on the primary account. Also moves all replicated tags to point to the same manifest as on the primary account, replicating new manifests as necessary. Rhythm: every hour (per repository) Clock: database field `repos.next_manifest_sync_at` Signal: Prometheus counter `keppel_manifest_syncs`
Image GC	Evaluates all GC policies configured by users on their accounts (see respective section in API spec for details). Rhythm: every hour (per repository) Clock: database field `repos.next_gc_at` Signal: Prometheus counter `keppel_image_garbage_collections`
Cleanup of abandoned uploads	Takes a blob upload that is still technically in progress, but has not been touched by the user in 24 hours, and removes it from the database and backing storage. Rhythm: 24 hours after upload was last touched (per upload) Clock: database field `uploads.updated_at` Signal: Prometheus counter `keppel_abandoned_upload_cleanups`
Account federation announcement	Takes an account and announces its existence to the federation driver. This is a no-op for the simpler federation driver implementations. For federation drivers that track account existence in a global-scoped storage, this validation ensures that all existing accounts are correctly tracked there. This is most useful when switching to a different federation driver and populating its storage. Rhythm: every hour (per account) Clock: database field `accounts.next_federation_announcement_at` Signal: Prometheus counter `keppel_account_federation_announcements`
Security scanning	Only if a Trivy instance has been configured (see below). Takes a manifest and updates its vulnerability status according to the result of its security scan in Trivy. Rhythm: every hour (per manifest) Clock: database field `trivy_security_info.next_check_at` Signal: Prometheus counter `keppel_trivy_security_status_checks`

In this table:

The rhythm is how often each task is executed.
The clock is a timestamp field in the database that indicates when the task was last run (or will next run) for a particular manifest/repository/account. You can manipulate this field if you want to have a task re-run ahead of schedule.
Success signals indicate that a task completed successfully.
Failure signals indicate that a task failed.

Most garbage collection (GC) passes run in a mark-and-sweep pattern: When an unreferenced object is encountered for the first time, it is only marked for deletion. It will be deleted when the next run still finds it unreferenced. This is to avoid inconsistencies arising from write operations running in parallel with a GC pass.

Note that the GC passes chain together: When a manifest is deleted, the blob mount GC will clean up its blob mounts. Then the blob GC will clean up the blobs. Both steps take about 2-3 hours because of the hourly GC rhythm and the mark-and-sweep pattern. Therefore it takes between 4-6 hours from a manifest's deletion to the deletion of the blobs on the backing storage. Add another 1-2 hours if you're deleting on a primary account and want to see blobs deleted in the replica account.

In the SAP Converged Cloud deployments of Keppel, we alert on all the keppel_...{task_outcome="failure"} metrics to be notified when any of these tasks are failing. We also use postgres_exporter custom metrics to track the clock database fields as Prometheus metrics and alert when these get way too old to be notified when the tasks are not running at all for some reason. See here under customMetrics and here for details.

Building and running Keppel

Build Keppel with make, install with make install or docker build. This is the same as in the general README, since the server and client components are all combined in one single binary. For a complete Keppel deployment, you need to run:

as many instances of keppel server api as you want,
exactly one instance of keppel server janitor,
optionally, one instance of keppel server healthmonitor,
optionally, one instance of keppel server anycastmonitor.

All commands take configuration from environment variables, as listed below.

Drivers

Keppel provides several interfaces for pluggable drivers, and it is up to you to choose the appropriate drivers for your environment:

The auth driver accesses an external auth service and translates users and permissions in that auth service into permissions in Keppel. The choice of auth driver therefore defines which auth tenants exist.
The storage driver accesses an external storage service and chooses the backing storage for each Keppel account. The choice of storage driver is usually linked to the choice of auth driver because the auth driver needs to set up permissions for Keppel to access the storage service.
The federation driver decides which account names a given user and auth tenant is allowed to claim. In a single-region deployment, the "trivial" federation driver allows everyone to claim any unused name. In a multi-region deployment, an appropriate federation driver could access a central service that manages account name claims. As for storage drivers, the choice of federation driver may be linked to the choice of auth driver.
The rate limit driver decides how many pull/push operations can be executed per time unit for a given account. This driver is optional. If no rate limit driver is configured, rate limiting will not be enabled. As for storage drivers, the choice of rate limit driver may be linked to the choice of auth driver.
The inbound cache driver adds a caching strategy to manifest pulls from external registries. The simplest implementation is the "trivial" inbound cache driver, which does not cache anything. Every access is a cache miss and goes through to the external registry.

Common configuration options

The following configuration options are understood by both the API server and the janitor:

Variable	Default	Explanation
`KEPPEL_API_PUBLIC_FQDN`	(required)	Full domain name where users reach keppel-api.
`KEPPEL_AUDIT_RABBITMQ_QUEUE_NAME`	(required for enabling audit trail)	Name for the queue that will hold the audit events. The events are published to the default exchange.
`KEPPEL_AUDIT_RABBITMQ_USERNAME`	`guest`	RabbitMQ Username.
`KEPPEL_AUDIT_RABBITMQ_PASSWORD`	`guest`	Password for the specified user.
`KEPPEL_AUDIT_RABBITMQ_HOSTNAME`	`localhost`	Hostname of the RabbitMQ server.
`KEPPEL_AUDIT_RABBITMQ_PORT`	`5672`	Port number to which the underlying connection is made.
`KEPPEL_AUDIT_SILENT`	(optional)	Whether to disable audit event logging to standard output.
`KEPPEL_DB_NAME`	`keppel`	The name of the database.
`KEPPEL_DB_USERNAME`	`postgres`	Username of the user that Keppel should use to connect to the database.
`KEPPEL_DB_PASSWORD`	(optional)	Password for the specified user.
`KEPPEL_DB_HOSTNAME`	`localhost`	Hostname of the database server.
`KEPPEL_DB_PORT`	`5432`	Port on which the PostgreSQL service is running on.
`KEPPEL_DB_CONNECTION_OPTIONS`	(optional)	Database connection options.
`KEPPEL_DRIVER_AUTH`	(required)	The name of an auth driver.
`KEPPEL_DRIVER_FEDERATION`	(required)	The name of a federation driver. For single-region deployments, the correct choice is probably `trivial`.
`KEPPEL_DRIVER_INBOUND_CACHE`	(required)	The name of an inbound cache driver. The driver name `trivial` chooses a zero-sized cache that effectively disables caching entirely.
`KEPPEL_DRIVER_STORAGE`	(required)	The name of a storage driver.
`KEPPEL_ISSUER_KEY`	(required)	The private key (in PEM format, or given as a path to a PEM file) that keppel-api uses to sign auth tokens for Docker clients. Can be generated with `openssl genrsa -out privkey.pem 4096` for RSA (legacy), or `openssl genpkey -algorithm ed25519 -out privkey.pem` for ed25519 (preferred).
`KEPPEL_PREVIOUS_ISSUER_KEY`	(optional)	The previous `KEPPEL_ISSUER_KEY`. If given, tokens signed with this key will still be accepted. This can be used to rotate issuer keys without disrupting the validity of pre-existing tokens.

To choose drivers, refer to the documentation for drivers. Note that some drivers require additional configuration as mentioned in their respective documentation.

API server configuration options

These options are only understood by the API server.

Variable	Default	Explanation
`KEPPEL_ANYCAST_ISSUER_KEY`	(required if `KEPPEL_API_ANYCAST_FQDN` is configured)	Like `KEPPEL_ISSUER_KEY`, but this key is used to sign tokens for access to the anycast-style endpoints. (See below for details.) This key must be the same for all keppel-api instances with the same anycast domain name.
`KEPPEL_ANYCAST_PREVIOUS_ISSUER_KEY`	(optional)	The previous `KEPPEL_ANYCAST_ISSUER_KEY`. If given, anycast tokens signed with this key will still be accepted. This can be used to rotate issuer keys without disrupting the validity of pre-existing tokens.
`KEPPEL_API_ANYCAST_FQDN`	(optional)	Full domain name where users reach any keppel-api from this Keppel's group of peers, usually through some sort of anycast mechanism (hence the name). When this keppel-api receives an API request directed to this URL or a path below, and the respective Keppel account does not exist locally, the request is reverse-proxied to the peer that holds the primary account. The anycast endpoints are limited to anonymous authorization and therefore cannot be used for pushing.
`KEPPEL_API_LISTEN_ADDRESS`	:8080	Listen address for HTTP server.
`KEPPEL_DRIVER_RATELIMIT`	(optional)	The name of a rate limit driver. Leave empty to disable rate limiting.
`KEPPEL_GUI_URI`	(optional)	If true, GET requests coming from a web browser for URLs that look like repositories (e.g. https://registry.example.org/someaccount/somerepo) will be redirected to this URL. The value must be a URL string, which may contain the placeholders `%ACCOUNT_NAME%`, `%REPO_NAME%` and `%AUTH_TENANT_ID%`. These placeholders will be replaced with their respective values if present. To avoid leaking account existence to unauthorized users, the redirect will only be done if the repository in question allowed anonymous pulling.
`KEPPEL_PEERS`	(optional)	A comma-separated list of hostnames where our peer keppel-api instances are running. This is the set of instances that this keppel-api can replicate from.
`KEPPEL_REDIS_ENABLE`	(required if `KEPPEL_DRIVER_RATELIMIT` is configured)	Whether to use Redis as an ephemeral storage by compatible auth drivers and rate limit drivers.
`KEPPEL_REDIS_HOSTNAME`	`localhost`	Hostname of the Redis server.
`KEPPEL_REDIS_PORT`	`6379`	Port on which the Redis server is running on.
`KEPPEL_REDIS_DB_NUM`	`0`	Database number.
`KEPPEL_REDIS_PASSWORD`	(optional)	Password for the authentication.

API server: Domain remapping support

Usually, Keppel exposes its APIs under the hostnames specified in $KEPPEL_API_PUBLIC_FQDN and $KEPPEL_API_ANYCAST_FQDN. However, if you wish, you can also configure your HTTPS reverse-proxy to serve the Keppel API on direct subdomains of these hostnames. In this case, the name of the subdomain will be interpreted as a Keppel account name, and the Registry API will be exposed on these subdomains without requiring the account name in the URL path. This is explained in more detail in the API spec.

Janitor configuration options

These options are only understood by the janitor.

Variable	Default	Explanation
`KEPPEL_JANITOR_LISTEN_ADDRESS`	:8080	Listen address for HTTP server (only provides Prometheus metrics).

Health monitor configuration options

The health monitor takes some configuration options on the commandline:

$ keppel server healthmonitor <account-name> --listen <listen-address>

Option	Default	Explanation
`<account-name>`	(required)	The account where the test image is uploaded to and downloaded from. This account should be reserved for the health monitor and not be used by anyone else.
`<listen-address>`	:8080	Listen address for HTTP server (only provides Prometheus metrics).

Additionally, the environment variables must contain credentials for authenticating with the authentication method used by the target Keppel API. (This is because the health monitor accesses the Keppel API to manage the configuration of its account.) Refer to the documentation of your auth driver for what environment variables are expected.

After the initial setup phase (where the account is created and the test image is uploaded), the test image will be downloaded and validated every 30 seconds. The result of the test is published as a Prometheus metric (see below). If the test fails, a detailed error message is logged in stderr. If the setup phase fails, an error message is logged as well and the program immediately exits with non-zero status.

Trivy Proxy configuration options

These options are only useful when the Trivy proxy is deployed but the Keppel API and janitor are also influenced by them.

Variable	Default	Explanation
`KEPPEL_TRIVY_ADDITIONAL_PULLABLE_REPOS`	(optional)	It adds additional scopes to the token issued by the API and the janitor which is meant to allow the trivy components to pull their DB OCI images from the respective repos.
`KEPPEL_TRIVY_DB_MIRROR_PREFIX`	(required)	Prefix under which trivy can find its database. This might be a mirror or ghcr.io.
`KEPPEL_TRIVY_TOKEN`	(required)	Static secret given out by the Keppel API and janitor to the trivy client to authenticate against the trivy server.
`KEPPEL_TRIVY_URL`	(required)	The URL under which the trivy proxy can be reached.

Prometheus metrics

All server components emit Prometheus metrics on the HTTP endpoint /metrics.

API metrics

Metric	Labels	Explanation
`keppel_pulled_blobs` `keppel_pushed_blobs` `keppel_pulled_manifests` `keppel_pushed_manifests` `keppel_aborted_uploads`	`account`, `auth_tenant_id`, `method`	Counters for various API operations, as identified by the metric name. `keppel_aborted_uploads` counts blob uploads that ran into errors. Successful uploads are counted by `keppel_pushed_blobs` instead. `method` is usually `registry-api`, but can also be `replication` (counting pulls on the primary account and pushes into replica accounts).
`keppel_failed_auditevent_publish` `keppel_successful_auditevent_publish`	none	Counter for failed/successful deliveries of audit events (only if audit event sending is configured).

Janitor metrics

See above for explanations of each operation.

Metric	Labels	Explanation
`keppel_blob_sweeps` `keppel_storage_sweeps`	`task_outcome` set to either `failure` or `success`	Counters for account-level operations. One increment equals one account.
`keppel_blob_mount_sweeps` `keppel_manifest_syncs`		Counters for repository-level operations. One increment equals one repository.
`keppel_blob_validations`	`task_outcome` set to either `failure` or `success`	Counters for blob-level operations. One increment equals one blob.
`keppel_manifest_validations`	`task_outcome` set to either `failure` or `success`	Counters for manifest-level operations. One increment equals one manifest.
`keppel_abandoned_upload_cleanups`	`task_outcome` set to either `failure` or `success`	Counters for upload-level operations. One increment equals one upload.

Health monitor metrics

Metric	Labels	Explanation
`keppel_healthmonitor_result`	none	0 if the last health check failed, 1 if it succeeded.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

operator-guide.md

operator-guide.md

Keppel Operator Guide

Terminology and data model

Validation and garbage collection

Building and running Keppel

Drivers

Common configuration options

API server configuration options

API server: Domain remapping support

Janitor configuration options

Health monitor configuration options

Trivy Proxy configuration options

Prometheus metrics

API metrics

Janitor metrics

Health monitor metrics

Files

operator-guide.md

Latest commit

History

operator-guide.md

File metadata and controls

Keppel Operator Guide

Terminology and data model

Validation and garbage collection

Building and running Keppel

Drivers

Common configuration options

API server configuration options

API server: Domain remapping support

Janitor configuration options

Health monitor configuration options

Trivy Proxy configuration options

Prometheus metrics

API metrics

Janitor metrics

Health monitor metrics