Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Configure registry cache in the shoot cluster (RegistryConfig and secret) #244

Open
pbochynski opened this issue Aug 27, 2024 · 3 comments
Labels
kind/enhancement Enhancement, improvement, extension lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed.

Comments

@pbochynski
Copy link

pbochynski commented Aug 27, 2024

How to categorize this issue?

/area usability
/kind enhancement

What would you like to be added:
Possibility to use RegistryConfig and secret from shoot cluster. It can be done similarly to DNSProviders like:

Kind: Shoot
...
spec:
  extensions:
    - type: registry-cache
      providerConfig:
        apiVersion: registry.extensions.gardener.cloud/v1alpha3
        kind: RegistryConfig
        registryConfigReplication:
          enabled: true
...

With replication enabled the RegistryConfig custom resource definition would be created in the shoot cluster and user could create such configs directly in the shoot.

Why is this needed:
Kyma customers do not have access to the control plane (garden project), they create resources in shoot cluster. We could introduce some mechanism to replicate the registry cache configuration from the shoot cluster to the garden project but it doesn't make sense, as the registry cache is running in the shoot cluster. We could keep credentials and configuration only in the shoot cluster and simplify our solution.

@ialidzhikov
Copy link
Member

Hi @pbochynski,

I think there are two feature requests in this issue:

  1. Enable deployment of a registry cache via custom resource in the Shoot cluster.

For a context, I think you have been using so far globally enabled extensions such as shoot-dns-service and shoot-cert-service. There are 2 ways how an extension can be enabled for a Shoot. The first way is the extension to be globally enabled by Gardener administrators (in the ControllerRegistration). A globally enabled extension is enabled for all Shoots on the corresponding landscape. The second way is an extension to be enabled via the Shoot spec, see example.
While extensions like shoot-dns-service and shoot-cert-service are suitable for and can be globally enabled (you don't have to do anything in the Shoot spec to enable them for the Shoot), there are extensions out there which are not suitable for and are not globally enabled. If you build a machinery on top of Gardener, it is not a correct expectation every extension to be globally enabled and nothing to be required to be configured in the Shoot spec. This is not the case for extensions like [registry-cache] and shoot-rsyslog-relp.

The second point I wanted to raise is that the registry-cache extension configuration is coupled to the Shoot cluster and makes sense there. For the shoot-dns-service and shoot-cert-service extensions you don't specify what kind of DNS records or certificates you want in the Shoot spec. It does not make much sense as usually DNS record or certificate is coupled to a Service/Ingress in the cluster, and not to the cluster itself. That's why the authors of these extensions choose to have CRDs like DNSEntry and Certificate available in the Shoot spec. You don't know usually on Shoot creation what kind of DNSEntrys and Certificates you need.
The registry-cache configuration is in the Shoot spec because it is a configuration related to the Shoot cluster (whether you want to cache images, which upstreams you want to cache, etc.). It is also coupled with the Shoot lifecycle. Enablement of a registry-cache does not consist of any deploying a StatefulSet and other K8s resources. The extension makes sure to configure the containerd on the cluster Nodes to make sure that containerd uses the deployed in-cluster registry cache.

tl;dr: I think it is about wrong expectation how extensions are usually deployed and consumed. Additionally, a consumption model based on custom resources does not fit the conceptual model of the registry-cache extension which is coupled to the Shoot cluster lifecycle (and not to a concept like Service/Ingress).

  1. Allow Secrets for the private registries to be specified in the Shoot cluster, not in the Garden cluster.

You might be also wondering why need to specify such credentials for the registry-cache when I already provide these credentials in form of image pull Secrets.

The short answer: This is how it is implemented in the upstream. See distribution/distribution#4281.

The long answer: The used upstream implementation of pull through cache (the Distribution project) supports only configuring single set of credentials per upstream. The pull through cache does not respect the authentication information provided by containerd but requires single set of credentials to be configured for it and it uses these credentials when it does requests to the upstream. If the pull through cache provided to the upstream the authentication info it receives in the request, it wouldn't be needed to configure these additional credentials in the Garden cluster.

We had this request also raised in the past: "Why I have to specify the image pull Secret in the Garden cluster again while I already have it specified in the Shoot cluster?". Tecnically, it should be possible to also specify this Secret in the Shoot cluster as well. However, it is again not possible to eliminate having 2 Secrets - 1 image pull Secret for the Pod (to make sure that containerd can pull the image from the upstream in case the cache is not available) and 1 Secret for registry-cache (to make sure the cache can pull the images from the upstream). I don't know if it improves much the user experience if we require the Secret to be specified in the Shoot cluster instead of in the Garden cluster.

@gardener-ci-robot
Copy link

The Gardener project currently lacks enough active contributors to adequately respond to all issues.
This bot triages issues according to the following rules:

  • After 90d of inactivity, lifecycle/stale is applied
  • After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
  • After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

  • Mark this issue as fresh with /remove-lifecycle stale
  • Mark this issue as rotten with /lifecycle rotten
  • Close this issue with /close

/lifecycle stale

@gardener-prow gardener-prow bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 17, 2024
@gardener-ci-robot
Copy link

The Gardener project currently lacks enough active contributors to adequately respond to all issues.
This bot triages issues according to the following rules:

  • After 90d of inactivity, lifecycle/stale is applied
  • After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
  • After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

  • Mark this issue as fresh with /remove-lifecycle rotten
  • Close this issue with /close

/lifecycle rotten

@gardener-prow gardener-prow bot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jan 16, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
kind/enhancement Enhancement, improvement, extension lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed.
Projects
None yet
Development

No branches or pull requests

3 participants