Migrate from the Redpanda Helm chart

If you are using the Redpanda Helm chart, you can migrate to the Redpanda Operator and use it to manage your Helm deployment.

The Redpanda Operator extends Kubernetes with custom resource definitions (CRDs), which allow Redpanda clusters to be treated as native Kubernetes resources. The primary resource that the Redpanda Operator uses to represent a Redpanda cluster is the Redpanda resource.

Here is an example of a Redpanda custom resource:

apiVersion: cluster.redpanda.com/v1alpha2
kind: Redpanda
metadata:
  name: <cluster-name>
spec:
  chartRef:
    chartVersion:
  clusterSpec:

metadata.name: Name to assign the Redpanda cluster. This name is also assigned to the Helm release.
spec.chartRef: Information about the Helm chart that will be used to deploy Redpanda.
spec.chartRef.chartversion: The version of the Helm chart values that the Redpanda CRD is compatible with.
spec.clusterSpec: This is where you can configure the Redpanda CRD with your values overrides from the Redpanda Helm chart.

Supported migration paths

The following table summarizes which Helm chart versions you can migrate from and which Redpanda Operator versions to install.

Helm Chart Version Operator Version Notes

Helm Chart Version	Operator Version	Notes
`<5.9.x`	`-`	You must first `helm upgrade` your Redpanda cluster to at least version 5.9.x before installing the Redpanda Operator. Migrating directly from 5.8.x or below is not supported.
`5.9.x or 5.10.x`	`v2.4.x`	After installing or upgrading to Helm chart 5.9.x or 5.10.x, you can install the Redpanda Operator v2.4.x.

<5.9.x

-

You must first helm upgrade your Redpanda cluster to at least version 5.9.x before installing the Redpanda Operator. Migrating directly from 5.8.x or below is not supported.

5.9.x or 5.10.x

v2.4.x

After installing or upgrading to Helm chart 5.9.x or 5.10.x, you can install the Redpanda Operator v2.4.x.

Prerequisites

Before migrating to the Redpanda Operator, you must have:

The name of your existing Helm release and the latest version of the Redpanda Helm chart that you have deployed.
```
helm list -A
```
In this example the chart version is 5.9.1 and the release name is redpanda.
```
NAME       CHART
redpanda   redpanda-5.9.1
```
Make a note of your name and version for the next steps. You’ll need to configure your Redpanda custom resource with these details.
Your values overrides.
```
helm get values <cluster-name> --namespace <namespace>
```
You should see your overrides in YAML format. You’ll need to configure your Redpanda custom resource with these details.

Before implementing any changes in your production environment, Redpanda Data recommends testing the migration in a non-production environment.

Migrate to the Redpanda Operator and Helm

To migrate to the latest Redpanda Operator and use it to manage your Helm deployment, follow these steps.

Make sure that you have permission to install custom resource definitions (CRDs):
```
kubectl auth can-i create CustomResourceDefinition --all-namespaces
```
You should see yes in the output.

You need these cluster-level permissions to install the Redpanda Operator CRDs in the next steps.

Install the Redpanda Operator custom resource definitions (CRDs):

kubectl kustomize "https://github.com/redpanda-data/redpanda-operator//operator/config/crd?ref=v2.4.2" \
    | kubectl apply --server-side -f -

Install the Redpanda Operator in the same namespace as your Redpanda Helm chart:

helm repo add redpanda https://charts.redpanda.com
helm repo update
helm upgrade --install redpanda-controller redpanda/operator \
  --namespace <namespace> \
  --set image.tag=v2.4.2 \
  --create-namespace

Ensure that the Deployment is successfully rolled out:

kubectl --namespace <namespace> rollout status -w deployment/redpanda-controller-operator

deployment "redpanda-controller" successfully rolled out

Configure a Redpanda custom resource that Redpanda Operator will use to adopt your Redpanda cluster.

Replace the placeholders with the values identified in the Prerequisites.

redpanda-cluster.yaml

apiVersion: cluster.redpanda.com/v1alpha2
kind: Redpanda
metadata:
  annotations:
    cluster.redpanda.com/managed: "true"
  creationTimestamp: null
  name: <cluster-name> (1)
spec:
  chartRef:
    chartVersion: <chart-version> (2)
  clusterSpec:
    <chart-overrides> (3)

1 Replace with your Helm release name.

Replace with your chart version.

Choose a chartVersion that the current Operator’s CRDs support. For example, 5.9.x or 5.10.x if you’re using Operator v2.4.x. See Kubernetes Compatibility.
If your existing Helm deployment is on version 5.8.x or below, you must first upgrade the chart using Helm before creating the Redpanda resource.

Replace with your chart overrides.

The Redpanda CRD is compatible with the version of the Helm chart defined in the operator. For details on the structure and configuration options of the Redpanda custom resource, refer to the Redpanda Operator CRD reference.

Adopt the Redpanda cluster by creating an instance of the Redpanda custom resource in the same namespace as the Redpanda Operator:
```
kubectl apply -f redpanda-cluster.yaml --namespace <namespace>
```

Wait for the Redpanda resource to successfully reach a deployed state:

kubectl get redpanda --namespace <namespace> --watch

Example output:

NAME       READY   STATUS
redpanda   True    Redpanda reconciliation succeeded

Roll back from Redpanda Operator to Helm

If you migrated to the Redpanda Operator and want to revert to using only Helm, follow these steps:

Uninstall or disable the Redpanda Operator.

You can uninstall the Redpanda Operator using Helm or disable it by changing the image to one that does not exist:
```
kubectl edit pod <operator-name> --namespace <namespace>
```

Delete the resources:

kubectl delete redpanda <cluster-name> --namespace <namespace>
kubectl delete helmrelease <cluster-name> --namespace <namespace>
kubectl delete helmchart <cluster-name> --namespace <namespace>
kubectl delete helmrepository <cluster-name> --namespace <namespace>

After completing these steps, the Redpanda Operator is no longer managing your Helm deployment.

Troubleshooting

While the deployment process can sometimes take a few minutes, a prolonged 'not ready' status may indicate an issue.

HelmRelease is not ready

If you are using the Redpanda Operator, you may see the following message while waiting for a Redpanda custom resource to be deployed:

NAME       READY   STATUS
redpanda   False   HelmRepository 'redpanda/redpanda-repository' is not ready
redpanda   False   HelmRelease 'redpanda/redpanda' is not ready

While the deployment process can sometimes take a few minutes, a prolonged 'not ready' status may indicate an issue. Follow the steps below to investigate:

Check the status of the HelmRelease:

kubectl describe helmrelease <redpanda-resource-name> --namespace <namespace>

Review the Redpanda Operator logs:

kubectl logs -l app.kubernetes.io/name=operator -c manager --namespace <namespace>

HelmRelease retries exhausted

If you are running the operator in Flux-managed mode (chartRef.useFlux: true), the HelmRelease retries exhausted error may occur when the Helm Controller has tried to reconcile the HelmRelease a number of times, but these attempts have failed consistently.

The Helm Controller watches for changes in HelmRelease objects. When changes are detected, it tries to reconcile the state defined in the HelmRelease with the state in the cluster. The process of reconciliation includes installation, upgrade, testing, rollback or uninstallation of Helm releases.

You may see this error due to:

Incorrect configuration in the HelmRelease.
Issues with the chart, such as a non-existent chart version or the chart repository not being accessible.
Missing dependencies or prerequisites required by the chart.
Issues with the underlying Kubernetes cluster, such as insufficient resources or connectivity issues.

To debug this error do the following:

Check the status of the HelmRelease:

kubectl describe helmrelease <cluster-name> --namespace <namespace>

Review the Redpanda Operator logs:

kubectl logs -l app.kubernetes.io/name=operator -c manager --namespace <namespace>

When you find and fix the error, you must use the Flux CLI, fluxctl, to suspend and resume the reconciliation process:

Install Flux CLI.

Suspend the HelmRelease:

flux suspend helmrelease <cluster-name> --namespace <namespace>

Resume the HelmRelease:

flux resume helmrelease <cluster-name> --namespace <namespace>

StatefulSet never rolls out

If the StatefulSet Pods remain in a pending state, they are waiting for resources to become available.

To identify the Pods that are pending, use the following command:

kubectl get pod --namespace <namespace>

The response includes a list of Pods in the StatefulSet and their status.

To view logs for a specific Pod, use the following command.

kubectl logs -f <pod-name> --namespace <namespace>

You can use the output to debug your deployment.

Didn’t match pod anti-affinity rules

If you see this error, your cluster does not have enough nodes to satisfy the anti-affinity rules:

Warning  FailedScheduling  18m  default-scheduler  0/1 nodes are available: 1 node(s) didn't match pod anti-affinity rules. preemption: 0/1 nodes are available: 1 No preemption victims found for incoming pod.

The Helm chart configures default podAntiAffinity rules to make sure that only one Pod running a Redpanda broker is scheduled on each worker node. To learn why, see Number of workers.

To resolve this issue, do one of the following:

Create additional worker nodes.

Modify the anti-affinity rules (for development purposes only).

If adding nodes is not an option, you can modify the podAntiAffinity rules in your StatefulSet to be less strict.

Operator
Helm

redpanda-cluster.yaml

apiVersion: cluster.redpanda.com/v1alpha2
kind: Redpanda
metadata:
  name: redpanda
spec:
  chartRef: {}
  clusterSpec:
    statefulset:
      podAntiAffinity:
        type: soft

kubectl apply -f redpanda-cluster.yaml --namespace <namespace>

--values
--set

docker-repo.yaml

statefulset:
  podAntiAffinity:
    type: soft

helm upgrade --install redpanda redpanda/redpanda --namespace <namespace> --create-namespace \
  --values docker-repo.yaml --reuse-values

helm upgrade --install redpanda redpanda/redpanda --namespace <namespace> --create-namespace \
  --set statefulset.podAntiAffinity.type=soft

Unable to mount volume

If you see volume mounting errors in the Pod events or in the Redpanda logs, ensure that each of your Pods has a volume available in which to store data.

If you’re using StorageClasses with dynamic provisioners (default), ensure they exist:
```
kubectl get storageclass
```
If you’re using PersistentVolumes, ensure that you have one PersistentVolume available for each Redpanda broker, and that each one has the storage capacity that’s set in storage.persistentVolume.size:
```
kubectl get persistentvolume --namespace <namespace>
```

To learn how to configure different storage volumes, see Configure Storage.

Failed to pull image

When deploying the Redpanda Helm chart, you may encounter Docker rate limit issues because the default registry URL is not recognized as a Docker Hub URL. The domain docker.redpanda.com is used for statistical purposes, such as tracking the number of downloads. It mirrors Docker Hub’s content while providing specific analytics for Redpanda.

Failed to pull image "docker.redpanda.com/redpandadata/redpanda:v<version>": rpc error: code = Unknown desc = failed to pull and unpack image "docker.redpanda.com/redpandadata/redpanda:v<version>": failed to copy: httpReadSeeker: failed open: unexpected status code 429 Too Many Requests - Server message: toomanyrequests: You have reached your pull rate limit. You may increase the limit by authenticating and upgrading: https://www.docker.com/increase-rate-limit

To fix this error, do one of the following:

Replace the image.repository value in the Helm chart with docker.io/redpandadata/redpanda. Switching to Docker Hub avoids the rate limit issues associated with docker.redpanda.com.

Operator
Helm

redpanda-cluster.yaml

apiVersion: cluster.redpanda.com/v1alpha2
kind: Redpanda
metadata:
  name: redpanda
spec:
  chartRef: {}
  clusterSpec:
    image:
      repository: docker.io/redpandadata/redpanda

kubectl apply -f redpanda-cluster.yaml --namespace <namespace>

--values
--set

docker-repo.yaml

image:
  repository: docker.io/redpandadata/redpanda

helm upgrade --install redpanda redpanda/redpanda --namespace <namespace> --create-namespace \
  --values docker-repo.yaml --reuse-values

helm upgrade --install redpanda redpanda/redpanda --namespace <namespace> --create-namespace \
  --set image.repository=docker.io/redpandadata/redpanda

Authenticate to Docker Hub by logging in with your Docker Hub credentials. The docker.redpanda.com site acts as a reflector for Docker Hub. As a result, when you log in with your Docker Hub credentials, you will bypass the rate limit issues.

Dig not defined

This error means that you are using an unsupported version of Helm:

Error: parse error at (redpanda/templates/statefulset.yaml:203): function "dig" not defined

To fix this error, ensure that you are using the minimum required version: 3.10.0.

helm version

Repository name already exists

If you see this error, remove the redpanda chart repository, then try installing it again.

helm repo remove redpanda
helm repo add redpanda https://charts.redpanda.com
helm repo update

Fatal error during checker "Data directory is writable" execution

This error appears when Redpanda does not have write access to your configured storage volume under storage in the Helm chart.

Error: fatal error during checker "Data directory is writable" execution: open /var/lib/redpanda/data/test_file: permission denied

To fix this error, set statefulset.initContainers.setDataDirOwnership.enabled to true so that the initContainer can set the correct permissions on the data directories.

Cannot patch "redpanda" with kind StatefulSet

This error appears when you run helm upgrade with the --values flag but do not include all your previous overrides.

Error: UPGRADE FAILED: cannot patch "redpanda" with kind StatefulSet: StatefulSet.apps "redpanda" is invalid: spec: Forbidden: updates to statefulset spec for fields other than 'replicas', 'template', 'updateStrategy', 'persistentVolumeClaimRetentionPolicy' and 'minReadySeconds' are forbidden

To fix this error, do one of the following:

Include all the value overrides from the previous installation or upgrade using either the --set or the --values flags.
Use the --reuse-values flag.

Do not use the --reuse-values flag to upgrade from one version of the Helm chart to another. This flag stops Helm from using any new values in the upgraded chart.

Cannot patch "redpanda-console" with kind Deployment

This error appears if you try to upgrade your deployment and you already have console.enabled set to true.

Error: UPGRADE FAILED: cannot patch "redpanda-console" with kind Deployment: Deployment.apps "redpanda-console" is invalid: spec.selector: Invalid value: v1.LabelSelector{MatchLabels:map[string]string{"app.kubernetes.io/instance":"redpanda", "app.kubernetes.io/name":"console"}, MatchExpressions:[]v1.LabelSelectorRequirement(nil)}: field is immutable

To fix this error, set console.enabled to false so that Helm doesn’t try to deploy Redpanda Console again.

Helm is in a pending-rollback state

An interrupted Helm upgrade process can leave your Helm release in a pending-rollback state. This state prevents further actions like upgrades, rollbacks, or deletions through standard Helm commands. To fix this:

Identify the Helm release that’s in a pending-rollback state:
```
helm list --namespace <namespace> --all
```
Look for releases with a status of pending-rollback. These are the ones that need intervention.
Verify the Secret’s status to avoid affecting the wrong resource:
```
kubectl --namespace <namespace> get secret --show-labels
```
Identify the Secret associated with your Helm release by its pending-rollback status in the labels.

Ensure you have correctly identified the Secret to avoid unintended consequences. Deleting the wrong Secret could impact other deployments or services.

Delete the Secret to clear the pending-rollback state:

kubectl --namespace <namespace> delete secret -l status=pending-rollback

After clearing the pending-rollback state:

Retry the upgrade: Restart the upgrade process. You should investigate the initial failure to avoid getting into the pending-rollback state again.
Perform a rollback: If you need to roll back to a previous release, use helm rollback <release-name> <revision> to revert to a specific, stable release version.

Crash loop backoffs

If a broker crashes after startup, or gets stuck in a crash loop, it can accumulate an increasing amount of stored state. This accumulated state not only consumes additional disk space but also prolongs the time required for each subsequent restart to process it.

To prevent infinite crash loops, the Redpanda Helm chart sets the crash_loop_limit broker configuration property to 5. The crash loop limit is the number of consecutive crashes that can happen within one hour of each other. By default, the broker terminates immediately after hitting the crash_loop_limit. The Pod running Redpanda remains in a CrashLoopBackoff state until its internal consecutive crash counter is reset to zero.

To facilitate debugging in environments where a broker is stuck in a crash loop, you can also set the crash_loop_sleep_sec broker configuration property. This setting determines how long the broker sleeps before terminating the process after reaching the crash loop limit. By providing a window during which the Pod remains available, you can SSH into it and troubleshoot the issue.

Example configuration:

config:
  node:
    crash_loop_limit: 5
    crash_loop_sleep_sec: 60

In this example, when the broker hits the crash_loop_limit of 5, it will sleep for 60 seconds before terminating the process. This delay allows administrators to access the Pod and troubleshoot.

To troubleshoot a crash loop backoff:

Check the Redpanda logs from the most recent crashes:

kubectl logs <pod-name> --namespace <namespace>

Kubernetes retains logs only for the current and the previous instance of a container. This limitation makes it difficult to access logs from earlier crashes, which may contain vital clues about the root cause of the issue. Given these log retention limitations, setting up a centralized logging system is crucial. Systems such as Loki or Datadog can capture and store logs from all containers, ensuring you have access to historical data.

Resolve the issue that led to the crash loop backoff.

Reset the crash counter to zero to allow Redpanda to restart. You can do any of the following to reset the counter:

Make changes to any of the following sections in the Redpanda Helm chart to trigger an update:
- config.node
- config.tunable
For example:
```
config:
  node:
    crash_loop_limit: <new-integer>
```

Delete the startup_log file in the broker’s data directory.

kubectl exec <pod-name> --namespace <namespace> -- rm /var/lib/redpanda/data/startup_log

It might be challenging to execute this command within a Pod that is in a CrashLoopBackoff state due to the limited time during which the Pod is available before it restarts. Wrapping the command in a loop might work.

Wait one hour since the last crash. The crash counter resets after one hour.

To avoid future crash loop backoffs and manage the accumulation of small segments effectively:

Monitor the size and number of segments regularly.
Optimize your Redpanda configuration for segment management.
Consider implementing Tiered Storage to manage data more efficiently.

A Redpanda Enterprise Edition license is required

During a Redpanda upgrade, if enterprise features are enabled and a valid Enterprise Edition license is missing, Redpanda logs a warning and aborts the upgrade process on the first broker. This issue prevents a successful upgrade.

A Redpanda Enterprise Edition license is required to use the currently enabled features. To apply your license, downgrade this broker to the pre-upgrade version and provide a valid license key via rpk using 'rpk cluster license set <key>', or via Redpanda Console. To request an enterprise license, please visit <redpanda.com/upgrade>. To try Redpanda Enterprise for 30 days, visit <redpanda.com/try-enterprise>. For more information, see <https://docs.redpanda.com/current/get-started/licenses>.

If you encounter this message, follow these steps to recover:

Roll back the affected broker to the original version.
Do one of the following:
- Apply a valid Redpanda Enterprise Edition license to the cluster.
- Disable enterprise features.
  
  If you do not have a valid license and want to proceed without using enterprise features, you can disable the enterprise features in your Redpanda configuration.
Retry the upgrade.

For more troubleshooting steps, see Troubleshoot Redpanda in Kubernetes.

Open an issue

If you cannot solve the issue or need assistance during the migration process, open a GitHub issue. Before opening a new issue, search the existing issues on GitHub to see if someone has already reported a similar problem or if any relevant discussions can help you.

Next steps

For information about the latest Redpanda Operator and the new Redpanda custom resource, see Redpanda in Kubernetes.

Was this helpful?

group Ask in the community

mail Share your feedback

group_add Make a contribution

What do you like about this doc?

Let us know what we do well:

Let us contact you about your feedback:

What did you not like about this doc?

Let us know what we can improve:

Let us contact you about your feedback:

Migrate from the Redpanda Helm chart

Supported migration paths

Prerequisites

Migrate to the Redpanda Operator and Helm

Roll back from Redpanda Operator to Helm

Troubleshooting

HelmRelease is not ready

HelmRelease retries exhausted

StatefulSet never rolls out

Didn’t match pod anti-affinity rules

Unable to mount volume

Failed to pull image

Dig not defined

Repository name already exists

Fatal error during checker "Data directory is writable" execution

Cannot patch "redpanda" with kind StatefulSet

Cannot patch "redpanda-console" with kind Deployment

Helm is in a pending-rollback state

Crash loop backoffs

A Redpanda Enterprise Edition license is required

Open an issue

Next steps

Simple online edits

Contribution guide