Skip to content

Conversation

@kacpersaw
Copy link
Contributor

Add a fallback step that installs helm via apt from Buildkite's repository if the primary azure/setup-helm action fails. This makes CI resilient to get.helm.sh outages while preserving the caching benefits of the action when it works.

Changes

  • Primary: azure/setup-helm with continue-on-error: true
  • Fallback: apt install from Buildkite's helm-debian repository (only runs if primary fails)

The fallback uses GPG-signed packages from the official Helm apt repository hosted by Buildkite.

Background

On Oct 29, 2025, get.helm.sh experienced an outage that broke CI. A workaround was applied (#20552) but later reverted. This PR makes the workaround automatic - it only kicks in when needed.

Fixes: coder/internal#1109

Add a fallback step that installs helm via apt from Buildkite's
repository if the primary azure/setup-helm action fails. This makes
CI resilient to get.helm.sh outages while preserving the caching
benefits of the action when it works.

The fallback uses GPG-signed packages from the official Helm apt
repository hosted by Buildkite, as recommended by the Helm docs.

Fixes: coder/internal#1109
Copy link
Contributor

@jdomeracki-coder jdomeracki-coder left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems reasonable

Copy link
Member

@johnstcn johnstcn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we anticipate the fallback to be more reliable than get.helm.sh? If so, should we just switch to that method entirely?

It might also be a good idea to run helm version or somehow specify the version of Helm we use to avoid flakes.

@kacpersaw
Copy link
Contributor Author

@johnstcn Neither source is inherently more reliable - get.helm.sh is official Helm infrastructure, while Buildkite's apt repo is community-maintained but recommended by Helm docs. The fallback approach provides redundancy - if either is down, the other is available. Switching entirely to apt would trade one single point of failure for another.

Added a helm version --short step to verify the correct version was installed.

@kacpersaw kacpersaw merged commit 134924d into main Dec 16, 2025
31 of 33 checks passed
@kacpersaw kacpersaw deleted the ci-workflow-cvp7 branch December 16, 2025 11:20
@github-actions github-actions bot locked and limited conversation to collaborators Dec 16, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

flake: Lint workflow - Helm download failure (get.helm.sh)

4 participants