Move benchmarks to daily cron by ludfjig · Pull Request #1302 · hyperlight-dev/hyperlight

ludfjig · 2026-03-11T19:34:30Z

Running per PR is slow (>30 min), and will just get slower and slower for every added benchmark. Furthermore, I don't think most PR authors look at the results anyway.

It now runs daily, and compares with the previous day's result. Unfortunately retention period is only 90 days (max), so maybe this is something to look into in anther PR (e.g. save results to different branch or soemthing)

Delete Benchmarks.yml and add its features (artifact upload, baseline_tag, baseline_run_id, retention_days inputs) to dep_benchmarks.yml. Update CreateRelease.yml to call dep_benchmarks.yml with a matrix directly. Signed-off-by: Ludvig Liljenberg <4257730+ludfjig@users.noreply.github.com>

Remove the benchmarks job from ValidatePullRequest.yml and add a new DailyBenchmarks.yml workflow that runs benchmarks daily, comparing against the previous day's run artifacts with 90-day retention. Signed-off-by: Ludvig Liljenberg <4257730+ludfjig@users.noreply.github.com>

Replace references to per-PR benchmarks and Benchmarks.yml with the new DailyBenchmarks.yml and dep_benchmarks.yml workflows. Signed-off-by: Ludvig Liljenberg <4257730+ludfjig@users.noreply.github.com>

simongdavies

LGTM, I fired off a Copilot review or backup

Copilot

Pull request overview

Moves benchmark execution out of PR validation and into a scheduled daily workflow, while reworking the benchmark workflow into a reusable component that can compare against either the prior day’s artifacts or a release baseline.

Changes:

Add DailyBenchmarks.yml scheduled workflow that performs day-over-day comparisons using prior run artifacts.
Update dep_benchmarks.yml to support baseline selection (previous run vs. release tag) and upload benchmark results as artifacts with configurable retention.
Remove benchmarks from ValidatePullRequest.yml, switch release benchmarking to the reusable workflow, and update docs/remove legacy workflow.

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 9 comments.

Show a summary per file

File	Description
docs/benchmarking-hyperlight.md	Updates documentation to reflect daily scheduled benchmarks and revised release workflow usage.
.github/workflows/dep_benchmarks.yml	Expands reusable benchmark runner to support baseline download and artifact retention/upload.
.github/workflows/ValidatePullRequest.yml	Removes per-PR benchmark job from PR validation pipeline.
.github/workflows/DailyBenchmarks.yml	Adds daily scheduled benchmarking + baseline discovery + failure notification.
.github/workflows/CreateRelease.yml	Switches release benchmarking to use the reusable workflow with a matrix.
.github/workflows/Benchmarks.yml	Deletes the legacy benchmarks workflow.

.github/workflows/DailyBenchmarks.yml

.github/workflows/dep_benchmarks.yml

docs/benchmarking-hyperlight.md

Signed-off-by: Ludvig Liljenberg <4257730+ludfjig@users.noreply.github.com>

.github/workflows/DailyBenchmarks.yml

Signed-off-by: Ludvig Liljenberg <4257730+ludfjig@users.noreply.github.com>

ludfjig added 3 commits March 11, 2026 12:32

docs: update benchmarking docs to reflect daily cron workflow

800b359

Replace references to per-PR benchmarks and Benchmarks.yml with the new DailyBenchmarks.yml and dep_benchmarks.yml workflows. Signed-off-by: Ludvig Liljenberg <4257730+ludfjig@users.noreply.github.com>

ludfjig added the kind/refactor For PRs that restructure or remove code without adding new functionality. label Mar 11, 2026

ludfjig marked this pull request as ready for review March 11, 2026 19:45

ludfjig requested review from danbugs, dblnz, devigned, jprendes, marosset, simongdavies and syntactically as code owners March 11, 2026 19:45

simongdavies requested a review from Copilot March 11, 2026 21:31

simongdavies previously approved these changes Mar 11, 2026

View reviewed changes

Copilot AI reviewed Mar 11, 2026

View reviewed changes

Add permission and fix docs

0b4d72f

Signed-off-by: Ludvig Liljenberg <4257730+ludfjig@users.noreply.github.com>

ludfjig dismissed simongdavies’s stale review via 0b4d72f March 11, 2026 21:59

jsturtevant reviewed Mar 11, 2026

View reviewed changes

.github/workflows/DailyBenchmarks.yml Outdated Show resolved Hide resolved

jsturtevant previously approved these changes Mar 11, 2026

View reviewed changes

Update issue title

172329d

Signed-off-by: Ludvig Liljenberg <4257730+ludfjig@users.noreply.github.com>

ludfjig dismissed jsturtevant’s stale review via 172329d March 11, 2026 23:02

jsturtevant approved these changes Mar 11, 2026

View reviewed changes

jsturtevant enabled auto-merge (squash) March 11, 2026 23:09

jsturtevant merged commit 98a9030 into hyperlight-dev:main Mar 11, 2026
48 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move benchmarks to daily cron#1302

Move benchmarks to daily cron#1302
jsturtevant merged 5 commits intohyperlight-dev:mainfrom
ludfjig:move-benchmarks-to-daily-cron

ludfjig commented Mar 11, 2026 •

edited

Loading

Uh oh!

simongdavies left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ludfjig commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

simongdavies left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ludfjig commented Mar 11, 2026 •

edited

Loading