[dbsp] Always GC storage at startup. by blp · Pull Request #5803 · feldera/feldera

blp · 2026-03-11T20:17:36Z

Until now, Feldera has only run GC at startup if a checkpoint file existed and was readable. This meant that if a pipeline crashed or force-stopped before the first checkpoint, then upon restart, the pipeline did not clear any files that were in storage from previous runs. This fixes the problem by unconditionally running GC at startup; if we can't read it now then there's no reason to believe that we will be able to read it later.

I tested this manually with a pipeline that writes to storage and force-stop.

Until now, Feldera has only run GC at startup if a checkpoint file existed and was readable. This meant that if a pipeline crashed or force-stopped before the first checkpoint, then upon restart, the pipeline did not clear any files that were in storage from previous runs. This fixes the problem by unconditionally running GC at startup; if we can't read it now then there's no reason to believe that we will be able to read it later. I tested this manually with a pipeline that writes to storage and force-stop. Signed-off-by: Ben Pfaff <blp@feldera.com>

mythical-fred

The fix makes sense -- if we can't read the checkpoint list, preserving orphaned files forever is wrong. But this changes startup behavior with no new test coverage.

mythical-fred · 2026-03-11T21:52:39Z

crates/dbsp/src/circuit/checkpointer.rs

@@ -70,15 +70,7 @@ impl Checkpointer {
    }


This changes the startup behavior: previously, if checkpoint_list was empty (no checkpoint or unreadable), storage files were preserved and only measured. Now gc_startup() runs unconditionally and will delete everything it doesn't recognize as belonging to a known checkpoint.

The fix is correct in intent, but it needs a test for the specific case this addresses: pipeline force-stopped before first checkpoint -> restart -> orphaned files are cleaned up. Without it there's no regression guard if gc_startup()'s handling of an empty checkpoint list ever changes.

blp requested a review from gz March 11, 2026 20:17

blp self-assigned this Mar 11, 2026

blp added bug Something isn't working storage Persistence for internal state in DBSP operators rust Pull requests that update Rust code labels Mar 11, 2026

ryzhyk approved these changes Mar 11, 2026

View reviewed changes

blp added this pull request to the merge queue Mar 11, 2026

mythical-fred suggested changes Mar 11, 2026

View reviewed changes

Merged via the queue into main with commit 30bcbaf Mar 11, 2026
1 check passed

blp deleted the always-gc branch March 11, 2026 22:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[dbsp] Always GC storage at startup.#5803

[dbsp] Always GC storage at startup.#5803
blp merged 1 commit intomainfrom
always-gc

blp commented Mar 11, 2026

Uh oh!

mythical-fred left a comment

Uh oh!

mythical-fred Mar 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

blp commented Mar 11, 2026

Uh oh!

mythical-fred left a comment

Choose a reason for hiding this comment

Uh oh!

mythical-fred Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants