Parallel serde by blp · Pull Request #5774 · feldera/feldera

blp · 2026-03-06T22:56:03Z

For my test cases, this raises performance for a 3-minute test run from 81M records to 85M records.

I don't know why `data` was passed as `&mut` of an iterator. It's easier to just pass an iterator directly. Also, simplify the code inside try_send_all() a bit, and actually enforce the constraint mentioned in the `Panics` section of its doc comment. Signed-off-by: Ben Pfaff <blp@feldera.com>

mythical-fred

Nice. Parallelizing serde across workers is exactly the right fix — the background task bottleneck was the obvious culprit. 5% throughput gain for free.

mythical-fred · 2026-03-06T23:14:35Z

crates/dbsp/src/operator/communication/exchange.rs

                                    .take()
+                                    .unwrap()
+                                    .into_serialized()
                                    .unwrap();


The bare .unwrap() on into_serialized() relies on the invariant that mailboxes for remote receivers always contain Mailbox::Serialized. Worth making that explicit: .expect("remote mailboxes should always be serialized") or a brief // SAFETY: comment.

Until now, when an exchange operator needed to serialize data for transmission to another host, all the serialization for all the transmitted data occurred at the same time, serialized, in the task that did the transmission. This commit changes it so that each worker instead serializes its own data that needs to be sent to another host. This should reduce the latency of serialization by parallelizing it across the sending workers. Similarly, until now, when a host received serialized data from an exchange with another host, it deserialized all of it together in the task that received it. This commit changes it so that each worker deserializes the data that it receives, which should reduce the latency of deserialization by parallelizing it across the receiving workers. Signed-off-by: Ben Pfaff <blp@feldera.com>

blp requested a review from ryzhyk March 6, 2026 22:56

blp self-assigned this Mar 6, 2026

blp added DBSP core Related to the core DBSP library ft Fault tolerant, distributed, and scale-out implementation performance rust Pull requests that update Rust code multihost Related to multihost or distributed pipelines labels Mar 6, 2026

mythical-fred approved these changes Mar 6, 2026

View reviewed changes

blp force-pushed the parallel-serde branch from a147dec to 0d3890a Compare March 6, 2026 23:20

ryzhyk approved these changes Mar 11, 2026

View reviewed changes

blp added this pull request to the merge queue Mar 11, 2026

Merged via the queue into main with commit 6dcefce Mar 11, 2026
1 check passed

blp deleted the parallel-serde branch March 11, 2026 18:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel serde#5774

Parallel serde#5774
blp merged 2 commits intomainfrom
parallel-serde

blp commented Mar 6, 2026

Uh oh!

mythical-fred left a comment

Uh oh!

mythical-fred Mar 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

blp commented Mar 6, 2026

Uh oh!

mythical-fred left a comment

Choose a reason for hiding this comment

Uh oh!

mythical-fred Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants