[WIP][CORE][HISTORY] Allow AppStatus to be cached and reused by the history server. by ForVic · Pull Request #54878 · apache/spark

ForVic · 2026-03-18T01:20:37Z

What changes were proposed in this pull request?

When a Spark application completes, we write out the AppStatus, the materialized state generated by the AppStatusListener in protobuf. When a Spark application is loaded in the Spark History Server, we load that state as an optimization, as opposed to recomputing that state using a ReplayListenerBus and AppStatusListener in the history server.

Why are the changes needed?

The Spark History Server can be slow to load application status for jobs with large event logs, due to deserialization overhead. The history server does the exact same work that is already being done on the driver during application runtime, while it is serving the live UI from the AppStatusListener's state into the KVStore (in-memory or rocksdb backed).

Does this PR introduce any user-facing change?

Yes, it introduces a couple of user facing configs to enable this change, and should result in the History server UI being quicker.

How was this patch tested?

Unit tests, and has been running internally, at scale across multiple spark history server instances for multiple days.

Was this patch authored or co-authored using generative AI tooling?

Partially,
Generated-by: GPT-5.4

[HISTORY] Cache app status snapshots for the history server

f61717c

ForVic changed the title ~~[CORE][HISTORY] Allow AppStatus to be cached and reused by the history server.~~ [WIP][CORE][HISTORY] Allow AppStatus to be cached and reused by the history server. Mar 18, 2026

ForVic force-pushed the dev/victors/history-snapshot-oss branch 2 times, most recently from 1d0b1a7 to 5ed5e7e Compare March 18, 2026 02:05

[HISTORY] Harden history snapshot publication and restore

40fe298

ForVic force-pushed the dev/victors/history-snapshot-oss branch from 5ed5e7e to 40fe298 Compare March 18, 2026 06:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP][CORE][HISTORY] Allow AppStatus to be cached and reused by the history server.#54878

[WIP][CORE][HISTORY] Allow AppStatus to be cached and reused by the history server.#54878
ForVic wants to merge 2 commits intoapache:masterfrom
ForVic:dev/victors/history-snapshot-oss

ForVic commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ForVic commented Mar 18, 2026

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant