CommandQueueMT: Reduce contention + Fix race conditions #112506

RandomShaper · 2025-11-07T11:41:56Z

See #112452 for an explanation of why this would be beneficial.

TL;DR Keeping the lock held over the command queue of the rendering server in separate thread mode is not needed because there's a single thread dealing with such server, Therefore, the lock is only needed for thread safety of the command queue itself, which means the lock can be released while commands are run, allowing other threads to add commands without waiting.

Not tested at all due to lack of time...

UPDATE: I've marked this PR as cherry-pickable into some releases, but probably the only commit that should be cherry-picked is the first one (CommandQueueMT: Fix race conditions), which fixes a clear bug. The rest of the PR is a performance improvement, and as such it's better scheduled only for the current dev branch.

servers/rendering/rendering_server_default.h

core/templates/command_queue_mt.h

brycehutchings · 2025-11-11T23:45:05Z

I did some basic testing of this fix using the Sponza scene (https://github.com/Calinou/godot-sponza). I confirmed it fixes the performance problem so that the performance loading a very complex glTF in a background thread takes the same time regardless of if using the Separate or Safe thread model. I also compared it to the performance without the fix for loading with Safe thread model and it was comparable, so I didn't see a degradation due to the memcpy.

RandomShaper · 2025-11-12T12:30:02Z

@brycehutchings Thanks a lot for testing and reviewing this.

RandomShaper · 2025-11-21T10:11:44Z

Rebased.

dsnopek · 2025-11-21T16:21:26Z

I noticed the issue with the lock being held while the RenderingServer is flushing the queue (and stalling everything that's queuing stuff for the next frame) while looking at Perfetto traces on Samsung Galaxy XR.

This PR seems to solve that problem entirely!

Here's a trace from master running my fork of the GDQuest TPS demo:

The set_render_display_info is the first thing on this frame that is attempting to queue a Callable for the rendering server, and it ends up stalled there until the render thread finishes rendering the previous frame. You can see all the process stuff waiting until the end.

(NOTE: this won't show up in traces of this project on Meta Quest 3, because Meta's runtime uses xrWaitFrame() to delay running process until close to the end of the frame anyway to improve input latency. However, it's still possible to trigger the issue there too, just not in this project and not as drastically.)

And here is a trace from this PR:

Notice that the process stuff happens super early now and overlaps the rendering, and it's only the RenderingServer::sync() that stalls until the rendering server is done, which is exactly what we would expect.

(NOTE: this trace is actually worse for input latency, but I think that's really a problem with xrWaitFrame() on Samsung Galaxy XR. Inadvertent lock contention on Godot's RenderingServer shouldn't be used as the solution for that :-))

dsnopek

The code looks good to me - after I noticed the issue in Perfetto, but before I found this PR, I was thinking about trying to implement this same change (ie copying the queue in _flush()). However, I'm not all that familiar with this code, so I'm glad that @RandomShaper did it :-)

I can say that the result seems correct from my testing!

akien-mga

Based on Bryce and David's review and testing, and Pedro's familiarity with this code, I think we can safely¹ merge this.

TIWAGOS, this code is touchy and regression prone. Keep it in mind when reviewing potential regression reports during the 4.6 dev/beta phase. ↩

Repiteo · 2025-11-21T20:52:50Z

Thanks!

RandomShaper added this to the 4.6 milestone Nov 7, 2025

RandomShaper requested review from a team as code owners November 7, 2025 11:41

RandomShaper added topic:core topic:rendering needs testing performance cherrypick:4.4 Considered for cherry-picking into a future 4.4.x release cherrypick:4.5 Considered for cherry-picking into a future 4.5.x release labels Nov 7, 2025

RandomShaper mentioned this pull request Nov 7, 2025

Loading gltf on background thread very slow when using "Separate" rendering thread model due to lock contention #112452

Open

akien-mga reviewed Nov 7, 2025

View reviewed changes

servers/rendering/rendering_server_default.h Outdated Show resolved Hide resolved

RandomShaper force-pushed the less_locky_cmd_queue branch 2 times, most recently from 352150c to bf84697 Compare November 7, 2025 13:36

brycehutchings reviewed Nov 7, 2025

View reviewed changes

core/templates/command_queue_mt.h Outdated Show resolved Hide resolved

RandomShaper force-pushed the less_locky_cmd_queue branch from bf84697 to bd615b4 Compare November 11, 2025 12:07

RandomShaper changed the title ~~CommandQueueMT: Release the lock during commands for single-flusher usages~~ CommandQueueMT: Reduce contention + Fix race conditions Nov 11, 2025

AThousandShips added bug enhancement labels Nov 11, 2025

brycehutchings reviewed Nov 11, 2025

View reviewed changes

core/templates/command_queue_mt.h Show resolved Hide resolved

brycehutchings reviewed Nov 11, 2025

View reviewed changes

core/templates/command_queue_mt.h Outdated Show resolved Hide resolved

brycehutchings reviewed Nov 11, 2025

View reviewed changes

core/templates/command_queue_mt.h Outdated Show resolved Hide resolved

RandomShaper force-pushed the less_locky_cmd_queue branch from bd615b4 to fa61594 Compare November 12, 2025 12:09

RandomShaper force-pushed the less_locky_cmd_queue branch from fa61594 to 3e1dcfe Compare November 14, 2025 08:05

RandomShaper added 2 commits November 21, 2025 11:09

CommandQueueMT: Fix race conditions

b16a8b8

CommandQueueMT: Reduce lock contention in cases of single flusher

4ba4558

RandomShaper force-pushed the less_locky_cmd_queue branch from 3e1dcfe to 4ba4558 Compare November 21, 2025 10:11

dsnopek approved these changes Nov 21, 2025

View reviewed changes

dsnopek mentioned this pull request Nov 21, 2025

OpenXR: Add profiling macro for process, xrWaitFrame() and acquiring swapchain #112893

Merged

akien-mga approved these changes Nov 21, 2025

View reviewed changes

Repiteo merged commit 0e182ee into godotengine:master Nov 21, 2025
20 checks passed

AThousandShips removed the needs testing label Nov 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

CommandQueueMT: Reduce contention + Fix race conditions #112506

CommandQueueMT: Reduce contention + Fix race conditions #112506

Uh oh!

RandomShaper commented Nov 7, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

brycehutchings commented Nov 11, 2025

Uh oh!

RandomShaper commented Nov 12, 2025

Uh oh!

RandomShaper commented Nov 21, 2025

Uh oh!

dsnopek commented Nov 21, 2025 •

edited

Loading

Uh oh!

dsnopek left a comment

Uh oh!

akien-mga left a comment •

edited

Loading

Uh oh!

Uh oh!

Repiteo commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Uh oh!

CommandQueueMT: Reduce contention + Fix race conditions #112506

CommandQueueMT: Reduce contention + Fix race conditions #112506

Uh oh!

Conversation

RandomShaper commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

brycehutchings commented Nov 11, 2025

Uh oh!

RandomShaper commented Nov 12, 2025

Uh oh!

RandomShaper commented Nov 21, 2025

Uh oh!

dsnopek commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dsnopek left a comment

Choose a reason for hiding this comment

Uh oh!

akien-mga left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Footnotes

Uh oh!

Uh oh!

Repiteo commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

RandomShaper commented Nov 7, 2025 •

edited

Loading

dsnopek commented Nov 21, 2025 •

edited

Loading

akien-mga left a comment •

edited

Loading