Implement per-pixel linked list for OIT by beicause · Pull Request #21831 · bevyengine/bevy

beicause · 2025-11-14T02:22:56Z

Objective

The current OIT stores viewport-sized fragments per layer. It uses much more memory than it can be.

Solution

Implements per-pixel linked list for OIT, which saves memory and can handle more layers. The implementation references https://github.com/KhronosGroup/Vulkan-Samples/tree/main/samples/api/oit_linked_lists

Testing

Tested with the order_independent_transparency example. I also added a new scene in it.

Details

<= 256mb

IceSentry

This is awesome. Thank you so much for working on this. Sorry it took so long for me to review, I got sick in the same week you opened the PR and haven't had time to come back to it since.

This is very close to what I had in mind as a follow up to my original OIT impl so I'm really happy to see it in action.

I managed to review the PR because I'm very familiar with OIT but to make the diff a bit simpler to follow I would suggest adding the depth prepass support in a separate PR to the current OIT impl. This way the linked list PR will be a bit easier to follow since it won't be mixed with the depth prepass changes.

crates/bevy_core_pipeline/src/core_3d/main_transparent_pass_3d_node.rs

crates/bevy_core_pipeline/src/oit/oit_draw.wgsl

crates/bevy_core_pipeline/src/oit/mod.rs

It's head

crates/bevy_core_pipeline/src/oit/resolve/oit_resolve.wgsl

Add `reserve_internal` to `BufferVec` Add `capacity` `set_label` `get_label` to `UninitBufferVec` Use `Vec::reserve` to reduce some allocation

IceSentry · 2026-02-02T20:38:04Z

@goodartistscopy btw, we highly encourage community reviews. Since it seems like you already looked over all the changes feel free to leave a review/approve.

IceSentry · 2026-02-02T20:38:28Z

We need 2 reviews for a maintainer to look at it and merge it.

tychedelia

There appears to be a memory leak in the example on macOS, but that's on main so not blocking here. Looks great! Thanks for your work @beicause

crates/bevy_core_pipeline/src/oit/resolve/oit_resolve.wgsl

crates/bevy_core_pipeline/src/oit/oit_draw.wgsl

tychedelia · 2026-02-04T21:07:50Z

crates/bevy_core_pipeline/src/oit/resolve/oit_resolve.wgsl

-@group(0) @binding(2) var<storage, read_write> layer_ids: array<atomic<i32>>;
+@group(0) @binding(1) var<storage, read> nodes: array<OitFragmentNode>;
+@group(0) @binding(2) var<storage, read_write> heads: array<u32>; // No need to be atomic
+@group(0) @binding(3) var<storage, read_write> atomic_counter: u32; // No need to be atomic


Note for other reviewers, was curious if contention on this is ever a source of concern, but seems that drivers very well optimize this case and so it's preferred over more complicated optimizations.

Contention on atomic_counter ("the allocator") is highest at the draw/accumulation phase. In the resolve shader, each thread has it's own head (and list of nodes), so no contention. However all threads also reset atomic_counter non atomically and I wonder if its very pedantically UB.

This references https://github.com/KhronosGroup/Vulkan-Samples/blob/6a4d8b0552df04aad581c746533a03db95ca5012/shaders/oit_linked_lists/combine.frag#L145-L147

Aside from correctness, I wonder if it costs the full bandwidth too ? Like, would

let screen_index = u32(floor(in.position.x) + floor(in.position.y) * view.viewport.z); if screen_index == 0 { atomic_counter = 0u; }

be more efficient ?

beicause · 2026-02-05T07:13:07Z

There appears to be a memory leak in the example on macOS

I think it's due to OitBuffers is never released if OIT is disabled. It should be resolved.

goodartistscopy

atomic_counter is a non informative name (allocator or next_node would be better)
I'll put the clean up of the shader loops along with #22781 but would be better included here

IceSentry · 2026-02-05T20:58:10Z

I'll put the clean up of the shader loops

I'd prefer if this was in a self contained small PR. Large PRs always take a long time to merge just because it's harder to review but keeping things small makes the process faster.

Implement per-pixel linked list for OIT

61a1e0c

beicause force-pushed the oit-opt branch from a9d91e6 to 61a1e0c Compare November 14, 2025 02:26

IceSentry self-assigned this Nov 14, 2025

IceSentry self-requested a review November 14, 2025 04:32

IceSentry removed their assignment Nov 14, 2025

IceSentry added C-Feature A new feature, making something new possible A-Rendering Drawing game state to the screen labels Nov 14, 2025

github-project-automation bot added this to Rendering Nov 14, 2025

IceSentry added S-Needs-Review Needs reviewer attention (from anyone!) to move forward D-Shaders This code uses GPU shader languages labels Nov 14, 2025

beicause added 6 commits November 14, 2025 18:09

update

8a3c399

Fix

7ebd383

make use of depth prepass to filter out fragments

9d0dbc3

Fix corrupted linked list on startup

c414d2e

change OIT default value

395acf0

<= 256mb

Sort in desc order and fix early termination in blending

a2583f6

beicause force-pushed the oit-opt branch from 0d16699 to 0bc65e6 Compare November 26, 2025 15:29

Merge remote-tracking branch 'upstream' into oit-opt

cd963ea

beicause force-pushed the oit-opt branch from 78c03e3 to cd963ea Compare November 26, 2025 15:34

format

6398884

IceSentry approved these changes Dec 6, 2025

View reviewed changes

crates/bevy_core_pipeline/src/core_3d/main_transparent_pass_3d_node.rs Outdated Show resolved Hide resolved

crates/bevy_core_pipeline/src/oit/oit_draw.wgsl Show resolved Hide resolved

beicause added 2 commits December 6, 2025 13:19

fmt

a8028b5

Merge remote-tracking branch 'upstream' into oit-opt

ae57011

beicause mentioned this pull request Dec 17, 2025

Weighted blended OIT and unsorted transparent #21782

Closed

IceSentry added this to the 0.19 milestone Dec 29, 2025

goodartistscopy reviewed Jan 8, 2026

View reviewed changes

crates/bevy_core_pipeline/src/oit/mod.rs Outdated Show resolved Hide resolved

beicause added 2 commits January 9, 2026 11:44

Merge remote-tracking branch 'upstream' into oit-opt

1fbeaa7

Rename header -> head

2babad2

It's head

goodartistscopy reviewed Jan 9, 2026

View reviewed changes

crates/bevy_core_pipeline/src/oit/resolve/oit_resolve.wgsl Show resolved Hide resolved

Add some methods and optimizations in buffer vec

0290b64

Add `reserve_internal` to `BufferVec` Add `capacity` `set_label` `get_label` to `UninitBufferVec` Use `Vec::reserve` to reduce some allocation

alice-i-cecile added C-Performance A change motivated by improving speed, memory usage or compile times C-Refinement Improves output quality, without fixing a clear bug or adding new functionality. and removed C-Feature A new feature, making something new possible labels Feb 2, 2026

github-project-automation bot added this to Rendering (2026 Proposal) Feb 2, 2026

github-project-automation bot moved this to Needs SME Triage in Rendering (2026 Proposal) Feb 2, 2026

alice-i-cecile requested a review from pcwalton February 2, 2026 21:50

tychedelia approved these changes Feb 4, 2026

View reviewed changes

tychedelia added S-Ready-For-Final-Review This PR has been approved by the community. It's ready for a maintainer to consider merging it and removed S-Needs-Review Needs reviewer attention (from anyone!) to move forward labels Feb 4, 2026

beicause added 4 commits February 5, 2026 13:21

Merge remote-tracking branch 'upstream/main' into oit-opt

ab1e342

docs, rm unnecessary clone

e955608

Bind oit nodes buffer capacity

4c7c38e

Release oit buffers if no camera enables OIT

23feaa2

beicause added 2 commits February 5, 2026 16:05

Resize oit buffers instead of reserve when changed

2b3dd5f

CI

d455ee5

goodartistscopy approved these changes Feb 5, 2026

View reviewed changes

IceSentry mentioned this pull request Feb 5, 2026

Implement Premultiplied Alpha for OIT #22821

Open

alice-i-cecile added this pull request to the merge queue Feb 5, 2026

Merged via the queue into bevyengine:main with commit afe0e5d Feb 5, 2026
38 checks passed

github-project-automation bot moved this to Done in Rendering Feb 5, 2026

github-project-automation bot moved this from Needs SME Triage to Done in Rendering (2026 Proposal) Feb 5, 2026

Uh oh!

Conversation

beicause commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Objective

Solution

Testing

Uh oh!

IceSentry left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

IceSentry commented Feb 2, 2026

Uh oh!

IceSentry commented Feb 2, 2026

Uh oh!

tychedelia left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

tychedelia Feb 4, 2026

Choose a reason for hiding this comment

Uh oh!

goodartistscopy Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

beicause Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

goodartistscopy Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

beicause commented Feb 5, 2026

Uh oh!

goodartistscopy left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

IceSentry commented Feb 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

beicause commented Nov 14, 2025 •

edited

Loading

goodartistscopy Feb 5, 2026 •

edited

Loading

goodartistscopy left a comment •

edited

Loading