[cudax] Implement cudax::coop::reduce for warp groups within a block#9258
[cudax] Implement cudax::coop::reduce for warp groups within a block#9258davebayer wants to merge 2 commits into
cudax::coop::reduce for warp groups within a block#9258Conversation
|
No actionable comments were generated in the recent review. 🎉 ℹ️ Recent review info⚙️ Run configurationConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Enterprise Run ID: 📒 Files selected for processing (1)
🚧 Files skipped from review as they are similar to previous changes (1)
SummaryThis PR implements Core Implementation Changes
Supporting Changes
Tests
API / ABI Surface Changes
Review Notes / Areas to Verify
important: WalkthroughAdds a warp-scoped two-stage cooperative reduce overload, adjusts group rank computation and barrier initialization to account for per-unit thread layout, and adds parametrized CUDA tests exercising reductions across multiple warps, types, operators, and item counts. ChangesCooperative reduce for warps within block
Assessment against linked issues
Possibly related PRs
Suggested reviewers
Comment |
There was a problem hiding this comment.
Actionable comments posted: 1
🧹 Nitpick comments (1)
cudax/test/coop/reduce/warps_within_block.cu (1)
152-153: ⚡ Quick winsuggestion: The test names and tags still say
this_warpon Lines 152 and 173, but this file/target iswarps_within_block. Rename theC2H_TESTnames/tags towarps_within_blockto keep filtering and failure triage unambiguous.Also applies to: 173-174
ℹ️ Review info
⚙️ Run configuration
Configuration used: Path: .coderabbit.yaml
Review profile: CHILL
Plan: Enterprise
Run ID: b9ab43e4-f31a-4ae2-94bb-fdad331fffa3
📒 Files selected for processing (5)
cudax/include/cuda/experimental/__coop/reduce.cuhcudax/include/cuda/experimental/__group/queries.cuhcudax/include/cuda/experimental/__group/synchronizer/barrier_synchronizer.cuhcudax/test/CMakeLists.txtcudax/test/coop/reduce/warps_within_block.cu
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
3aebfa2 to
05ebea1
Compare
😬 CI Workflow Results🟥 Finished in 32m 40s: Pass: 94%/55 | Total: 8h 19m | Max: 32m 40s | Hits: 67%/47318See results here. |
Fixes #9160.