Make apply more memory-friendly for CUDA by VidithM · Pull Request #407 · DrTimothyAldenDavis/GraphBLAS

VidithM · 2025-03-12T05:33:15Z

If doing an in-place apply and C is iso on input but not on output, and a non-positional operator is used , then we need to realloc C->x and set all numerical entries to the iso value. However, this pins C->x on the host which is bad for CUDA. This change defers the iso expansion to the appropriate point.

(would it be better to instead change the API for GB_apply_op to have a do_iso_expansion flag? The drawback with the current solution is that the expansion may be performed when not needed, if C is not iso on input.)

VidithM marked this pull request as draft March 12, 2025 05:33

Make apply more memory-friendly for CUDA

7b86930

VidithM force-pushed the dev2a branch from 314ed8b to 7b86930 Compare March 12, 2025 05:35

tag another issue

c7a174c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make apply more memory-friendly for CUDA#407

Make apply more memory-friendly for CUDA#407
VidithM wants to merge 2 commits intoDrTimothyAldenDavis:dev2from
VidithM:dev2a

VidithM commented Mar 12, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

VidithM commented Mar 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

VidithM commented Mar 12, 2025 •

edited

Loading