Skip to content

Conversation

@zhangyue207
Copy link
Collaborator

__device__ __forceinline__ unsigned int cvtaToShared(const void *ptr) {
#if defined(__CUDA_ARCH__) && defined(__cvta_generic_to_shared)
    return static_cast<unsigned int>(__cvta_generic_to_shared(ptr));
#else
    return static_cast<unsigned int>(reinterpret_cast<uintptr_t>(ptr));
#endif
}

注意这里的修改是否会对nv产生影响

@zhangyue207 zhangyue207 requested a review from a team January 28, 2026 06:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants