[Doc][Polish] avoid memory leak and explain some points #54

muyuuuu · 2024-11-25T16:12:50Z

避免内存泄漏
一次 IO，多次计算那里是我个人的理解，在之前 opencl 优化算子中也用到了，算一种通用思路吧
矩阵大小不为 32 的倍数时会有段错误。我见过的优化方法是先取出能整除 32 的图像区域进行 cuda 加速，边界部分用 C 处理。我不确定百度的优化方法，就没写解决方案
TM 的解释，我看了好久看懂了 TM 的用法，擅自主张加了个解释

AndSonder

LGTM

AndSonder · 2024-11-26T02:52:41Z

感谢对文档的补充！

“矩阵大小不为 32 的倍数时会有段错误” 这个问题其实在很多sgemm优化算法里面都会遇到，文档里面的文章主要还是学习这些优化方法，考虑特别多边界情况的话会让代码变的非常复杂

[Doc][Polish] avoid memory leak and explain some points

ebb5463

AndSonder approved these changes Nov 26, 2024

View reviewed changes

AndSonder merged commit 2d37c86 into PaddleJitLab:develop Nov 26, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Doc][Polish] avoid memory leak and explain some points #54

[Doc][Polish] avoid memory leak and explain some points #54

Uh oh!

muyuuuu commented Nov 25, 2024

Uh oh!

AndSonder left a comment

Uh oh!

AndSonder commented Nov 26, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[Doc][Polish] avoid memory leak and explain some points #54

[Doc][Polish] avoid memory leak and explain some points #54

Uh oh!

Conversation

muyuuuu commented Nov 25, 2024

Uh oh!

AndSonder left a comment

Choose a reason for hiding this comment

Uh oh!

AndSonder commented Nov 26, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants