feat: update conv1d_update op for Qwen3-Next/Qwen3.5. by maojunx99 · Pull Request #1291 · jd-opensource/xllm

maojunx99 · 2026-04-16T03:07:10Z

Summary

Modify the shape of conv_cache in kv_cache to reduce subsequent transpose and achieve more efficient computation.
The new version of conv1d_update operator has been adapted for NPU.

Updates depend on the operator library: https://gitcode.com/xLLM-AI/torch_npu_ops/pull/13

gemini-code-assist

Code Review

This pull request introduces the causal_conv1d_update_v2 kernel for NPU and updates the Qwen3GatedDeltaNetBase implementation. Key feedback includes correcting an erroneous transpose in the prefill path that impacts tensor narrowing, fixing a naming convention violation for local variables, and removing a redundant batch variable. Additionally, a reshape operation in the decode path needs to be corrected to ensure the kernel receives the expected tensor dimensions.

maojunx99 requested review from DongheJin, JimHsiung, RobbieLeung, XuZhang99, liutongxuan, walsonyang and yq33victor as code owners April 16, 2026 03:07

gemini-code-assist bot reviewed Apr 16, 2026

View reviewed changes

maojunx99 force-pushed the main branch 2 times, most recently from 53ed5d5 to a6ee25d Compare April 16, 2026 06:31

yingxudeng reviewed Apr 16, 2026

View reviewed changes

Comment thread xllm/core/kernels/ops_api.cpp Outdated

Comment thread xllm/core/kernels/param.h Outdated

Comment thread xllm/core/distributed_runtime/llm_engine.cpp

yingxudeng changed the title ~~update conv1d_updae op for Qwen3 Next/Qwen 3.5~~ feat: update conv1d_update op for Qwen3-Next/Qwen3.5. Apr 16, 2026

maojunx99 force-pushed the main branch from a6ee25d to 8844ef8 Compare April 16, 2026 11:15

zhang-minchao reviewed Apr 16, 2026

View reviewed changes

Comment thread xllm/core/layers/npu_torch/qwen3_gated_delta_net_base.cpp

yingxudeng reviewed Apr 16, 2026

View reviewed changes

Comment thread xllm/core/layers/npu_torch/qwen3_gated_delta_net_base.cpp Outdated

yingxudeng reviewed Apr 16, 2026

View reviewed changes

Comment thread xllm/core/kernels/ops_api.cpp Outdated

maojunx99 force-pushed the main branch 2 times, most recently from 3e67b12 to 5afd0de Compare April 17, 2026 07:00

zhang-minchao previously approved these changes Apr 17, 2026

View reviewed changes

yingxudeng previously approved these changes Apr 17, 2026

View reviewed changes

DongheJin previously approved these changes Apr 17, 2026

View reviewed changes

update conv1d_update op

ab4ed2e

maojunx99 dismissed stale reviews from DongheJin, yingxudeng, and zhang-minchao via ab4ed2e April 18, 2026 12:59

maojunx99 force-pushed the main branch from 5afd0de to ab4ed2e Compare April 18, 2026 12:59

yingxudeng previously approved these changes Apr 18, 2026

View reviewed changes

zhang-minchao previously approved these changes Apr 18, 2026

View reviewed changes

maojunx99 dismissed stale reviews from zhang-minchao and yingxudeng via 623a1e9 April 19, 2026 13:48

yingxudeng previously approved these changes Apr 19, 2026

View reviewed changes

update torch_npu_ops commit_hash

61ca7b4

maojunx99 dismissed yingxudeng’s stale review via 61ca7b4 April 20, 2026 06:22

maojunx99 force-pushed the main branch from 623a1e9 to 61ca7b4 Compare April 20, 2026 06:22

yingxudeng approved these changes Apr 20, 2026

View reviewed changes

DongheJin approved these changes Apr 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: update conv1d_update op for Qwen3-Next/Qwen3.5.#1291

feat: update conv1d_update op for Qwen3-Next/Qwen3.5.#1291
maojunx99 wants to merge 2 commits intojd-opensource:mainfrom
maojunx99:main

maojunx99 commented Apr 16, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

maojunx99 commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

maojunx99 commented Apr 16, 2026 •

edited

Loading