Skip to content

Update Case 1 to recognize both quantized dict and WithAux as prequantized template.#266

Open
copybara-service[bot] wants to merge 1 commit intomainfrom
test_908764879
Open

Update Case 1 to recognize both quantized dict and WithAux as prequantized template.#266
copybara-service[bot] wants to merge 1 commit intomainfrom
test_908764879

Conversation

@copybara-service
Copy link
Copy Markdown

Update Case 1 to recognize both quantized dict and WithAux as prequantized template.

Qwen 3.5 MoE overrides the original fp template (non-intercepted operations are not wrapped in WithAux) for Pallas_call to be a dict like {'array': {'qvalue': ..., 'scale': ...}}. Previously, this falls back to Case 2 "checkpoint is prequantized, template is fp" because it is a non-WithAux quantized format. With this change, this accurately categorizes to Case 1 "checkpoint is prequantized, template is prequantized".

…tized template.

Qwen 3.5 MoE overrides the original fp template (non-intercepted operations are not wrapped in WithAux) for Pallas_call to be a dict like {'array': {'qvalue': ..., 'scale': ...}}. Previously, this falls back to Case 2 "checkpoint is prequantized, template is fp" because it is a non-WithAux quantized format. With this change, this accurately categorizes to Case 1 "checkpoint is prequantized, template is prequantized".

PiperOrigin-RevId: 908764879
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant