Skip to content

feat: add 4-bit quantization support to turboquant_attention

08d7fe6
Select commit
Loading
Failed to load commit list.
Closed

feat: add mx.fast.turboquant_attention for compressed KV cache #3340

feat: add 4-bit quantization support to turboquant_attention
08d7fe6
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs