#

async-dma

Here is 1 public repository matching this topic...

FonaTech / Project_Chronos

⚡ Zero-Stall MoE Inference via Lookahead Prediction & Async DMA Prefetching. Optimized for SSD I/O with Hybrid MLA+Sliding Window Attention.

open-source artificial-intelligence lora high-throughput open-models mixture-of-experts llm generative-ai large-language-model streaming-llm predictive-inference sliding-window-attention io-latency-hiding async-dma ssd-offloading lookahead-routing mla-attention dual-layer-moe

Updated Apr 23, 2026
Python

Improve this page

Add a description, image, and links to the async-dma topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the async-dma topic, visit your repo's landing page and select "manage topics."