[Neuron] Add AWS Neuron (Trainium/Inferentia) as an officially supported device by JingyaHuang · Pull Request #13289 · huggingface/diffusers

JingyaHuang · 2026-03-19T16:01:08Z

What does this PR do?

This PR adds AWS Neuron (Trainium/Inferentia) as an officially supported compute backend in Diffusers, on par with existing backends like CUDA, MPS, XPU, and MLU.

Changes

import_utils.py — adds is_torch_neuronx_available() detection, following the existing pattern for optional backends.
torch_utils.py — registers "neuron" in all backend dispatch tables (BACKEND_SUPPORTS_TRAINING, BACKEND_EMPTY_CACHE, BACKEND_DEVICE_COUNT, BACKEND_MANUAL_SEED, etc.) and adds a randn_tensor workaround since Neuron/XLA does not support creating random tensors directly on device (falls back to CPU).
utils/init.py — exports is_torch_neuronx_available.
pipeline_utils.py — adds two new DiffusionPipeline methods:
- enable_neuron_compile(model_names, cache_dir, fullgraph) — wraps pipeline nn.Module components with torch.compile(backend="neuron") for whole-graph NEFF compilation. Supports optional NEFF caching via TORCH_NEURONX_NEFF_CACHE_DIR.
- neuron_warmup(*args, **kwargs) — runs a single dummy forward pass to trigger upfront neuronx-cc compilation before timed inference.

Usage

Eager mode

import torch                                                                                                             
import torch_neuronx  # noqa: F401 — registers torch.neuron                                                            
                                                                                                                           
from diffusers import AutoPipelineForText2Image                                                                          
                                                                                                                           
# Load and move to Neuron device                                                                                         
pipe = AutoPipelineForText2Image.from_pretrained(                                                                        
    "stabilityai/sdxl-turbo",                                                                                            
    torch_dtype=torch.bfloat16,                           
    variant="fp16",                                                                                                      
)
pipe = pipe.to(torch.neuron.current_device())                                                                            
                                                                                                                         
# Warmup                                                                   
pipe(prompt="warmup", height=512, width=512, num_inference_steps=1, guidance_scale=0.0)                                                                                                                        
                                                          
# Inference                                                                                               
image = pipe(                                             
    prompt="a golden retriever surfing a wave, photorealistic",                                                          
    height=512,
    width=512,                                                                                                           
    num_inference_steps=1, 
    guidance_scale=0.0,                                                                    
).images[0]                                                                                                              
                                                                                                                         
image.save("output.png")

Next Steps

Enable torch.compile on Neuron device
Add tensor parallel support for memory-bound devices like neuron
Tackle the compatibility of diffusers+nki kernels lib to boost the performance on neuron under the compile mode

HuggingFaceDocBuilderDev · 2026-03-19T16:09:52Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

… into add-neuron-backend

JingyaHuang and others added 3 commits March 18, 2026 11:15

draft:add neuron as a legit backend

98f6c8c

Merge branch 'huggingface:main' into add-neuron-backend

c58b8b8

Merge branch 'huggingface:main' into add-neuron-backend

3367409

JingyaHuang and others added 5 commits March 25, 2026 11:54

Merge branch 'main' into add-neuron-backend

0c51734

feat: neuron-specific changes in the pipeline

a76953c

tests: eager tests

2480388

fix: style

929ab72

Merge branch 'huggingface:main' into add-neuron-backend

52cac76

github-actions bot added models tests utils pipelines size/M PR with diff < 200 LOC labels Apr 9, 2026

Merge branch 'add-neuron-backend' of github.com:JingyaHuang/diffusers…

28a5086

… into add-neuron-backend

github-actions bot added lora examples size/M PR with diff < 200 LOC and removed size/M PR with diff < 200 LOC labels Apr 9, 2026

Merge branch 'huggingface:main' into add-neuron-backend

68689e5

github-actions bot added size/M PR with diff < 200 LOC and removed size/M PR with diff < 200 LOC labels Apr 10, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Neuron] Add AWS Neuron (Trainium/Inferentia) as an officially supported device#13289

[Neuron] Add AWS Neuron (Trainium/Inferentia) as an officially supported device#13289
JingyaHuang wants to merge 10 commits intohuggingface:mainfrom
JingyaHuang:add-neuron-backend

JingyaHuang commented Mar 19, 2026 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

JingyaHuang commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Mar 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JingyaHuang commented Mar 19, 2026 •

edited

Loading