Feat: integrate NNX LoRA support via Qwix with unified configuration by RexBearIU · Pull Request #3320 · AI-Hypercomputer/maxtext

RexBearIU · 2026-03-05T10:51:03Z

Description

Overview
This pull request introduces native LoRA support in MaxText by leveraging the NNX model definition and the Qwix library. It enables a seamless workflow for applying LoRA adapters during
training and provides utilities for bidirectional checkpoint conversion with the HuggingFace ecosystem.

Key Changes

Core NNX Integration:
- Refactored NNXDecoder layer application logic to support nnx.scan with dynamic graph initialization, ensuring compatibility with Qwix's parameter materialization.
SFT Pipeline Enhancements:
- Integrated apply_lora_to_model and restore_lora_from_path into the SFT trainer.
- Added dummy input preparation to materialize LoRA parameters before trainer initialization.
Bidirectional Conversion Scripts:
- hf_lora_to_maxtext.py: Converts HuggingFace PEFT adapters to MaxText checkpoint format. Updated to 2026 copyright and cleaned up comments.
- maxtext_to_hf_lora.py: Converts MaxText LoRA checkpoints back to HuggingFace format. Updated to use max_logging and 2026 copyright.
Configuration & Type System:
- Added lora_module_path auto-detection logic for popular models (Llama, etc.) via lora_module_path.yml.
- Updated types.py with specific LoRA/QLoRA fields.
Current Limitations:
- QLoRA flags (lora_weight_qtype, lora_tile_size) are included in the configuration but explicitly marked as TODO / Not Working for this initial release.

Tests

The Qwix-based LoRA implementation was validated through a new unit test suite and verified via a comprehensive tutorial.

Unit Tests
Implemented tests/unit/lora_utils_test.py to ensure structural correctness and trainer compatibility. Key areas covered:

Model Transformation: Verified that apply_lora_to_model correctly injects nnx.LoRAParam into the model state.
Layer Scanning: Confirmed the implementation works with both scan_layers=True and scan_layers=False by handling the resulting differences in the nnx module path tree.
Trainer Compatibility: Validated that tunix.sft.peft_trainer.PeftTrainer correctly identifies the LoRA parameters for optimization, ensuring only adapter weights are trained.
Path Matching: Tested the regex logic for auto-detecting LoRA target modules across different model architectures (e.g., Llama).

Command to run unit tests:

1 # From the maxtext root directory
2 export PYTHONPATH=$PYTHONPATH:$(pwd)/src:$(pwd)
3 python3 tests/unit/lora_utils_test.py

Documentation

Added docs/tutorials/posttraining/lora.md, which provides a step-by-step guide for running LoRA fine-tuning, including environment setup and checkpoint conversion. This tutorial serves as the reference for end-to-end functional verification.

Logit test result

https://paste.googleplex.com/6233928391327744

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

codecov · 2026-03-05T10:58:12Z

Codecov Report

❌ Patch coverage is 31.63636% with 376 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/maxtext/checkpoint_conversion/to_maxtext.py	0.00%	234 Missing ⚠️
...rc/maxtext/checkpoint_conversion/to_huggingface.py	0.00%	63 Missing ⚠️
src/maxtext/utils/lora_utils.py	73.84%	26 Missing and 8 partials ⚠️
src/maxtext/utils/sharding.py	73.61%	10 Missing and 9 partials ⚠️
src/maxtext/checkpoint_conversion/utils/utils.py	0.00%	17 Missing ⚠️
src/maxtext/layers/nnx_decoders.py	82.14%	4 Missing and 1 partial ⚠️
src/maxtext/utils/maxtext_utils.py	33.33%	3 Missing and 1 partial ⚠️

📢 Thoughts on this report? Let us know!

bvandermoon

Thank you @RexBearIU. Left some comments but this is generally looking good to me

RexBearIU force-pushed the jackyf/feat/lora-nnx branch from 11939f9 to 6540bc8 Compare March 5, 2026 10:53

RexBearIU force-pushed the jackyf/feat/lora-nnx branch 7 times, most recently from 69e481b to 80b5592 Compare March 11, 2026 08:19

RexBearIU force-pushed the jackyf/feat/lora-nnx branch 11 times, most recently from f5a0f6d to 23e79c7 Compare March 25, 2026 08:33

RexBearIU force-pushed the jackyf/feat/lora-nnx branch 5 times, most recently from 5a05148 to 7570b3d Compare April 14, 2026 02:45

RexBearIU marked this pull request as ready for review April 14, 2026 04:08

RexBearIU requested review from RissyRan, parambole, shuningjin and suexu1025 as code owners April 14, 2026 04:08

SurbhiJainUSC reviewed Apr 16, 2026

View reviewed changes

Comment thread src/maxtext/checkpoint_conversion/hf_lora_to_maxtext.py Outdated

RexBearIU force-pushed the jackyf/feat/lora-nnx branch 2 times, most recently from 0dfeb76 to 2f91ad8 Compare April 16, 2026 10:32

RexBearIU changed the title ~~Jackyf/feat/lora nnx~~ Feat: integrate NNX LoRA support via Qwix with unified configuration Apr 16, 2026

RexBearIU force-pushed the jackyf/feat/lora-nnx branch from 2f91ad8 to f5736a1 Compare April 16, 2026 10:54

bvandermoon reviewed Apr 20, 2026

View reviewed changes

Comment thread src/maxtext/layers/nnx_decoders.py

Comment thread docs/tutorials/posttraining/lora.md Outdated

RexBearIU force-pushed the jackyf/feat/lora-nnx branch 5 times, most recently from b747695 to a952892 Compare April 24, 2026 14:39

SurbhiJainUSC reviewed Apr 27, 2026

View reviewed changes

Comment thread docs/_static/js/editable_commands.js Outdated

SurbhiJainUSC reviewed Apr 27, 2026

View reviewed changes

Comment thread docs/_static/js/editable_commands.js Outdated

Comment thread docs/_static/js/editable_commands.js Outdated

Comment thread src/maxtext/utils/lora_utils.py Outdated

bvandermoon reviewed Apr 28, 2026

View reviewed changes

Comment thread docs/tutorials/posttraining/lora.md

Comment thread src/dependencies/requirements/generated_requirements/tpu-post-train-requirements.txt

Comment thread pytest.ini

SurbhiJainUSC reviewed Apr 28, 2026

View reviewed changes

Comment thread docs/tutorials/posttraining/lora.md Outdated

SurbhiJainUSC reviewed Apr 28, 2026

View reviewed changes

Comment thread docs/tutorials/posttraining/lora.md Outdated

SurbhiJainUSC reviewed Apr 28, 2026

View reviewed changes

Comment thread docs/tutorials/posttraining/lora.md Outdated

SurbhiJainUSC reviewed Apr 28, 2026

View reviewed changes

Comment thread docs/tutorials/posttraining/lora.md Outdated

SurbhiJainUSC reviewed Apr 28, 2026

View reviewed changes

Comment thread docs/tutorials/posttraining/lora.md Outdated

SurbhiJainUSC reviewed Apr 28, 2026

View reviewed changes

Comment thread docs/tutorials/posttraining/lora.md Outdated