perf: Update moe_token_dispatcher_type default to alltoall by parthmannan · Pull Request #2004 · NVIDIA-NeMo/RL

parthmannan · 2026-02-22T21:19:47Z

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Issues

List issues that this PR closes (syntax):

Usage

You can potentially add a usage example below

# Add a code snippet demonstrating how to use this

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you run the unit tests and functional tests locally? Visit our Testing Guide for how to run tests
Did you add or update any necessary documentation? Visit our Document Development Guide for how to write, build and test the docs.

Additional Information

...

Summary by CodeRabbit

Chores
- Updated MOE token dispatcher configuration from "allgather" to "alltoall" across multiple example configurations.
- Updated corresponding test configurations and documentation to reflect the new dispatcher default.

coderabbitai · 2026-02-22T21:21:40Z

📝 Walkthrough

Walkthrough

The PR updates Megatron MoE token dispatcher configuration from "allgather" to "alltoall" across example configuration files, test files, and documentation, changing the distributed token routing strategy.

Changes

Cohort / File(s)	Summary
Example Configurations `examples/configs/distillation_math.yaml`, `examples/configs/distillation_math_megatron.yaml`, `examples/configs/dpo.yaml`, `examples/configs/grpo_math_1B.yaml`, `examples/configs/grpo_math_1B_megatron.yaml`, `examples/configs/sft.yaml`, `examples/configs/sft_openmathinstruct2_megatron.yaml`, `examples/configs/vlm_grpo_3B.yaml`, `examples/configs/vlm_grpo_3B_megatron.yaml`, `examples/nemo_gym/grpo_workplace_assistant_nemotron_nano_v2_9b.yaml`	Updated `moe_token_dispatcher_type` from "allgather" to "alltoall" in Megatron policy configurations across all example YAML files.
Documentation `nemo_rl/models/policy/__init__.py`	Updated documentation comment for `MegatronConfig.moe_token_dispatcher_type` to reflect new default value of "alltoall".
Test Configurations `tests/unit/models/generation/test_vllm_generation.py`, `tests/unit/models/megatron/test_megatron_setup.py`, `tests/unit/models/policy/test_megatron_worker.py`	Updated Megatron test configurations to expect `moe_token_dispatcher_type` value of "alltoall" instead of "allgather".

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~5 minutes

Possibly related PRs

feat: refactor megatron init #1646: Adds and sets up Megatron MoE config handling infrastructure that this PR's dispatcher type changes depend on.
perf: DeepEP interface in megatron backend #1794: Introduces the moe_token_dispatcher_type field to Megatron configuration; this PR changes its default value.

Suggested labels

Performance

Suggested reviewers

terrykong

🚥 Pre-merge checks | ✅ 3 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Test Results For Major Changes	⚠️ Warning	PR changes moe_token_dispatcher_type default from 'allgather' to 'alltoall' across multiple configs, a significant distributed training change affecting performance and convergence, but PR description lacks test results or regression validation.	Add test results and performance metrics demonstrating no regressions from the dispatcher type change, including convergence validation and before/after performance numbers.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly and specifically describes the main change: updating the default value of moe_token_dispatcher_type from 'allgather' to 'alltoall' across multiple configuration files.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Tip

Issue Planner is now in beta. Read the docs and try it out! Share your feedback on Discord.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Signed-off-by: Parth Mannan <pmannan@nvidia.com>

coderabbitai

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

nemo_rl/models/policy/__init__.py (1)
1-1: ⚠️ Potential issue | 🟡 Minor

Update the copyright year to 2026.

The file was modified but the header still shows 2025; update it to the current year.
🔧 Suggested fix
-# Copyright (c) 2025, NVIDIA CORPORATION.  All rights reserved.
+# Copyright (c) 2026, NVIDIA CORPORATION.  All rights reserved.
As per coding guidelines: Add the NVIDIA copyright header (with current year) to all Python files and shell scripts, excluding tests (files under tests/ or test-only scripts).
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@nemo_rl/models/policy/__init__.py` at line 1, Replace the outdated copyright
year in the top-of-file header comment from 2025 to 2026; locate the header
comment at the beginning of the module (the copyright comment line in
nemo_rl/models/policy/__init__.py) and update the year to "2026" so the NVIDIA
copyright header is current.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Outside diff comments:
In `@nemo_rl/models/policy/__init__.py`:
- Line 1: Replace the outdated copyright year in the top-of-file header comment
from 2025 to 2026; locate the header comment at the beginning of the module (the
copyright comment line in nemo_rl/models/policy/__init__.py) and update the year
to "2026" so the NVIDIA copyright header is current.

parthmannan requested review from a team as code owners February 22, 2026 21:19

parthmannan requested review from guyueh1 and removed request for a team February 22, 2026 21:19

parthmannan changed the title ~~Update moe_token_dispatcher_type default to alltoall~~ perf: Update moe_token_dispatcher_type default to alltoall Feb 22, 2026

parthmannan added 2 commits February 22, 2026 13:23

Update moe_token_dispatcher_type default to alltoall

2442083

Signed-off-by: Parth Mannan <pmannan@nvidia.com>

Update in vllm test

493ddef

Signed-off-by: Parth Mannan <pmannan@nvidia.com>

coderabbitai bot reviewed Feb 22, 2026

View reviewed changes

parthmannan force-pushed the pmannan/update_moe_default branch from 90a737d to 493ddef Compare February 22, 2026 21:49

parthmannan added the CI:L2 Run doctests, unit tests, functional tests, and convergence tests label Feb 23, 2026

parthmannan temporarily deployed to nemo-ci February 23, 2026 08:56 — with GitHub Actions Inactive

parthmannan temporarily deployed to nemo-ci February 23, 2026 09:48 — with GitHub Actions Inactive

parthmannan temporarily deployed to nemo-ci February 23, 2026 14:09 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: Update moe_token_dispatcher_type default to alltoall#2004

perf: Update moe_token_dispatcher_type default to alltoall#2004
parthmannan wants to merge 2 commits intoNVIDIA-NeMo:mainfrom
parthmannan:pmannan/update_moe_default

parthmannan commented Feb 22, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Feb 22, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

parthmannan commented Feb 22, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Issues

Usage

Before your PR is "Ready for review"

Additional Information

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

parthmannan commented Feb 22, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 22, 2026 •

edited

Loading