perf: optimize LTX2 inference latency and implement granular TPU profiling by mbohlool · Pull Request #389 · AI-Hypercomputer/maxdiffusion

mbohlool · 2026-04-23T00:20:42Z

Description:
This PR introduces better timing and profiling capabilities to the LTX2 generation pipeline to help identify performance bottlenecks.

Key Changes:

Detailed Timing: Added time.perf_counter() blocks and jax.block_until_ready() calls across the pipeline to accurately measure text encoding, connector passes, denoising steps, VAE decoding, and post-processing.

Multi-Pass Execution: Updated generate_ltx2.py to support a three-stage execution flow:

Warmup Pass: For JIT compilation.

Generation Pass: For actual output and standard timing.

Profiling Pass: (Optional) Captured via max_utils.Profiler for a subset of steps.

Enhanced Logging: Added a summary table for Load, Compile, and Inference times.

Config Updates: Added skip_first_n_steps_for_profiler and profiler_steps to the LTX2 configuration.

Memory Management: Explicitly deletes large tensors (out, videos, audios) before the profiling run to prevent OOM.

github-actions · 2026-04-23T00:20:53Z

e2e testgrid: https://8bcf50593faf4ea38060e236169827e5-dot-us-central1.composer.googleusercontent.com/dags/maxdiffusion_tpu_e2e/grid

Perseus14 · 2026-04-23T04:35:53Z

@mbohlool Could you add a table with the latency gain (single video and amortized throughput) of this change with the baseline (main)?

Thanks!

mbohlool · 2026-05-01T22:22:23Z

@Perseus14 change the PR to focus only on the timing and profiling part. I explored the performance tweaking later. PTAL.

Perseus14 · 2026-05-02T03:44:14Z

        spec = NamedSharding(self.mesh, P(*activation_axes))
        video_embeds_sharded = jax.device_put(video_embeds, spec)
-        audio_embeds_sharded = jax.device_put(audio_embeds, spec)
+        audio_embeds_sharded = audio_embeds


@prishajain1 Could you check whether this will cause issues?

Perseus14 · 2026-05-02T03:45:24Z

+      f"  Load (checkpoint):   {load_time:>7.1f}s\n"
+      f"  Compile:             {compile_time:>7.1f}s\n"
+      f"  {'─' * 40}\n"
+      f"  Inference:           {generation_time:>7.1f}s\n"


Is it possible to print a component wise split here for quick analysis now that we are timing all the components?

Elisa has done something like this for the WAN pipelines here

github-actions · 2026-05-02T04:59:45Z

🤖 Hi @Perseus14, I've received your request, and I'm working on it now! You can track my progress in the logs for more details.

github-actions · 2026-05-02T05:00:08Z

🤖 I'm sorry @Perseus14, but I was unable to process your request. Please see the logs for more details.

mbohlool requested a review from entrpn as a code owner April 23, 2026 00:20

mbohlool force-pushed the mehdy_perf branch 2 times, most recently from c2eae2f to 6bd35bf Compare April 23, 2026 00:51

mbohlool force-pushed the mehdy_perf branch from 6bd35bf to caaef98 Compare May 1, 2026 22:17

feat(ltx2): add performance profiling and timing instrumentation

6942969

mbohlool force-pushed the mehdy_perf branch from caaef98 to 6942969 Compare May 1, 2026 22:20

Perseus14 reviewed May 2, 2026

View reviewed changes

Perseus14 added the gemini-review label May 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: optimize LTX2 inference latency and implement granular TPU profiling#389

perf: optimize LTX2 inference latency and implement granular TPU profiling#389
mbohlool wants to merge 1 commit intomainfrom
mehdy_perf

mbohlool commented Apr 23, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Apr 23, 2026

Uh oh!

Perseus14 commented Apr 23, 2026

Uh oh!

mbohlool commented May 1, 2026

Uh oh!

Perseus14 May 2, 2026

Uh oh!

Perseus14 May 2, 2026

Uh oh!

github-actions Bot commented May 2, 2026

Uh oh!

github-actions Bot commented May 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mbohlool commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Apr 23, 2026

Uh oh!

Perseus14 commented Apr 23, 2026

Uh oh!

mbohlool commented May 1, 2026

Uh oh!

Perseus14 May 2, 2026

Choose a reason for hiding this comment

Uh oh!

Perseus14 May 2, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented May 2, 2026

Uh oh!

github-actions Bot commented May 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mbohlool commented Apr 23, 2026 •

edited

Loading