[flux.2 LoRA] make lora training compatible with flux.2 klein kv#13325

Open

linoytsaban wants to merge 21 commits intohuggingface:mainfrom

linoytsaban:klein-kv-train

Collaborator

linoytsaban commented Mar 24, 2026 •

edited

Loading

adds a flux.2 klein kv training script, with additional general changes that can be propagated to other lora scripts as well:

Default aspect ratio buckets - uses preset aspect ratio buckets to avoid needing to pass them manually.
--caption_dropout_rate - Randomly replaces prompts with empty strings at the given rate (default 0.05). Forces the model to learn from visual signal alone on some steps to improve robustness.
--shift_timesteps - resolution-adaptive timestep sampling. Samples from sigmoid distribution then warps with t' = (t·μ)/(1+(μ-1)·t) where μ scales with latent sequence length. higher resolution gets more high-noise training. default behaviour in popular trainers like ai-toolkit and khoya.
Cache keying fix - latent/prompt caches bug fixed.
Multiple conditions support - allows for multiple image conditions per example.

linoytsaban and others added 5 commits

March 20, 2026 22:55

kv

f8286f0


          Merge branch 'huggingface:main' into klein-kv-train

a04ad6e


          add if on self._set_kv_attn_processors() so it's not called when the …

7afc82e

…transformer is None (e.g. when initializing the pipeline as a text encoding pipeline)


          Merge branch 'huggingface:main' into klein-kv-train

cb0b500


          Merge branch 'huggingface:main' into klein-kv-train

cc106cb

HuggingFaceDocBuilderDev commented Mar 24, 2026

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

linoytsaban and others added 7 commits

March 30, 2026 14:25


          Merge branch 'main' into klein-kv-train

49afd7e


          extend default target modules according to huggingface#13011

72eb9b5


          fix re-shuffle logic when caching latents

e5e2f97

fix

bcca16d

fix

7be8f0f


          add flags to make cropping and random flipping disabled by default

7cc5914


          add flags to make cropping and random flipping disabled by default

eff04c0

linoytsaban marked this pull request as ready for review

March 31, 2026 10:41


          Merge branch 'main' into klein-kv-train

a49b251

Contributor

github-actions bot commented Mar 31, 2026 •

edited

Loading

Style bot fixed some files and pushed the changes.

github-actions bot and others added 5 commits

March 31, 2026 10:43


          Apply style fixes

04298b5


          add dropout

8d4eab9


          Merge remote-tracking branch 'origin/klein-kv-train' into klein-kv-train

a30a991


          add dropout and dynamic shift

f09e08a


          remove clamp

38762d1

linoytsaban requested a review from sayakpaul

March 31, 2026 18:07


          Merge branch 'main' into klein-kv-train

0c53885

sayakpaul reviewed

View reviewed changes

Member

sayakpaul left a comment

Thanks for getting started on this. Could we see some results across different setups?

examples/dreambooth/train_dreambooth_lora_flux2_klein_kv_img2img.py Outdated Show resolved Hide resolved

examples/dreambooth/train_dreambooth_lora_flux2_klein_kv_img2img.py

		return images


		def module_filter_fn(mod: torch.nn.Module, fqn: str):

Member

sayakpaul Apr 1, 2026

Maybe for a different PR: we could move it to training_utils.py and name it module_filter_fn_torchao?

Collaborator Author

linoytsaban Apr 1, 2026

maybe do that in a seprate refactor PR since this also persists in other lora scripts?

examples/dreambooth/train_dreambooth_lora_flux2_klein_kv_img2img.py

		return batch


		class BucketBatchSampler(BatchSampler):

Member

sayakpaul Apr 1, 2026

Can probably be moved to training_utils.py?

Collaborator Author

linoytsaban Apr 1, 2026

maybe do that in a seprate refactor PR since this also persists in other lora scripts?

examples/dreambooth/train_dreambooth_lora_flux2_klein_kv_img2img.py

+                      # train transformer_blocks and single_transformer_blocks
+                      target_modules = ["to_k", "to_q", "to_v", "to_out.0"] + [
+                          "to_qkv_mlp_proj",
+                          *[f"single_transformer_blocks.{i}.attn.to_out" for i in range(24)],

Member

sayakpaul Apr 1, 2026

⚡️

examples/dreambooth/train_dreambooth_lora_flux2_klein_kv_img2img.py

+                  # if cache_latents is set to True, we encode images to latents and store them.
+                  # Similar to pre-encoding in the case of a single instance prompt, if custom prompts are provided
+                  # we encode them in advance as well.
+                  precompute_latents = args.cache_latents or train_dataset.custom_instance_prompts

Member

sayakpaul Apr 1, 2026

What was the fix needed to ensure the keying fix for precomputed latents?

examples/dreambooth/train_dreambooth_lora_flux2_klein_kv_img2img.py

+                          sigma = sigma.unsqueeze(-1)
+                      return sigma
+                  def calculate_shift(

Member

sayakpaul Apr 1, 2026

Could use it from the pipeline itself?

examples/dreambooth/train_dreambooth_lora_flux2_klein_kv_img2img.py

+                  # When caption dropout triggers, we replace the real prompt embedding with this.
+                  # Note: empty_prompt_embeds and empty_text_ids are computed above before the text encoder is freed.
+                  if args.caption_dropout_rate > 0.0:
+                      if empty_prompt_embeds is None:

Member

sayakpaul Apr 1, 2026

This will probably not be the case since we're already doing:

if args.caption_dropout_rate > 0.0:
        logger.info("Pre-computing empty prompt embeddings for caption dropout...")
        with offload_models(text_encoding_pipeline, device=accelerator.device, offload=args.offload):
            empty_prompt_embeds, empty_text_ids = compute_text_embeddings("", text_encoding_pipeline)

examples/dreambooth/train_dreambooth_lora_flux2_klein_kv_img2img.py

Comment on lines +1734 to +1740

+                                  # Clone when caption dropout is active to avoid mutating the cache.
+                                  if args.caption_dropout_rate > 0.0:
+                                      prompt_embeds = prompt_embeds_cache[cache_key].clone()
+                                      text_ids = text_ids_cache[cache_key].clone()
+                                  else:
+                                      prompt_embeds = prompt_embeds_cache[cache_key]
+                                      text_ids = text_ids_cache[cache_key]

Member

sayakpaul Apr 1, 2026

Why is this needed?

linoytsaban and others added 2 commits

April 1, 2026 14:17


          Update examples/dreambooth/train_dreambooth_lora_flux2_klein_kv_img2i…

2d50f4e

…mg.py

Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>


          Merge branch 'huggingface:main' into klein-kv-train

a6d7904

github-actions bot added pipelines examples size/L labels

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

examples pipelines size/L