Hi authors and community,
Huge thanks for open-sourcing the training scripts! It's truly great contribution to the field.
I have two questions regarding the training process and would appreciate any insights from those who have experimented with it:
-
I am currently attempting a full train of the Wan 2.2 TI2V 5B model. I was wondering if anyone could share their training speed?
(like, what is the approximate seconds per step (s/step) I should expect? )
-
I noticed a specific issue during generation/training: there seems to be a slight but noticeable sudden color shift immediately after the first frame (the conditioning image). The subsequent frames have a slightly different color tone compared to the initial frame.
Has anyone else encountered this issue? Is this related to VAE encoding/decoding or a specific training configuration?
Any advice or reference data would be greatly appreciated. Thanks again!