I am really looking forward to use muon like optimizer with fsdp
I am really looking forward to use muon like optimizer with fsdp