Hi:
Thank you for your awesome work!
I have a question regarding PI05 and the fixed token length in your implementation. In PI05, the robot state should also be treated as part of the prompt, combined with language instruction and image encoding before tokenization. So the token length after the tokenizer can vary due to changes in the number in the robot state. But in your implementation, the token length must be fixed after set_prompt() if I understand correctly. Otherwise, it has to recapture the graph. And in your PI05 implementation, the robot state does not seem to be given as an input during inference. Can you explain to me why the state is missing, and do you have any plan or idea to deal with variant prompt length?
Thanks a lot!
Hi:
Thank you for your awesome work!
I have a question regarding PI05 and the fixed token length in your implementation. In PI05, the robot state should also be treated as part of the prompt, combined with language instruction and image encoding before tokenization. So the token length after the tokenizer can vary due to changes in the number in the robot state. But in your implementation, the token length must be fixed after set_prompt() if I understand correctly. Otherwise, it has to recapture the graph. And in your PI05 implementation, the robot state does not seem to be given as an input during inference. Can you explain to me why the state is missing, and do you have any plan or idea to deal with variant prompt length?
Thanks a lot!