I'm trying to reproduce the leaderboard scores using OpenClaw (3.13), but it's been difficult due to the lack of specific settings (e.g., timeout, max_steps, context window, etc.). Could you please share the detailed evaluation configuration? Thank you very much!
I'm trying to reproduce the leaderboard scores using OpenClaw (3.13), but it's been difficult due to the lack of specific settings (e.g., timeout, max_steps, context window, etc.). Could you please share the detailed evaluation configuration? Thank you very much!