Reinforcement Learning on a rust simulation that only gives delayed rewards.
-
Updated
Apr 29, 2026 - Python
Reinforcement Learning on a rust simulation that only gives delayed rewards.
Gridworld problem with dynamic obstacles implemented in PyGame GUI and solved using traditional RL agent (Q-learning) and deep RL agent (PPO algorithm).
Add a description, image, and links to the skrl topic page so that developers can more easily learn about it.
To associate your repository with the skrl topic, visit your repo's landing page and select "manage topics."