News

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3) neural network, a robot learns to navigate to a random goal ...
R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning.