Ewerton, M., Villamizar, M., Jankowski, J., Calinon, S. and Odobez, J.-M. (2023)
A Multitask and Kernel approach for Learning to Push Objects with a Task-Parameterized Deep Q-Network
In Proc. IEEE/RSJ Intl Conf. on Intelligent Robots and Systems (IROS).
Abstract
Pushing is an essential motor skill involved in several manipulation tasks, and has been an important research topic in robotics. Recent works have shown that Deep Q-Networks (DQNs) can learn pushing policies (when, where to push, and how) to solve manipulation tasks, potentially in synergy with other skills (e.g. grasping). Nevertheless, DQNs often assume a fixed setting and task, which may limit their deployment in practice. Furthermore, they suffer from sparse- gradient backpropagation when the action space is very large, a problem exacerbated by the fact that they are trained to predict state-action values based on a single reward function aggregating several facets of the task, rendering the model training challenging. To address these issues, we propose a multi-head target-parameterized DQN to learn robotic manipulation tasks, in particular pushing policies, and make the following contributions: i) we show that learning to predict different reward and task aspects can be beneficial compared to predicting a single value function where reward factors are not disentangled; ii) we study several alternatives to generalize a policy by encoding the target parameters either into the network layers or visually in the input; iii) we propose a kernelized version of the loss function, allowing to obtain better, faster and more stable training performance. Extensive experiments on simulations validate our design choices, and we show that our architecture learned on simulated data can achieves high performance in a real-robot setup involving a Franka Emika robot arm and unseen objects.
Bibtex reference
@inproceedings{Ewerton23IROS, author="Ewerton, M. and Villamizar, M. and Jankowski, J. and Calinon, S. and Odobez, J.-M.", title="A Multitask and Kernel approach for Learning to Push Objects with a Task-Parameterized Deep Q-Network", booktitle="Proc.\ {IEEE/RSJ} Intl Conf.\ on Intelligent Robots and Systems ({IROS})", year="2023", pages="" }