Training Open-ended Policies to follow Video-prompt Instructions with Reinforcement Learning

Published in OpenReview preprint, 2024

Recommended citation: K. He, B. Zhang, Z. Wang, S. Cai, Q. Fu, H. Fu, A. Liu, and Y. Liang. (2024). "Training Open-ended Policies to follow Video-prompt Instructions with Reinforcement Learning." OpenReview preprint.