Training Open-ended Policies to follow Video-prompt Instructions with Reinforcement Learning
Published in OpenReview preprint, 2024
Recommended citation: K. He, B. Zhang, Z. Wang, S. Cai, Q. Fu, H. Fu, A. Liu, and Y. Liang. (2024). "Training Open-ended Policies to follow Video-prompt Instructions with Reinforcement Learning." OpenReview preprint.
