با برنامه Player FM !
Jacob Beck and Risto Vuorio
Manage episode 357253007 series 2536330
Jacob Beck and Risto Vuorio on their recent Survey of Meta-Reinforcement Learning. Jacob and Risto are Ph.D. students at Whiteson Research Lab at University of Oxford.
Featured Reference
A Survey of Meta-Reinforcement Learning
Jacob Beck, Risto Vuorio, Evan Zheran Liu, Zheng Xiong, Luisa Zintgraf, Chelsea Finn, Shimon Whiteson
Additional References
- VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning, Luisa Zintgraf et al
- Mastering Diverse Domains through World Models (Dreamerv3), Hafner et al
- Unsupervised Meta-Learning for Reinforcement Learning (MAML), Gupta et al
- Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices (DREAM), Liu et al
- RL2: Fast Reinforcement Learning via Slow Reinforcement Learning, Duan et al
- Learning to reinforcement learn, Wang et al
72 قسمت
Manage episode 357253007 series 2536330
Jacob Beck and Risto Vuorio on their recent Survey of Meta-Reinforcement Learning. Jacob and Risto are Ph.D. students at Whiteson Research Lab at University of Oxford.
Featured Reference
A Survey of Meta-Reinforcement Learning
Jacob Beck, Risto Vuorio, Evan Zheran Liu, Zheng Xiong, Luisa Zintgraf, Chelsea Finn, Shimon Whiteson
Additional References
- VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning, Luisa Zintgraf et al
- Mastering Diverse Domains through World Models (Dreamerv3), Hafner et al
- Unsupervised Meta-Learning for Reinforcement Learning (MAML), Gupta et al
- Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices (DREAM), Liu et al
- RL2: Fast Reinforcement Learning via Slow Reinforcement Learning, Duan et al
- Learning to reinforcement learn, Wang et al
72 قسمت
همه قسمت ها
×به Player FM خوش آمدید!
Player FM در سراسر وب را برای یافتن پادکست های با کیفیت اسکن می کند تا همین الان لذت ببرید. این بهترین برنامه ی پادکست است که در اندروید، آیفون و وب کار می کند. ثبت نام کنید تا اشتراک های شما در بین دستگاه های مختلف همگام سازی شود.