Report
Eurekaverse: Environment Curriculum Generation via Large Language Models
العنوان: | Eurekaverse: Environment Curriculum Generation via Large Language Models |
---|---|
المؤلفون: | Liang, William, Wang, Sam, Wang, Hung-Ju, Bastani, Osbert, Jayaraman, Dinesh, Ma, Yecheng Jason |
سنة النشر: | 2024 |
المجموعة: | Computer Science |
مصطلحات موضوعية: | Computer Science - Robotics, Computer Science - Artificial Intelligence, Computer Science - Machine Learning |
الوصف: | Recent work has demonstrated that a promising strategy for teaching robots a wide range of complex skills is by training them on a curriculum of progressively more challenging environments. However, developing an effective curriculum of environment distributions currently requires significant expertise, which must be repeated for every new domain. Our key insight is that environments are often naturally represented as code. Thus, we probe whether effective environment curriculum design can be achieved and automated via code generation by large language models (LLM). In this paper, we introduce Eurekaverse, an unsupervised environment design algorithm that uses LLMs to sample progressively more challenging, diverse, and learnable environments for skill training. We validate Eurekaverse's effectiveness in the domain of quadrupedal parkour learning, in which a quadruped robot must traverse through a variety of obstacle courses. The automatic curriculum designed by Eurekaverse enables gradual learning of complex parkour skills in simulation and can successfully transfer to the real-world, outperforming manual training courses designed by humans. Comment: Conference on Robot Learning (CoRL), 2024. Project website and code: https://eureka-research.github.io/eurekaverse |
نوع الوثيقة: | Working Paper |
URL الوصول: | http://arxiv.org/abs/2411.01775 |
رقم الانضمام: | edsarx.2411.01775 |
قاعدة البيانات: | arXiv |
الوصف غير متاح. |