Eurekaverse: Environment Curriculum Generation via Large Language Models

التفاصيل البيبلوغرافية
العنوان: Eurekaverse: Environment Curriculum Generation via Large Language Models
المؤلفون: Liang, William, Wang, Sam, Wang, Hung-Ju, Bastani, Osbert, Jayaraman, Dinesh, Ma, Yecheng Jason
سنة النشر: 2024
المجموعة: Computer Science
مصطلحات موضوعية: Computer Science - Robotics, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
الوصف: Recent work has demonstrated that a promising strategy for teaching robots a wide range of complex skills is by training them on a curriculum of progressively more challenging environments. However, developing an effective curriculum of environment distributions currently requires significant expertise, which must be repeated for every new domain. Our key insight is that environments are often naturally represented as code. Thus, we probe whether effective environment curriculum design can be achieved and automated via code generation by large language models (LLM). In this paper, we introduce Eurekaverse, an unsupervised environment design algorithm that uses LLMs to sample progressively more challenging, diverse, and learnable environments for skill training. We validate Eurekaverse's effectiveness in the domain of quadrupedal parkour learning, in which a quadruped robot must traverse through a variety of obstacle courses. The automatic curriculum designed by Eurekaverse enables gradual learning of complex parkour skills in simulation and can successfully transfer to the real-world, outperforming manual training courses designed by humans.
Comment: Conference on Robot Learning (CoRL), 2024. Project website and code: https://eureka-research.github.io/eurekaverse
نوع الوثيقة: Working Paper
URL الوصول: http://arxiv.org/abs/2411.01775
رقم الانضمام: edsarx.2411.01775
قاعدة البيانات: arXiv