A curated list of papers for generalist AI agents in both virtual and physical worlds.
Date | keywords | Paper | Publication | Others |
---|---|---|---|---|
May 2022 | Gato | A Generalist Agent | TMLR'22 | Report |
Feb 2024 | Interactive Agent Foundation Model | An Interactive Agent Foundation Model | ArXiv'24 | Report |
Date | keywords | Paper | Publication | Others |
---|---|---|---|---|
Mar 2018 | World Models | World Models | ArXiv'18 | Project |
Jan 2023 | DreamerV3 | Mastering Diverse Domains through World Models | ArXiv'23 | Project |
Aug 2023 | Human World Model | Structured World Models from Human Videos | RSS'23 | Project |
Feb 2024 | World Models | The Essential Role of Causality in Foundation World Models for Embodied AI | ArXiv'24 | Project |
Nov 2024 | WHALE | WHALE: Towards Generalizable and Scalable World Models for Embodied Decision-making | ArXiv'24 | Project |
Date | keywords | Paper | Publication | Others |
---|---|---|---|---|
Feb 2024 | Agent-Pro | Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization | ACL'24 | Project |
Dec 2023 | LARP | LARP: Language-Agent Role Play for Open-World Games | ArXiv'23 | Project |
Mar 2024 | SIMA | Scaling Instructable Agents Across Many Simulated Worlds | ArXiv'24 | Report |
Aug 2024 | Optimus-1 | Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks | ArXiv'24 | Project |
Date | keywords | Paper | Publication | Others |
---|---|---|---|---|
Aug 2024 | VisualAgentBench | VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents | ArXiv'24 | Project |
We are currently under ongoing updates and always welcome contributions. If you find any interesting papers that are not included in this collection, feel free to open a pull request.
For any questions or suggestions, please contact Yongyuan Liang or Ruihan Yang.