
-
University of California, Los Angeles
- Los Angeles
-
23:52
(UTC -12:00) - https://threesr.github.io/
Stars
This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!
A fork to add multimodal model training to open-r1
verl: Volcano Engine Reinforcement Learning for LLMs
Latest Advances on System-2 Reasoning
Building a comprehensive and handy list of papers for GUI agents
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
A library for advanced large language model reasoning
A Survey on Vision-Language Geo-Foundation Models (VLGFMs)
A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"
Code for 3D-LLM: Injecting the 3D World into Large Language Models
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.
A PyTorch native library for large-scale model training
A curated list of recent diffusion models for video generation, editing, and various other applications.
TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.
LangChain 的中文入门教程
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
A curated list of Monte Carlo tree search papers with implementations.
Paper collection on building and evaluating language model agents via executable language grounding
Codebase for AAAI 2024 conference paper Visual Chain-of-Thought Prompting for Knowledge-based Visual Reasoning
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
🩹Editing large language models within 10 seconds⚡
PVRNet: Point-View Relation Neural Network for 3D Shape Recognition (AAAI 2019)