Lists (22)
Sort Name ascending (A-Z)
agent paper list
🐵 AIGC
APP-Agent
CFI
🔏differential-privacy
edge comput
File compression
📁FPGA
🔆IC
⚡ Inspiration
JAP
JSP
lecture
🎇LLM
LLM4OP
✅MCM
MLLM
🚀 My resp
offload
💬 Others
📙 research experience
RL
Starred repositories
Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"
A research prototype of a human-centered web agent
SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
Memory for AI Agents; SOTA in AI Agent Memory; Announcing OpenMemory MCP - local and secure memory management.
AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.
official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”
⭐ Linux / Windows / macOS 跨平台 V2Ray 客户端 | 支持 VMess / VLESS / SSR / Trojan / Trojan-Go / NaiveProxy / HTTP / HTTPS / SOCKS5 | 使用 C++ / Qt 开发 | 可拓展插件式设计 ⭐
Official code repo for the paper "LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark"
MPO: Boosting LLM Agents with Meta Plan Optimization
[NeurIPS 2024] Agent Planning with World Knowledge Model
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Solve Visual Understanding with Reinforced VLMs
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
A simple screen parsing tool towards pure vision based GUI agent
Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents
A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
Agent S: an open agentic framework that uses computers like a human
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Open Overleaf/ShareLaTex projects in vscode, with full collaboration support.
An environment for mobile angets to interact with realistic android device or android emulator
Official implementation of AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
AndroidWorld is an environment and benchmark for autonomous agents
Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’