-
The University of Hong Kong
- Hong Kong, China
-
21:30
(UTC +08:00) - https://jiacheng-ye.github.io/
Stars
[ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"
[ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials
EvaByte: Efficient Byte-level Language Models at Scale
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Minimalistic large language model 3D-parallelism training
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalizat…
OpenReivew Submission Visualization (ICLR 2024/2025)
[ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"
This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.
[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"
[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
[ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Can Language Models Solve Olympiad Programming?
Repository for the paper Stream of Search: Learning to Search in Language
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Diffusion model papers, survey, and taxonomy
code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"