Skip to content
View jiacheng-ye's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report jiacheng-ye

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"

Python 9 Updated Feb 28, 2025

[ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials

Python 22 Updated Feb 21, 2025

EvaByte: Efficient Byte-level Language Models at Scale

Python 83 3 Updated Feb 28, 2025

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,258 170 Updated Feb 12, 2025

Extending context length of visual language models

Python 7 Updated Dec 18, 2024

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Python 241 16 Updated Jan 14, 2025

Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym

Jupyter Notebook 368 23 Updated Feb 26, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,163 517 Updated Mar 1, 2025

Minimalistic large language model 3D-parallelism training

Python 1,619 163 Updated Feb 28, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,449 243 Updated Feb 20, 2025

This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalizat…

Python 75 1 Updated Dec 2, 2024

OpenReivew Submission Visualization (ICLR 2024/2025)

Python 151 8 Updated Oct 17, 2024

[ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"

Python 34 2 Updated Feb 14, 2025

This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.

241 7 Updated Feb 24, 2025

[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"

Python 70 3 Updated Nov 25, 2024

GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.

Python 54 6 Updated Jul 8, 2024

[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Python 101 6 Updated Feb 19, 2025

O1 Replication Journey

1,961 65 Updated Jan 14, 2025

Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"

Python 220 19 Updated Feb 16, 2025

[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Jupyter Notebook 117 8 Updated Aug 26, 2024

[ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

HTML 341 43 Updated Feb 27, 2025

Kolmogorov Arnold Networks

Jupyter Notebook 15,451 1,454 Updated Jan 19, 2025
HTML 2,866 2,318 Updated Feb 18, 2025

The official Meta Llama 3 GitHub site

Python 28,411 3,293 Updated Jan 26, 2025

Can Language Models Solve Olympiad Programming?

Python 110 11 Updated Jan 14, 2025

Repository for the paper Stream of Search: Learning to Search in Language

Python 138 19 Updated Feb 3, 2025

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 1,646 186 Updated Feb 28, 2025

Diffusion model papers, survey, and taxonomy

3,108 259 Updated Feb 27, 2025

Grok open release

Python 50,189 8,365 Updated Aug 30, 2024

code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"

Python 54 1 Updated Aug 23, 2024
Next