This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalizat…

Python 75 1 Updated Dec 2, 2024

ranpox / openreview-visualization

OpenReivew Submission Visualization (ICLR 2024/2025)

Python 151 8 Updated Oct 17, 2024

HKUNLP / diffusion-vs-ar

[ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"

Python 34 2 Updated Feb 14, 2025

ranpox / awesome-computer-use

This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.

241 7 Updated Feb 24, 2025

HKUNLP / STRING

[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"

Python 70 3 Updated Nov 25, 2024

qtli / GSM-Plus

GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.

Python 54 6 Updated Jul 8, 2024

HKUNLP / DiffuLLaMA

[ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Python 101 6 Updated Feb 19, 2025

GAIR-NLP / O1-Journey

O1 Replication Journey

1,961 65 Updated Jan 14, 2025

GAIR-NLP / ProX

Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"

Python 220 19 Updated Feb 16, 2025

xlang-ai / Spider2-V

[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Jupyter Notebook 117 8 Updated Aug 26, 2024

xlang-ai / Spider2

[ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

HTML 341 43 Updated Feb 27, 2025

KindXiaoming / pykan

Kolmogorov Arnold Networks

Jupyter Notebook 15,451 1,454 Updated Jan 19, 2025

jonbarron / website

HTML 2,866 2,318 Updated Feb 18, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 28,411 3,293 Updated Jan 26, 2025

princeton-nlp / USACO

Can Language Models Solve Olympiad Programming?

Python 110 11 Updated Jan 14, 2025

kanishkg / stream-of-search

Repository for the paper Stream of Search: Learning to Search in Language

Python 138 19 Updated Feb 3, 2025

xlang-ai / OSWorld

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 1,646 186 Updated Feb 28, 2025

YangLing0818 / Diffusion-Models-Papers-Survey-Taxonomy

Diffusion model papers, survey, and taxonomy

3,108 259 Updated Feb 27, 2025

xai-org / grok-1

Grok open release

Python 50,189 8,365 Updated Aug 30, 2024

pipilurj / bootstrapped-preference-optimization-BPO

code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"

Python 54 1 Updated Aug 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jiacheng Ye jiacheng-ye

Achievements

Achievements

Block or report jiacheng-ye

Stars

HKUNLP / DiffuSearch

xlang-ai / AgentTrek

OpenEvaByte / evabyte

huggingface / datatrove

kiaia / GIRAFFE

xlang-ai / aguvis

SWE-Gym / SWE-Gym

OpenRLHF / OpenRLHF

huggingface / nanotron

facebookresearch / lingua

Furyton / awesome-language-model-analysis