Skip to content
View YangLinyi's full-sized avatar

Organizations

@openreasoner

Block or report YangLinyi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Self-adaptation Framework๐Ÿ™ that adapts LLMs for unseen tasks in real-time!

Python 876 101 Updated Jan 30, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,008 66 Updated Jan 25, 2025
Python 454 56 Updated Jan 2, 2025

Named Entity Recognition as Dependency Parsing

Python 348 39 Updated Aug 16, 2023

A brief and partial summary of RLHF algorithms.

89 2 Updated Nov 24, 2024

Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.

Python 157 12 Updated Oct 22, 2024

PyTorch library for Active Fine-Tuning

Python 53 3 Updated Jan 31, 2025

A recipe for online RLHF and online iterative DPO.

Python 467 51 Updated Dec 28, 2024

FeatureAlignment = Alignment + Mechanistic Interpretability

Python 27 1 Updated Jan 17, 2025

[NeurIPS 2024] Can Language Models Learn to Skip Steps?

Python 12 Updated Jan 25, 2025

A survey on harmful fine-tuning attack for large language model

129 2 Updated Feb 1, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 ๐Ÿ“ and reasoning techniques.

6,355 353 Updated Feb 3, 2025

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1 Updated Oct 12, 2024

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,502 116 Updated Jan 17, 2025

O1 Replication Journey

1,914 59 Updated Jan 14, 2025

Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)

173 11 Updated Sep 22, 2024

Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!

Python 9 Updated Oct 16, 2024

๐Ÿš€ Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,846 105 Updated Feb 3, 2025

Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24

Python 152 21 Updated Dec 3, 2024

This is the official repository for paper: "Human Simulacra: Benchmarking the Personification of Large Language Models"

Python 19 2 Updated Jan 20, 2025

Personality Alignment of Language Models

Python 20 2 Updated Sep 2, 2024

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 18,048 1,294 Updated Jan 27, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 39,004 5,138 Updated Feb 1, 2025

Training Sparse Autoencoders on Language Models

Jupyter Notebook 605 137 Updated Feb 3, 2025

LLM training in simple, raw C/CUDA

Cuda 1 Updated Jul 11, 2024

[ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models

Python 36 2 Updated Jul 19, 2024

LLM training in simple, raw C/CUDA

Cuda 25,207 2,894 Updated Oct 2, 2024
Next