Skip to content
View chrisliu298's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Highlights

  • Pro

Block or report chrisliu298

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Scalable RL solution for advanced reasoning of language models

Python 920 60 Updated Jan 17, 2025

Friends of OLMo and their links.

247 15 Updated Dec 15, 2024

🟣 LLMs interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.

324 34 Updated Jul 2, 2024

veRL: Volcano Engine Reinforcement Learning for LLM

Python 705 55 Updated Jan 21, 2025

Best practices & guides on how to write distributed pytorch training code

Python 338 22 Updated Jan 15, 2025

Skywork Reward Model Series

9 1 Updated Sep 6, 2024

Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment

Jupyter Notebook 53 3 Updated Aug 30, 2024

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

Python 569 59 Updated Jan 9, 2025

Unlock your displays on your Mac! Flexible HiDPI scaling, XDR/HDR extra brightness, virtual screens, DDC control, extra dimming, PIP/streaming, EDID override and lots more!

22,034 384 Updated Jan 20, 2025

A comprehensive repository of reasoning tasks for LLMs (and beyond)

JavaScript 298 40 Updated Sep 27, 2024

Official Repository for "Tamper-Resistant Safeguards for Open-Weight LLMs"

Python 42 6 Updated Oct 14, 2024

Improving Alignment and Robustness with Circuit Breakers

Jupyter Notebook 175 24 Updated Sep 24, 2024

A fast + lightweight implementation of the GCG algorithm in PyTorch

Python 1 Updated Jul 25, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 34,148 5,249 Updated Jan 21, 2025

A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)

150 4 Updated Sep 26, 2024

Must-read Papers on Knowledge Editing for Large Language Models.

993 66 Updated Dec 22, 2024

A recipe for online RLHF and online iterative DPO.

Python 458 51 Updated Dec 28, 2024

LLM101n: Let's build a Storyteller

31,073 1,699 Updated Aug 1, 2024

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024

Python 65 5 Updated Sep 30, 2024

Powerful menu bar manager for macOS

Swift 15,897 288 Updated Jan 16, 2025

Universal and Transferable Attacks on Aligned Language Models

Python 3,579 487 Updated Aug 2, 2024

A simple, online, minimal, keyboard-centered Firefox CSS theme.

CSS 109 8 Updated Aug 26, 2024

A resource repository for machine unlearning in large language models

289 13 Updated Jan 16, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 137,820 27,624 Updated Jan 21, 2025

A library for making RepE control vectors

Jupyter Notebook 531 42 Updated Jan 8, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,330 193 Updated Aug 11, 2024

Representation Engineering: A Top-Down Approach to AI Transparency

Jupyter Notebook 778 89 Updated Aug 14, 2024

Landing Page for TOFU

Python 107 28 Updated Dec 20, 2024
Next