-
Peking University
- Beijing
-
16:44
(UTC +08:00) - htlou.github.io
- @HantaoLou
- https://scholar.google.com/citations?user=h1s9iX4AAAAJ
Highlights
- Pro
Stars
[AAAI Alignment Track 25 Poster] Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction
Open source replication of Anthropic's Crosscoders for Model Diffing
A list of AI opportunities in academia for undergraduate and graduate students. Internships, scholarships, and fellowships.
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
electron-ssr原作者删除了这个伟大的项目,故备份了下来,不继续开发,且用且珍惜
A simple utility to execute your deep learning scripts when there are enough idle gpus | 一个在有足够的空闲gpu时执行深度学习训练的小工具
Training Sparse Autoencoders on Language Models
Sparsify transformers with SAEs and transcoders
Align Anything: Training All-modality Model with Feedback
Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-contex…
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…
XuehaiPan / torchrec
Forked from pytorch/torchrecPytorch domain library for recommendation systems
A markdown version emoji cheat sheet
Thorn in a HaizeStack test for evaluating long-context adversarial robustness.
A guidance language for controlling large language models.
The nnsight package enables interpreting and manipulating the internals of deep learned models.
⚡ Dynamically generated stats for your github readmes
Self-Supervised Alignment with Mutual Information
xmcp / Bilibili-Evolved
Forked from the1812/Bilibili-Evolved强大的哔哩哔哩增强脚本
R package for flexible correlation matrices based on ggplot2
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Automation scripts for setting up a basic development environment.