shuyhere

Follow

🎯

Focusing

shuyhere shuyhere

🎯

Focusing

Follow

08 Aquarius INTP , reverse engineer of LLM, KAUST cs phd .

33 followers · 51 following

King Abdullah University of Science and Technology，Univercity of Macau，UfishAI
Jeddah
15:49 (UTC +08:00)
https://www.notion.so/shuyhere/Shu-Yang-1210f14e46e080f18511e448279487e6?pvs=4
@shuyhere

Achievements

Achievements

Highlights

Pro

Stars

research

39 repositories

01-ai / Yi

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,792 492 Updated Nov 27, 2024

kmeng01 / rome

Locating and editing factual associations in GPT (NeurIPS 2022)

Python 599 134 Updated Apr 20, 2024

LLaMafia / llamafia.github

Python 317 16 Updated Jul 16, 2024

fe1ixxu / ALMA

State-of-the-art LLM-based translation models.

Ruby 477 38 Updated Jan 24, 2025

ZaydH / influence_analysis_papers

Influence Analysis and Estimation - Survey, Papers, and Taxonomy

68 3 Updated Feb 27, 2024

amazon-science / controlling-llm-memorization

Python 31 5 Updated May 19, 2023

d2l-ai / d2l-zh

《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 65,360 11,203 Updated Jul 30, 2024

yule-BUAA / MergeLM

Codebase for Merging Language Models (ICML 2024)

Python 794 46 Updated May 5, 2024

zjunlp / EasyEdit

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Jupyter Notebook 2,050 253 Updated Jan 23, 2025

liuqidong07 / MOELoRA-peft

[SIGIR'24] The official implementation code of MOELoRA.

Python 143 19 Updated Jul 22, 2024

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,064 546 Updated Oct 24, 2024

rustformers / llm

[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models

Rust 6,087 370 Updated Jun 24, 2024

huggingface / candle

Minimalist ML framework for Rust

Rust 16,402 1,010 Updated Jan 28, 2025

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,184 706 Updated Dec 17, 2024

scaleapi / llm-engine

Scale LLM Engine public repository

Python 789 61 Updated Jan 29, 2025

jondurbin / airoboros

Customizable implementation of the self-instruct paper.

Python 1,034 71 Updated Mar 7, 2024

alexrs / herd

Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.

Python 12 1 Updated Feb 11, 2024

yangjianxin1 / Firefly-LLaMA2-Chinese

Firefly中文LLaMA-2大模型，支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

Python 403 32 Updated Oct 21, 2023

KaiNylund / lm-weights-encode-time

Jupyter Notebook 67 8 Updated Aug 16, 2024

shuyhere / about-super-alignment

Feeling confused about super alignment? Here is a reading list

42 1 Updated Jan 9, 2024

openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 15,407 2,646 Updated Dec 18, 2024

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 15,617 1,266 Updated Dec 12, 2024

TUDB-Labs / mLoRA

An Efficient "Factory" to Build Multiple LoRA Adapters

Python 291 55 Updated Jan 26, 2025

tloen / alpaca-lora

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,786 2,225 Updated Jul 29, 2024

triton-lang / triton

Development repository for the Triton language and compiler

C++ 14,185 1,744 Updated Jan 29, 2025

unslothai / hyperlearn

2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.

Jupyter Notebook 1,933 131 Updated Nov 19, 2024

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,132 494 Updated May 3, 2024

bitsandbytes-foundation / bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Python 6,568 652 Updated Jan 28, 2025

myscale / Retrieval-QA-Benchmark

Benchmark baseline for retrieval qa applications

Jupyter Notebook 96 12 Updated Apr 14, 2024

SkunkworksAI / hydra-moe

Python 412 15 Updated Nov 2, 2023