Skip to content
View shuyhere's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report shuyhere

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

research

39 repositories

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,792 492 Updated Nov 27, 2024

Locating and editing factual associations in GPT (NeurIPS 2022)

Python 599 134 Updated Apr 20, 2024
Python 317 16 Updated Jul 16, 2024

State-of-the-art LLM-based translation models.

Ruby 477 38 Updated Jan 24, 2025

Influence Analysis and Estimation - Survey, Papers, and Taxonomy

68 3 Updated Feb 27, 2024

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 65,360 11,203 Updated Jul 30, 2024

Codebase for Merging Language Models (ICML 2024)

Python 794 46 Updated May 5, 2024

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Jupyter Notebook 2,050 253 Updated Jan 23, 2025

[SIGIR'24] The official implementation code of MOELoRA.

Python 143 19 Updated Jul 22, 2024

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,064 546 Updated Oct 24, 2024

[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models

Rust 6,087 370 Updated Jun 24, 2024

Minimalist ML framework for Rust

Rust 16,402 1,010 Updated Jan 28, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,184 706 Updated Dec 17, 2024

Scale LLM Engine public repository

Python 789 61 Updated Jan 29, 2025

Customizable implementation of the self-instruct paper.

Python 1,034 71 Updated Mar 7, 2024

Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.

Python 12 1 Updated Feb 11, 2024

Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

Python 403 32 Updated Oct 21, 2023
Jupyter Notebook 67 8 Updated Aug 16, 2024

Feeling confused about super alignment? Here is a reading list

42 1 Updated Jan 9, 2024

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 15,407 2,646 Updated Dec 18, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 15,617 1,266 Updated Dec 12, 2024

An Efficient "Factory" to Build Multiple LoRA Adapters

Python 291 55 Updated Jan 26, 2025

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,786 2,225 Updated Jul 29, 2024

Development repository for the Triton language and compiler

C++ 14,185 1,744 Updated Jan 29, 2025

2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.

Jupyter Notebook 1,933 131 Updated Nov 19, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,132 494 Updated May 3, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 6,568 652 Updated Jan 28, 2025

Benchmark baseline for retrieval qa applications

Jupyter Notebook 96 12 Updated Apr 14, 2024
Python 412 15 Updated Nov 2, 2023