Skip to content
View suoyuexh's full-sized avatar

Block or report suoyuexh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

FlashMLA: Efficient MLA decoding kernels

C++ 11,016 748 Updated Mar 1, 2025

A framework for few-shot evaluation of language models.

Python 8,072 2,160 Updated Mar 3, 2025

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 18,308 2,256 Updated Mar 3, 2025

Let your Claude able to think

TypeScript 14,565 1,700 Updated Jan 23, 2025

DeepSeek LLM: Let there be answers

Makefile 6,098 944 Updated Feb 4, 2024
Python 1,456 109 Updated May 12, 2023

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,855 4,060 Updated Jul 17, 2024

RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.

Python 118 8 Updated Jun 30, 2024

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

864 51 Updated Feb 27, 2025

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

Python 763 98 Updated Feb 22, 2025

Python bindings for llama.cpp

Python 8,754 1,068 Updated Jan 29, 2025

LLM inference in C/C++

C++ 75,744 10,949 Updated Mar 3, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,588 1,765 Updated Feb 26, 2025

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 21,493 2,787 Updated Aug 15, 2024

The Hugging Face course on Transformers

MDX 2,582 819 Updated Mar 3, 2025

A playbook for effectively prompting post-trained LLMs

837 35 Updated Jan 21, 2025

Writing AI Conference Papers: A Handbook for Beginners

1,974 68 Updated Feb 13, 2025

A Latex style and template for paper preprints (based on NIPS style)

TeX 1,216 326 Updated Jan 2, 2024

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Python 6,278 762 Updated Feb 27, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 140,530 28,179 Updated Mar 3, 2025

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 15,891 3,554 Updated Jun 2, 2023

整理分类深度学习各方向公开数据集

209 24 Updated Dec 27, 2023

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Python 25,109 4,531 Updated Aug 18, 2024

Understanding Deep Learning - Simon J.D. Prince

Jupyter Notebook 7,165 1,511 Updated Feb 16, 2025

Awesome LLM Benchmarks to evaluate the LLMs across text, code, image, audio, video and more.

131 11 Updated Jan 3, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,031 5,236 Updated Jun 27, 2024
Next