suoyuexh

suoyuexh

0 followers · 3 following

Stars

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

C++ 11,016 748 Updated Mar 1, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 8,072 2,160 Updated Mar 3, 2025

BerriAI / litellm

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 18,308 2,256 Updated Mar 3, 2025

deepseek-ai / DeepSeek-R1

84,506 10,914 Updated Feb 24, 2025

ZJU-LLMs / Foundations-of-LLMs

8,213 699 Updated Jan 14, 2025

richards199999 / Thinking-Claude

Let your Claude able to think

TypeScript 14,565 1,700 Updated Jan 23, 2025

deepseek-ai / DeepSeek-LLM

DeepSeek LLM: Let there be answers

Makefile 6,098 944 Updated Feb 4, 2024

deepseek-ai / DeepSeek-V3

Python 90,622 14,622 Updated Feb 24, 2025

sahil280114 / codealpaca

Python 1,456 109 Updated May 12, 2023

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,855 4,060 Updated Jul 17, 2024

carriex / recomp

RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.

Python 118 8 Updated Jun 30, 2024

Tebmer / Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

864 51 Updated Feb 27, 2025

RUCAIBox / LLMBox

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

Python 763 98 Updated Feb 22, 2025

abetlen / llama-cpp-python

Python bindings for llama.cpp

Python 8,754 1,068 Updated Jan 29, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 75,744 10,949 Updated Mar 3, 2025

ImagineAILab / ai-by-hand-excel

3,777 485 Updated Jan 28, 2025

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,588 1,765 Updated Feb 26, 2025

karpathy / minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 21,493 2,787 Updated Aug 15, 2024

huggingface / course

The Hugging Face course on Transformers

MDX 2,582 819 Updated Mar 3, 2025

varungodbole / prompt-tuning-playbook

A playbook for effectively prompting post-trained LLMs

837 35 Updated Jan 21, 2025

hzwer / WritingAIPaper

Writing AI Conference Papers: A Handbook for Beginners

1,974 68 Updated Feb 13, 2025

kourgeorge / arxiv-style

A Latex style and template for paper preprints (based on NIPS style)

TeX 1,216 326 Updated Jan 2, 2024

google-research / text-to-text-transfer-transformer

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Python 6,278 762 Updated Feb 27, 2025

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 140,530 28,179 Updated Mar 3, 2025

tensorflow / tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 15,891 3,554 Updated Jun 2, 2023

extreme-assistant / Deep-learning-datasets

整理分类深度学习各方向公开数据集

209 24 Updated Dec 27, 2023

d2l-ai / d2l-en

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Python 25,109 4,531 Updated Aug 18, 2024

udlbook / udlbook

Understanding Deep Learning - Simon J.D. Prince

Jupyter Notebook 7,165 1,511 Updated Feb 16, 2025

wgwang / awesome-LLM-benchmarks

Awesome LLM Benchmarks to evaluate the LLMs across text, code, image, audio, video and more.

131 11 Updated Jan 3, 2024

THUDM / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,031 5,236 Updated Jun 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly