skydoorkai

Follow

Zhang Haitao skydoorkai

Follow

4 followers · 0 following

Achievements

Achievements

Stars

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 7,350 706 Updated Jan 16, 2025

intelligent-machine-learning / tfplus

An extension library of tensorflow to accelerate industrial recommendation system model training

C++ 8 Updated Nov 27, 2024

intelligent-machine-learning / atorch

An industrial extension library of pytorch to accelerate large scale model training

Python 12 1 Updated Nov 27, 2024

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,238 344 Updated Jan 14, 2025

intelligent-machine-learning / dlrover

DLRover: An Automatic Distributed Deep Learning System

Python 1,311 169 Updated Jan 16, 2025

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 6,005 1,040 Updated Jan 10, 2025

databricks / megablocks

Python 1,242 177 Updated Nov 20, 2024

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 645 56 Updated Dec 19, 2024

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,325 882 Updated Jul 1, 2024

haoliuhl / ringattention

Large Context Attention

Python 671 53 Updated Aug 12, 2024

state-spaces / mamba

Mamba SSM architecture

Python 13,793 1,184 Updated Jan 6, 2025

mlcommons / training_results_v3.1

This repository contains the results and code for the MLPerf™ Training v3.1 benchmark.

Python 17 11 Updated Jan 14, 2025

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 4,900 425 Updated Nov 21, 2024

ggerganov / llama.cpp

LLM inference in C/C++

C++ 70,790 10,233 Updated Jan 16, 2025

shawwn / llama-dl

High-speed download of LLaMA, Facebook's 65B parameter GPT model

Shell 4,167 417 Updated Jun 28, 2023

deep-floyd / IF

Python 7,720 505 Updated Apr 14, 2024

mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,751 375 Updated Jul 11, 2024

THUDM / ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,755 1,853 Updated Jun 27, 2024

codefuse-ai / MFTCoder

High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.

Python 657 66 Updated Dec 30, 2024

pytorch / xla

Enabling PyTorch on XLA Devices (e.g. Google TPU)

C++ 2,511 491 Updated Jan 16, 2025

thunlp / PromptPapers

Must-read papers on prompt-based tuning for pre-trained language models.

4,128 382 Updated Jul 17, 2023

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 11,106 2,481 Updated Jan 16, 2025

CarperAI / trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,566 473 Updated Jan 8, 2024

DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 9,544 1,733 Updated Jan 7, 2025

NVIDIA / cuQuantum

Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples

Jupyter Notebook 359 68 Updated Nov 19, 2024

sql-machine-learning / elasticdl

Kubernetes-native Deep Learning Framework

Python 733 114 Updated Jan 26, 2024

Lightning-AI / pytorch-lightning

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,802 3,416 Updated Jan 9, 2025

facebookresearch / fairscale

PyTorch extensions for high performance and large scale training.

Python 3,232 281 Updated Jan 12, 2025

kubedl-io / kubedl

Run your deep learning workloads on Kubernetes more easily and efficiently.

Go 514 79 Updated Mar 4, 2024

couler-proj / couler

Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.

Python 921 86 Updated Oct 8, 2024