Skip to content
View skydoorkai's full-sized avatar

Block or report skydoorkai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SGLang is a fast serving framework for large language models and vision language models.

Python 7,350 706 Updated Jan 16, 2025

An extension library of tensorflow to accelerate industrial recommendation system model training

C++ 8 Updated Nov 27, 2024

An industrial extension library of pytorch to accelerate large scale model training

Python 12 1 Updated Nov 27, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,238 344 Updated Jan 14, 2025

DLRover: An Automatic Distributed Deep Learning System

Python 1,311 169 Updated Jan 16, 2025

CUDA Templates for Linear Algebra Subroutines

C++ 6,005 1,040 Updated Jan 10, 2025
Python 1,242 177 Updated Nov 20, 2024

Ring attention implementation with flash attention

Python 645 56 Updated Dec 19, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,325 882 Updated Jul 1, 2024

Large Context Attention

Python 671 53 Updated Aug 12, 2024

Mamba SSM architecture

Python 13,793 1,184 Updated Jan 6, 2025

This repository contains the results and code for the MLPerf™ Training v3.1 benchmark.

Python 17 11 Updated Jan 14, 2025

Robust recipes to align language models with human and AI preferences

Python 4,900 425 Updated Nov 21, 2024

LLM inference in C/C++

C++ 70,790 10,233 Updated Jan 16, 2025

High-speed download of LLaMA, Facebook's 65B parameter GPT model

Shell 4,167 417 Updated Jun 28, 2023
Python 7,720 505 Updated Apr 14, 2024

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,751 375 Updated Jul 11, 2024

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,755 1,853 Updated Jun 27, 2024

High Accuracy and efficiency multi-task fine-tuning framework for Code LLMs. This work has been accepted by KDD 2024.

Python 657 66 Updated Dec 30, 2024

Enabling PyTorch on XLA Devices (e.g. Google TPU)

C++ 2,511 491 Updated Jan 16, 2025

Must-read papers on prompt-based tuning for pre-trained language models.

4,128 382 Updated Jul 17, 2023

Ongoing research training transformer models at scale

Python 11,106 2,481 Updated Jan 16, 2025

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,566 473 Updated Jan 8, 2024

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 9,544 1,733 Updated Jan 7, 2025

Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples

Jupyter Notebook 359 68 Updated Nov 19, 2024

Kubernetes-native Deep Learning Framework

Python 733 114 Updated Jan 26, 2024

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,802 3,416 Updated Jan 9, 2025

PyTorch extensions for high performance and large scale training.

Python 3,232 281 Updated Jan 12, 2025

Run your deep learning workloads on Kubernetes more easily and efficiently.

Go 514 79 Updated Mar 4, 2024

Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.

Python 921 86 Updated Oct 8, 2024
Next