Skip to content
View jianhuichu's full-sized avatar

Block or report jianhuichu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A flexible and efficient training framework for large-scale alignment tasks

Python 263 20 Updated Jan 3, 2025

Run your deep learning workloads on Kubernetes more easily and efficiently.

Go 513 79 Updated Mar 4, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,099 4,180 Updated Jan 4, 2025

Retrieval and Retrieval-augmented LLMs

Python 8,125 595 Updated Jan 3, 2025

Ongoing research training transformer models at scale

Python 11,003 2,457 Updated Jan 4, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 85,515 23,021 Updated Jan 4, 2025

An Open Source Machine Learning Framework for Everyone

C++ 187,082 74,389 Updated Jan 4, 2025

Best practice for training LLaMA models in Megatron-LM

Python 638 55 Updated Jan 2, 2024

πŸ“° Must-read papers and blogs on Speculative Decoding ⚑️

538 26 Updated Dec 30, 2024

πŸ“–A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. πŸŽ‰πŸŽ‰

3,096 210 Updated Jan 3, 2025

Survey: A collection of AWESOME papers and resources on the large language model (LLM) related recommender system topics.

1,130 64 Updated Dec 30, 2024

An easy-to-use framework for large scale recommendation algorithms.

Python 67 15 Updated Jan 2, 2025

Fast and memory-efficient exact attention

Python 14,897 1,410 Updated Jan 2, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 1,687 169 Updated Jan 4, 2025