Lists (12)
Sort Name ascending (A-Z)
Starred repositories
An annotated implementation of the Transformer paper.
Build Container Images In Kubernetes
A course on aligning smol models.
校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step
Efficient Triton Kernels for LLM Training
Cloud-native high-performance edge/middle/service proxy
The modern editor for API Design and Technical Writing.
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
Ongoing research training transformer models at scale
This repository contains resources for technical coding interviews.
Prometheus Operator creates/configures/manages Prometheus clusters atop Kubernetes
SDK for building Kubernetes applications. Provides high level APIs, useful abstractions, and project scaffolding.
Transformers 3rd Edition
Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inference Server.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Standardized Serverless ML Inference Platform on Kubernetes
DSPy: The framework for programming—not prompting—language models
Puppet PadLocal is a Pad Protocol for WeChat
NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference i…
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
This repository contains tutorials and examples for Triton Inference Server
DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.
Alibaba Java Diagnostic Tool Arthas/Alibaba Java诊断利器Arthas