Skip to content
View Oneal65's full-sized avatar

Block or report Oneal65

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Deep Learning GPU Training System

HTML 4,142 1,377 Updated Jan 7, 2025

LeetCode 101:力扣刷题指南

8,950 1,182 Updated Dec 8, 2024

An annotated implementation of the Transformer paper.

Jupyter Notebook 5,881 1,254 Updated Apr 7, 2024

Build Container Images In Kubernetes

Go 15,080 1,445 Updated Jan 6, 2025

A course on aligning smol models.

Jupyter Notebook 4,025 1,315 Updated Jan 9, 2025

校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

C++ 2,657 301 Updated Oct 26, 2024

Efficient Triton Kernels for LLM Training

Python 4,139 239 Updated Jan 9, 2025

Cloud-native high-performance edge/middle/service proxy

C++ 25,297 4,846 Updated Jan 10, 2025

The modern editor for API Design and Technical Writing.

820 50 Updated Feb 29, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,116 1,053 Updated Jan 8, 2025

Ongoing research training transformer models at scale

Python 11,055 2,471 Updated Jan 9, 2025

This repository contains resources for technical coding interviews.

3,418 358 Updated Mar 11, 2024

Prometheus Operator creates/configures/manages Prometheus clusters atop Kubernetes

Go 9,218 3,735 Updated Jan 9, 2025

SDK for building Kubernetes applications. Provides high level APIs, useful abstractions, and project scaffolding.

Go 7,295 1,749 Updated Jan 9, 2025

Transformers 3rd Edition

Jupyter Notebook 369 140 Updated Oct 30, 2024

Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inference Server.

Python 53 2 Updated Dec 19, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 37,608 4,840 Updated Jan 8, 2025

Standardized Serverless ML Inference Platform on Kubernetes

Python 3,772 1,086 Updated Jan 8, 2025

DSPy: The framework for programming—not prompting—language models

Python 20,879 1,579 Updated Jan 10, 2025

Puppet PadLocal is a Pad Protocol for WeChat

TypeScript 662 89 Updated Jul 4, 2023

NVIDIA Merlin is an open source library providing end-to-end GPU-accelerated recommender systems, from feature engineering and preprocessing to training deep learning models and running inference i…

Python 794 120 Updated Dec 5, 2024

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8,567 1,507 Updated Jan 9, 2025

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 13,781 3,263 Updated Aug 12, 2024

This repository contains tutorials and examples for Triton Inference Server

Python 613 100 Updated Jan 9, 2025

Animation engine for explanatory math videos

Python 73,810 6,448 Updated Jan 8, 2025

LLM training in simple, raw C/CUDA

Cuda 24,989 2,845 Updated Oct 2, 2024

DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.

C++ 1,062 357 Updated Jul 5, 2024

汇总各大互联网公司容易考察的高频leetcode题🔥

18,844 2,712 Updated Mar 13, 2024

【干货】史上最全的PyTorch学习资源汇总

Python 4,295 823 Updated Aug 14, 2019

Alibaba Java Diagnostic Tool Arthas/Alibaba Java诊断利器Arthas

Java 35,870 7,522 Updated Dec 2, 2024
Next