ChrisGao001

ChrisGao001

Stars

DefTruth / Awesome-LLM-Inference

📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, Parallelism, etc. 🎉🎉

3,635 254 Updated Mar 4, 2025

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 5,837 507 Updated Mar 13, 2025

ztxz16 / fastllm

纯c++的全平台llm加速库，支持python调用，chatglm-6B级模型单卡可达10000+token / s，支持glm, llama, moss基座，手机端流畅运行

C++ 3,430 351 Updated Mar 12, 2025

openvinotoolkit / openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

C++ 7,966 2,466 Updated Mar 13, 2025

tlc-pack / relax

Python 194 57 Updated Mar 28, 2023

NVIDIA-Merlin / HierarchicalKV

HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of HierarchicalKV is to store key-value feature-embeddings on h…

Cuda 140 27 Updated Mar 2, 2025

onnx / onnxmltools

ONNXMLTools enables conversion of models to ONNX

Python 1,055 192 Updated Jan 8, 2025

BBuf / tvm_mlir_learn

compiler learning resources collect.

Python 2,308 342 Updated May 27, 2024

NVIDIA / TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 11,315 2,166 Updated Mar 11, 2025

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 87,852 23,578 Updated Mar 13, 2025

gaoljhy / pilipili

Forked from fengfan0409/pilipili

blibili

Go 3 Updated Nov 29, 2019

NVIDIA-Merlin / HugeCTR

HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training

C++ 977 200 Updated Mar 13, 2025

chenxingxing6 / AdminUi

后台ui大全（有这些你就够了）https://blog.csdn.net/m0_37499059/article/details/80519211

95 78 Updated May 31, 2018

lutzroeder / netron

Visualizer for neural network, deep learning and machine learning models

JavaScript 29,628 2,862 Updated Mar 12, 2025

chromium / chromium

The official GitHub mirror of the Chromium source

C++ 20,114 7,410 Updated Mar 13, 2025

zakirullin / tiny-compiler

A tiny compiler for a language featuring LL(2) with Lexer, Parser, ASM-like codegen and VM. Complex enough to give you a flavour of how the "real" thing works whilst not being a mere toy example

C 565 46 Updated Mar 20, 2023

GoogleCloudPlatform / tf-estimator-tutorials

This repository includes tutorials on how to use the TensorFlow estimator APIs to perform various ML tasks, in a systematic and standardised way

Jupyter Notebook 671 234 Updated Aug 20, 2024

yao62995 / tensorflow

图解tensorflow 源码

2,180 597 Updated Nov 11, 2016

jemalloc / jemalloc

C 9,830 1,478 Updated Mar 13, 2025

vearch / vearch

Distributed vector search for AI-native applications

Go 2,140 342 Updated Mar 13, 2025

Qihoo360 / floyd

Forked from PikaLabs/floyd

A raft consensus implementation that is simply and understandable

C++ 152 51 Updated Jun 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ChrisGao001

Block or report ChrisGao001