iwzbi

‼️

panicking

Wenzheng Bi iwzbi

‼️

panicking

Nothing is true. Everything is permitted.

2 followers · 24 following

Achievements

Stars

Projects

29 repositories

openvinotoolkit / openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

C++ 7,936 2,457 Updated Mar 10, 2025

OAID / AutoKernel

AutoKernel 是一个简单易用，低门槛的自动算子优化工具，提高深度学习算法部署效率。

C++ 736 94 Updated Sep 23, 2022

tlc-pack / relax

Python 194 57 Updated Mar 28, 2023

llvm / llvm-project

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 31,303 12,931 Updated Mar 10, 2025

jax-ml / jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 31,593 2,929 Updated Mar 10, 2025

apache / tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,085 3,528 Updated Mar 10, 2025

tensorflow / tensorflow

An Open Source Machine Learning Framework for Everyone

C++ 188,488 74,582 Updated Mar 10, 2025

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 87,742 23,560 Updated Mar 10, 2025

NVIDIA / TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 11,297 2,164 Updated Mar 7, 2025

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 7,019 1,149 Updated Feb 28, 2025

chainer / chainer

A flexible framework of neural networks for deep learning

Python 5,910 1,366 Updated Aug 28, 2023

alibaba / BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

C++ 849 164 Updated Dec 30, 2024

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 14,799 1,851 Updated Mar 10, 2025

rui314 / mold

Mold: A Modern Linker 🦠

C++ 14,931 491 Updated Mar 10, 2025

Tencent / ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 21,083 4,213 Updated Mar 10, 2025

openxla / xla

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 3,014 515 Updated Mar 10, 2025

xdit-project / xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 1,467 134 Updated Mar 10, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 11,677 1,189 Updated Mar 10, 2025

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,157 651 Updated Mar 9, 2025

microsoft / LLMLingua

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 4,927 278 Updated Jan 26, 2025

pytorch / PiPPy

Pipeline Parallelism for PyTorch

Python 754 88 Updated Aug 21, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,654 1,138 Updated Mar 7, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,748 2,389 Updated Aug 12, 2024

opendilab / DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Python 3,284 391 Updated Mar 10, 2025

google / trax

Trax — Deep Learning with Clear Code and Speed

Python 8,174 824 Updated Feb 7, 2025

huggingface / candle

Minimalist ML framework for Rust

Rust 16,757 1,049 Updated Mar 9, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,982 6,175 Updated Mar 10, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 16,192 1,535 Updated Mar 9, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 76,196 11,024 Updated Mar 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wenzheng Bi iwzbi

Achievements

Achievements

Block or report iwzbi

Projects

openvinotoolkit / openvino

OAID / AutoKernel

tlc-pack / relax

llvm / llvm-project

jax-ml / jax

apache / tvm

tensorflow / tensorflow

pytorch / pytorch

NVIDIA / TensorRT

NVIDIA / cutlass

chainer / chainer

alibaba / BladeDISC

triton-lang / triton

rui314 / mold

Tencent / ncnn

openxla / xla

xdit-project / xDiT

sgl-project / sglang

facebookresearch / xformers

microsoft / LLMLingua

pytorch / PiPPy

NVIDIA / TensorRT-LLM

haotian-liu / LLaVA

opendilab / DI-engine

google / trax

huggingface / candle

vllm-project / vllm

Dao-AILab / flash-attention

ggml-org / llama.cpp