Stars
One second to read GitHub code with VS Code.
An optimizing compiler for decision tree ensemble inference.
Backward compatible ML compute opset inspired by HLO/MHLO
A retargetable MLIR-based machine learning compiler and runtime toolkit.
A machine learning compiler for GPUs, CPUs, and ML accelerators
Open standard for machine learning interoperability
Using Low-rank adaptation to quickly fine-tune diffusion models.
Development repository for the Triton language and compiler
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
Privacy Preserving Collaborative Encrypted Network Traffic Classification (Differential Privacy, Federated Learning, Membership Inference Attack, Encrypted Traffic Classification)
Bolt is a deep learning library with high performance and heterogeneous flexibility.
Training neural networks in TensorFlow 2.0 with 5x less memory
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
A curated reading list of research in Mixture-of-Experts(MoE).
High-efficiency floating-point neural network inference operators for mobile, server, and Web
Implementation of 1D, 2D, and 3D FFT convolutions in PyTorch. Much faster than direct convolutions for large kernel sizes.
OpenHarmony documentation | OpenHarmony开发者文档
A GPS bicycle speedometer that supports offline maps and track recording
A rewrite of the old legacy software "depends.exe" in C# for Windows devs to troubleshoot dll load dependencies issues.
Open deep learning compiler stack for cpu, gpu and specialized accelerators
A domain specific language to express machine learning workloads.
a language for fast, portable data-parallel computation
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M