Skip to content
View 7350399925's full-sized avatar

Block or report 7350399925

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Yi-1.5 is an upgraded version of Yi, delivering stronger performance in coding, math, reasoning, and instruction-following capability.

547 36 Updated Nov 11, 2024

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…

Cuda 926 148 Updated Jul 29, 2023

Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.

Cuda 353 72 Updated Sep 8, 2024

Development repository for the Triton language and compiler

MLIR 14,626 1,820 Updated Feb 27, 2025

Top free VPN (ClashX & V2Ray proxy) with subscription links. [免费VPN、免费梯子、免费科学上网、免费订阅链接、免费节点、精选、ClashX & V2Ray 教程]

Python 4,093 341 Updated Jul 25, 2024

Stable Diffusion AI client app for Android

Kotlin 826 79 Updated Feb 14, 2025

Compose Multiplatform app generates images using Stability AI

Kotlin 57 6 Updated Mar 26, 2024

Stable Diffusion in NCNN with c++, supported txt2img and img2img

C++ 1,026 99 Updated Jul 3, 2023

llm deploy project based mnn. This project has merged into MNN.

C++ 1,554 173 Updated Jan 20, 2025

CPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS)

C 1,049 343 Updated Feb 19, 2025

ppl.cv is a high-performance image processing library of openPPL supporting various platforms.

C++ 498 113 Updated Oct 30, 2024

High-efficiency floating-point neural network inference operators for mobile, server, and Web

C 1,966 394 Updated Feb 27, 2025

A debugging and profiling tool that can trace and visualize python code execution

Python 6,093 419 Updated Feb 20, 2025
C++ 243 39 Updated Sep 15, 2023

mperf是一个面向移动/嵌入式平台的算子性能调优工具箱

C++ 178 29 Updated Aug 17, 2023

Use GraphicBuffer class from Android native code

C++ 200 54 Updated Mar 30, 2021

Stable Diffusion with Core ML on Apple Silicon

Python 17,168 970 Updated Jan 23, 2025

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

C++ 846 165 Updated Dec 30, 2024

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 9,818 1,757 Updated Feb 25, 2025

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 7,111 780 Updated Feb 27, 2025

字节跳动笔试、面试、工作总结

C 65 14 Updated Mar 28, 2019

Protocol Buffers - Google's data interchange format

C++ 66,691 15,624 Updated Feb 27, 2025

A curated list of awesome things related to HarmonyOS. 华为鸿蒙操作系统。

C 19,497 3,310 Updated Jul 19, 2024