Skip to content
View polarisZhao's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report polarisZhao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Take a screenshot online and compresses images in browser with Webassembly

JavaScript 929 106 Updated Feb 12, 2025

A python module to repair invalid JSON, commonly used to parse the output of LLMs

Python 1,520 79 Updated Feb 23, 2025

LLM Inference on consumer devices

Python 94 13 Updated Feb 26, 2025

The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.

Python 2,928 192 Updated Feb 26, 2025
Python 2,246 155 Updated Feb 24, 2025

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.

C 235 24 Updated Feb 13, 2025

Advanced Quantization Algorithm for LLMs/VLMs.

Python 378 29 Updated Feb 26, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 24,079 2,077 Updated Feb 26, 2025

Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.

Python 90 9 Updated Feb 26, 2025

A throughput-oriented high-performance serving framework for LLMs

Cuda 742 29 Updated Sep 21, 2024

🛠 A lite C++ toolkit of 100+ Awesome AI models, support ORT, MNN, NCNN, TNN and TensorRT. 🎉🎉

C++ 3,922 729 Updated Feb 23, 2025

Efficient Triton Kernels for LLM Training

Python 4,503 273 Updated Feb 26, 2025

SOTA Open Source TTS

Python 19,551 1,512 Updated Feb 18, 2025

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,060 768 Updated Oct 16, 2024

A highly optimized LLM inference acceleration engine for Llama and its variants.

C++ 866 102 Updated Feb 19, 2025

🔗 Some useful websites for programmers.

66,107 8,065 Updated Feb 19, 2025

Materials for learning SGLang

293 19 Updated Feb 26, 2025

leetcode 刷题总结

8 Updated Dec 22, 2021

OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.

Python 495 39 Updated Feb 23, 2025

A collection of (mostly) technical things every software developer should know about

86,293 7,935 Updated Aug 6, 2024

Porting of Pillow resize method in C++ and OpenCV.

C++ 130 16 Updated Mar 22, 2023

GLM Series Edge Models

Python 128 7 Updated Feb 19, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 11,699 758 Updated Feb 26, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 2,670 163 Updated Feb 23, 2025

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper

Python 31,693 2,628 Updated Feb 26, 2025

LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.

Python 518 53 Updated Feb 26, 2025

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 739 60 Updated Sep 4, 2024
Next