Skip to content
View huangyungaodq's full-sized avatar

Block or report huangyungaodq

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 3,303 251 Updated Feb 21, 2025

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 81,237 61,022 Updated Feb 24, 2025

This is InfiniRetri, a tool enhance Transformer-based LLMs(Large Language Model) ablity to hangle Long-Context.

11 Updated Feb 19, 2025

一个基于预训练的句向量生成工具

Python 135 11 Updated Mar 30, 2023
Python 5 Updated Feb 24, 2025

Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.

C 69 2 Updated Feb 2, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 10,709 1,052 Updated Feb 24, 2025

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 31,923 2,122 Updated Feb 22, 2025

Fully open reproduction of DeepSeek-R1

Python 21,322 1,874 Updated Feb 24, 2025

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,222 467 Updated Nov 6, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 15,616 1,084 Updated Feb 20, 2025

Visualizer for neural network, deep learning and machine learning models

JavaScript 29,449 2,853 Updated Feb 24, 2025

🔒 Enterprise-grade API gateway that helps you monitor and impose cost or rate limits per API key. Get fine-grained access control and monitoring per user, application, or environment. Supports Open…

Go 996 68 Updated Jan 5, 2025

🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.

Rust 403 36 Updated Feb 23, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 1,966 119 Updated Feb 23, 2025

保存微信历史版本

Shell 447 34 Updated Dec 19, 2024

continous batching and parallel acceleration for RWKV6

Python 24 Updated Jun 28, 2024
Python 2,224 153 Updated Feb 24, 2025

Rudimentary support for using multiple GPUs in a ComfyUI workflow

Python 192 22 Updated Aug 4, 2024

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,573 479 Updated Feb 12, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 68,364 7,350 Updated Feb 24, 2025

[ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention

Python 28 1 Updated Feb 15, 2025

🔊 Cross browser Speech Synthesis also known as Text to speech or TTS; no dependencies; uses Web Speech API

JavaScript 224 23 Updated Feb 1, 2025

Streaming ASR and TTS based on FastAPI+ sherpa-onnx

Python 76 10 Updated Sep 30, 2024

ChatGPT web application. ChatGPT 网页应用,支持多对话、海量提示词、PWA、ASR、TTS

TypeScript 129 29 Updated Sep 8, 2024

实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and …

Python 707 99 Updated Nov 15, 2024

Local SRT/LLM/TTS Voicechat

Python 624 66 Updated Oct 12, 2024

百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,时延低至800ms,Mac等低配置也可运行,支持打断

Python 675 117 Updated Feb 21, 2025
Next