Skip to content
View sportzhang's full-sized avatar

Block or report sportzhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 787 55 Updated Mar 5, 2025

Multilingual Voice Understanding Model

Python 5,026 458 Updated Jan 8, 2025

🎓电子科技大学 📔课程资料

Python 3,018 438 Updated Mar 17, 2025

LLM inference in C/C++

C++ 76,989 11,157 Updated Mar 21, 2025

End-to-End Object Detection with Transformers

Python 14,140 2,537 Updated Mar 12, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 24,478 2,139 Updated Mar 22, 2025

App-Controller: Allow users to manipulate your App with natural language

Python 123 10 Updated Nov 23, 2024

[CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

Python 3,178 323 Updated Feb 27, 2025

Let your Claude able to think

TypeScript 14,772 1,716 Updated Mar 10, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 9,047 926 Updated Mar 20, 2025

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 18,304 2,527 Updated Mar 14, 2025

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 25,629 3,736 Updated Feb 10, 2025

TensorRT Extension for Stable Diffusion Web UI

Python 1,963 156 Updated Jun 14, 2024

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,942 629 Updated Mar 7, 2025

Next-Token Prediction is All You Need

Python 2,037 78 Updated Mar 17, 2025

TransNet V2: Shot Boundary Detection Neural Network

Python 584 100 Updated Dec 4, 2023

A tool used to obfuscate python scripts, bind obfuscated scripts to fixed machine or expire obfuscated scripts.

Python 4,087 310 Updated Mar 20, 2025

python代码加密以及python代码的License控制

Python 147 63 Updated Dec 26, 2020

Bringing Old Photo Back to Life (CVPR 2020 oral)

Python 15,391 2,042 Updated Oct 26, 2023

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 11,276 1,296 Updated Mar 7, 2025

The AI Code Editor

28,746 1,790 Updated Oct 13, 2024

🌍 `/usr/bin/qemu-*-static`

Shell 2,541 236 Updated Jun 25, 2024

很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。

Shell 9,173 1,110 Updated Mar 20, 2025

Kolors Team

Python 4,285 323 Updated Nov 13, 2024

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,026 1,374 Updated Mar 3, 2025

This repository contains the code for the FastApi Authentication api and test cases.

Python 29 14 Updated Oct 24, 2023

[NeurIPS D&B Track 2024] Official implementation of HumanVid

Python 284 4 Updated Feb 20, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 71,845 7,789 Updated Mar 22, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 23,762 2,379 Updated Mar 22, 2025
Next
Showing results