Skip to content
View ZQun's full-sized avatar
💪
Very nice
💪
Very nice

Organizations

@midwayjs

Block or report ZQun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Project Page repo of OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication

JavaScript 205 14 Updated Apr 15, 2025

fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的agent框架。

Python 10,819 2,001 Updated Apr 2, 2025

Build beautiful website using Vue & Nuxt.

Vue 2,813 121 Updated Apr 11, 2025

JavaScript animation engine

JavaScript 57,676 3,948 Updated Apr 16, 2025

一个使用 Typescript + Electron 实现的类 Manus 桌面端

HTML 87 16 Updated Apr 8, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 2,542 188 Updated Apr 15, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 43,420 7,449 Updated Apr 16, 2025

Let AI be your browser operator.

TypeScript 7,978 453 Updated Apr 16, 2025

Unitree robot sdk version 2. https://support.unitree.com/home/zh/developer

C++ 381 115 Updated Apr 11, 2025

Use GitHub Actions to automatically get Microsoft Edge offline installation package

Python 111 16 Updated Apr 16, 2025

Make websites accessible for AI agents

Python 56,249 6,024 Updated Apr 16, 2025

Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer

Python 1,189 160 Updated Mar 13, 2025

This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice, the LLM models are QWen2.5-0.5B/1.5B, and there are three …

Python 659 118 Updated Mar 1, 2025

实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and …

Python 876 115 Updated Mar 21, 2025

The customization marketplace for Windows programs: https://windhawk.net/

C++ 3,381 92 Updated Oct 2, 2024

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 9,660 826 Updated Mar 12, 2025

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 3,554 415 Updated Feb 27, 2025

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python 2,464 205 Updated Mar 14, 2025

A framework helps you quickly build AI Native IDE products. MCP Client, supports Model Context Protocol (MCP) tools via MCP server.

TypeScript 3,308 413 Updated Apr 16, 2025

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,810 107 Updated Apr 9, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,118 2,227 Updated Feb 1, 2025

Admin Web Interface for juanfont/headscale

Svelte 707 53 Updated Apr 1, 2025

An open source, self-hosted implementation of the Tailscale control server

Go 27,203 1,457 Updated Apr 16, 2025

The easiest, most secure way to use WireGuard and 2FA.

Go 22,088 1,764 Updated Apr 16, 2025

An efficient VPN. 简便高效的异地组网、内网穿透工具

Rust 1,978 232 Updated Feb 24, 2025

🌐 The Internet OS! Free, Open-Source, and Self-Hostable.

JavaScript 30,196 2,239 Updated Apr 16, 2025

Windows inside a Docker container.

Shell 34,181 2,437 Updated Apr 16, 2025

Borgo is a statically typed language that compiles to Go.

Rust 4,390 60 Updated Oct 27, 2024
Next