ZQun ZQun

💪

Very nice

17 followers · 3 following

Achievements

Organizations

Stars

HumanAIGC / omnitalker

Project Page repo of OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication

JavaScript 205 14 Updated Apr 15, 2025

xszyou / Fay

fay是一个帮助数字人（2.5d、3d、移动、pc、网页）或大语言模型（openai兼容、deepseek）连通业务系统的agent框架。

Python 10,819 2,001 Updated Apr 2, 2025

unovue / inspira-ui

Build beautiful website using Vue & Nuxt.

Vue 2,813 121 Updated Apr 11, 2025

juliangarnier / anime

JavaScript animation engine

JavaScript 57,676 3,948 Updated Apr 16, 2025

electron-manus / lugumanus

一个使用 Typescript + Electron 实现的类 Manus 桌面端

HTML 87 16 Updated Apr 8, 2025

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 2,542 188 Updated Apr 15, 2025

mannaandpoem / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 43,420 7,449 Updated Apr 16, 2025

web-infra-dev / midscene

Let AI be your browser operator.

TypeScript 7,978 453 Updated Apr 16, 2025

unitreerobotics / unitree_sdk2

Unitree robot sdk version 2. https://support.unitree.com/home/zh/developer

C++ 381 115 Updated Apr 11, 2025

Bush2021 / edge_installer

Use GitHub Actions to automatically get Microsoft Edge offline installation package

Python 111 16 Updated Apr 16, 2025

browser-use / browser-use

Make websites accessible for AI agents

Python 56,249 6,024 Updated Apr 16, 2025

Unstructured-IO / unstructured-api

Python 710 159 Updated Feb 12, 2025

fudan-generative-vision / hallo3

Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer

Python 1,189 160 Updated Mar 13, 2025

ABexit / ASR-LLM-TTS

This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice, the LLM models are QWen2.5-0.5B/1.5B, and there are three …

Python 659 118 Updated Mar 1, 2025

Henry-23 / VideoChat

实时语音交互数字人，支持端到端语音方案（GLM-4-Voice - THG）和级联方案（ASR-LLM-TTS-THG）。可自定义形象与音色，无须训练，支持音色克隆，首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and …

Python 876 115 Updated Mar 21, 2025