Skip to content
View taosiyuan163's full-sized avatar

Block or report taosiyuan163

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A minimal and universal controller for FLUX.1.

Python 4 Updated Dec 3, 2024

A minimal and universal controller for FLUX.1.

Python 1,095 68 Updated Jan 9, 2025

高颜值AI数字人克隆、声音克隆、短视频生成、直播(待发布)、AI配音、AI字幕,包括Windows安装版,Web版,H5版,小程序版,副业必备,开源数字人克隆平台后端API

Python 111 22 Updated Jan 8, 2025

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 1,901 132 Updated Jan 12, 2025

美团爬虫(爬取直播间的弹幕、商品等)

Python 2 1 Updated Jan 7, 2025

基于Vue3 + WebRTC + Node + SRS搭建的直播间

Vue 1,351 275 Updated Jan 14, 2025

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 12,521 1,345 Updated Jan 14, 2025

Enjoy the magic of Diffusion models!

Python 6,740 629 Updated Jan 15, 2025

Code to accompany "A Method for Animating Children's Drawings of the Human Figure"

Python 12,148 1,041 Updated Aug 9, 2024

[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 958 102 Updated Jan 14, 2025

ComfyUI Node

Python 285 12 Updated Oct 22, 2024

Official repository of In-Context LoRA for Diffusion Transformers

1,475 75 Updated Dec 20, 2024

Official repository for LTX-Video

Python 2,553 207 Updated Jan 3, 2025

Prompt, run, edit, and deploy full-stack web applications

TypeScript 11,806 7,538 Updated Dec 17, 2024

Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System

Python 2,610 248 Updated Jan 4, 2025

LLM-powered multiagent persona simulation for imagination enhancement and business insights.

Python 5,229 418 Updated Jan 3, 2025

The fastest digital human algorithm, now on your desktop.

Python 397 36 Updated Dec 29, 2024

“alibabacloud-nls-python-sdk提供使用阿里云智能语音服务的能力,包括语音识别、语音合成、文件转写等。”

Python 40 12 Updated Dec 3, 2024

实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and …

Python 622 84 Updated Nov 15, 2024

[真诚套壳]东半球最强的套壳数字人系统,前后端分离,可对接硅基、飞影、闪剪、壹定开放平台等所有市面上的数字人API接口,开箱即用,star交个朋友。

Vue 40 11 Updated Nov 26, 2024

A powerful tool that translates ComfyUI workflows into executable Python code.

Python 1,407 142 Updated Jan 14, 2025

Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation

Python 4,499 647 Updated Dec 13, 2024

The official HelloMeme GitHub site

Python 555 38 Updated Jan 15, 2025

The best OSS video generation models

Python 2,707 274 Updated Jan 8, 2025

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,701 266 Updated Dec 21, 2024

📹 A more flexible CogVideoX that can generate videos at any resolution and creates videos from images.

Python 609 41 Updated Dec 16, 2024

Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"

Python 988 114 Updated Oct 29, 2024

Automate browser-based workflows with LLMs and Computer Vision

Python 11,430 808 Updated Jan 15, 2025
Next