Lists (1)
Sort Name ascending (A-Z)
Stars
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
deiteris / voice-changer
Forked from w-okada/voice-changerリアルタイムボイスチェンジャー Realtime Voice Changer
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
MLLM for On-Demand Spatial-Temporal Understanding at Arbitrary Resolution
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
Build real-time multimodal AI applications 🤖🎙️📹
ain-soph / ChatTTS
Forked from 2noise/ChatTTSChatTTS is a generative speech model for daily dialogue.
Add voice to your ollama model. Supports real-time speech generation and streaming output from your LLM.
An Open Source text-to-speech system built by inverting Whisper.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
A framework to enable multimodal models to operate a computer.
[ECCV'24] Kalman-Inspired Feature Propagation for Video Face Super-Resolution
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
A bot that likes comments on Tiktok videos.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
real time face swap and one-click video deepfake with only a single image
A generative speech model for daily dialogue.
Bring portraits to life via webcam!
Bring portraits to life via Monitor!
ShiJiaying / LivePortrait
Forked from KwaiVGI/LivePortraitBring portraits to life!
ymuhong / LivePortrait-Advanced
Forked from KwaiVGI/LivePortraitBring portraits to life!
Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…