Stars
Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI
Video Generation Foundation Models: https://saiyan-world.github.io/goku/
OpenHealth, AI Health Assistant | Powered by Your Data
Powerful & Easy-to-Use Video Face Swapping and Editing Software
Janus-Series: Unified Multimodal Understanding and Generation Models
Script-IDE is a plugin for Godot. It transforms the Script UI into an IDE like UI. Tabs are used for navigating between scripts. The default Outline got an overhaul and now shows all members of the…
ComfyUI wrapper for Kokoro-onnx
Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
🐚 WebView2 and Qt6-based shell, desktop app for Stremio with latest web ui support
Lightweight streaming web app for freshwater sailors 🏴☠️
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
The Summator Example from Custom Modules made with the GDExtension system in Godot 4
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
0lento / TRELLIS
Forked from microsoft/TRELLISTRELLIS fork with additional memory handling.
Generate Diffuse Textures on Meshes directly in Blender 3D with Stable Diffusion.
Dagor Engine and Tools source code from Gaijin Games KFT
Tensor math and scientific computation for the Godot game engine.
basic port of Radiance Cascades shadertoy implementation
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Foundational model for human-like, expressive TTS
Godot 4 plugin that lowers CPU consumption when losing window focus