Stars
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Certbot is EFF's tool to obtain certs from Let's Encrypt and (optionally) auto-enable HTTPS on your server. It can also act as a client for any other CA that uses the ACME protocol.
A generative world for general-purpose robotics & embodied AI learning.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
😺 A tool designed to shorten steps needed to import and optimize models into VRChat. Compatible models are: MMD, XNALara, Mixamo, DAZ/Poser, Blender Rigify, Sims 2, Motion Builder, 3DS Max and pote…
raryelcostasouza / pyTranscriber
Forked from agermanidis/autosubpyTranscriber can be used to generate automatic transcription / automatic subtitles for audio/video files through a friendly graphical user interface.
效果更好的补帧软件,显存占用更小,是DAIN速度的10-25倍,包含抽帧处理,去除动漫卡顿感
Plenoxels: Radiance Fields without Neural Networks
mmd_tools is a blender addon for importing Models and Motions of MikuMikuDance.
We estimate dense, flicker-free, geometrically consistent depth from monocular video, for example hand-held cell phone video.
🎞️ Subtitles generation tool (Web-UI + CLI + Python package) powered by OpenAI's Whisper and its variants 🎞️
This project aims to facilitate the conversion of Visual Studio to CMake projects.
Automatic fingering generator for piano scores
(CVPR 2023) Pytorch implementation of “T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations”
[CVPR 2023] Executing your Commands via Motion Diffusion in Latent Space, a fast and high-quality motion diffusion model
Extension for AUTOMATIC111's WebUI
Modding platform for GI, HSR, WW and ZZZ
AnimationGPT:An AIGC tool for generating game combat motion assets
repository for 360 panorama image generation based on Stable Diffusion
Midi event transformer for symbolic music generation
Text to speech alignment using CTC forced alignment
Official implementation of "TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts (ECCV2022)"
Mod / Plugin for KK / KKS to export Characters into PMX Format and additional Utility to cleanup the model into a useable state for MMD
Enhanced img2img extension for AUTOMATIC111's WebUI