Lists (10)
Sort Name ascending (A-Z)
Stars
生成potplayer正在播放的视频回链到可以使用makrdown类型的笔记软件中,例如:obsidian、typora、 logseq、notion等等。Generate a backlink of the video being played by potplayer to a makrdown-type note program. examples: obsidian, typora,…
The New Stable Diffusion Audio Sampler 1.0 In a ComfyUI Node. Make some beats!
Generative models for conditional audio generation
Instant voice cloning by MIT and MyShell. Audio foundation model.
Official inference repo for FLUX.1 models
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…
Fine-Grained Open Domain Image Animation with Motion Guidance
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Official Code for MotionCtrl [SIGGRAPH 2024]
[CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generation
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
GLIDE: a diffusion-based text-conditional image synthesis model
超级微信电脑客户端,支持多开、防消息撤销、语音消息备份...开放WeChatSDK
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
A curated list of recent diffusion models for video generation, editing, and various other applications.
Diffusion model papers, survey, and taxonomy
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
Automate Creation of YouTube Shorts using MoviePy.
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models