Starred repositories
State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
😘 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装)
📸 Quickly generate image from DOM node using HTML5 canvas and SVG.
Converts raster images into SVG in ComfyUI.
ComfyUI node to create SVG vector using Potrace
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…
ComfyUI nodes for the Ultimate Stable Diffusion Upscale script by Coyote-A.
Nodes related to video workflows
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…
Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
Adobe Illustrator file parser, targeting both Node.js and browser (via WebAssembly).
Scripting in Illustrator is used to automate a wide variety of repetitive task or as complex as an entire new feature
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
A generative speech model for daily dialogue.
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…
🦜🔗 Build context-aware reasoning applications
ChatOllama is an open source chatbot based on LLMs. It supports a wide range of language models, and knowledge base management.
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge managemen…