ftyh2005

ftyh2005

0 followers · 4 following

Stars

166 stars written in Python

Clear filter

AUTOMATIC1111 / stable-diffusion-webui

Stable Diffusion web UI

Python 148,103 27,681 Updated Feb 18, 2025

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 76,656 9,169 Updated Jan 4, 2025

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 67,901 7,284 Updated Feb 20, 2025

binary-husky / gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 67,604 8,295 Updated Feb 12, 2025

deepfakes / faceswap

Deepfakes Software For All

Python 53,300 13,328 Updated Nov 19, 2024

lllyasviel / Fooocus

Focus on prompting and generating

Python 43,326 6,458 Updated Jan 24, 2025

oobabooga / text-generation-webui

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 42,578 5,504 Updated Feb 18, 2025

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 40,878 4,560 Updated Feb 20, 2025

LC044 / WeChatMsg

提取微信聊天记录，将其导出成HTML、Word、Excel文档永久保存，对聊天记录进行分析生成年度聊天报告，用聊天数据训练专属于个人的AI聊天助手

Python 37,393 3,854 Updated Jan 2, 2025

ultralytics / ultralytics

Ultralytics YOLO11 🚀

Python 36,787 7,112 Updated Feb 20, 2025

TencentARC / GFPGAN

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Python 36,341 6,029 Updated Jul 26, 2024

zhayujie / chatgpt-on-wechat

基于大模型搭建的聊天机器人，同时支持微信公众号、企业微信应用、飞书、钉钉等接入，可选择GPT3.5/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI，能处理文本、语音和图片，访问操作系统和互联网，支持基于自有知识库进行定制企业智能客服。

Python 34,802 8,859 Updated Feb 5, 2025

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 34,551 3,726 Updated Feb 18, 2025

lllyasviel / ControlNet

Let us control diffusion models!

Python 31,500 2,820 Updated Feb 25, 2024

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 30,995 3,115 Updated Jan 7, 2025

hiroi-sora / Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。

Python 29,600 2,952 Updated Feb 9, 2025

s0md3v / roop

one-click face swap

Python 29,324 6,626 Updated Aug 19, 2024

chatanywhere / GPT_API_free

Free ChatGPT API Key，免费ChatGPT API，支持GPT4 API（免费），ChatGPT国内可用免费转发API，直连无需代理。可以搭配ChatBox等软件/插件使用，极大降低接口使用成本。国内即可无限制畅快聊天。

Python 27,685 2,036 Updated Feb 14, 2025

iperov / DeepFaceLive

Real-time face swap for PC streaming or video calls

Python 27,646 286 Updated Nov 8, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,599 5,670 Updated Feb 20, 2025

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Python 27,162 3,866 Updated Nov 24, 2024

Vision-CAIR / MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,584 2,925 Updated Sep 2, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 23,377 2,311 Updated Feb 20, 2025

harry0703 / MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 23,288 3,418 Updated Feb 10, 2025

facefusion / facefusion

Industry leading face manipulation platform

Python 21,556 3,270 Updated Feb 16, 2025

Cinnamon / kotaemon

An open-source RAG-based tool for chatting with your documents.

Python 21,260 1,669 Updated Feb 14, 2025

Sanster / IOPaint

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Python 20,444 2,080 Updated Nov 23, 2024

fishaudio / fish-speech

SOTA Open Source TTS

Python 19,353 1,490 Updated Feb 18, 2025

danielgatis / rembg

Rembg is a tool to remove images background

Python 18,048 1,945 Updated Feb 20, 2025

Byaidu / PDFMathTranslate

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，提供 CLI/GUI/Docker/Zotero

Python 17,458 1,403 Updated Feb 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly