Stars
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer(RVC), zero-shot Voice Cloning (E2, F5-TTS), YouTub…
Lightpanda: the headless browser designed for AI and automation
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Janus-Series: Unified Multimodal Understanding and Generation Models
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…
📄 A curated list of awesome .cursorrules files
FastAPI framework, high performance, easy to learn, fast to code, ready for production
Medical o1, Towards medical complex reasoning with LLMs
Deep Face Recognition UI With ReactJS
Anthropic's Interactive Prompt Engineering Tutorial
[Support 0.45](Multi Language 多语言)自动注册 Cursor Ai ,自动重置机器ID , 免费升级使用Pro 功能: You've reached your trial request limit. / Too many free trial accounts used on this machine. Please upgrade to pro. We ha…
Pretrained models for TensorFlow.js
静默活体检测 Silent Face Anti-Spoofing Attack Detection
Translation plugin for IntelliJ based IDEs/Android Studio.
Robust Speech Recognition via Large-Scale Weak Supervision
A feature-rich command-line audio/video downloader
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…
基于InsightFace与SpringBoot的身份认证系统 :前后端分离Web端项目,主要实现了网页版的人脸登录,通过调取前端摄像头拍照,传入后台进行跟数据库人脸库的相似度比对
Javascript library for precise tracking of facial features via Constrained Local Models
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization