Lists (22)
Sort Oldest
AI app
Front-end-comp
AI dev
开源应用
fe-base
开源插件
python_base
swift-base
研发基建
tech_list
data_list
end_server_base
AI 绘画
esp32
tts
agent
rag
quant
ai_learning
ocr
mcp
swift-app
Stars
A set of beautifully-designed, accessible components and a code distribution platform. Works with your favorite frameworks. Open Source. Open Code.
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.
PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books. The project has just started.
AI Agent Framework For Software Engineers
A cheat sheet that helps React developers to quickly start with SwiftUI.
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"
Cursor Talk To Figma MCP
动手学Ollama,CPU玩转大模型部署,在线阅读地址:https://datawhalechina.github.io/handy-ollama/
HiPixel is a native macOS application for AI-powered image super-resolution, built with SwiftUI and leveraging Upscayl's powerful AI models.
A open, local Manus AI alternative. Powered with Deepseek R1. No APIs, no $456 monthly bills. Enjoy an AI agent that reason, code, and browse with no worries.
[ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
Official Implementation of "KBLaM: Knowledge Base augmented Language Model"
TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools
Pocket Flow: 100-line LLM framework. Let Agents build Agents!
Pioneering Multimodal Reasoning with CoT
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
YT Navigator: AI-powered YouTube content explorer that lets you search and chat with channel videos using AI agents. Extract insights from hours of content in seconds with semantic search and preci…
The official Soundwave repository
Train your AI self, amplify you, bridge the world
A community-driven AI automation framework that builds upon the incredible work of the open source community. Our goal is to combine language models with specialized tools for tasks like web search…
MCP server for fetch web page content using Playwright headless browser.
Official implementation of the paper "MusicInfuser: Making Video Diffusion Listen and Dance"