Stars
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.
Awesome Digital Human
TEN, a voice agent framework to create conversational AI.
TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compa…
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
☁️ Build multimodal AI applications with cloud-native stack
A blazing fast inference solution for text embeddings models
Browser automation system that uses AI-driven planning to navigate web pages and perform goals.
A simple screen parsing tool towards pure vision based GUI agent
JavaScript/WebGL glasses virtual try-on widget. Real-time camera experience, robust to all lighting conditions, high-end 3D PBR rendering, easy integration, fully customizable.
a state-of-the-art-level open visual language model | 多模态预训练模型
Prompt, run, edit, and deploy full-stack web applications
Auto_Jobs_Applier_AI_Agent aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in an automated an…
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
o1-engineer is a command-line tool designed to assist developers in managing and interacting with their projects efficiently. Leveraging the power of OpenAI's API, this tool provides functionalitie…
Fuji is an AI agent that lives in your browser's sidepanel. You can now get tasks done online with a single command!
Your Next Store: Modern Commerce with Next.js and Stripe as the backend.
Dead simple FLUX LoRA training UI with LOW VRAM support
From comfyui workflow to web app, in seconds
本项目为 chatgpt-on-wechat下游分支, 额外对接了LLMOps平台 Dify,支持Dify智能助手模式,调用工具和知识库,支持Dify工作流。
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
Enhanced ChatGPT Clone: Features Agents, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code…
Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0