
Lists (4)
Sort Name ascending (A-Z)
Starred repositories
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/mEkkMXFG
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural language to make computers work by themselves
Make websites accessible for AI agents
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…
Toolkit for linearizing PDFs for LLM datasets/training
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Frontier Multimodal Foundation Models for Image and Video Understanding
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
A simple screen parsing tool towards pure vision based GUI agent
Retrieval and Retrieval-augmented LLMs
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Ongoing research training transformer models at scale
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…
A BPMN 2.0 rendering toolkit and web modeler.
🔥 flowable workflow designer based on vue and [email protected]
A foolproof Elasticsearch ORM framework that is easy to use, requires minimal coding, and is highly expandable...