Stars
Open-source no-code web data extraction platform. Turn websites to APIs & spreadsheets with no-code robots in minutes.
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Make websites accessible for AI agents
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
😎丰富生态、🧩支持扩展、🦄多模态 - 大模型原生即时通信机器人平台 | 适配 QQ / 微信(企业微信、个人微信)/ 飞书 / 钉钉 / Discord / Telegram 等消息平台 | 支持 ChatGPT、DeepSeek、Dify、Claude、Gemini、xAI Grok、Ollama、LM Studio、阿里云百炼、火山方舟、SiliconFlow、Qwen、Moonshot…
An open-source PAM tool alternative to CyberArk. 广受欢迎的开源堡垒机。
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Convert PDF to markdown + JSON quickly with high accuracy
A machine learning software for extracting information from scholarly documents
#1 Locally hosted web application that allows you to perform various operations on PDF files
Hazelcast is a unified real-time data platform combining stream processing with a fast data store, allowing customers to act instantly on data-in-motion for real-time insights.
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
😎 Awesome lists about all kinds of interesting topics
Focalboard is an open source, self-hosted alternative to Trello, Notion, and Asana.
ReaLTaiizor is a .NET WinForms control library that offers a wide range of components and is user-friendly and design-focused.
Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licen…
The open source Solver AI for Java, Python and Kotlin to optimize scheduling and routing. Solve the vehicle routing problem, employee rostering, task assignment, maintenance scheduling and other pl…
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Docs2KG: A Human-LLM Collaborative Approach to Unified Knowledge Graph Construction from Heterogeneous Documents
Open Source Continuous File Synchronization
OpenRefine is a free, open source power tool for working with messy data and improving it
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A super fast Graph Database uses GraphBLAS under the hood for its sparse adjacency matrix graph representation. Our goal is to provide the best Knowledge Graph for LLM (GraphRAG).
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Build Multimodal AI Agents with memory, knowledge and tools. Simple, fast and model-agnostic.
Elegant Scraper and Crawler Framework for Golang
An easy-to-use, distributed, extensible task/job queue framework for #golang