Starred repositories
coonooo / awesome-programming-books.github.io
Forked from awesome-programming-books/awesome-programming-books.github.io📚 经典技术书籍 PDF 文件,持续更新...
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Recipes to scale inference-time compute of open models
Xiaomi Home Integration for Home Assistant
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
一个网络安全法律法规、安全政策、国家标准、行业标准知识库。A knowledge base of cybersecurity laws and regulations, security policies, national standards, and industry standards.
爬取中国所有省份办公厅公文数据。Crawler for all Policy text of all provinces in China
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library
动手学Ollama,CPU玩转大模型部署,在线阅读地址:https://datawhalechina.github.io/handy-ollama/
The official Python library for the OpenAI API
SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.
Official inference repo for FLUX.1 models
An Efficient ProxyPool with Getter, Tester and Server
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)
Collection of China illegal cases about web crawler 本项目用来整理所有中国大陆爬虫开发者涉诉与违规相关的新闻、资料与法律法规。致力于帮助在中国大陆工作的爬虫行业从业者了解我国相关法律,避免触碰数据合规红线。 [AD]中文知识图谱门户
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
a semi-structure representation of database schema
A MULTI-GENERATOR ENSEMBLE FRAMEWORK FOR NATURAL LANGUAGE TO SQL
Notepad++ official repository
免费开源的中文搜索引擎,采用 C/C++ 编写 (基于 xapian 和 scws),提供 PHP 的开发接口和丰富文档