Stars
Scrapy, a fast high-level web crawling & scraping framework for Python.
LlamaIndex is a data framework for your LLM applications
An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers.
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
A complete and graceful API for Wechat. 微信个人号接口、微信机器人及命令行微信,三十行即可自定义个人号机器人。
Generative Models by Stability AI
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …
Code for the paper "Language Models are Unsupervised Multitask Learners"
Universal LLM Deployment Engine with ML Compilation
End-to-End Object Detection with Transformers
Transparent proxy server that works as a poor man's VPN. Forwards over ssh. Doesn't require admin. Works with Linux and MacOS. Supports DNS tunneling.
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
A little word cloud generator in Python
Chinese version of GPT2 training code, using BERT tokenizer.
🔰 Home Assistant Operating System
Windows GUI Automation with Python (based on text properties)
Keras implementation of RetinaNet object detection.
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
[验证码识别-训练] This project is based on CNN/ResNet/DenseNet+GRU/LSTM+CTC/CrossEntropy to realize verification code identification. This project is only for training the model.
Deep Learning NLP Pipeline implemented on Tensorflow
Open and Save Evernote notes from Sublime Text 3 using Markdown