Starred repositories
Efficient vision foundation models for high-resolution generation and perception.
This is a collection of our NAS and Vision Transformer work.
Generate synthetic license plates for OCR or object deteciton project
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Open source Python library for converting PDF to DOCX.
使用Nanodet+YoloV8-Pose实现指针仪表的实时检测、高精度读数识别(借助ncnn框架)
stock股票.获取股票数据,计算股票指标,筹码分布,识别股票形态,综合选股,选股策略,股票验证回测,股票自动交易,支持PC及移动设备。
Make RepVGG Greater Again: A Quantization-aware Approach
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
BargainNet: Background-Guided Domain Translation for Image Harmonization. Useful for Image harmonization, image composition, etc.
基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。
整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post-processing, and convert the models to ONNX.
检测和提取各种场景图片中的表格区域,并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation issues.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Practical, Easy-to-copy CMake examples
一些大语言模型和多模态模型的应用,主要包括Rag,小模型,Agent,跨模态搜索,OCR等等
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Generate text images for training deep learning ocr model
A synthetic data generator for text recognition
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTor…
UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models
ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along with their corresponding binary masks.
computer vision projects | 计算机视觉相关好玩的AI项目(Python、C++、embedded system)