Skip to content
View xinmaoge's full-sized avatar
😇
😇

Block or report xinmaoge

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Efficient vision foundation models for high-resolution generation and perception.

Python 2,628 213 Updated Jan 24, 2025

This is a collection of our NAS and Vision Transformer work.

Python 1,715 233 Updated Jul 25, 2024

Generate synthetic license plates for OCR or object deteciton project

Python 1 Updated Oct 16, 2024

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 14,614 1,981 Updated Feb 14, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 25,735 1,961 Updated Feb 14, 2025

Open source Python library for converting PDF to DOCX.

Python 2,751 396 Updated Sep 23, 2024

使用Nanodet+YoloV8-Pose实现指针仪表的实时检测、高精度读数识别(借助ncnn框架)

C++ 65 4 Updated Oct 31, 2024

stock股票.获取股票数据,计算股票指标,筹码分布,识别股票形态,综合选股,选股策略,股票验证回测,股票自动交易,支持PC及移动设备。

Python 7,508 1,425 Updated Jan 17, 2025

Make RepVGG Greater Again: A Quantization-aware Approach

Python 21 2 Updated Mar 8, 2024

LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案

444 114 Updated Oct 16, 2023

BargainNet: Background-Guided Domain Translation for Image Harmonization. Useful for Image harmonization, image composition, etc.

Python 71 5 Updated Jan 19, 2025
Python 780 79 Updated Sep 14, 2023

基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。

Python 196 15 Updated Jan 10, 2025

整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post-processing, and convert the models to ONNX.

Python 513 46 Updated Jan 17, 2025

检测和提取各种场景图片中的表格区域,并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation issues.

Python 56 Updated Dec 10, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 40,513 4,973 Updated Feb 14, 2025

GOT的vLLM加速实现 并结合 MinerU 实现RAG中的pdf 解析

Python 45 5 Updated Nov 7, 2024

Using GPT to parse PDF

Python 3,227 232 Updated Aug 7, 2024

从0到1车辆识别

Python 8 1 Updated Apr 24, 2022

Practical, Easy-to-copy CMake examples

C++ 300 46 Updated Nov 15, 2024

研究GOT-OCR-项目落地加速,不限语言

Python 58 4 Updated Oct 24, 2024

一些大语言模型和多模态模型的应用,主要包括Rag,小模型,Agent,跨模态搜索,OCR等等

Python 152 8 Updated Nov 6, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 6,865 601 Updated Feb 10, 2025

Generate text images for training deep learning ocr model

Python 1,415 386 Updated Jan 17, 2022

A synthetic data generator for text recognition

Python 3,399 996 Updated Jul 18, 2024

CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTor…

Python 3,407 515 Updated Nov 30, 2024

UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models

Python 217 17 Updated Feb 14, 2025

ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along with their corresponding binary masks.

Python 255 26 Updated Aug 22, 2024

computer vision projects | 计算机视觉相关好玩的AI项目(Python、C++、embedded system)

Jupyter Notebook 2,308 684 Updated Jan 18, 2025
Next