Stars
9
stars
written in Python
Clear filter
OCR, layout analysis, reading order, table recognition in 90+ languages
A Unified Toolkit for Deep Learning Based Document Image Analysis
Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.
基于在线民宿 UGC 数据的意见挖掘项目,包含数据挖掘和NLP 相关的处理,负责数据采集、主题抽取、情感分析等任务。目的是克服用户打分和评论不一致,实时对在线民宿的满意度评测,包含在线评论采集和情感可视化分析。搭建了百度地图POI查询入口,可以进行自动化的批量查询 POI 信息的功能;构建了基于在线民宿语料的 LDA 自动主题聚类模型,利用主题中心词能找出对应的主题属性字典;以用户打分作为标…
This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".
[TPAMI'24] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation