- Suzhou, Jiangsu
-
08:56
(UTC +08:00)
Starred repositories
[CVPR 2025] VGGT: Visual Geometry Grounded Transformer
A simple screen parsing tool towards pure vision based GUI agent
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
Get your documents ready for gen AI
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.
An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI
The AI Podcast Studio: generate podcasts scripts and their audio version with a team of AI workers in a Podcast Studio 🎙️📜
nerdymomocat-templates / webtrotion-astro-notion-cms-website-blog
Forked from otoyo/astro-notion-blogYour own notion website with astro
🚀 Begin building your very own Notion Blog with Astro.
每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Writing AI Conference Papers: A Handbook for Beginners
Open Vocabulary Learning for Neural Chinese Pinyin IME (ACL 2020)
YOLOv11 trained on DocLayNet dataset.
Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.
检测和提取各种场景图片中的表格区域,并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation issues.
📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.
OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR
Detect and extract tables to markdown and csv
OCR, layout analysis, reading order, table recognition in 90+ languages