NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…

Python 2,284 192 Updated Jan 21, 2025

infiniflow / ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 29,185 2,761 Updated Jan 22, 2025

phamquiluan / jdeskew

ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation

Jupyter Notebook 131 11 Updated Jan 11, 2025

deepdoctection / deepdoctection

A Repo For Document AI

Python 2,668 146 Updated Jan 14, 2025

ppaanngggg / layoutreader

A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.

Python 160 11 Updated May 23, 2024

tanguymagne / UVDoc

Code for the paper "UVDoc: Neural Grid-based Document Unwarping"

C++ 110 21 Updated Jul 28, 2024

intsig-textin / parsex-frontend

如需体验TextIn文档解析，请访问 https://cc.co/16YSIy

JavaScript 115 19 Updated Dec 3, 2024

chatchat-space / Langchain-Chatchat

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 33,051 5,687 Updated Nov 29, 2024

YidaHu / chatbot

基于LLM的聊天机器人，AI Agent的自主智能体，利用Function、Tools、Agent来实现LLM自主工作

Python 6 2 Updated May 22, 2024

1461521844lijin / trt_yolo_video_pipeline

TensorRT+YOLO系列的多路多卡多实例并行视频分析处理案例

C++ 264 50 Updated Aug 1, 2024

wy876 / POC

收集整理漏洞EXP/POC,大部分漏洞来源网络，目前收集整理了1400多个poc/exp，长期更新。

4,748 1,038 Updated Jan 7, 2025

bubbliiiing / clip-pytorch

这是一个clip-pytorch的模型，可以训练自己的数据集。

Python 206 26 Updated Apr 5, 2023

BADBADBADBOY / CardDetectRotate

卡证和文档检测和矫正

Python 40 11 Updated Sep 18, 2024

KangLiao929 / MOWA

MOWA: Multiple-in-One Image Warping Model

Python 106 17 Updated Oct 14, 2024

frotms / Curve-Text-Rectification-Using-Pairs-Of-Points

A way to rectify curve text images using spatial transformer by pairs of points.

Python 34 7 Updated Dec 9, 2020

yuesong-feng / 30dayMakeCppServer

30天自制C++服务器，包含教程和源代码

C++ 6,009 775 Updated Jan 21, 2025

0voice / cpp_new_features

2021年最新整理， C++ 学习资料，含C++ 11 / 14 / 17 / 20 / 23 新特性、入门教程、推荐书籍、优质文章、学习笔记、教学视频等

C++ 5,307 1,119 Updated Jun 8, 2022

FerretAngel / labelImage

一个在线图片数据标注的网页工具

TypeScript 32 7 Updated Jan 3, 2025

lenML / Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

Python 986 131 Updated Jan 20, 2025

0voice / audio_video_streaming

音视频流媒体权威资料整理，500+份文章，论文，视频，实践项目，协议，业界大神名单。

5,541 1,241 Updated May 20, 2024

0voice / ffmpeg_develop_doc

2023年，最新音视频学习资料整理，项目（调试可用），ffmpeg命令手册，文章，编解码论文，视频讲解，面试题全套资料

C 1,997 579 Updated May 20, 2024

ossrs / srs

SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.

C++ 26,219 5,429 Updated Jan 14, 2025

Kevinnan-teen / Intelligent-Traffic-Based-On-CV

本项目是基于计算机视觉的端到端交通路口智能监控系统．采用的设计架构由SRS流媒体服务器、GPU服务器、Local客户端三部分组成．可将远端视频流通过rtmp协议传输到流媒体服务器，然后经过目标检测等一些列算法对视频进行分析，最后在本地客户端查看分析结果．项目主要用Python实现，流媒体服务器采用开源的SRS实时视频服务器搭建，GPU服务器使用YOLO模型实现道路目标如人、车、交通灯等物体的…

Python 338 75 Updated Jul 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wcyong

Achievements

Achievements

Block or report wcyong

Stars

RVC-Boss / GPT-SoVITS

QuivrHQ / MegaParse

coqui-ai / TTS

harry0703 / MoneyPrinterTurbo

blakeblackshear / frigate

linyqh / NarratoAI

DS4SD / docling

NVIDIA / nv-ingest