Skip to content
View wcyong's full-sized avatar

Block or report wcyong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
390 results for source starred repositories
Clear filter

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 39,111 4,416 Updated Jan 18, 2025

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

Python 5,162 252 Updated Jan 17, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 37,053 4,594 Updated Aug 16, 2024

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 20,845 3,149 Updated Dec 12, 2024

NVR with realtime local object detection for IP cameras

TypeScript 20,383 1,863 Updated Jan 22, 2025

利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.

Python 3,395 382 Updated Jan 11, 2025

Get your documents ready for gen AI

Python 18,869 999 Updated Jan 21, 2025

NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…

Python 2,284 192 Updated Jan 21, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 29,185 2,761 Updated Jan 22, 2025

ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation

Jupyter Notebook 131 11 Updated Jan 11, 2025

A Repo For Document AI

Python 2,668 146 Updated Jan 14, 2025

A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.

Python 160 11 Updated May 23, 2024

Code for the paper "UVDoc: Neural Grid-based Document Unwarping"

C++ 110 21 Updated Jul 28, 2024

如需体验TextIn文档解析,请访问 https://cc.co/16YSIy

JavaScript 115 19 Updated Dec 3, 2024

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 33,051 5,687 Updated Nov 29, 2024

基于LLM的聊天机器人,AI Agent的自主智能体,利用Function、Tools、Agent来实现LLM自主工作

Python 6 2 Updated May 22, 2024

TensorRT+YOLO系列的 多路 多卡 多实例 并行视频分析处理案例

C++ 264 50 Updated Aug 1, 2024

收集整理漏洞EXP/POC,大部分漏洞来源网络,目前收集整理了1400多个poc/exp,长期更新。

4,748 1,038 Updated Jan 7, 2025

这是一个clip-pytorch的模型,可以训练自己的数据集。

Python 206 26 Updated Apr 5, 2023

卡证和文档检测和矫正

Python 40 11 Updated Sep 18, 2024

MOWA: Multiple-in-One Image Warping Model

Python 106 17 Updated Oct 14, 2024

A way to rectify curve text images using spatial transformer by pairs of points.

Python 34 7 Updated Dec 9, 2020

30天自制C++服务器,包含教程和源代码

C++ 6,009 775 Updated Jan 21, 2025

2021年最新整理, C++ 学习资料,含C++ 11 / 14 / 17 / 20 / 23 新特性、入门教程、推荐书籍、优质文章、学习笔记、教学视频等

C++ 5,307 1,119 Updated Jun 8, 2022

一个在线图片数据标注的网页工具

TypeScript 32 7 Updated Jan 3, 2025

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

Python 986 131 Updated Jan 20, 2025

音视频流媒体权威资料整理,500+份文章,论文,视频,实践项目,协议,业界大神名单。

5,541 1,241 Updated May 20, 2024

2023年,最新音视频学习资料整理,项目(调试可用),ffmpeg命令手册,文章,编解码论文,视频讲解,面试题全套资料

C 1,997 579 Updated May 20, 2024

SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181.

C++ 26,219 5,429 Updated Jan 14, 2025

本项目是基于计算机视觉的端到端交通路口智能监控系统.采用的设计架构由SRS流媒体服务器、GPU服务器、Local客户端三部分组成.可将远端视频流通过rtmp协议传输到流媒体服务器,然后经过目标检测等一些列算法对视频进行分析,最后在本地客户端查看分析结果.项目主要用Python实现,流媒体服务器采用开源的SRS实时视频服务器搭建,GPU服务器使用YOLO模型实现道路目标如人、车、交通灯等物体的…

Python 338 75 Updated Jul 16, 2022
Next