Skip to content
View leo3349's full-sized avatar

Block or report leo3349

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

智能电话外呼系统 呼叫中心系统 freeswitch webrtc

Java 464 245 Updated Jun 24, 2024

A blazing fast inference solution for text embeddings models

Rust 3,084 210 Updated Jan 21, 2025

PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)

125 10 Updated Nov 12, 2024

PDF to Markdown with vision models

Python 9,153 584 Updated Dec 18, 2024

Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC…

C++ 4,443 508 Updated Jan 26, 2025

Measures Width of Finger to select ring size Using Image Processing and Hand Landmarks

Python 1 Updated Nov 1, 2023

Get the width of fingers according the photo with a hand and a coin besides the hand.

Python 10 Updated Dec 16, 2023

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 9,198 1,225 Updated Jan 22, 2025

Awesome Opencv Project

Python 506 166 Updated Jun 17, 2022

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Python 2,136 181 Updated Sep 23, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 6,632 581 Updated Jan 11, 2025

[WIP] Streaming Audio Models Examples in JS

JavaScript 11 1 Updated Mar 29, 2024

Get your documents ready for gen AI

Python 19,121 1,011 Updated Jan 26, 2025

Noise supression using deep filtering

Python 2,686 249 Updated Oct 17, 2024

Python text-to-speech library with built-in voice effects and support for multiple TTS engines

Python 19 3 Updated Jun 2, 2024

Lightweight, performant, deep table extraction

Python 391 28 Updated Dec 11, 2024

AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…

Python 3,376 524 Updated Jan 25, 2025

Real time interactive streaming digital human

Python 4,385 642 Updated Jan 26, 2025

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Jupyter Notebook 5,095 327 Updated Oct 18, 2023

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,856 468 Updated Dec 26, 2024

实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果

Python 305 42 Updated Dec 31, 2024

Recurrent neural network for audio noise reduction

C 4,279 919 Updated Jan 1, 2025

Table Recognition and Content Extraction in PDF Files

Python 23 7 Updated Apr 22, 2019

OpenCV-Python图像处理教程

Python 2 1 Updated Nov 30, 2018

darknet text detect and darknet cnn ocr

C 1,145 287 Updated Oct 12, 2021

yolo3+ocr

Python 5,979 1,733 Updated Aug 29, 2022

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 19,962 1,132 Updated Jan 25, 2025
Python 604 53 Updated Jun 7, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 6,486 432 Updated Jan 3, 2025
Next