Starred repositories
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice, the LLM models are QWen2.5-0.5B/1.5B, and there are three …
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Robust Speech Recognition via Large-Scale Weak Supervision
CapsWriter 的离线版,一个好用的 PC 端的语音输入工具
带有详细注释的 Redis 3.0 代码(annotated Redis 3.0 source code)。
听说C与Linux更搭配哦~ 内容包括:C基础 C++面向对象编程 基础数据结构 linux系统编程以及一些操作系统的相关知识
🔥🔥超过1000本的计算机经典书籍、个人笔记资料以及本人在各平台发表文章中所涉及的资源等。书籍资源包括C/C++、Java、Python、Go语言、数据结构与算法、操作系统、后端架构、计算机系统知识、数据库、计算机网络、设计模式、前端、汇编以及校招社招各种面经~
Curated list of project-based tutorials
RT-GENE: Real-Time Eye Gaze and Blink Estimation in Natural Environments
A streaming media project based on IPv4
❤️这是一条汇总网上许多资料,而不是资料的纯粹堆砌,让人眼花缭乱的复制粘贴,这不是帮你在总结所有的知识点,而是根据实际的计算机系课程来安排学习路线,并且配合上面向就业的学习,与完全跟着学校课程相比,做到了不和工业界面试不脱节,非常实际、非常认真、非常掉头发,真心求个视频三连!
VSCode插件:自动生成,自动更新VSCode文件头部注释, 自动生成函数注释并支持提取函数参数,支持所有主流语言,文档齐全,使用简单,配置灵活方便,持续维护多年。
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
The official PyTorch implementation of L2CS-Net for gaze estimation and tracking
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
基于deeplabv3plus网络实现了虹膜图像分割以及水果图像分割
RepVGG: Making VGG-style ConvNets Great Again
This repository contains all links of my work on gaze estimation. All updates will be shown in this page.
🚀An automatic configuration program for vim
Make your vim more power and much easer. 最实用的vim配置🔥