Lists (4)
Sort Name ascending (A-Z)
Stars
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Use OpenAI's realtime API for a chatting with your documents
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Blazing fast whisper turbo for ASR (speech-to-text) tasks
Fast and accurate automatic speech recognition (ASR) for edge devices
📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)
An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing and batching to deliver high-quality text extraction from comp…
Implementation of Nougat Neural Optical Understanding for Academic Documents
Convert PDF to markdown + JSON quickly with high accuracy
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
2025年3月更新,目前国内可用Docker镜像源汇总,DockerHub国内镜像加速列表,🚀DockerHub镜像加速器
📚 本代码库是作者小傅哥多年从事一线互联网 Java 开发的学习历程技术汇总,旨在为大家提供一个清晰详细的学习教程,侧重点更倾向编写Java核心内容。如果本仓库能为您提供帮助,请给予支持(关注、点赞、分享)!
LinkOS 公益运营的 docker.io、gcr.io、ghcr.io、quay.io、registry.k8s.io 镜像仓库加速服务。
CodeReadingNote pro supports jetbrains22.1.4+, code remark, custom tags, tags grouping topic, ongoing maintenance
阿里x82y231已还原纯算,阿里滑块x82y227,228,140算法,瑞数456vmp全套,京东登录cookie、m端wskey,cookie,小红书x-s,x-comm,头条、巨量、抖音a-b,x-b,补环境dom框架,京粉sign,拼多多(PDD)的anti-content、boss直聘zptoken协议、易盾全套,阿里2.0盾识别算法,爱马仕的datahome,魔改浏览器过cdp检…
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。
各种 app 逆向爬虫数据接口。抖音,小红书, 快手 ,京东, 美团 ,饿了么 ,大众点评, douyin xiaohongshu kuaishou jingdong meituan eleme dianping 抖音数据 ,美团数据 ,小红书数据, 快手数据, 点评数据。爬虫。抖音爬虫。小红书爬虫。饿了么爬虫。快手爬虫。点评爬虫。 得物。得物爬虫
A generative speech model for daily dialogue.