Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 58,874 8,728 Updated Jan 14, 2025

johnma2006 / mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,685 198 Updated Mar 8, 2024

huggingface / huggingface_hub

The official Python client for the Huggingface Hub.

Python 2,226 582 Updated Jan 16, 2025

databricks / dbrx

Code examples and resources for DBRX, a large language model developed by Databricks

Python 2,528 240 Updated May 1, 2024

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,386 1,871 Updated Jan 16, 2025

espnet / espnet

End-to-End Speech Processing Toolkit

Python 8,685 2,205 Updated Jan 15, 2025

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,874 4,560 Updated Aug 16, 2024

Executedone / Chinese-FastSpeech2

基于标贝数据继续训练，同时对原本的FastSpeech2模型做了改进，引入了韵律表征以及韵律预测模块，使中文发音更生动且富有节奏

Python 253 42 Updated Sep 10, 2023

ming024 / FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 1,907 548 Updated Oct 27, 2023

redb17 / youtube-shorts-generator

YouTube Shorts videos generator using ChatGPT, Bing, GTTS, OpenAI's Whisper, MoviePy and Python

Python 65 8 Updated Aug 4, 2023

pndurette / gTTS

Python library and CLI tool to interface with Google Translate's text-to-speech API

Python 2,371 363 Updated Dec 23, 2024

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 13,368 1,452 Updated Jan 14, 2025

nateshmbhat / pyttsx3

Offline Text To Speech synthesis for python

Python 2,200 344 Updated Jan 5, 2025

Renovamen / Speech-and-Text

Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字（PocketSphinx、百度 API、科大讯飞 API）和文字转语音（pyttsx3）

Python 314 73 Updated Jun 3, 2019

jianchang512 / stt

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具，输出json、srt字幕、纯文字格式

Python 2,826 309 Updated Dec 5, 2024

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Peng Sun sunpeng981712364

Starred repositories

cityscapes

kaggle

lane-detection

tracking-by-detection

multi-object-tracking

mot

toolbox

active-learning

Machine learning

optical-flow