Evenchow

Evenchow Evenchow

open and happy

china shenzhen

Stars

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,018 4,171 Updated Dec 26, 2024

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

795 48 Updated Dec 21, 2024

fishaudio / fish-speech

SOTA Open Source TTS

Python 17,756 1,328 Updated Dec 25, 2024

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 8,410 1,075 Updated Dec 27, 2024

crewAIInc / crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 22,969 3,172 Updated Dec 27, 2024

toeverything / AFFiNE

There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable an…

TypeScript 43,626 2,849 Updated Dec 27, 2024

hankinghu / literature-books

书籍txt

994 356 Updated Nov 11, 2019

deepset-ai / haystack

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…

Python 18,298 1,965 Updated Dec 25, 2024

HKUDS / LightRAG

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 12,289 1,666 Updated Dec 27, 2024

geekan / MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 46,193 5,491 Updated Dec 18, 2024

openai / swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 17,202 1,753 Updated Oct 15, 2024

OpenSPG / KAG

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…

Python 1,520 111 Updated Dec 27, 2024

lihuithe / podlm-public

Python 531 67 Updated Oct 26, 2024

xinchen-ai / Westlake-Omni

Python 187 17 Updated Sep 24, 2024

jasonppy / VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,779 760 Updated Jun 24, 2024

wzpan / wukong-robot

🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目，支持ChatGPT多轮对话能力，还可能是首个支持脑机交互的开源智能音箱项目。

Python 6,463 1,352 Updated Oct 25, 2024

heshengtao / comfyui_LLM_party

LLM Agent Framework in ComfyUI includes Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfaces, such as…

Python 1,154 102 Updated Dec 27, 2024

rany2 / edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 6,638 653 Updated Dec 26, 2024

kyutai-labs / moshi

Python 7,051 550 Updated Dec 20, 2024

MadcowD / ell

A language model programming library.

Python 5,479 324 Updated Dec 18, 2024

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,687 185 Updated Nov 14, 2024

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 7,492 798 Updated Dec 26, 2024

huggingface / parler-tts

Inference and training library for high-quality TTS models.

Python 4,821 496 Updated Dec 10, 2024

huggingface / speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,637 389 Updated Dec 4, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 37,386 4,247 Updated Dec 19, 2024

t41372 / Open-LLM-VTuber

Talk to any LLM with hands-free voice interaction, voice interruption, and Live2D taking face running locally across platforms

Python 1,672 161 Updated Dec 27, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 33,198 3,613 Updated Dec 3, 2024

6drf21e / ChatTTS_colab

🚀 一键部署（含离线整合包）！基于 ChatTTS ，支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用，无需复杂安装。

Python 2,174 274 Updated Jul 2, 2024

run-llama / llama_parse

Parse files for optimal RAG

Python 3,424 334 Updated Dec 18, 2024

hwchase17 / auto-openai-prompter

Python 246 33 Updated Dec 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly