Skip to content
View mawwalker's full-sized avatar

Block or report mawwalker

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,600 669 Updated Mar 3, 2025

Faster Whisper transcription with CTranslate2

Python 14,497 1,222 Updated Jan 1, 2025

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…

C 963 158 Updated Jan 22, 2025

Speech-to-text, text-to-speech, speaker diarization, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC…

C++ 5,086 570 Updated Mar 3, 2025

数字人资料整理

709 82 Updated Jan 8, 2025

AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊…

Python 3,555 549 Updated Feb 26, 2025

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 3,034 355 Updated Feb 27, 2025

Pseudo Streaming SenseVoice with Hotwords

Python 201 25 Updated Feb 26, 2025

A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's CoT reasoning traces with Anthropic Claude models.

Rust 4,612 361 Updated Feb 4, 2025

Memory-Guided Diffusion for Expressive Talking Video Generation

Python 153 8 Updated Dec 18, 2024

Open Source framework for voice and multimodal conversational AI

Python 4,979 574 Updated Mar 3, 2025

Taming Stable Diffusion for Lip Sync!

Python 2,769 409 Updated Jan 19, 2025

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 18,658 2,125 Updated Feb 27, 2025

The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai

Python 4,666 536 Updated Mar 3, 2025

AIOS: AI Agent Operating System

Python 3,879 474 Updated Mar 3, 2025

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

TypeScript 64,224 15,254 Updated Mar 3, 2025

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 14,614 5,345 Updated Jan 28, 2025

real time face swap and one-click video deepfake with only a single image

Python 44,395 6,536 Updated Feb 19, 2025

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 18,320 2,259 Updated Mar 4, 2025

Build Multimodal AI Agents with memory, knowledge and tools. Simple, fast and model-agnostic.

Python 19,765 2,648 Updated Mar 3, 2025

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 3,590 448 Updated Nov 27, 2024

Darwin/macOS emulation layer for Linux

Objective-C 11,755 448 Updated Feb 16, 2025

Simple Python version management

Roff 41,017 3,124 Updated Feb 28, 2025

Truly independent web browser

C++ 33,260 1,384 Updated Mar 3, 2025

Low-code platform for building business applications. Connect to databases, cloud storages, GraphQL, API endpoints, Airtable, Google sheets, OpenAI, etc and build apps using drag and drop applicati…

JavaScript 34,871 4,469 Updated Mar 3, 2025

Investment Research for Everyone, Everywhere.

Python 36,600 3,318 Updated Mar 3, 2025

PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis

Python 14,274 752 Updated Feb 26, 2025

🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 利用 Rust 轻松构建轻量级多端桌面应用

Rust 36,154 6,438 Updated Mar 1, 2025

Collection of publicly available IPTV channels from all over the world

JavaScript 90,689 3,202 Updated Mar 4, 2025

There can be more than Notion and Miro. AFFiNE(pronounced [ə‘fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable an…

TypeScript 46,198 3,060 Updated Mar 4, 2025
Next