mapleee

Amos Gee mapleee

Stars

NVIDIA / Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 4,347 243 Updated Jan 9, 2025

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 1,725 67 Updated Jan 2, 2025

hkchengrex / MMAudio

[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 911 94 Updated Jan 9, 2025

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,208 953 Updated Jan 8, 2025

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 7,178 546 Updated Jan 2, 2025

xszyou / Fay

Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guid…

JavaScript 9,670 1,844 Updated Jan 8, 2025

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 19,278 1,362 Updated Dec 31, 2024

lllyasviel / Paints-UNDO

Understand Human Behavior to Align True Needs

Python 3,630 325 Updated Jul 20, 2024

ZCSSR / url

ZCSSR新地址发布页

777 101 Updated Dec 25, 2024

SunoAI-API / Suno-API

Create Music in Seconds with SunoAPI.

Python 1,535 231 Updated Nov 25, 2024

gcui-art / suno-api

Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.

TypeScript 1,643 390 Updated Nov 27, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 33,511 3,637 Updated Jan 7, 2025

fishaudio / fish-speech

SOTA Open Source TTS

Python 18,212 1,362 Updated Jan 4, 2025

yiyizym / lrc_editor

一个简单的在线制作歌词的小工具

JavaScript 106 14 Updated Dec 29, 2022

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 38,463 4,346 Updated Jan 2, 2025

descriptinc / descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,248 117 Updated Jul 11, 2024

cumulo-autumn / StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 9,850 719 Updated Dec 4, 2024

scrapy / scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

Python 53,737 10,612 Updated Jan 8, 2025

Boris-code / feapder

🚀🚀🚀feapder is an easy to use, powerful crawler framework | feapder是一款上手简单，功能强大的Python爬虫框架。内置AirSpider、Spider、TaskSpider、BatchSpider四种爬虫解决不同场景的需求。且支持断点续爬、监控报警、浏览器渲染、海量数据去重等功能。更有功能强大的爬虫管理系统feaplat为其提…

Python 3,071 494 Updated Nov 6, 2024

QwenLM / Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,546 112 Updated Jul 5, 2024

timedomain-tech / ACE_sequence_file

Open-source file format designed for high-quality, customizable singing synthesis.

Python 12 5 Updated Dec 19, 2024

Stability-AI / stable-audio-tools

Generative models for conditional audio generation

Python 2,819 274 Updated Jan 9, 2025

CNChTu / Diffusion-SVC

Python 429 60 Updated Nov 18, 2024

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 8,163 1,153 Updated Jan 9, 2025

labuladong / fucking-algorithm

刷算法全靠套路，认准 labuladong 就够了！English version supported! Crack LeetCode, not only how, but also why.

Markdown 126,458 23,282 Updated Sep 22, 2024

hellojxt / NeuralAE

Python 1 1 Updated Sep 7, 2023

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 21,280 2,201 Updated Nov 11, 2024

yy1lab / Lyrics-Conditioned-Neural-Melody-Generation

Jupyter Notebook 425 61 Updated May 10, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,552 2,579 Updated Jan 7, 2025

yt-dlp / yt-dlp

A feature-rich command-line audio/video downloader

Python 95,764 7,513 Updated Dec 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly