Skip to content
View mapleee's full-sized avatar

Block or report mapleee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 4,347 243 Updated Jan 9, 2025

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 1,725 67 Updated Jan 2, 2025

[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 911 94 Updated Jan 9, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,208 953 Updated Jan 8, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 7,178 546 Updated Jan 2, 2025

Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guid…

JavaScript 9,670 1,844 Updated Jan 8, 2025

Official inference repo for FLUX.1 models

Python 19,278 1,362 Updated Dec 31, 2024

Understand Human Behavior to Align True Needs

Python 3,630 325 Updated Jul 20, 2024

ZCSSR新地址发布页

777 101 Updated Dec 25, 2024

Create Music in Seconds with SunoAPI.

Python 1,535 231 Updated Nov 25, 2024

Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.

TypeScript 1,643 390 Updated Nov 27, 2024

A generative speech model for daily dialogue.

Python 33,511 3,637 Updated Jan 7, 2025

SOTA Open Source TTS

Python 18,212 1,362 Updated Jan 4, 2025

一个简单的在线制作歌词的小工具

JavaScript 106 14 Updated Dec 29, 2022

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 38,463 4,346 Updated Jan 2, 2025

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,248 117 Updated Jul 11, 2024

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 9,850 719 Updated Dec 4, 2024

Scrapy, a fast high-level web crawling & scraping framework for Python.

Python 53,737 10,612 Updated Jan 8, 2025

🚀🚀🚀feapder is an easy to use, powerful crawler framework | feapder是一款上手简单,功能强大的Python爬虫框架。内置AirSpider、Spider、TaskSpider、BatchSpider四种爬虫解决不同场景的需求。且支持断点续爬、监控报警、浏览器渲染、海量数据去重等功能。更有功能强大的爬虫管理系统feaplat为其提…

Python 3,071 494 Updated Nov 6, 2024

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,546 112 Updated Jul 5, 2024

Open-source file format designed for high-quality, customizable singing synthesis.

Python 12 5 Updated Dec 19, 2024

Generative models for conditional audio generation

Python 2,819 274 Updated Jan 9, 2025
Python 429 60 Updated Nov 18, 2024

vits2 backbone with multilingual-bert

Python 8,163 1,153 Updated Jan 9, 2025

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

Markdown 126,458 23,282 Updated Sep 22, 2024
Python 1 1 Updated Sep 7, 2023

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 21,280 2,201 Updated Nov 11, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,552 2,579 Updated Jan 7, 2025

A feature-rich command-line audio/video downloader

Python 95,764 7,513 Updated Dec 26, 2024
Next