Skip to content
View tuteng0915's full-sized avatar
  • Tsinghua U
  • Beijing

Block or report tuteng0915

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 21,108 6,117 Updated Mar 23, 2025

MT3: Multi-Task Multitrack Music Transcription

Python 1,502 200 Updated Mar 14, 2025
Python 2 Updated Jan 19, 2025

Towards Modality Generalization: A Benchmark and Prospective Analysis

Python 21 1 Updated Feb 9, 2025

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

Rust 3,145 196 Updated Mar 25, 2025

LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]

Python 313 38 Updated Apr 8, 2024

[ICLR2023] Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation (CDCD).

Python 155 11 Updated Apr 5, 2023

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,703 2,278 Updated Mar 13, 2025

ImageBind One Embedding Space to Bind Them All

Python 8,564 801 Updated Jul 31, 2024

MU-LLaMA: Music Understanding Large Language Model

Python 272 21 Updated Mar 25, 2024

Evaluation functions for music/audio information retrieval/signal processing algorithms.

Python 636 117 Updated Feb 25, 2025

A curated list of Video to Audio Generation

33 1 Updated Oct 17, 2024

Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model

Python 160 22 Updated Jul 30, 2024

Manually annotated chord data set of US pop songs and Popular Music Collection of RWC Music Database

Python 87 13 Updated Apr 9, 2013

SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

Python 5,109 619 Updated Mar 24, 2025

A large-scale dataset of caption-annotated MIDI files.

Python 60 3 Updated Jul 23, 2024
Jupyter Notebook 166 10 Updated Jul 5, 2024

This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.

Python 218 12 Updated Jul 25, 2024

The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.

Jupyter Notebook 150 5 Updated Dec 22, 2023

Stable Diffusion web UI

Python 149,902 27,945 Updated Mar 4, 2025
Python 20 2 Updated Jan 16, 2025

提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手

Python 38,260 3,931 Updated Mar 11, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 45,231 5,530 Updated Mar 25, 2025

A curated list of awesome 3d generation papers

1,130 56 Updated Mar 9, 2023
Python 2 Updated Nov 24, 2023

Responsive Resume Cv Website Using HTML CSS And JavaScript

HTML 296 172 Updated Mar 31, 2024

A modern static resume template and theme. Powered by Jekyll and GitHub pages.

HTML 2,146 1,435 Updated Jun 15, 2024

[ICCV 2023] Online Clustered Codebook

Python 162 10 Updated Sep 19, 2024
Next