Skip to content
View AaronZ345's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report AaronZ345

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
AaronZ345/README.md

Hi there 👋

I am a PhD student at the College of Computer Science and Technology, Zhejiang University (浙江大学计算机学院).

I am now working on the Audio Research Team at Zhejiang University, under the supervision of Prof. Zhou Zhao (赵洲). My current research focuses on spatial audio generation based on multi-modal prompts.

I graduated from Chu Kochen Honors College, Zhejiang University (浙江大学竺可桢学院), with dual bachelor's degrees in Computer Science and Automation.
I also worked as a visiting scholar at University of Massachusetts Amherst, collaborating with Prof. Przemyslaw Grabowicz.

My research interests primarily focus on Multi-Modal Generative AI, specifically in Singing and Music Synthesis, and Spatial Audio Generation. I have published first-author papers at top international AI conferences, including NeurIPS, AAAI, and EMNLP.

I am actively seeking postdoctoral positions and research collaborations. Please feel free to contact me via email at [email protected].

📎 Homepages

💻 Research Papers

🎙 Singing Voice Synthesis


Anurag's GitHub stats Top Langs

Pinned Loading

  1. StyleSinger StyleSinger Public

    PyTorch Implementation of StyleSinger(AAAI 2024): Style Transfer for Out-of-Domain Singing Voice Synthesis

    Python 357 37

  2. GTSinger/GTSinger GTSinger/GTSinger Public

    Dataset and code of GTSinger(NeurIPS 2024 Spotlight): A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks

    Python 256 9

  3. TCSinger TCSinger Public

    PyTorch Implementation of TCSinger(EMNLP 2024): Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control

    Python 286 49