Skip to content
/ SRVQ Public

Spherical residual vector quantization (SRVQ)

License

Notifications You must be signed in to change notification settings

yluo42/SRVQ

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

Spherical residual vector quantization (SRVQ)

This repository contains a Pytorch-based minimalist implementation of the spherical residual vector quantization (SRVQ) module used in our Gull neural audio codec framework. Find the demo page here.

SRVQ is a modification to the standard RVQ to better quantize unit-norm inputs. The general idea is to use unit-norm codebooks with standard VQ-VAE selection and update scheme at the first hierarchy (R=1), while use rotation matrices defined by Householder transformations (treat learnable reflection matrices as learnable rotation matrices) for other hierarchies (R>1).

Reference

If you use SRVQ in your project, please consider citing the following paper:

@article{luo2024gull,
title={Gull: A Generative Multifunctional Audio Codec},
author={Luo, Yi and Yu, Jianwei and Chen, Hangting and Gu Rongzhi and Weng, Chao},
journal={arXiv preprint arXiv:2404.04947},
year={2024}
}

About

Spherical residual vector quantization (SRVQ)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages