-
Institute of Acoustics, CAS
- 中国
-
18:26
(UTC +08:00) - E-mail: [email protected]
Starred repositories
High-efficiency floating-point neural network inference operators for mobile, server, and Web
An opensource OpenWrt variant for mainland China users.
transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"
Denoising Diffusion Probabilistic Models
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
[NeurIPS 2022] Denoising Diffusion Restoration Models -- Official Code Repository
Production First and Production Ready End-to-End Speech Recognition Toolkit
The official implementation of GTCRN, an ultra-lite speech enhancement model.
Official Repository of "Unpaired Image-to-Image Translation via Neural Schrödinger Bridge" (ICLR 2024)
[ICCV 2023] Code for "Deformable Model-Driven Neural Rendering for High-Fidelity 3D Reconstruction of Human Heads Under Low-View Settings"
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech
Python implementation of performance metrics in Loizou's Speech Enhancement book
Structured state space sequence models
SpikingJelly is an open-source deep learning framework for Spiking Neural Network (SNN) based on PyTorch.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting (NeurIPS@2023 Spotlight, TPAMI@2024)
A collection of resources and papers on Diffusion Models
Likelihood Training of Schrödinger Bridge using FBSDEs Theory, ICLR 2022
Speech Enhancement Generative Adversarial Network in TensorFlow
On-device noise suppression powered by deep learning