Stars
No fortress, purely open ground. OpenManus is Coming.
MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. It implements OpenAI-compatible API endpoints, enabling seaml…
A cross compile environment Docker image, can be used like docker multiarch image without speed penalty.
Robust Speech Recognition via Large-Scale Weak Supervision
Sparsity-aware deep learning inference runtime for CPUs
Acceptance rates for the major AI conferences
Code for STFT Transformer used in BirdCLEF 2021 competition.
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…
Exclude machine-translated sites of Stack Exchange from Google search results
The state-of-art time domain network for speech separation, and it performs well on speech enhancement and music separation
An open-source speech separation and enhancement library
Unofficial complex tensor and scalar support for Pytorch
The code used for TASLP 2019. The latest version is available in SoundSourceSeparation repository.
Open-Source Large Vocabulary Continuous Speech Recognition Engine
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Real-time GCC-NMF Blind Speech Separation and Enhancement
Code to do blind source separation with more microphones than sources using auxilliary based independent vector analysis.