Stars
Deep Neural Network for Speaker Count Estimation
"DeepDPM: Deep Clustering With An Unknown Number of Clusters" [Ronen, Finder, and Freifeld, CVPR 2022]
“A Robust Learning Membership Scaling Fuzzy C-Means Algorithm Based on New Belief Peak“. Fuzzy clustering algorithm including RL_MFCM,RL_FCM,DI-FCM,DI-FSC,OFCM,SPFCM...
Code of Entropy-k-means: Entropy K-Means Clustering With Feature Reduction Under Unknown Number of Clusters
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.