-
GOKE
- Hunan, Changsha
-
01:10
(UTC +08:00)
Lists (5)
Sort Name ascending (A-Z)
Stars
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
ESC-50: Dataset for Environmental Sound Classification
🐌 useful scripts for making developer's everyday life easier and happier, involved java, shell etc.
Manipulate audio with a simple and easy high level interface
Task 4 Large-scale weakly supervised sound event detection for smart cars
A PyTorch Implementation of End-to-End Models for Speech-to-Text
Large, modern dataset for speech recognition
A 10000+ hours dataset for Chinese speech recognition
Production First and Production Ready End-to-End Speech Recognition Toolkit
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.
🧠💬 Articles I wrote about machine learning, archived from MachineCurve.com.
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Models and examples built with TensorFlow
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Audio processing by using pytorch 1D convolution network
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
The Munich Open-Source Large-Scale Multimedia Feature Extractor
speech enhancement\speech seperation\sound source localization