Skip to content
View shipley223's full-sized avatar
🎯
Focusing
🎯
Focusing
  • GOKE
  • Hunan, Changsha
  • 01:10 (UTC +08:00)

Block or report shipley223

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Python 14,577 1,663 Updated Feb 14, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 33,160 4,845 Updated Feb 14, 2025

ESC-50: Dataset for Environmental Sound Classification

Python 1,471 291 Updated Mar 20, 2024

🐌 useful scripts for making developer's everyday life easier and happier, involved java, shell etc.

Shell 7,364 2,811 Updated Sep 3, 2024

Manipulate audio with a simple and easy high level interface

Python 9,177 1,071 Updated Jul 25, 2024

Task 4 Large-scale weakly supervised sound event detection for smart cars

Python 65 31 Updated Dec 20, 2021

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Python 756 177 Updated Jul 6, 2023

PyTorch CTC Decoder bindings

C++ 831 247 Updated Apr 4, 2024

Large, modern dataset for speech recognition

Shell 662 62 Updated Feb 26, 2024

A 10000+ hours dataset for Chinese speech recognition

Shell 517 49 Updated Jul 3, 2023
Shell 10 1 Updated Mar 25, 2024

End-to-End Speech Processing Toolkit

Python 8,775 2,215 Updated Feb 5, 2025

KenLM: Faster and Smaller Language Model Queries

C++ 2,556 514 Updated Jul 30, 2024

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,301 1,099 Updated Feb 10, 2025

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Python 996 91 Updated Jan 15, 2025

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Python 1,949 193 Updated Feb 14, 2025

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,491 1,880 Updated Feb 8, 2025

High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.

C++ 532 134 Updated Sep 23, 2022

🧠💬 Articles I wrote about machine learning, archived from MachineCurve.com.

3,532 753 Updated Jun 28, 2024

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

1,828 232 Updated Jun 6, 2024

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 8,116 841 Updated Feb 13, 2025

Models and examples built with TensorFlow

Python 77,371 45,698 Updated Feb 11, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,517 5,647 Updated Feb 15, 2025

Audio processing by using pytorch 1D convolution network

Python 1,052 91 Updated Feb 13, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 139,390 27,939 Updated Feb 14, 2025

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,511 440 Updated Jan 3, 2025

The Munich Open-Source Large-Scale Multimedia Feature Extractor

C++ 632 80 Updated Oct 19, 2023

speech enhancement\speech seperation\sound source localization

1,087 223 Updated Nov 14, 2023