shipley223

🎯

Focusing

shipley shipley223

🎯

Focusing

2 followers · 3 following

GOKE
Hunan, Changsha
01:10 (UTC +08:00)

Lists (5)

Sort

Stars

albumentations-team / albumentations

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Python 14,577 1,663 Updated Feb 14, 2025

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 33,160 4,845 Updated Feb 14, 2025

karolpiczak / ESC-50

ESC-50: Dataset for Environmental Sound Classification

Python 1,471 291 Updated Mar 20, 2024

oldratlee / useful-scripts

🐌 useful scripts for making developer's everyday life easier and happier, involved java, shell etc.

Shell 7,364 2,811 Updated Sep 3, 2024

jiaaro / pydub

Manipulate audio with a simple and easy high level interface

Python 9,177 1,071 Updated Jul 25, 2024

ankitshah009 / Task-4-Large-scale-weakly-supervised-sound-event-detection-for-smart-cars

Task 4 Large-scale weakly supervised sound event detection for smart cars

Python 65 31 Updated Dec 20, 2021

awni / speech

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Python 756 177 Updated Jul 6, 2023

parlance / ctcdecode

PyTorch CTC Decoder bindings

C++ 831 247 Updated Apr 4, 2024

SpeechColab / GigaSpeech

Large, modern dataset for speech recognition

Shell 662 62 Updated Feb 26, 2024

wenet-e2e / WenetSpeech

A 10000+ hours dataset for Chinese speech recognition

Shell 517 49 Updated Jul 3, 2023

pkufool / open-commands

Shell 10 1 Updated Mar 25, 2024

espnet / espnet

End-to-End Speech Processing Toolkit

Python 8,775 2,215 Updated Feb 5, 2025

kpu / kenlm

KenLM: Faster and Smaller Language Model Queries

C++ 2,556 514 Updated Jul 30, 2024

wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,301 1,099 Updated Feb 10, 2025

asteroid-team / torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Python 996 91 Updated Jan 15, 2025

iver56 / audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Python 1,949 193 Updated Feb 14, 2025

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,491 1,880 Updated Feb 8, 2025

PaddlePaddle / Anakin

High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.

C++ 532 134 Updated Sep 23, 2022

christianversloot / machine-learning-articles

🧠💬 Articles I wrote about machine learning, archived from MachineCurve.com.

3,532 753 Updated Jun 28, 2024

jim-schwoebel / voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

1,828 232 Updated Jun 6, 2024

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 8,116 841 Updated Feb 13, 2025

tensorflow / models

Models and examples built with TensorFlow

Python 77,371 45,698 Updated Feb 11, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,517 5,647 Updated Feb 15, 2025

KinWaiCheuk / nnAudio

Audio processing by using pytorch 1D convolution network

Python 1,052 91 Updated Feb 13, 2024

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 139,390 27,939 Updated Feb 14, 2025

LCAV / pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,511 440 Updated Jan 3, 2025

audeering / opensmile

The Munich Open-Source Large-Scale Multimedia Feature Extractor

C++ 632 80 Updated Oct 19, 2023

WenzheLiu-Speech / awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

1,087 223 Updated Nov 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly