Skip to content

v-perfilev/speech_denoiser

Repository files navigation

Speech Denoiser

This app utilizes PyTorch to denoise human speech, distinguishing it from background noise.

For training the model, speech datasets from Mozilla Common Voice and environmental sounds from UrbanSound8K were used.

Quick Start

  1. Clone the repository:
git clone https://github.com/v-perfilev/speech_denoiser.git
  1. Install the required packages:
pip install -r requirements.txt
  1. Copy dataset with clean and noisy sound samples into the ../_datasets/ directory. To generate datasets you can use my another project https://github.com/v-perfilev/audio_dataset_handler.git.

  2. Train the model by running the model_training.ipynb notebook.

  3. Run the app:

python usage_example.py

Features

  • Real-time speech detection using a pretrained neural network model.
  • Supports multiple microphone inputs.
  • Lightweight and easy to deploy.

Requirements

  • ffmpeg (!!!)
  • numpy
  • matplotlib
  • torchaudio
  • pyaudio
  • soundfile
  • torch

About

Speech denoising app leveraging PyTorch

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published