Wavoice

Pytorch implementation of our proposed system for noise-resistant multi-modal speech recognition system via fusing mmWave and audio signals.

Wavoice: A Noise-resistant Multi-modal Speech Recognition System Fusing mmWave and Audio Signals.

Tiantian Liu,Gao Ming,Feng Lin,Chao Wang,Zhongjie Ba,Jinsong Han,Wenyao Xu,Kui Ren in SenSys2021

Prerequisites

Linux
Python 3.8
NVIDIA GPU

Getting Started

Installation

Install PyTorch 1.8.0 and dependencies from http://pytorch.org
Install some python libraries

git clone https://github.com/TitaniumLiu/Wavoice.git
pip install -r requirements.txt

Preprocessing

Before training the model, please preprocess the mmWave and speech signal.The dataset comprises: train_mmwave,train_voice,test_mmwave,and test_voice. Please run the stript util/prepare_Wavoice.py after seting the right path to the dataset in the stript. This will create a processed/ folder containing three csvs files that save training paths, testing paths,and character label files

Training

Modify parameters in config/Wavoice-config.yaml to train and test the model. Make sure the path to the processed/ right in the -config.yaml.
Tain a model

python train.py --config_path config/Wavoice-config.yaml

Training with your own dataset

Please prepare the mmWave/audio dataset and the corresponding groundtruth according to LibriSpeech

Citation

If you find this useful for your research, please use the following.

@inproceedings{liu2021Wavoice,
  title={A Noise-resistant Multi-modal Speech Recognition System Fusing mmWave and Audio Signals},
  author={Tiantian Liu,Gao Ming,Feng Lin,Chao Wang,Zhongjie Ba,Jinsong Han,Wenyao Xu,Kui Ren}, 
  booktitle={Proceedings of the 19th Conference on Embedded Networked Sensor Systems (SenSys)},
  year={2021}
}

Reference

The codes for Lisen, Attend, and Speller(LAS) in the system is borrowed from https://github.com/jiwidi/las-pytorch

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
config		config
model		model
solver		solver
utils		utils
README.md		README.md
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wavoice

Prerequisites

Getting Started

Installation

Preprocessing

Training

Training with your own dataset

Citation

Reference

About

Releases

Packages

Languages

TitaniumLiu/Wavoice

Folders and files

Latest commit

History

Repository files navigation

Wavoice

Prerequisites

Getting Started

Installation

Preprocessing

Training

Training with your own dataset

Citation

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages