Speaker Recognition

Data comes from the LibriSpeech ASR corpus

http://www.openslr.org/12/

File: train-clean-100.tar.gz (6.3G)(training set of 100 hours "clean" speech)

5 male speakers and 5 female speakers.

Deep Feed Forward Neural Network built with PyBrain:

Mean accuracy (over 10 speakers) is 92% with 1 second of voice data, compared to 10% for random guessing. With 3 seconds of voice data, the mean accuracy is 99%. With 200 milliseconds of data, the accuracy is 74%.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
LibriSpeech		LibriSpeech
networks		networks
README.md		README.md
min_max_values.dat		min_max_values.dat
speakerId.ipynb		speakerId.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speaker Recognition

About

Releases

Packages

Languages

aravindnatarajan/SpeakerRecognition

Folders and files

Latest commit

History

Repository files navigation

Speaker Recognition

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages