We aim to build a web-based app that can perform visual lip reading to help vocally impaired people communicate.
For this project, we are employing HCNN(Hahn-based Convolutional Neural Networks) in learning lip shape to character translation (letters and digits). Hahn orthogonal moments were used for feature extraction and input layer before being passed to the convolution layers
In this version, we use Python 3.9, PyTorch 1.9.0 and dlib 19.22.1
pip install requirements.txt
For MacOS
Once you have Homebrew installed install ffmpeg from the terminal with the following:
brew install ffmpeg
For Windows
For Linux
sudo apt install ffmpeg
Using Kaggle
kaggle datasets download -d codebreaker619/face-landmark-shape-predictor
Copy file to lip-reading-service/src/ALR/resources
Hahn Discrete Orthogonal Polynomials