GitHub - Demfier/pmup: App to cheer you up with some awesome quotes when depressed using deep learning

An app to cheer you up in those hard times

About

Got the Monday blues?

Pmup (pronounced pŭmp) is the perfect "Pick Me Up" app for you. It uses advanced machine learning and artificial intelligence methods to detect the slightest hint of sadness in your voice and motivate you. In short, it listens, analyses and appreciates.

In order to alleviate ambiguity-resolution problem in text (For e.g., "I am okay" can be spoken in a variety of ways), we derive features from the way you speak too. Behind the scenes, it uses two models to analyze the speech input:

Acoustic: It parses the audio-data from the microphone and extracts the Mel-Frequency Cepstrum (MFC). The MFC captures various properties of the sound like loudness, and pitch. It can be thought of as the features derived from the spectrogram which are the defacto visualisation for spoken audio data. Thus, we decided to feed the MFC features to a Convolutional Neural Network (CNN) which classifies the data into various emotions by generating a probability distribution over the classes. The model is trained on open source datasets: RAVDESS and SAVEEand has an accuracy of ~75%.
Text: Next it uses the HTML5 speech2Text engine to convert captured audio to text. Here we've implemented an optimized version of Compositional coding capsule network with k-means routing for text classification, one of state-of-the-art methods for sentiment analysis in text. It basically uses capsule nets for multi-class classification of the text data. This model is trained on the Yelp reviews and Amazon polarity data, and has a test accuracy of ~94%.

Finally, we combine predictions from both the models to estimate emotions in real-time. Pmup, then, uses this inferred emotion to either tell you jokes, motivativational quotes or just be your pal.

Setup instructions

Although we would be hosting the app online soon, you can follow the given steps to run the app locally:

Clone this repository by git clone [email protected]:Demfier/pmup.git and switch to its root directory by cd pmup
Install and setup the following dependencies on your system (using virtualenv is highly recommended):
- NodeJS
- Python3
- Flask
- PyTorch
- Tensorflow
- Numpy
- Pandas
- Keras (<=2.1.3)
- Scikit-learn
- Librosa
Download the pretrained model for text from here and put the files yelp.pth, sentence_encoder inside the webserver folder
Run node index.js from the ux folder to start your node server
Open another terminal and run flask run -p 8080 from the webserver folder
Navigate to https://localhost:5000 in your browser
Tap the mic and start speaking
Once you are finished, tap the mic again

Based on our algorithms, you should either hear a motivational quote if sad, or you'll hear a super-funny joke 😉

Privacy

Respecting privacy, we never store any voice data, or do any voice fingerprinting. All voice data is instantaneous analysed and deleted.

Next steps

We found that combining results from both audio and text helps but still, the network seems to be very shallow and hence, we will now attempt to make an end-to-end deep learning model which processes text and audio at the same time by using some fusion techniques such as LMF to learn a shared a representation for the different modalities

Team members

Gaurav, Aseem, Rishav

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
caps-sentiment-classifier		caps-sentiment-classifier
ux		ux
webserver		webserver
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Setup instructions

Privacy

Next steps

Team members

About

Releases

Packages

Contributors 3

Languages

Demfier/pmup

Folders and files

Latest commit

History

Repository files navigation

About

Setup instructions

Privacy

Next steps

Team members

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages