Skip to content

Latest commit

 

History

History
26 lines (18 loc) · 741 Bytes

File metadata and controls

26 lines (18 loc) · 741 Bytes

Real or Not? NLP with Disaster Tweets Challenge

Capstone project for Udacity Machine Learning Engineer Nanodegree

Dataset

The training and testing datasets for the NLP with Disaster Tweets can be downloaded from Kaggle's competition webpage.

Note: The notebook assumes data files are stored in data directory.

Software Requirements

  • Python >= 3.6

The libraries used are:

  • numpy
  • pandas
  • seaborn
  • plotly.express
  • wordcloud
  • re
  • nltk
  • sklearn
  • xgboost

It is recommended to use XGBoost with GPU support for better performance. If you need more information on that, please read here.