A list of practical Machine Learning or AI projects for learners
These projects are divided in multiple categories, same problem may appear in more than one categories.
Tags:
🐳 Deep learning suitable ideas
💻 Data will probably fit on your laptop
☁️ Might need cloud service. Recommended for those with programming experience
Problems are motivated by the ones shared at:
- Text - include NLP tasks here for now
- Vision - includes image and video processing
- Forecasting - mostly Time Series and similar forecasting challenges
- Recommender Systems
💻 Can you classify the text of an e-mail message to decide who sent it? The Enron E-mail data set contains about 500,000 e-mails from about 150 users. The data set is available here: Enron Data
☁️ 🐳 Autonomous Tagging of Stack Overflow Questions Make a multi-label classification system that automatically assigns tags for questions posted on a forum such as Stackoverflow or Quora. Find StackLite for your 💻 or 10% sample for ☁️
☁️ 🐳 💻 Sentiment Analysis on Tweets or Long text such as news items, Quora answers
☁️ 🐳 Identify keywords from millions of text questions such as those on Stackoverflow for a Facebook Data Science Recruitment Challenge? Find the data on Kaggle
💻 🐳 Automated Essay Grading The purpose of this project is to implement and train machine learning algorithms to automatically assess and grade essay responses. These grades from the automatic grading system should match the human grades consistently
💻 🐳 Quora Question Pairs Can you identify question pairs that have the same intent? Data via Kaggle
🐳 🐋 Machine Translation
🐳 🐋 Text Generation Generate plausible text sequences for a given problem. You can use the Alice in Wonderland text from Project Gutenberg. Check mlm/blog for further details
Classify/tag images from a dataset such as CIFAR-10, ImageNet or MS Coco into classes such as dog, cat, horse. You can try to create an object recognition system which can identify which object category is the best match for a given test image. Apply clustering to learn object categories without supervision
💻 🐳 MNIST Handwritten Digit Classification Challenge is the classic entry point. The MNIST data is beginner-friendly and is small enough to fit on one computer.
💻 🐳 Character recognition (digits) is the good old Optical Character Recognition problem. Stanford dataset is a good starting point.
🐳 Bird Species Identification from an Image using the Caltech-UCSD Birds dataset
🐳 ☁️ Diagnosing and Segmenting Brain Tumors and Phenotypes using MRI Scans using the MICCAI Machine Learning Challenge aka MLC 2014 dataset
Rainfall prediction Learn a probabilistic model to predict rain levels. 45 years of daily precipitation data from the Northwest of the US mentioned here is good for for getting started.
💻 Pollution Level Forecasting using Multi-variate Time Series forecasting on the Air Quality dataset
💻 Home Electricity Forecasting Find a short term forecast on electricity consumption of a single home. Find the dataset here
☁️ Movie Recommender Can you predict the rating a user will give on a movie? Do this using the movies that user has rated in the past, as well as the ratings similar users have given similar movies. The data is available here: Netflix Prize
💻 Best Buy Search + Recommendation System Predict which Xbox game a visitor will be most interested in based on their search query using the BestBuy dataset on Kaggle
💻 Can you predict Influencers in the Social Network? In the Facebook/Twitter era, it's extremely useful to find influencers for targeting advertising. Trying finding them via the PeerIndex dataset
-
Can I use the ideas here for my thesis? Yeah, totally. I'd love to know how it went.
-
Do you want to share my solution/code to a problem here? Yeah, sure - why not? Go to Github issues in the repository and let me know there.
-
How can I add my ideas here? Just send a pull request and we'll discuss?
-
Hey @NirantK, something is wrong here! Yikes, I am sorry. Please tell me by raising a Github issue. I'll try to fix it as soon as possible.
This repo was compiled by Nirant Kasliwal