Movie-recommender

Phase 4 project

Group Members

Elsie Juma
Iain Mosima
Ibrahim Hafiz
Elsie Kiprop
Davis Obatsa
Peter Kigotho
Oscar Karuga

Business Understanding

Cinemy,a movie streaming company asked their users to rate them on google play store. The feedback received was that the movies recommended to the users didn't match their interests thus most customers were dissatisfied. They have approached us, a data analytics company to help them solve their problem. We will therefore, build a movie recommender system that will aid in suggesting top 5 movies to the streaming site users based on their ratings and the genres they prefer.

Metrics for success

We will use RSME and MAE as our metric for success,the model having the lowest scores being our best model.

Data Understanding

The data used has been sourced from MovieLens dataset from the GroupLens research lab at the University of Minnesota. It contains 100836 ratings and 3683 tag applications across 9742 movies. These data were created by 610 users. The dataset is distributed among four csv files: 1.links.csv 2.movies.csv 3.ratings.csv 4.tags.csv

1.Movies.csv

Each line of this file after the header row represents one movie, and has the following columns: movieId: Unique id for each movie title: Name of movies followed by their year of release genres: categories that a movie might fall into separated by |

2.Links.csv

The file links.csv contains indentifiers that can be used to link this data to other data sources like IMDb. Each line of this file after the header row represents one imdb link, and has the following columns:

movieId: Unique id for each movie as used by https://movielens.org.
imdbId: Unique id for each movie as used by http://www.imdb.com.
tmdbId: Unique id for each movie as used by https://www.themoviedb.org.

3.Tags.csv.

Each line of this file after the header row represents one tag applied to one movie by one user, and has the following columns:

userId: Unique id for each user
movieId: Unique id for each movie
tag: User-generated metadata about the movie in forms of short meaningful phrases
timestamp: Time when tag was provided by user

4.Ratings.csv

Each line of this file after the header row represents one rating, and has the following columns:

userId: Unique id for each user
movieId: Unique id for each movie
rating: Rating given by userId for movieId. Ratings are made on a 5-star scale with 0.5 increments.
timestamp: Time when rating was given

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
data		data
LICENSE		LICENSE
README.md		README.md
Recommendation Systems.pptx		Recommendation Systems.pptx
functions.py		functions.py
phase_4_project.ipynb		phase_4_project.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Movie-recommender

Group Members

Business Understanding

Metrics for success

Data Understanding

About

Releases

Packages

Languages

License

oscar066/Movie-reccomender

Folders and files

Latest commit

History

Repository files navigation

Movie-recommender

Group Members

Business Understanding

Metrics for success

Data Understanding

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages