GROUP MEMBERS | GITHUB | |
---|---|---|
1. | MAUREEN WAMBUGU | https://github.com/Mau-Wambugu |
2. | STEPHEN WAWERU | https://github.com/stendewa |
3. | LYDIA NJERI | https://github.com/lydiahsherry23 |
4. | LILIAN MULI | https://github.com/mwikali24 |
5. | PETER MAINA | https://github.com/Mr-PeterMaina |
- Recommendation systems are powerful tools that use machine learning algorithms to provide relevant suggestions to users based on behaviour patterns or user data.
- A Book Recommendation System is a recommendation system where we recommend similar books to readers based on their interests. Recommendation systems help drive engagement, increase sales, increases revenue and this in return brings in loyal clients as the customer experience is elevated promoting customer satisfaction. We have 2 main recommendation system models:
-
- Collaborative filtering
-
- Content-based Filtering
BUSINESS PROBLEM
- Over the past years, there has been rise in huge ecommerce and online services leading clients facing difficulty when searching for the right products.
- Clients looking to purchase books also face the same struggle when trying to match the right books with their taste and preferences.
- The Business Problem is to develop a recommendation system that recommends books that are tailored to our users preferences inorder to improve customer experience and engagement.
PROJECT OBJECTIVE
- To build a book recommendation system that provides personalized suggestions to our users.
- Improve sales by showcasing books a user is most likely to buy.
- Offer relevant books to users inorder to improve customer retention.
- Increase customer engagement
The analysis aims to achieve these objectives by considering the following analysis questions:
-
Which authors consistently receive higher ratings from users?
-
How does the year of publication affect the average ratings of books? Are newer books preferred over older ones?
-
How accurate are the recommendations generated by the collaborative filtering model especially in terms of precision?
-
How does class imbalance in ratings affect the performance of the recommendation model?
DATA SOURCE
- We used data obtained from Kaggle mined by Cai-Nicolas Ziegler
- It contains 3 CSV Files:
- Books.csv - contains information about books{ISBN;Title;Author;Year;Publisher}
- Ratings.csv - contains book ratings provided by users that range from 0 to 10. {{User-ID;ISBN;Rating}}
- Users.csv - contains information about the users {User-ID;Age}
STAKEHOLDERS
- Customers
- As the end user, they expect accurate book suggestions based on personal interests.
- Marketing team
- They would want to do targeted advertising on specific books and also promote personalized offers.
- Data scientist
- Interested in ensuring the recommendation system models are accurate and scalable.
- Book Authors
- They would be interested in knowing how their books are recommended inorder to learn and understand their readers taste and preferences.
- Executive {CEO}
- They would want to understand how recommendation systems impact revenue and customer retention comparing it to the budget allocated to the project.
METHODOLOGY
- Our project will focus on the CRISP-DM:
- Business Understanding
- Data Understanding
- Data Preparation
- Modeling
- Evaluation
- Deployment
DATA SOURCE
- We used data obtained from Kaggle mined by Cai-Nicolas Ziegler
- It contains 3 CSV Files:
- Books.csv - contains information about books{ISBN;Title;Author;Year;Publisher}
- Ratings.csv - contains book ratings provided by users that range from 0 to 10. {{User-ID;ISBN;Rating}}
- Users.csv - contains information about the users {User-ID;Age}
STAKEHOLDERS
- Customers
- As the end user, they expect accurate book suggestions based on personal interests.
- Marketing team
- They would want to do targeted advertising on specific books and also promote personalized offers.
- Data scientist
- Interested in ensuring the recommendation system models are accurate and scalable.
- Book Authors
- They would be interested in knowing how their books are recommended inorder to learn and understand their readers taste and preferences.
- Executive {CEO}
- They would want to understand how recommendation systems impact revenue and customer retention comparing it to the budget allocated to the project.
METHODOLOGY
- Our project will focus on the CRISP-DM:
- Business Understanding
- Data Understanding
- Data Preparation
- Modeling
- Evaluation
- Deployment
- Univariate data analysis
- Bivariate analysis
In our analysis we used collaborative filtering to build a recommendation system.
In our evaluation we focused majorly on precision score,recall score,f1 score,accuracy score
- Effectiveness of collaborative filtering
- Collaborative filtering using Singular Value Decomposition {SVD} can effectively predict book ratings for users based on their historical preferences and those of similar users.
- Model performance evaluation
- The model has a resonable predictive accuracy score since RMSE= 3.4219 and MAE = 2.7470
- The model met our goal of having a precision score of atleast 75 % as it had a Precision of 79% measures the proportion of relevant recommendations that were actually correct i.e how many of the recommended books were truly relevant.
- However the model has a low recall score of 0.0814, this indicates most relevant books are not captured by the model.Thus it requires further attention to enhance the model quality.
- User - Item based collaborative filtering
- User-User method effectively identifies similar users and their preferences.
- Item based approach using SVD provided personalized recommendations based on predicted rating for unseen books.
- Our analysis shows that the same user has different recommendations based on the two different approaches.
- Item - based approach
- Based on this approach it is evident that highly rated books are preferred by most users.
- Enhanced personalization and feedback mechanism.
- Utilize data for targeted marketing.
- Business should feature highly rated books.
- Use top rated labels to highlight highly rated books in marketing campaigns.
-
Explore Alternative Algorithms: Investigate the performance of other recommendation algorithms, such as neural collaborative filtering or hybrid approaches, to compare their effectiveness against the current SVD model.
-
Content-Based Recommendations: Explore the potential for a content-based recommendation system that utilizes book attributes, such as author, to complement collaborative filtering methods and provide more holistic recommendations.