Recommender_Yahoo_Music

explorations in collaborative filtering and recommender systems with Yahoo! Music ratings

Acquiring the Data

I use the following data that can be acquired from Yahoo. Yahoo! Music User Ratings of Musical Artists, version 1.0 (423 MB) This dataset represents a snapshot of the Yahoo! Music community's preferences for various musical artists. The dataset contains over ten million ratings of musical artists given by Yahoo! Music users over the course of a one month period sometime prior to March 2004. Users are represented as meaningless anonymous numbers so that no identifying information is revealed. The dataset may be used by researchers to validate recommender systems or collaborative filtering algorithms. The dataset may serve as a testbed for matrix and graph algorithms including PCA and clustering algorithms. The size of this dataset is 423 MB.

Creating the database

The steps used to create and import data into the database are in the scripts/initialize_db.sh file of this repository. It's short enough that i've pasted it below:

#!/bin/bash

# Edit the line below to reflect your own path
REPODIR = /home/btq/GitHub/Recommender_Yahoo_Music
cd $REPODIR

createdb ymusic_data

psql ymusic_data -f scripts/create_ymusic_schema.sql

tail -n 97954 data/ydata-ymusic-artist-names-v1_0.txt > data/ydata-ymusic-artist-names-v2_0.txt

psql ymusic_data -f scripts/import_ymusic_data.sql

The steps performed in this script are:

Create the ymusic_data database with the createdb command
Create 3 tables and define the schema
Remove the first two lines of the artist-names file
Import the data files into the respective psql tables.

NOTE: The avg_rating in ym_artist and ym_user tables is deceptive because 255 means never play again, so it should probably be 0 for the range to make much sense. A very disliked artist could have an avg_rating greater than 100.

Making Predictions

See the Yahoo_Music_Recommender for an analysis of the data and an explanation of how to make predictions.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
img		img
lib		lib
scripts		scripts
.gitignore		.gitignore
README.md		README.md
Yahoo_Music_User_Based_Recommender.ipynb		Yahoo_Music_User_Based_Recommender.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Recommender_Yahoo_Music

Acquiring the Data

Creating the database

Making Predictions

About

Releases

Packages

Languages

btq/Recommender_Yahoo_Music

Folders and files

Latest commit

History

Repository files navigation

Recommender_Yahoo_Music

Acquiring the Data

Creating the database

Making Predictions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages