How Can We Compare Song Audio Files and Classify them?

Research Question

Since we’ve never worked with audio data or classification of audio data we wanted to try working with data that is structured as such. We ask the question: how do the audio features from songs, specifically Spotify Tracks compare to each other?

Is there a relationship between the some of these features such as tempo correlating with danceability/energy/liveness and if so how are they correlated. Additionally, how can we use these features to cluster songs based on these audio tracks of songs being coverted to numeric features?

Hypothesis

Certain audio features will be statistically different between the distribution of certain genres. These differences in distributions will allow us to perform Unsupervised Learning on the data to cluster the songs into different groups / listening personas.

For example, the mean "tempo" of Pop Artists will be higher than that of Ballad Singers since Pop songs tend to be more upbeat and fast.

If numerical data is extracted from the songs then models can be trained to cluster / classify songs into different groups since there will be enough difference between certain features between certain groups. This approach of comparing audio features between two groups can then be applied to other projects such as comparisons of living beings/objects to classify the two.

Overview

This project consists of the following components:

Audio Feature Extraction: We extract audio features on the Top Charting Songs from Spotify's API
Model Selection: Performing different Supervised and Unsupervised Learning algorithms
Evaluation: We evaluate the supervised model's performance using accuracy, precision, recall, and F1-score metrics.
Adaptability: The project is designed to be easily adaptable for other audio classification / clustering tasks.

Requirements

Python 3.8+
librosa
TensorFlow
Keras
NumPy
pandas
scikit-learn
Optional: Cuda (if training on an Nvidia GPU)

Installation

To install the project and its dependencies, follow these steps:

Clone the repository:

git clone https://github.com/COGS108/Group_Sp23_The_group_chat

Change to the project directory:

cd Group_Sp23_The_group_chat

Install the required dependencies:

pip install -r requirements.txt

Further Applications

This project can be easily adapted to other audio classification tasks, such as:

Identifying speakers in a conversation
Detecting emotions in speech
Recognizing animal sounds or bird calls

To adapt the project, simply prepare your data and follow the usage instructions outlined above.

Contributors

We appreciate all contributions to this project. If you would like to contribute, please open an issue or submit a pull request on the project's GitHub repository.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
src		src
COGS 108 Final Slides.pdf		COGS 108 Final Slides.pdf
README.md		README.md
checklist.md		checklist.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How Can We Compare Song Audio Files and Classify them?

Research Question

Hypothesis

Table of Contents

Overview

Requirements

Installation

Further Applications

Contributors

About

Releases

Packages

Languages

mateoign/Spotify-Recommender

Folders and files

Latest commit

History

Repository files navigation

How Can We Compare Song Audio Files and Classify them?

Research Question

Hypothesis

Table of Contents

Overview

Requirements

Installation

Further Applications

Contributors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages