Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
preprocessor.py		preprocessor.py

README.md

LOFI Dataset

The training dataset is synthesized using multiple sources.

Chords and melodies are obtained from Hooktheory
Lyrics are obtained by scraping Google or Musixmatch
Audio features are obtained using the Spotify API

To build the training set and add lyrics and audio features:

Download the Hooktheory dataset from this repo and copy the event folder into this directory, renaming it hooktheory.
Register application at the Spotify Developer Dashboard.
Write client id into the file spotify_client_id and secret into spotify_client_secret.
Set add_lyrics (true/false), add_spotify (true/false) and lyrics_provider (google/musixmatch) inside prepocessor.py.
Run python prepocessor.py.
The dataset will be built into the folder processed.

You don't need lyrics if you are running Lofi2Lofi only. This will create a larger dataset, as tracks with no lyrics will get discarded if add_lyrics is true

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dataset

dataset

README.md

LOFI Dataset

Files

dataset

Directory actions

More options

Directory actions

More options

Latest commit

History

dataset

Folders and files

parent directory

README.md

LOFI Dataset