Skip to content

marianaossilva/DSW2019

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 
 
 
 
 

Repository files navigation

DSW 2019 - DATASET SHOWCASE WORKSHOP GitHub repo size

MusicOSet - An Enhanced Music Dataset for Music Data Mining

Dataset Information

This repository stores an open and enhanced dataset of musical elements (music, albums, and artists) suitable for music data mining.

The attractive features of MusicOSet include:

  • Integration and centralization of different musical data sources
  • Calculation of popularity scores and classification of hits and non-hits musical elements, varying from 1962 to 2018
  • Enriched metadata for music, artists, and albums from the US popular music industry
  • Availability of acoustic and lyrical resources
  • Unrestricted access in two formats: SQL database and compressed .csv files

Dataset Statistics


Data # Records
Songs 20,405
Artists 11,518
Albums 26,522
Lyrics 19,664
Acoustic Features 20,405
Genres 1,561

Schema

Format and Usage

MusicOSet is available in a public repository in two different formats

  1. Relational Database
    • musicoset.sql: SQL file that will create the relational database and subsequently loads all the information in the tables by a MySQL installation (233MB)
  2. .csv Tables

Applicability

Source (citation)

@InProceedings{silva2019musicoset,
title     = {{MusicOSet: An Enhanced Open Dataset for Music Data Mining}},
author    = {Silva, Mariana O. and Rocha, La\'{\i}s M. and Moro, Mirella M.},
booktitle = {{XXXIV} Simp{\'{o}}sio Brasileiro de Banco de Dados: Dataset Showcase Workshop, {SBBD} 2019 Companion},
address   = {Fortaleza, CE, Brazil},
year      = {2019}
}

License

  • The dataset is meant for research purposes.

Acknowledgments

The work is supported by CNPq, Brazil.

About

MusicOSet - An Enhanced Music Dataset for Music Data Mining

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published