SAMBASET : a dataset of historical samba de enredo recording for computational music analysis

Maia, Lucas S. - Fuentes, Magdalena - Biscainho, Luiz W. P. - Rocamora, Martín - Essid, Slim

Resumen:

In the last few years, several datasets have been released to meet the requirements of “hungry” yet promising datadriven approaches in music technology research. Since, for historical reasons, most investigations conducted in the field still revolve around music of the so-called “Western” tradition, the corresponding data, methodology and conclusions carry a strong cultural bias. Music of non- “Western” background, whenever present, is usually underrepresented, poorly labeled, or even mislabeled, the exception being projects that aim at specifically describing such music. In this paper we present SAMBASET, a dataset of Brazilian samba music that contains over 40 hours of historical and modern samba de enredo commercial recordings. To the best of our knowledge, this is the first dataset of this genre. We describe the collection of metadata (e.g. artist, composer, release date) and outline our semiautomatic approach to the challenging task of annotating beats in this large dataset, which includes the assessment of the performance of state-of-the-art beat tracking algorithms for this specific case. Finally, we present a study on tempo and beat tracking that illustrates SAMBASETs value, and we comment on other tasks for which it could be used.


Detalles Bibliográficos
2019
Inglés
Universidad de la República
COLIBRI
https://hdl.handle.net/20.500.12008/21751
Acceso abierto
Licencia Creative Common Atribución (CC-BY)