Selective annotation of few data for beat tracking of Latin American music using rhythmic features.

Maia, Lucas Simões - Rocamora, Martín - Biscainho, Luiz W. P. - Fuentes, Magdalena

Resumen:

Training state-of-the-art beat tracking models usually requires large amounts of annotated data. It is widely known that data annotation is a time-consuming process and generally involves expert knowledge in the context of MIR. This can be particularly challenging if we consider culture-specific datasets. Previous research has shown that, under certain homogeneity conditions, it is possible to obtain good tracking results with these models using few training datapoints. However, this shifts the problem to that of the selection of these data. In this paper, we propose a methodology for selectively annotating meaningful samples from a dataset with the objective of training a beat tracker. We extract a rhythmic feature from each track and apply selection methods in the feature space limited by a budget of samples to be annotated. We then train a TCN-based state-of-the-art model using the selected data. The trained model is shown to perform well on the remainder of the dataset when compared to random selection. We hope that our study will alleviate the annotation process of culture-specific datasets and ultimately help build a more culturally diverse perspective in the field of Music Information Retrieval.


Detalles Bibliográficos
2024
Este trabajo fue parcialmente apoyado por la Coordinación para el Perfeccionamiento del Personal de Educación Superior – Brasil (CAPES) – Código de Finanzas 001
El Consejo Nacional de Desarrollo Científico y Tecnológico (CNPq) – números de subvención 141356/2018-9 y 311146/2021-0
El Sistema Nacional de Investigadores – Agencia Nacional de Investigación e Innovación (SNI-ANII)
Beat tracking
Selective annotation
Rhythmic description
Inglés
Universidad de la República
COLIBRI
https://transactions.ismir.net/articles/10.5334/tismir.170
https://hdl.handle.net/20.500.12008/43862
Acceso abierto
Licencia Creative Commons Atribución (CC - By 4.0)
Resumen:
Sumario:Training state-of-the-art beat tracking models usually requires large amounts of annotated data. It is widely known that data annotation is a time-consuming process and generally involves expert knowledge in the context of MIR. This can be particularly challenging if we consider culture-specific datasets. Previous research has shown that, under certain homogeneity conditions, it is possible to obtain good tracking results with these models using few training datapoints. However, this shifts the problem to that of the selection of these data. In this paper, we propose a methodology for selectively annotating meaningful samples from a dataset with the objective of training a beat tracker. We extract a rhythmic feature from each track and apply selection methods in the feature space limited by a budget of samples to be annotated. We then train a TCN-based state-of-the-art model using the selected data. The trained model is shown to perform well on the remainder of the dataset when compared to random selection. We hope that our study will alleviate the annotation process of culture-specific datasets and ultimately help build a more culturally diverse perspective in the field of Music Information Retrieval.