Selective annotation of few data for beat tracking of Latin American music using rhythmic features.
Resumen:
Training state-of-the-art beat tracking models usually requires large amounts of annotated data. It is widely known that data annotation is a time-consuming process and generally involves expert knowledge in the context of MIR. This can be particularly challenging if we consider culture-specific datasets. Previous research has shown that, under certain homogeneity conditions, it is possible to obtain good tracking results with these models using few training datapoints. However, this shifts the problem to that of the selection of these data. In this paper, we propose a methodology for selectively annotating meaningful samples from a dataset with the objective of training a beat tracker. We extract a rhythmic feature from each track and apply selection methods in the feature space limited by a budget of samples to be annotated. We then train a TCN-based state-of-the-art model using the selected data. The trained model is shown to perform well on the remainder of the dataset when compared to random selection. We hope that our study will alleviate the annotation process of culture-specific datasets and ultimately help build a more culturally diverse perspective in the field of Music Information Retrieval.
2024 | |
Este trabajo fue parcialmente apoyado por la Coordinación para el Perfeccionamiento del Personal de Educación Superior – Brasil (CAPES) – Código de Finanzas 001 El Consejo Nacional de Desarrollo Científico y Tecnológico (CNPq) – números de subvención 141356/2018-9 y 311146/2021-0 El Sistema Nacional de Investigadores – Agencia Nacional de Investigación e Innovación (SNI-ANII) |
|
Beat tracking Selective annotation Rhythmic description |
|
Inglés | |
Universidad de la República | |
COLIBRI | |
https://transactions.ismir.net/articles/10.5334/tismir.170
https://hdl.handle.net/20.500.12008/43862 |
|
Acceso abierto | |
Licencia Creative Commons Atribución (CC - By 4.0) |
Sumario: | Training state-of-the-art beat tracking models usually requires large amounts of annotated data. It is widely known that data annotation is a time-consuming process and generally involves expert knowledge in the context of MIR. This can be particularly challenging if we consider culture-specific datasets. Previous research has shown that, under certain homogeneity conditions, it is possible to obtain good tracking results with these models using few training datapoints. However, this shifts the problem to that of the selection of these data. In this paper, we propose a methodology for selectively annotating meaningful samples from a dataset with the objective of training a beat tracker. We extract a rhythmic feature from each track and apply selection methods in the feature space limited by a budget of samples to be annotated. We then train a TCN-based state-of-the-art model using the selected data. The trained model is shown to perform well on the remainder of the dataset when compared to random selection. We hope that our study will alleviate the annotation process of culture-specific datasets and ultimately help build a more culturally diverse perspective in the field of Music Information Retrieval. |
---|