Indian Art Music Raga Recognition Dataset

Introduction

Rāga recognition datasets comprise two sizable datasets, one for each music tradition, Carnatic music dataset (CMD) and, Hindustani music dataset (HMD).These datasets comprise full length audio recordings and their associated rāga labels. These two datasets can be used to develop and evaluate approaches for performing automatic rāga recognition in Indian art music. To the best of our knowledge, these are the largest
and the most comprehensive (in terms of the available metadata) datasets ever used for studying this task. For more information about the dataset we refer to Chapter 3 of this thesis.

These datasets are derived from the CompMusic corpora of Indian art music, for which each recording is associated with a MBID.

Please cite the following publications if you use the material shared here in your research work.

[1]. Gulati, S., Serrà, J., Ganguli, K. K., ¸Sentürk, S., & Serra, X. (2016). Time-delayed melody surfaces for raga recognition. In Proceedings of the 17th International Society for Music Information Retrieval Conference (ISMIR), pp. 751–757. New York, USA.
[Postprint PDF]

[2]. Gulati, S., Serrà, J., Ishwar, V., ¸Sentürk, S., & Serra, X. (2016). Phrase-based raga recognition using vector space modeling. In Proceedings of the 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 66–70. Shanghai, China.
Postprint PDF]

Datasets

[Dataset (features)]

[Dataset (audios)]

Annotation Format

We provide both tsv files and json files that contain information about each audio recording in terms of its mbid, the path of the audio/feature files and the associated rāga identifier. Each rāga is assigned a unique identifier by Dunya, which is similar to the mbid in terms of purpose. We also provide a mapping of the rāga id to its transliterated name.