Carnatic Music Rhythm Dataset
CompMusic Carnatic Rhythm Dataset is a rhythm annotated test corpus for automatic rhythm analysis tasks in Carnatic Music. The collection consists of audio excerpts from the CompMusic Carnatic research corpus, manually annotated time aligned markers indicating the progression through the taala cycle, and the associated taala related metadata. A brief description of the dataset is provided below. For a brief overview and audio examples of taalas in Carnatic music, please see http://compmusic.upf.edu/examples-taala-carnatic
Reference:
Please cite the following publications if you use the dataset in your work:
- Srinivasamurthy, A., Holzapfel, A., Cemgil, A. T., & Serra, X. (2015, October). Particle Filters for Efficient Meter Tracking with Dynamic Bayesian Networks. In Proceedings of the 16th International Society for Music Information Retrieval Conference (ISMIR 2015) (pp. 197–203). Malaga, Spain. (Subset)
- Srinivasamurthy, A., & Serra, X. (2014, May). A Supervised Approach to Hierarchical Metrical Cycle Tracking from Audio Music Recordings. In Proceedings of the 39th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014) (pp. 5237–5241). Florence, Italy. (Full dataset)
Download
This dataset can be donwloaded here.
The Dataset
Tāla | Beats, Aksharas |
# Pieces |
Total minutes (hours) | Median length of a piece (min) | # Annotated beats | # Samas | Annotated Positions in Tala Cycle |
Adi | 8, 32 | 50 | 252.78 (4.21) | 4.85 | 22793 | 2882 | 1,2,3,4,5,6,7,8 |
Rupaka | 3, 12 | 50 | 267.45 (4.45) | 4.62 | 22668 | 7582 | 1,2,3 |
Mishra Chapu | 7, 14 | 48 | 342.13 (5.7) | 6.59 | 31055 | 7795 | 1,2,3,4,5,6,7 |
Khanda Chapu | 5, 10 | 28 | 134.62 (2.24) | 4.41 | 13111 | 4387 | 1,2,3,4,5 |
Total | 176 | 996.98 (16.62) | 5.06 | 89627 | 22646 |
Audio music content
Annotations
1. Adi taala (32 aksharas)
2. Rupaka taala (12 aksharas) 3. Mishra Chapu taala (14 aksharas) 4. Khanda Chapu taala (10 aksharas)
Taala related metadata: For each excerpt, the taala of the piece, edupu (offset of the start of the piece, relative to the sama, measured in aksharas) of the composition, and the kalai (the cycle length scaling factor) are recorded. Each excerpt can be uniquely identified and located with the MBID of the recording, and the relative start and end times of the excerpt within the whole recording. A separate 5 digit taala based unique ID is also provided for each excerpt as a double check. The artist, release, the lead instrument, and the raaga of the piece are additional editorial metadata obtained from the release. A flag indicates if the excerpt is a full piece or only a part of a full piece. There are optional comments on audio quality and annotation specifics.
Possible uses of the dataset
Dataset organization
Data Subset
Tāla | Beats, Aksharas |
# Pieces |
Total minutes | Median length of a piece (min) | # Annotated beats | # Samas | Annotated Positions in Tala Cycle |
Adi | 8, 32 | 30 | 58.87 | 2 | 5452 | 696 | 1,2,3,4,5,6,7,8 |
Rupaka | 3, 12 | 30 | 60 | 2 | 5148 | 1725 | 1,2,3 |
Mishra Chapu | 7, 14 | 30 | 60 | 2 | 8992 | 1299 | 1,2,3,4,5,6,7 |
Khanda Chapu | 5, 10 | 28 | 55.93 | 2 | 9133 | 1840 | 1,2,3,4,5 |
Total | 118 | 234.8 | 2 | 28725 | 5560 |