Dataset for Interspeech 2018 submission: Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions

doi:10.5281/zenodo.1185124

Published February 27, 2018 | Version v1

Dataset Open

Dataset for Interspeech 2018 submission: Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions

1. Music Technology Group - Universitat Pompeu Fabra

This dataset contains the materials for training, testing the joint and HSMM models mentioned in the paper "Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions".

The filename list of this dataset can be found in the function get_train_test_recordings_joint() of ./general/trainTestSeparation.py file. The dataset contains the Praat TextGrids and .wavs of the variables: train_primary_school, val_primary_school and test_primary_school. For accessing other datasets such as train_nacta_2017, train_nacta and train_sepa, please download them from the links:

jingju dataset part1: https://zenodo.org/record/1185154

jingju dataset part2: https://doi.org/10.5281/zenodo.842229

Once you have downloaded these three datasets, you need to set the paths in ./general/filePathShared.py.

Set path_jingju_dataset to the parent path of these three datasets.

Set primarySchool_dataset_root_path to the path of the interspeech2018 dataset (the current dataset).

Set nacta_dataset_root_path to the path of the jingju dataset part1.

Set nacta2017_dataset_root_path to the path the jingju dataset part2.

For more information on this paper, please refer to the Github page: https://github.com/ronggong/interspeech2018_submission01

Files

interspeech2018.zip

Files (641.6 MB)

Name	Size	Download all
interspeech2018.zip md5:07ca96cfc56f46cd3253330979b5c61d	641.6 MB	Preview Download

Additional details

COMPMUSIC – Computational models for the discovery of the world's music 267583: European Commission

	All versions	This version
Views	772	771
Downloads	113	113
Data volume	467.1 GB	467.1 GB

Dataset for Interspeech 2018 submission: Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions

Creators

Description

Files

interspeech2018.zip

Files (641.6 MB)

Additional details

Funding