Score-Informed Syllable Segmentation for Jingju a Cappella Singing Voice with Mel-Frequency Intensity Profiles

Rong Gong; Nicolas Obin; Georgi Dzhambazov; Xavier Serra

Communication Dans Un Congrès Année : 2017

Score-Informed Syllable Segmentation for Jingju a Cappella Singing Voice with Mel-Frequency Intensity Profiles

(1) , (2) , (1) , (1)

1
2

Rong Gong

Fonction : Auteur
PersonId : 1006850

Music Technology Group

Nicolas Obin

Fonction : Auteur

Analyse et synthèse sonores [Paris]

Georgi Dzhambazov

Fonction : Auteur
PersonId : 1006852

Music Technology Group

Xavier Serra

Fonction : Auteur
PersonId : 1006853

Music Technology Group

Résumé

This paper introduces a new unsupervised and score-informed method for the segmentation of singing voice into syllables. The main idea of the proposed method is to detect the syllable onset on a probability density function by incorporating a priori syllable duration derived from the score. Firstly, intensity profiles are used to exploit the characteristics of singing voice depending on the Mel-frequency regions. Then, the syllable onset probability density function is obtained by selecting candidates over the intensity profiles and weighted for the purpose of emphasizing the onset regions. Finally, the syllable duration distribution shaped by the score is incorporated into Viterbi decoding to determine the optimal sequence of onset time positions. The proposed method outperforms conventional methods for the segmentation of syllable on a jingju (also known as Peking or Beijing opera) a cappella dataset. An analysis is conducted on precision errors to provide direction for future improvement.

Domaines

Traitement du signal et de l'image [eess.SP] Machine Learning [stat.ML]

Nicolas Obin : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-01513160

Soumis le : lundi 24 avril 2017-17:00:22

Dernière modification le : vendredi 24 mars 2023-14:53:04

Dates et versions

hal-01513160 , version 1 (24-04-2017)

Identifiants

HAL Id : hal-01513160 , version 1

Citer

Rong Gong, Nicolas Obin, Georgi Dzhambazov, Xavier Serra. Score-Informed Syllable Segmentation for Jingju a Cappella Singing Voice with Mel-Frequency Intensity Profiles. International Workshop on Folk Music Analysis, Jun 2017, Malaga, Spain. ⟨hal-01513160⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS IRCAM STMS SORBONNE-UNIVERSITE SU-SCIENCES

87 Consultations

0 Téléchargements

Score-Informed Syllable Segmentation for Jingju a Cappella Singing Voice with Mel-Frequency Intensity Profiles

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager