Syll-O-Matic: an Adaptive Time-Frequency Representation for the Automatic Segmentation of Speech into Syllables - Sorbonne Université
Communication Dans Un Congrès Année : 2013

Syll-O-Matic: an Adaptive Time-Frequency Representation for the Automatic Segmentation of Speech into Syllables

Résumé

This paper introduces novel paradigms for the segmentation of speech into syllables. The main idea of the proposed method is based on the use of a time-frequency representation of the speech signal, and the fusion of intensity and voicing measures through various frequency regions for the automatic selection of pertinent information for the segmentation. The time-frequency representation is used to exploit the speech characteristics depending on the frequency region. In this representation, intensity profiles are measured to provide in- formation into various frequency regions, and voicing profiles are measured to determine the frequency regions that are pertinent for the segmentation. The proposed method outperforms conventional methods for the detection of syllable landmark and boundaries on the T I M I T database of American-English, and provides a promising paradigm for the segmentation of speech into syllables.
Fichier principal
Vignette du fichier
ICASSP2013_NO_FL_AR.pdf (3.09 Mo) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00943799 , version 1 (08-02-2014)

Identifiants

  • HAL Id : hal-00943799 , version 1

Citer

Nicolas Obin, François Lamare, Axel Roebel. Syll-O-Matic: an Adaptive Time-Frequency Representation for the Automatic Segmentation of Speech into Syllables. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2013, Vancouver, Canada. ⟨hal-00943799⟩
335 Consultations
348 Téléchargements

Partager

More