Syll-O-Matic: an Adaptive Time-Frequency Representation for the Automatic Segmentation of Speech into Syllables - Sorbonne Université Accéder directement au contenu
Communication Dans Un Congrès Année : 2013

Syll-O-Matic: an Adaptive Time-Frequency Representation for the Automatic Segmentation of Speech into Syllables

Résumé

This paper introduces novel paradigms for the segmentation of speech into syllables. The main idea of the proposed method is based on the use of a time-frequency representation of the speech signal, and the fusion of intensity and voicing measures through various frequency regions for the automatic selection of pertinent information for the segmentation. The time-frequency representation is used to exploit the speech characteristics depending on the frequency region. In this representation, intensity profiles are measured to provide in- formation into various frequency regions, and voicing profiles are measured to determine the frequency regions that are pertinent for the segmentation. The proposed method outperforms conventional methods for the detection of syllable landmark and boundaries on the T I M I T database of American-English, and provides a promising paradigm for the segmentation of speech into syllables.
Fichier principal
Vignette du fichier
ICASSP2013_NO_FL_AR.pdf (3.09 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00943799 , version 1 (08-02-2014)

Identifiants

  • HAL Id : hal-00943799 , version 1

Citer

Nicolas Obin, François Lamare, Axel Roebel. Syll-O-Matic: an Adaptive Time-Frequency Representation for the Automatic Segmentation of Speech into Syllables. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2013, Vancouver, Canada. ⟨hal-00943799⟩
311 Consultations
328 Téléchargements

Partager

Gmail Facebook X LinkedIn More