Syll-O-Matic: an Adaptive Time-Frequency Representation for the Automatic Segmentation of Speech into Syllables - Sorbonne Université
Conference Papers Year : 2013

Syll-O-Matic: an Adaptive Time-Frequency Representation for the Automatic Segmentation of Speech into Syllables

Abstract

This paper introduces novel paradigms for the segmentation of speech into syllables. The main idea of the proposed method is based on the use of a time-frequency representation of the speech signal, and the fusion of intensity and voicing measures through various frequency regions for the automatic selection of pertinent information for the segmentation. The time-frequency representation is used to exploit the speech characteristics depending on the frequency region. In this representation, intensity profiles are measured to provide in- formation into various frequency regions, and voicing profiles are measured to determine the frequency regions that are pertinent for the segmentation. The proposed method outperforms conventional methods for the detection of syllable landmark and boundaries on the T I M I T database of American-English, and provides a promising paradigm for the segmentation of speech into syllables.
Fichier principal
Vignette du fichier
ICASSP2013_NO_FL_AR.pdf (3.09 Mo) Télécharger le fichier
Origin Files produced by the author(s)
Loading...

Dates and versions

hal-00943799 , version 1 (08-02-2014)

Identifiers

  • HAL Id : hal-00943799 , version 1

Cite

Nicolas Obin, François Lamare, Axel Roebel. Syll-O-Matic: an Adaptive Time-Frequency Representation for the Automatic Segmentation of Speech into Syllables. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2013, Vancouver, Canada. ⟨hal-00943799⟩
334 View
344 Download

Share

More