Syll-O-Matic: an Adaptive Time-Frequency Representation for the Automatic Segmentation of Speech into Syllables

Nicolas Obin; François Lamare; Axel Roebel

Communication Dans Un Congrès Année : 2013

Syll-O-Matic: an Adaptive Time-Frequency Representation for the Automatic Segmentation of Speech into Syllables

(1) , (1) , (1)

Nicolas Obin

Fonction : Auteur
PersonId : 7042
IdHAL : nicolas-obin
ORCID : 0000-0002-5236-5306
IdRef : 157523799

Sciences et Technologies de la Musique et du Son

François Lamare

Fonction : Auteur
PersonId : 952255

Sciences et Technologies de la Musique et du Son

Axel Roebel

Fonction : Auteur
PersonId : 4527
IdHAL : axel-roebel
ORCID : 0000-0001-6136-4391
IdRef : 227186079

Sciences et Technologies de la Musique et du Son

Résumé

This paper introduces novel paradigms for the segmentation of speech into syllables. The main idea of the proposed method is based on the use of a time-frequency representation of the speech signal, and the fusion of intensity and voicing measures through various frequency regions for the automatic selection of pertinent information for the segmentation. The time-frequency representation is used to exploit the speech characteristics depending on the frequency region. In this representation, intensity profiles are measured to provide in- formation into various frequency regions, and voicing profiles are measured to determine the frequency regions that are pertinent for the segmentation. The proposed method outperforms conventional methods for the detection of syllable landmark and boundaries on the T I M I T database of American-English, and provides a promising paradigm for the segmentation of speech into syllables.

Mots clés

speech segmentation syllable segmentation time- frequency representation information fusion

Domaines

Son [cs.SD] Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP] Linguistique

Fichier principal

ICASSP2013_NO_FL_AR.pdf (3.09 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Nicolas Obin : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-00943799

Soumis le : samedi 8 février 2014-20:55:01

Dernière modification le : vendredi 24 mars 2023-14:52:58

Archivage à long terme le : lundi 12 mai 2014-12:40:30

Dates et versions

hal-00943799 , version 1 (08-02-2014)

Identifiants

HAL Id : hal-00943799 , version 1

Citer

Nicolas Obin, François Lamare, Axel Roebel. Syll-O-Matic: an Adaptive Time-Frequency Representation for the Automatic Segmentation of Speech into Syllables. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2013, Vancouver, Canada. ⟨hal-00943799⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS IRCAM STMS SORBONNE-UNIVERSITE SU-SCIENCES

335 Consultations

348 Téléchargements

Syll-O-Matic: an Adaptive Time-Frequency Representation for the Automatic Segmentation of Speech into Syllables

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager