Skip to Main content Skip to Navigation
Reports

Adaptation au locuteur pour la séparation de la parole par NMF

Guillaume Doras 1
1 Analyse et synthèse sonores [Paris]
STMS - Sciences et Technologies de la Musique et du Son
Abstract : This master thesis is on the use of semi-supervised NMF for audio sources separation, and in particular speech separation. The main contribution of this work is a speech separation method based on an adapta- tion to an unknown speaker of a prior training performed with a known speaker. First, an overview of the speech separation by semi-supervised NMF methods - and their current limits in the case of an unknown speaker - is presented, as well as a state-of-the-art of improvements proposed until now. After discussing those different approaches, a new adaptation method to an unknown speaker of the training performed with a known speaker is presented, as well as several constraints aimed at improving the separation quality. Finally, the proposed adaptation model and the different constraints are evaluated and compared to the results obtained without speaker adaptation.
Complete list of metadatas

https://hal.sorbonne-universite.fr/hal-01482183
Contributor : Nicolas Obin <>
Submitted on : Friday, March 3, 2017 - 12:26:29 PM
Last modification on : Thursday, March 21, 2019 - 2:39:34 PM

Identifiers

  • HAL Id : hal-01482183, version 1

Citation

Guillaume Doras. Adaptation au locuteur pour la séparation de la parole par NMF. [Stage] STMS - Sciences et Technologies de la Musique et du Son UMR 9912 IRCAM-CNRS-UPMC. 2016. ⟨hal-01482183⟩

Share

Metrics

Record views

795