Vocal cues in emotion encoding and decoding, Motivation and Emotion, vol.15, pp.123-148, 1991. ,
Text-to-Speech Synthesis, 2009. ,
Pysfc-a system for prosody analysis based on the superposition of functional contours prosody model, International Conference on Speech Prosody, pp.774-778, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01821214
Modelling and Synthesising F0 contours with the Discrete Cosine Transform, International Conference on Acoustics, Speech, and Signal Processing, pp.3973-3976, 2008. ,
Sparse coding of pitch contours with deep auto-encoders, International Conference on Speech Prosody, pp.799-803, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01722007
A variational prosody model for the decomposition and synthesis of speech prosody, Speech Prosody, 2018. ,
Emotional voice conversion using neural networks with arbitrary scales F0 based on wavelet transform, EURASIP Journal on Audio, Speech, and Music Processing, vol.2017, issue.1, 2017. ,
Statistical parametric speech synthesis, International Conference on Audio, Speech, and Signal Processing, pp.1229-1232, 2007. ,
Multilevel parametric-base F0 model for speech synthesis, pp.2274-2277, 2008. ,
Stylization and Trajectory Modelling of Short and Long Term Speech Prosody Variations, Interspeech, pp.2029-2032, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00598144
MeLos: Analysis and Modelling of Speech Prosody and Speaking Style, 2011. ,
URL : https://hal.archives-ouvertes.fr/tel-00694687
Intonation conversion from neutral to expressive speech, pp.2765-2768, 2011. ,
Modeling F0 trajectories in hierarchically structured deep neural networks, Speech Communication, vol.76, pp.82-92, 2016. ,
Statistical parametric speech synthesis: from HMM to LSTM-RNN, 2015. ,
, TTS synthesis with bidirectional LSTM based recurrent neural networks," Interspeech, 2014.
Unidirectional long short-term memory recurrent neural network with recurrent output layer for lowlatency speech synthesis, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015. ,
Deep bidirectional LSTM modeling of timbre and prosody for emotional voice conversion, 2016. ,
DBLSTM-based multitask learning for pitch transformation in voice conversion, 2016 10th International Symposium on Chinese Spoken Language Processing, 2016. ,
An RNN-Based quantized F0 model with Multi-Tier feedback links for Text-toSpeech synthesis, 2017. ,
A TemplateBased approach for speech synthesis intonation generation using LSTMs, 2016. ,
Sequence to sequence learning with neural networks, Advances in Neural Information Processing Systems (NIPS), 2014. ,
Tacotron: A fully end-to-end text-to-speech synthesis model, 2017. ,
Google's Next-Generation Real-Time Unit-Selection synthesizer using Sequence-to-Sequence LSTM-Based autoencoders, 2017. ,
WaveNet: A generative model for raw audio, 2016. ,
Learning phrase representations using rnn encoder-decoder for statistical machine translation, Empirical Methods in Natural Language Processing (EMNLP), 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01433235
Neural machine translation by jointly learning to align and translate, International Conference on Learning Representations (ICLR), 2014. ,
Online and linear-time attention by enforcing monotonic alignments, International Conference on Machine Learning (ICML), 2017. ,
Recurrent neural aligner: An encoder-decoder neural network model for sequence to sequence mapping, pp.1298-1302, 2017. ,
Automatic Phoneme Segmentation with Relaxed Textual Constraints, International Conference on Language Resources and Evaluation, pp.2403-2407, 2008. ,
URL : https://hal.archives-ouvertes.fr/hal-01161385