B. Schuller, A. Batliner, S. Steidl, and D. Seppi, Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge, Speech Communication, vol.53, issue.9, pp.1062-1087, 2011.

B. Schuller, S. Steidl, A. Batliner, F. Burkhardt, L. Devillers et al., Paralinguistics in Speech and Language -State-of-the-Art and the Challenge, Computer Speech and Language, Special Issue on Paralinguistics in Naturalistic Speech and Language, vol.27, issue.1, pp.4-39, 2013.

B. Schuller, S. Steidl, A. Batliner, F. Schiel, J. Krajewski et al., Medium-Term Speaker States -A Review on Intoxication, Sleepiness and the First Challenge, Computer Speech and Language

B. Schuller, S. Steidl, A. Batliner, E. Nöth, A. Vinciarelli et al., The INTERSPEECH 2012 Speaker Trait Challenge, Proc. Interspeech, 2012.

M. Przybocki and A. Martin, NIST Speaker Recognition Evaluation Chronicles, Proc. Odyssey, pp.12-22, 2004.

J. Downie, A. Ehmann, M. Bay, and M. Jones, The Music Information Retrieval Evaluation eXchange: Some Observations and Insights, Advances in Music Information Retrieval, pp.93-115, 2010.

T. , Recent developments in the evaluation of information retrieval systems: Moving towards diversity and practical relevance, Informatica, vol.32, pp.27-38, 2008.

B. Schuller, The Computational Paralinguistics Challenge, IEEE Signal Processing Magazine, vol.29, issue.4, pp.97-101, 2012.
URL : https://hal.archives-ouvertes.fr/hal-01993250

A. Vinciarelli, M. Pantic, and H. Bourlard, Social signal processing: Survey of an emerging domain, Image and Vision Computing, vol.27, pp.1743-1759, 2009.

W. Roth and K. Tobin, Solidarity and conflict: aligned and misaligned prosody as a transactional resource in intraand intercultural communication involving power differences, Cultural Studies of Science Education, vol.5, p.807, 2010.

J. Mccann and S. Peppe, Prosody in autism spectrum disorders: a critical review, International Journal of Language and Communication Disorder, vol.38, pp.325-350, 2003.

J. Van-santen, E. Prudhommeaux, L. Black, and M. Mitchell, Computational Prosodic Markers for Autism, Autism, vol.14, pp.215-236, 2010.

D. Bone, M. P. Black, C. Lee, M. E. Williams, P. Levitt et al., Spontaneous-Speech Acoustic-Prosodic Features of Children with Autism and the Interacting Psychologist, Proc. Interspeech, 2012.

A. Batliner, K. Fischer, R. Huber, J. Spilker, and E. Nöth, Desperately Seeking Emotions: Actors, Wizards, and Human Beings, Proc. ISCA Workshop on Speech and Emotion, pp.195-200, 2000.

T. Vogt and E. André, Comparing Feature Sets for Acted and Spontaneous Speech in View of Automatic Emotion Recognition, Proc. International Conference on Multimedia and Expo (ICME), pp.474-477, 2005.

B. Schuller and A. Batliner, Computational Paralinguistics: Emotion, Affect and Personality in Speech and Language Processing, 2013.

J. Bachorowski, M. Smoski, and M. Owren, The acoustic features of human laughter, Journal of the Acoustical Society of America, vol.110, pp.1581-1597, 2001.

J. Vettin and D. Todt, Laughter in Conversation: Features of Occurrence and Acoustic Structure, Journal of Nonverbal Behavior, vol.28, issue.2, pp.93-115, 2004.

H. Tanaka and N. Campbell, Acoustic Features of Four Types of Laughter in Natural Conversational Speech, Proc. 17th International Congress of Phonetic Sciences (ICPhS), pp.1958-1961, 2011.

H. Clark and J. Fox-tree, Using "uh" and "um" in spontaneous speaking, Cognition, vol.84, issue.1, pp.73-111, 2002.

A. Vinciarelli, H. Salamin, A. Polychroniou, G. Mohammadi, and A. Origlia, From nonverbal cues to perception: Personality and social attractiveness, Cognitive Behavioural Systems, pp.60-72, 2012.

S. Young, G. Evermann, M. Gales, T. Hain, D. Kershaw et al., The HTK book (v3.4), 2006.

S. Kim, M. Filippone, F. Valente, and A. Vinciarelli, Predicting the conflict level in television political debates: an approach based on crowdsourcing, nonverbal communication and gaussian processes, Proc. of ACM International Conference on Multimedia, pp.793-796, 2012.

S. Meignier and T. Merlin, LIUM SpkDiarization: An open source toolkit for diarization, Proc. CMU SPUD Workshop, 2010.
URL : https://hal.archives-ouvertes.fr/hal-01433518

T. Bänziger, M. Mortillaro, and K. Scherer, Introducing the Geneva Multimodal expression corpus for experimental research on emotion perception, Emotion, vol.12, pp.1161-1179, 2012.

F. Ringeval, J. Demouy, G. Szaszák, M. Chetouani, L. Robel et al., Automatic intonation recognition for the prosodic assessment of language impaired children, IEEE Transactions on Audio, Speech & Language Processing, vol.19, pp.1328-1342, 2011.
URL : https://hal.archives-ouvertes.fr/hal-02423449

F. Eyben, M. Wöllmer, and B. Schuller, openSMILE -The Munich Versatile and Fast Open-Source Audio Feature Extractor, Proc. ACM Multimedia, pp.1459-1462, 2010.

I. H. Witten and E. Frank, Data mining: Practical machine learning tools and techniques, 2005.

M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann et al., The WEKA Data Mining Software: An Update, SIGKDD Explorations, vol.11, 2009.