Dang-Khoa Mac, Laboratory of Informatics of Grenoble (LIG), CNRS
Véronique Aubergé, Laboratory of Informatics of Grenoble (LIG), CNRS
Albert Rilliard, LIMSI-CNRS, Orsay
Eric Castelli, International Research Center MICA, CNRS-UMI 2954, Hanoi University of Technology
Prosodic attitudes (social affects) are highly linked to the language through the culture, and are a main part of face to face interaction. Therefore, for description and modeling, as well as for applications like translation, language learning or synthesis, a cross-cultural approach is relevant. This paper presents a cross-perception of Audio-Visual prosodic attitudes in Vietnamese, an under-resourced tonal language. Based on an audio-visual corpus of 16 attitudes, perception experiments were carried out with Vietnamese and French participants: firstly, to understand the contribution of audio and visual modalities to affective communication; secondly, to perceptually measure how the native and non-native listeners recognize and confuse the Vietnamese attitudes. The results reveal cultural specificities and cross-cultural common attitudes in Vietnamese.