emotion recognition

how machines recognize emotions in speech

Speech emotion recognition by machines is a challenging task due to the subjective nature of emotions, contrasting with humans’ seemingly effortless perception of emotions in speech. Two critical questions arise: How does the classification of emotional utterances by human raters differ from that by machine learning models? Which emotion types demonstrate a strong correlation between human raters’ assessments and modeling results?

To address these questions, this project not only collects human ratings of emotional utterances but also utilizes the openSMILE Python package to extract a comprehensive set of features. A carefully selected subset of these features is then employed to train classifiers, including support vector machines (SVM) and convolutional neural networks (CNN), to predict the emotional labels of utterances.

More findings from this research will be available soon. Please stay tuned for updates.

Related Publications

2023

  1. ASA
    Acoustic properties of vocal emotions in American English and Mandarin Chinese
    Fenqi Wang, and Ratree Wayland
    In The 184th Meeting of Acoustical Society of America, 2023