Supervised Vocal-Based Emotion Recognition Using Multiclass Support Vector Machine, Random Forests, and Adaboost

Yükleniyor...
Küçük Resim

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

AUDIO ENGINEERING SOC

Erişim Hakkı

info:eu-repo/semantics/embargoedAccess

Özet

This paper investigates and compares three different classifiers-multi-class Support Vector Machine, Adaboost, and random forests-for the purpose of vocal emotion recognition. Additionally, the decisions of all classifiers are combined using majority voting. The proposed method has been applied to two different emotional databases, which are the Surrey Audio-Visual Expressed Emotion Database and the Polish Emotional Speech Database. Fourteen features, namely pitch, intensity, first through fourth formants and their bandwidths, mean autocorrelation, mean noise-to-harmonics ratio, mean harmonics-to-noise ratio, and standard deviation have been extracted from both databases. Best recognition rate on the Surrey Audio-Visual Expressed Emotion Database has been achieved for the random forest algorithm, which is 75.71%. On the Polish Emotional Speech Database the best recognition rate has been achieved by applying the Adaboost classifier, which is 87.5%. The achieved recognition rates are higher than presented by Yuncu et al. on the same databases (73.81% on the Surrey Audio-Visual Expressed Emotion Database and 71.30% on the Polish Emotional Speech database).

Açıklama

Anahtar Kelimeler

SPEECH; MODELS; CLASSIFICATION; FEATURES

Kaynak

JOURNAL OF THE AUDIO ENGINEERING SOCIETY

WoS Q Değeri

Scopus Q Değeri

Cilt

65

Sayı

7-8

Künye

Noroozi, F., Kaminska, D., Sapinski, T., & Anbarjafari, G. (August 15, 2017). Supervised Vocal-Based Emotion Recognition Using Multiclass Support Vector Machine, Random Forests, and Adaboost. Journal of the Audio Engineering Society, 65, 562-572.

Onay

İnceleme

Ekleyen

Referans Veren