Audio-Visual Emotion Recognition in Video Clips

Anbarjafari, Gholamreza; Noroozi, Fatemeh; Marjanovic, Marina; Njegus, Angelina; Escalera, Sergio

doi:10.1109/TAFFC.2017.2713783

Audio-Visual Emotion Recognition in Video Clips

dc.contributor.author	Anbarjafari, Gholamreza
dc.contributor.author	Noroozi, Fatemeh
dc.contributor.author	Marjanovic, Marina
dc.contributor.author	Njegus, Angelina
dc.contributor.author	Escalera, Sergio
dc.date.accessioned	2019-11-05T12:14:35Z
dc.date.available	2019-11-05T12:14:35Z
dc.date.issued	2019-01
dc.department	HKÜ, Mühendislik Fakültesi, Elektirik Elektronik Mühendisliği Bölümü	en_US
dc.description.abstract	This paper presents a multimodal emotion recognition system, which is based on the analysis of audio and visual cues. From the audio channel, Mel-Frequency Cepstral Coefficients, Filter Bank Energies and prosodic features are extracted. For the visual part, two strategies are considered. First, facial landmarks' geometric relations, i.e., distances and angles, are computed. Second, we summarize each emotional video into a reduced set of key-frames, which are taught to visually discriminate between the emotions. In order to do so, a convolutional neural network is applied to key-frames summarizing videos. Finally, confidence outputs of all the classifiers from all the modalities are used to define a new feature space to be learned for final emotion label prediction, in a late fusion/stacking fashion. The experiments conducted on the SAVEE, eNTERFACE'05, and RML databases show significant performance improvements by our proposed system in comparison to current alternatives, defining the current state-of-the-art in all three databases.	en_US
dc.identifier.citation	Noroozi, F., Marjanovic, M., Njegus, A., Escalera, S., & Anbarjafari, G. (January 01, 2019). Audio-Visual Emotion Recognition in Video Clips. Ieee Transactions on Affective Computing, 10, 1, 60-75.	en_US
dc.identifier.doi	10.1109/TAFFC.2017.2713783
dc.identifier.endpage	75	en_US
dc.identifier.issn	1949-3045
dc.identifier.issue	1	en_US
dc.identifier.scopus	2-s2.0-85050265218
dc.identifier.scopusquality	Q1
dc.identifier.startpage	60	en_US
dc.identifier.uri	https://doi.org/10.1109/TAFFC.2017.2713783
dc.identifier.uri	https://hdl.handle.net/20.500.11782/574
dc.identifier.volume	10	en_US
dc.identifier.wos	WOS:000461333200008
dc.identifier.wosquality	Q1
dc.indekslendigikaynak	Web of Science
dc.indekslendigikaynak	Scopus
dc.language.iso	en
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	en_US
dc.relation.ispartof	IEEE TRANSACTIONS ON AFFECTIVE COMPUTING
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı	en_US
dc.rights	info:eu-repo/semantics/embargoedAccess	en_US
dc.subject	Multimodal emotion recognition; classifier fusion; data fusion; convolutional neural networks	en_US
dc.title	Audio-Visual Emotion Recognition in Video Clips
dc.type	Article

Dosyalar

Orijinal paket

Listeleniyor 1 - 1 / 1

İsim:: 000461333200008.pdf
Boyut:: 800.97 KB
Biçim:: Adobe Portable Document Format
Açıklama:: Makale Dosyası

İndir

Lisans paketi

Listeleniyor 1 - 1 / 1

İsim:: license.txt
Boyut:: 1.56 KB
Biçim:: Item-specific license agreed upon to submission
Açıklama:

İndir

Koleksiyon

MF - EEM Makale Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu
WoS İndeksli Yayınlar Koleksiyonu