TY - GEN
T1 - Emotion Recognition from Raw Speech Signals Using 2D CNN with Deep Metric Learning
AU - Mocanu, Bogdan
AU - Tapu, Ruxandra
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022/1/1
Y1 - 2022/1/1
N2 - In this paper we have introduced a novel emotion recognition framework from raw speech signals. The system is based on ResNet architecture fed with spectrogram inputs. The CNN is further extended with a GhostVLAD feature aggregation layer that extracts a single, fixed size descriptor constructed at the level of the utterance. The system adopts a sentiment metric loss that integrates the relations between various classes of emotions. The experimental evaluation conducted on two publicly available databases: RAVDESS and CREMA-D validates the proposed methodology with average accuracy scores of 82% and 63%, respectively.
AB - In this paper we have introduced a novel emotion recognition framework from raw speech signals. The system is based on ResNet architecture fed with spectrogram inputs. The CNN is further extended with a GhostVLAD feature aggregation layer that extracts a single, fixed size descriptor constructed at the level of the utterance. The system adopts a sentiment metric loss that integrates the relations between various classes of emotions. The experimental evaluation conducted on two publicly available databases: RAVDESS and CREMA-D validates the proposed methodology with average accuracy scores of 82% and 63%, respectively.
KW - GhostVLAD aggregation layer
KW - multi-stage training
KW - sentiment metric learning
KW - speech emotion recognition
U2 - 10.1109/ICCE53296.2022.9730534
DO - 10.1109/ICCE53296.2022.9730534
M3 - Conference contribution
AN - SCOPUS:85127059517
T3 - Digest of Technical Papers - IEEE International Conference on Consumer Electronics
BT - 2022 IEEE International Conference on Consumer Electronics, ICCE 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE International Conference on Consumer Electronics, ICCE 2022
Y2 - 7 January 2022 through 9 January 2022
ER -