>
Fa   |   Ar   |   En
   a novel multi-task and ensembled optimized parallel convolutional autoencoder and transformer for speech emotion recognition  
   
نویسنده sharifzadeh jafari zahra ,seyedin sanaz
منبع aut journal of electrical engineering - 2024 - دوره : 56 - شماره : 2 - صفحه:213 -226
چکیده    Recognizing the emotions from speech signals is very important in different applications of human-computer-interaction (hci). in this paper, we present a novel model for speech emotion recognition (ser) based on new multi-task parallel convolutional autoencoder (pcae) and transformer networks. the pcaes have been proposed to generate high-level informative harmonic sparse features from the input. with the aid of the proposed parallel cae, we can extract nonlinear sparse features in an ensemble manner improving the accuracy and the generalization of the model. these pcaes also address the problem of the loss of initial sequential information during convolution operations for ser tasks. we have also proposed using a transformer in parallel with pcaes to gather long-term dependencies between speech samples and make use of its self-attention mechanism. finally, we have proposed a multi-task loss function made up of two terms of classification and ae mapper losses. this multi-task loss tries not only to reduce the classification error but also the regression error caused by the pcaes which also work as mappers between the input and output mel-frequency-cepstral-coefficients (mfccs). thus, we can both focus on finding accurate features with pcaes and improving the classification results. we have evaluated our proposed method on the ravdess ser dataset in different terms of accuracy, precision, recall, and f1-score. the average accuracy of the proposed model on eight emotions outperforms all the recent baselines.
کلیدواژه speech emotion recognition ,mel frequency cepstral coefficients ,autoencoder ,transformer ,multi-task deep learning
آدرس amirkabir university of technology (tehran polytechnic), department of electrical engineering, iran, amirkabir university of technology (tehran polytechnic), department of electrical engineering, iran
پست الکترونیکی sseyedin@aut.ac.ir
 
     
   
Authors
  
 
 

Copyright 2023
Islamic World Science Citation Center
All Rights Reserved