Evaluation of wavelet measures on automatic detection of emotion in noisy and telephony speech signals
Fecha
2014-01-01
Autores
Vasquez-Correa, J. C.
Garcia, N.
Vargas-Bonilla, J. F.
Orozco-Arroyave, J. R.
Arias-Londono, J. D.
Título de la revista
ISSN de la revista
Título del volumen
Editor
IEEE
Resumen
Detection of emotion in humans from speech signals is a recent research field. One of the scenarios where this field has been applied is in situations where the human integrity and security are at risk. In this paper we are propossing a set of features based on the Teager energy operator, and several entropy measures obtained from the decomposition signals from discrete wavelet transform to characterize different types of negative emotions such as anger, anxiety, disgust, and desperation. The features are measured in three different conditions: (1) the original speech signals, (2) the signals that are contaminated with noise, or are affected by the presence of a phone channel, and (3) the signals that are obtained after processing using an algorithm for Speech Enhancement based on Karhunen-Love Transform. According to the results, when the speech enhancement is applied, the detection of emotion in speech is increased in up to 22% compared to results obtained when the speech signal is highly contaminated with noise. © 2014 IEEE.