Acoustic Characteristics of Emotional Speech Using Spectrogram Image Classification

Stolar, M, Lech, M, Bolia, R and Skinner, M 2018, 'Acoustic Characteristics of Emotional Speech Using Spectrogram Image Classification', in 2018 12th International Conference on Signal Processing and Communication Systems (ICSPCS), Cairns, Australia, 17-19 December 2018, pp. 1-6.


Document type: Conference Paper
Collection: Conference Papers

Title Acoustic Characteristics of Emotional Speech Using Spectrogram Image Classification
Author(s) Stolar, M
Lech, M
Bolia, R
Skinner, M
Year 2018
Conference name 2018 12th International Conference on Signal Processing and Communication Systems (ICSPCS)
Conference location Cairns, Australia
Conference dates 17-19 December 2018
Proceedings title 2018 12th International Conference on Signal Processing and Communication Systems (ICSPCS)
Publisher IEEE
Place of publication USA
Start page 1
End page 6
Total pages 6
Abstract One of the problems limiting the accuracy of speech emotion recognition (SER) is difficulty in the differentiation between acoustically-similar emotions. Since it is not clear how emotions differ in acoustic terms, it is difficult to design new, more efficient SER strategies. In this study, amplitude-frequency analysis of emotional speech was performed to determine relative differences between seven emotional categories of speech in the Berlin Emotional Speech (EMO-DB) database. The analysis transformed short J-second blocks of speech into RGB images of spectrograms using four different frequency scales. The images were used to train a convolutional neural network (CNN) to recognize emotions. By training the network with different combinations of frequency scales and color components of the RGB images that emphasized different frequency and spectral amplitude values, links between different emotions and corresponding amplitude-frequency characteristics of speech were determined.
Subjects Signal Processing
Keyword(s) speech analysis
emotion recognition
speech classification
neural networks
Copyright notice © 2018 IEEE
ISBN 9781538656020
Versions
Version Filter Type
Access Statistics: 18 Abstract Views  -  Detailed Statistics
Created: Tue, 26 Mar 2019, 09:36:00 EST by Catalyst Administrator
© 2014 RMIT Research Repository • Powered by Fez SoftwareContact us