Optimal classifier selection in Turkish speech emotion detection Türkçe sesli duygu tespitinde etkin siniflandirici seçimi

Ozsonmez D. B., ACARMAN T., PARLAK İ. B.

29th IEEE Conference on Signal Processing and Communications Applications, SIU 2021, Virtual, Istanbul, Türkiye, 9 - 11 Haziran 2021, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Doi Numarası: 10.1109/siu53274.2021.9477785
Basıldığı Şehir: Virtual, Istanbul
Basıldığı Ülke: Türkiye
Anahtar Kelimeler: Artificial neural networks, Deep learning, Emotion classification, Speech processing
Galatasaray Üniversitesi Adresli: Evet

Özet

Emotion detection comprises various signal processing steps since emotion perception is subjective in speech. Signals used in emotion detection would be extracted using acoustic, visual and textual data. The quality of data acquisition and labelling has positive effects on emotion classification. Emotion extraction varies depending on factors such as spoken language, emotion polarity, age, and gender. Thus, it is crucial to use relevant features of spoken language in aural sentiment detection. In this study, acoustic feature extraction methods; MFCC, LFCC, PLP-RASTA, LPC, Mel-Spectrogram are used. Feature selection is performed with Principal Component Analysis method. The architecture of artificial neural network models is applied on emotion classification using Turkish speech datasets (TurES and TurEV-DB). The benchmark analysis is studied through different features and comparative results are obtained. Grid Search and Randomized Search are used to choose the best parameters for models.