Iranian Journal of Electrical and Electronic Engineering

Search published articles

Showing 3 results for Emotion Recognition

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

M. Bashirpour, M. Geravanchizadeh,
Volume 12, Issue 3 (9-2016)

Abstract

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its performance in emotion recognition using clean and noisy speech materials and compare it with the performances of the well-known MFCC, LPCC, RASTA-PLP, and also TEMFCC features. Speech samples are extracted from the Berlin emotional speech database (Emo DB) and Persian emotional speech database (Persian ESD) which are corrupted with 4 different noise types under various SNR levels. The experiments are conducted in clean train/noisy test scenarios to simulate practical conditions with noise sources. Simulation results show that higher recognition rates are achieved for PNCC as compared with the conventional features under noisy conditions.

A Novel Nonparametric Kernel for Speech Emotion Recognition

Mohammad Hasheminejad,
Volume 19, Issue 4 (12-2023)

Abstract

The Nonparametric Speech Kernel (NSK), a nonparametric kernel technique, is presented in this study as a novel way to improve Speech Emotion Recognition (SER). The method aims to effectively reduce the size of speech features to improve recognition accuracy. The proposed approach addresses the need for efficient and compact low-dimensional features for speech emotion recognition. Having acknowledged the intrinsic distinctions between speech and picture data, we have refined the Kernel Nonparametric Weighted Feature Extraction (KNWFE) formulation to suggest NSK, which is especially intended for speech emotion identification. The output of NSK can be used as input features for deep learning models such as convolutional neural networks (CNNs), recurrent neural networks (RNNs), or hybrid architectures. In deep learning, NSK can also be used as a kernel function for kernel-based methods such as kernelized support vector machines (SVM) or kernelized neural networks. Our tests demonstrate that NSK outperforms current techniques, outperforming the best-tested approach by 5.02% and 3.05%, respectively, with an average accuracy of 96.568% for the Persian speech emotion dataset and 82.56% for the Berlin speech emotion dataset.

Performance Comparison of Facial Emotion Recognition: Introducing a Model within the Driver Assistance Framework based on Deep Learning with LBP Feature Extraction for In-Vehicle Applications

Ehsan Ghasemi, Seyyed Mohammad Razavi, Sajad Mohamadzadeh,
Volume 20, Issue 4 (11-2024)

Abstract

This study proposes a descriptor-based approach combined with deep learning, which recognizes facial emotions for safe driving. Paying attention to the driver's facial expressions is crucial to address the increasing road accidents. This project aims to develop a Facial Emotion Recognition (FER) system that monitors the driver's facial expressions to identify emotions and provide instant assistance for safety control. In the initial stage, Viola-Jones face detection was employed to detect the facial region, followed by Butterworth high-pass filtering to enhance the identified region for locating the eye, nose, and mouth regions, using Viola-Jones face detection. Secondly, the Local Binary Patterns (LBP) feature descriptor is utilized to extract features from the identified eye, nose, and mouth regions. Using 3 RGB channels, the extracted features from these three regions are fed into RessNet-50 and EfficientNet deep networks. The outputs of the two deep learning models' classifiers are combined and integrated using two ensemble methods: ensemble maximum voting and ensemble mean. Based on these combining classifier rules, the performance was evaluated on the JAFFE and KMU-FED databases. The experimental results demonstrate that the proposed method can effectively and with higher accuracy than other competitors recognize emotions in the JAFFE and KMU-FED datasets. The novelty and originality of this paper lie in its significant application in the automotive industry. Implementing our proposed method in a system capable of high accuracy and precision can help mitigate numerous driving hazards. Our approach has achieved 99% and 98% accuracy on the JAFFE and KMU-FED databases, respectively. This high level of accuracy, coupled with its practical relevance, underscores the innovative nature of our work.

Page 1 from 1

© 2022 by the authors. Licensee IUST, Tehran, Iran. This is an open access journal distributed under the terms and conditions of the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license.

Iranian Journal of Electrical and Electronic Engineering

Iran University of Science and Technology

Search published articles

Aims & Scopes

Related Websites