Rezwan Matin - Publications

Research

An Audio Processing Approach using Ensemble Learning for Speech-Emotion Recognition for Children with ASD

IEEE World AI IoT Congress (AIIoT), Seattle, WA, USA, May 2021.

Children with Autism Spectrum Disorder (ASD) find it difficult to detect human emotions in social interactions. A speech emotion recognition system was developed in this work, which aims to help these children to better identify the emotions of their communication partner. The system was developed using machine learning and deep learning techniques. Through the use of ensemble learning, multiple machine learning algorithms were joined to provide a final prediction on the recorded input utterances. The ensemble of models includes a Support Vector Machine (SVM), a Multi-Layer Perceptron (MLP), and a Recurrent Neural Network (RNN). All three models were trained on the Ryerson Audio-Visual Database of Emotional Speech and Songs (RAVDESS), the Toronto Emotional Speech Set (TESS), and the Crowd-sourced Emotional Multimodal Actors Dataset (CREMA-D). A fourth dataset was used, which was created by adding background noise to the clean speech files from the datasets previously mentioned. The paper describes the audio processing of the samples, the techniques used to include the background noise, and the feature extraction coefficients considered for the development and training of the models. This study presents the performance evaluation of the individual models to each of the datasets, inclusion of the background noises, and the combination of using all of the samples in all three datasets. The evaluation was made to select optimal hyperparameters configuration of the models to evaluate the performance of the ensemble learning approach through majority voting. The overall performance of the ensemble learning reached a peak accuracy of 66.5%, reaching a higher performance emotion classification accuracy than the MLP model which reached 65.7%.
(To read the full article, click here.)

A Speech Emotion Recognition Solution-based on Support Vector Machine for Children with Autism Spectrum Disorder to Help Identify Human Emotions

IEEE Intermountain Engineering, Technology, and Computing Conference (I-ETC), Orem, UT, USA, October 2020.

Children who fall into the autism spectrum have difficulty communicating with others. In this work, a speech emotion recognition model has been developed to help children with Autism Spectrum Disorder (ASD) identify emotions in social interactions. The model is created using the Python programming language to develop a machine learning model based on the Support Vector Machine (SVM). SVM has proven to yield high accuracies when classifying inputs in speech processing. Individual audio databases are specifically designed to train models for the emotion recognition task. One such speech corpus is the Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS), which is used to train the model in this work. Acoustic feature extraction will be part of the pre-processing step utilizing Python libraries. The libROSA library is used in this work. The first 26 Mel-frequency Cepstral Coefficients (MFCCs) and the zero-crossing rate (ZCR) are extracted and used as the acoustic features to train the machine learning model. The final SVM model provided a test accuracy of 77%. This model also performed well when significant background noise was introduced to the RAVDESS audio recordings, for which it yielded a test accuracy of 64%.
(To read the full article, click here.)

Material and Performance Analysis of MEMS Piezoresistive Pressure Sensor

International Journal of Engineering Trends and Technology (IJETT), Volume-31, Number-1, January 2016 issue.

This work focuses on MEMS piezoresistive pressure sensor. The sensor was simulated in COMSOL Multiphysics v4.4. The Motorola MPX100 series sensor was studied. Applied pressure range is varied from 0 to 100 kPa. To gain the optimum output, different combination of material for diaphragm & piezoresistor have been studied and corresponding displacement change, shear stress distribution and output voltage have been shown. Sensitivity of the sensor was also calculated for different combination of materials. Impact of doping concentration on output voltage for both diaphragm & piezoresistor material has also been studied.
(To read the full article, click here.)

Publications

Contents

Research

An Audio Processing Approach using Ensemble Learning for Speech-Emotion Recognition for Children with ASD

A Speech Emotion Recognition Solution-based on Support Vector Machine for Children with Autism Spectrum Disorder to Help Identify Human Emotions

Material and Performance Analysis of MEMS Piezoresistive Pressure Sensor