skip to main content
10.1145/3641142.3641167acmotherconferencesArticle/Chapter ViewAbstractPublication PagesacswConference Proceedingsconference-collections
short-paper
Open access

Emotion Variation Detection in Discrete English Speech: A Wavelet Transform Use Case in Mental Health Monitoring

Published: 13 May 2024 Publication History

Abstract

The increasing complexity in modern society has been leading to a series of emotional shifts and mental pressures for individuals. Emotion detection can assist people in managing stress and monitoring mental health. Consequently, recent works are leveraging advancements in vocal/acoustic signal processing and machine learning models to improve emotion detection from speech signals. A challenge in detecting variations in emotion from speech involves the identification of appropriate features that can accurately represent the underlying phenomenon. This paper proposes a set of features derived from energy content and entropy measures extracted through the decomposition signals of the discrete wavelet transform. These features aim to characterize various negative emotions, encompassing fear, sadness, anger, anxiety, and disgust, within speech signals in non-controlled noise conditions. We employ CNN-based architectures to classify the speech signals to detect the embedded emotions. The results of our experiments on publicly available datasets show that the proposed method performs better than the state-of-the-art methods, which use other time-frequency representations. We achieved an unweighted accuracy (UA) of 83.7 ± 2.5 and a weighted accuracy (WA) of 81.7 ± 5.

References

[1]
Paul S Addison. 2017. The illustrated wavelet transform handbook: introductory theory and applications in science, engineering, medicine and finance. CRC press.
[2]
Mehmet Berkehan Akçay and Kaya Oğuz. 2020. Speech emotion recognition: Emotional models, databases, features, preprocessing methods, supporting modalities, and classifiers. Speech Communication 116 (2020), 56–76.
[3]
Xu Dong An and Zhou Ruan. 2021. Speech Emotion Recognition algorithm based on deep learning algorithm fusion of temporal and spatial features. Journal of Physics: Conference Series 1861, 1 (mar 2021), 012064. https://doi.org/10.1088/1742-6596/1861/1/012064
[4]
Surekha Reddy Bandela and T Kishore Kumar. 2017. Stressed speech emotion recognition using feature fusion of teager energy operator and MFCC. In 2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT). IEEE, 1–5.
[5]
Fani Deligianni, Yao Guo, and Guang-Zhong Yang. 2019. From emotions to mood disorders: A survey on gait analysis methodology. IEEE journal of biomedical and health informatics 23, 6 (2019), 2302–2316.
[6]
Kate Dupuis and M Kathleen Pichora-Fuller. 2010. Toronto emotional speech set (tess)-younger talker_happy. (2010).
[7]
S Lalitha, Deepa Gupta, Mohammed Zakariah, and Yousef Ajami Alotaibi. 2021. Mental Illness Disorder Diagnosis Using Emotion Variation Detection from Continuous English Speech.Computers, Materials & Continua 69, 3 (2021).
[8]
Margaret Lech and Ling He. 2013. Stress and emotion recognition using acoustic speech analysis. In Mental Health Informatics. Springer, 163–184.
[9]
Kapang Legoh, T Tuithung, and U Bhattacharjee. 2015. Features and model adaptation techniques for robust speech recognition: a review. Communications on Applied Electronics (CAE). New York, USA: Foundation of Computer Science FCS (2015), 18–31.
[10]
Yulan Li, Charlesetta Baidoo, Ting Cai, and Goodlet A Kusi. 2019. Speech emotion recognition using 1d cnn with no attention. In 2019 23rd international computer science and engineering conference (ICSEC). IEEE, 351–356.
[11]
Steven R Livingstone and Frank A Russo. 2018. The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS): A dynamic, multimodal set of facial and vocal expressions in North American English. PloS one 13, 5 (2018), e0196391.
[12]
Samaneh Madanian, Talen Chen, Olayinka Adeleye, John Michael Templeton, Christian Poellabauer, Dave Parry, and Sandra L. Schneider. 2023. Speech emotion recognition using machine learning — A systematic review. Intelligent Systems with Applications 20 (2023), 200266. https://doi.org/10.1016/j.iswa.2023.200266
[13]
Samaneh Madanian, David Parry, Olayinka Adeleye, Christian Poellabauer, Farhaan Mirza, Shilpa Mathew, and Sandy Schneider. 2022. Automatic Speech Emotion Recognition Using Machine Learning: Digital Transformation of Mental Health. (2022).
[14]
Nivedhitha Mahendran, PM Durai Raj Vincent, Kathiravan Srinivasan, Vishal Sharma, and DushanthaNalin K Jayakody. 2020. Realizing a stacking generalization model to improve the prediction accuracy of major depressive disorder in adults. IEEE Access 8 (2020), 49509–49522.
[15]
Sherri Melrose 2015. Seasonal affective disorder: an overview of assessment and treatment approaches. Depression research and treatment 2015 (2015).
[16]
Hemanta Kumar Palo and Mihir Narayan Mohanty. 2018. Wavelet based feature combination for recognition of emotions. Ain shams engineering journal 9, 4 (2018), 1799–1806.
[17]
A Pramod Reddy and V Vijayarajan. 2017. Extraction of emotions from speech-a survey. International Journal of Applied Engineering Research 12, 16 (2017), 5760–5767.
[18]
Celia Shahnaz, SM Shafiul Hasan, 2016. Emotion recognition based on wavelet analysis of Empirical Mode Decomposed EEG signals responsive to music videos. In 2016 IEEE Region 10 Conference (TENCON). IEEE, 424–427.
[19]
Stanley Smith Stevens, John Volkmann, and Edwin Broomell Newman. 1937. A scale for the measurement of the psychological magnitude pitch. The journal of the acoustical society of america 8, 3 (1937), 185–190.
[20]
Madhukar H Trivedi. 2004. The link between depression and physical symptoms. Primary care companion to the Journal of clinical psychiatry 6, suppl 1 (2004), 12.
[21]
Yue Xie, Ruiyu Liang, Zhenlin Liang, Chengwei Huang, Cairong Zou, and Björn Schuller. 2019. Speech emotion classification using attention-based LSTM. IEEE/ACM Transactions on Audio, Speech, and Language Processing 27, 11 (2019), 1675–1685.
[22]
Rendi EJ Yohanes, Wee Ser, and Guang-bin Huang. 2012. Discrete wavelet transform coefficients for emotion recognition from EEG signals. In 2012 annual international conference of the IEEE engineering in medicine and biology society. IEEE, 2251–2254.
[23]
X Zhu, Y Liu, Z Qin, and J Li. [n. d.]. Data augmentation in emotion classification using generative adversarial networks. arXiv 2017. arXiv preprint arXiv:1711.00648 ([n. d.]).

Index Terms

  1. Emotion Variation Detection in Discrete English Speech: A Wavelet Transform Use Case in Mental Health Monitoring

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      ACSW '24: Proceedings of the 2024 Australasian Computer Science Week
      January 2024
      152 pages
      ISBN:9798400717307
      DOI:10.1145/3641142
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 13 May 2024

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. CNN
      2. Data Analytics
      3. Digital Health
      4. Healthcare Digital Transformation
      5. Mental health
      6. Speech Emotion
      7. Wavelet

      Qualifiers

      • Short-paper
      • Research
      • Refereed limited

      Conference

      ACSW 2024
      ACSW 2024: 2024 Australasian Computer Science Week
      January 29 - February 2, 2024
      NSW, Sydney, Australia

      Acceptance Rates

      Overall Acceptance Rate 61 of 141 submissions, 43%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 246
        Total Downloads
      • Downloads (Last 12 months)246
      • Downloads (Last 6 weeks)46
      Reflects downloads up to 09 Jan 2025

      Other Metrics

      Citations

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format.

      HTML Format

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media