Chakhtouna et al., 2023 - Google Patents

Speaker and gender dependencies in within/cross linguistic Speech Emotion Recognition

Chakhtouna et al., 2023

Document ID: 11106186351508536630
Author: Chakhtouna A; Sekkate S; Adib A
Publication year: 2023
Publication venue: International Journal of Speech Technology

External Links

Cited by

Snippet

In this difficult period and with the great influence of COVID-19 on many aspects of people's lives, many areas have been affected such as economy, tourism and especially issues related to the medical field. For example in healthcare, a lot of people suffered from …

Continue reading at link.springer.com (other versions)

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/30—Medical informatics, i.e. computer-based analysis or dissemination of patient or disease data
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Detecting, measuring or recording for diagnostic purposes; Identification of persons
- A61B5/72—Signal processing specially adapted for physiological signals or for diagnostic purposes
- A61B5/7235—Details of waveform analysis
- A61B5/7264—Classification of physiological signals or data, e.g. using neural networks, statistical classifiers, expert systems or fuzzy systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems

Similar Documents

Publication	Publication Date	Title
Ancilin et al.	2021	Improved speech emotion recognition with Mel frequency magnitude coefficient
Hegde et al.	2019	A survey on machine learning approaches for automatic detection of voice disorders
Panek et al.	2015	Acoustic analysis assessment in speech pathology detection
Karan et al.	2021	An improved framework for Parkinson’s disease prediction using Variational Mode Decomposition-Hilbert spectrum of speech signal
Upadhya et al.	2018	Thomson Multitaper MFCC and PLP voice features for early detection of Parkinson disease
Zhang et al.	2023	Intelligent speech technologies for transcription, disease diagnosis, and medical equipment interactive control in smart hospitals: A review
Benba et al.	2016	Voice assessments for detecting patients with Parkinson’s diseases using PCA and NPCA
Niu et al.	2021	A time-frequency channel attention and vectorization network for automatic depression level prediction
Chakhtouna et al.	2023	Speaker and gender dependencies in within/cross linguistic Speech Emotion Recognition
Shetty et al.	2018	Classification of healthy and pathological voices using MFCC and ANN
Sekkate et al.	2023	A statistical feature extraction for deep speech emotion recognition in a bilingual scenario
Suparatpinyo et al.	2023	Smart voice recognition based on deep learning for depression diagnosis
Nishikawa et al.	2022	Machine learning model for discrimination of mild dementia patients using acoustic features
Gallardo-Antolín et al.	2021	On combining acoustic and modulation spectrograms in an attention LSTM-based system for speech intelligibility level classification
Majda-Zdancewicz et al.	2021	Deep learning vs feature engineering in the assessment of voice signals for diagnosis in Parkinson’s disease
Gidaye et al.	2020	Wavelet sub-band features for voice disorder detection and classification
Deepa et al.	2022	Speech technology in healthcare
Mishra et al.	2024	Improvement of emotion classification performance using multi-resolution variational mode decomposition method
Xing et al.	2022	2-level hierarchical depression recognition method based on task-stimulated and integrated speech features
Selvan et al.	2023	Emotion detection on phone calls during emergency using ensemble model with hyper parameter tuning
Mishra et al.	2024	Speech emotion recognition using a combination of variational mode decomposition and Hilbert transform
Al Dujaili et al.	2023	Automatic speech emotion recognition based on hybrid features with ANN, LDA and K_NN classifiers
Meng et al.	2022	A lightweight CNN and Transformer hybrid model for mental retardation screening among children from spontaneous speech
Chakhtouna et al.	2022	Improving speaker-dependency/independency of wavelet-based speech emotion recognition
Singhal et al.	2023	Voice signal-based disease diagnosis using iot and learning algorithms for healthcare