Madruga et al., 2020 - Google Patents
Multicondition training for noise-robust detection of benign vocal fold lesions from recorded speechMadruga et al., 2020
View PDF- Document ID
- 5783334005009098250
- Author
- Madruga M
- Campos-Roca Y
- Perez C
- Publication year
- Publication venue
- Ieee Access
External Links
Snippet
This study evaluates the effects of Multicondition Training (MCT) on computer aided diagnosis systems for voice quality assessment associated to exudative lesions of Reinke's space. This technique adds various noise conditions to the speech recordings in order to …
- 230000003902 lesions 0 title abstract description 24
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by the preceding groups
- G01N33/48—Investigating or analysing materials by specific methods not covered by the preceding groups biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/5005—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by the preceding groups
- G01N33/48—Investigating or analysing materials by specific methods not covered by the preceding groups biological material, e.g. blood, urine; Haemocytometers
- G01N33/483—Physical analysis of biological material
- G01N33/487—Physical analysis of biological material of liquid biological material
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Karan et al. | Non-negative matrix factorization-based time-frequency feature extraction of voice signal for Parkinson's disease prediction | |
Mekyska et al. | Robust and complex approach of pathological speech signal analysis | |
Arora et al. | Developing a large scale population screening tool for the assessment of Parkinson's disease using telephone-quality voice | |
Hadjitodorov et al. | A computer system for acoustic analysis of pathological voices and laryngeal diseases screening | |
Arias-Vergara et al. | Parkinson’s disease and aging: analysis of their effect in phonation and articulation of speech | |
Dibazar et al. | Feature analysis for automatic detection of pathological speech | |
He et al. | Study of empirical mode decomposition and spectral analysis for stress and emotion classification in natural speech | |
Ngo et al. | Computerized analysis of speech and voice for Parkinson's disease: A systematic review | |
Travieso et al. | Detection of different voice diseases based on the nonlinear characterization of speech signals | |
Hemmerling et al. | Voice data mining for laryngeal pathology assessment | |
Panek et al. | Acoustic analysis assessment in speech pathology detection | |
Gómez-García et al. | On the design of automatic voice condition analysis systems. Part II: Review of speaker recognition techniques and study on the effects of different variability factors | |
Madruga et al. | Multicondition training for noise-robust detection of benign vocal fold lesions from recorded speech | |
Jothilakshmi | Automatic system to detect the type of voice pathology | |
Godino-Llorente et al. | Pathological likelihood index as a measurement of the degree of voice normality and perceived hoarseness | |
Ding et al. | Deep connected attention (DCA) ResNet for robust voice pathology detection and classification | |
Zakariah et al. | [Retracted] An Analytical Study of Speech Pathology Detection Based on MFCC and Deep Neural Networks | |
Ren et al. | The acoustic dissection of cough: diving into machine listening-based COVID-19 analysis and detection | |
Kiss et al. | Language independent detection possibilities of depression by speech | |
Sabir et al. | Improved algorithm for pathological and normal voices identification | |
Khaskhoussy et al. | Speech processing for early Parkinson’s disease diagnosis: machine learning and deep learning-based approach | |
Xie et al. | A voice disease detection method based on MFCCs and shallow CNN | |
Sharma et al. | Audio texture and age-wise analysis of disordered speech in children having specific language impairment | |
Reddy et al. | Exemplar-Based Sparse Representations for Detection of Parkinson's Disease From Speech | |
Deepa et al. | Speech technology in healthcare |