default search action
Speech Communication, Volume 54
Volume 54, Number 1, January 2012
- Abhishek Jaywant, Marc D. Pell:
Categorical processing of negative emotions from speech prosody. 1-10 - Elisabetta Fersini, Enza Messina, Francesco Archetti:
Emotional states in judicial courtrooms: An experimental investigation. 11-22 - Mouloud Djamah, Douglas D. O'Shaughnessy:
Fine granularity scalable speech coding using embedded tree-structured vector quantization. 23-39 - Abhijeet Sangwan, John H. L. Hansen:
Automatic analysis of Mandarin accented English using phonological features. 40-54 - Deepu Vijayasenan, Fabio Valente, Hervé Bourlard:
Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features. 55-67 - Máire Ní Chiosáin, Pauline Welby, Robert Espesser:
Is the syllabification of Irish a typological exception? An experimental study. 68-91 - Silke Paulmann, Debra Titone, Marc D. Pell:
How emotional prosody guides your way: Evidence from eye movements. 92-107 - Peter Jancovic, Xin Zou, Münevver Köküer:
Speech enhancement based on Sparse Code Shrinkage employing multiple speech models. 108-118 - Cong-Thanh Do, Dominique Pastor, André Goalic:
A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech. 119-133 - Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech. 134-146 - Ying-Yee Kong, Ala Mullangi:
On the development of a frequency-lowering system that enhances place-of-articulation perception. 147-160
Volume 54, Number 2, February 2012
- Nigel G. Ward, Alejandro Vega, Timo Baumann:
Prosodic and temporal features for language modeling for dialog. 161-174 - J. Sebastian Andersson, Junichi Yamagishi, Robert A. J. Clark:
Synthesis and evaluation of conversational characteristics in HMM-based speech synthesis. 175-188 - Sophie Bouton, Pascale Colé, Willy Serniclaes:
The influence of lexical knowledge on phoneme discrimination in deaf children with cochlear implants. 189-198 - Jón Guðnason, Mark R. P. Thomas, Daniel P. W. Ellis, Patrick A. Naylor:
Data-driven voice source waveform analysis and synthesis. 199-211 - George Saon, Hagen Soltau:
Boosting systems for large vocabulary continuous speech recognition. 212-218 - Gakuto Kurata, Abhinav Sethy, Bhuvana Ramabhadran, Ariya Rastrow, Nobuyasu Itoh, Masafumi Nishimura:
Acoustically discriminative language model training with pseudo-hypothesis. 219-228 - Masakiyo Fujimoto, Shinji Watanabe, Tomohiro Nakatani:
Frame-wise model re-estimation method based on Gaussian pruning with weight normalization for noise robust voice activity detection. 229-244 - Vataya Chunwijitra, Takashi Nose, Takao Kobayashi:
A tone-modeling technique using a quantized F0 context to improve tone correctness in average-voice-based speech synthesis. 245-255 - Hamid Reza Tohidypour, Seyyed Ali Seyyedsalehi, Hossein Behbood, Hossein Roshandel:
A new representation for speech frame recognition based on redundant wavelet filter banks. 256-271 - Fei Chen, Philipos C. Loizou:
Impact of SNR and gain-function over- and under-estimation on speech intelligibility. 272-281 - Kuldip K. Paliwal, Belinda Schwerin, Kamil K. Wójcicki:
Speech enhancement using a minimum mean-square error short-time spectral modulation magnitude estimator. 282-305 - Andrew Hines, Naomi Harte:
Speech intelligibility prediction using a Neurogram Similarity Index Measure. 306-320
Volume 54, Number 3, March 2012
- Bert Réveil, Jean-Pierre Martens, Henk van den Heuvel:
Improving proper name recognition by means of automatically learned pronunciation variants. 321-340 - Pandurangarao N. Kulkarni, Prem C. Pandey, Dakshayani S. Jangamashetti:
Multi-band frequency compression for improving speech perception by listeners with moderate sensorineural hearing loss. 341-350 - Antonio Moreno-Daniel, Jay G. Wilpon, Biing-Hwang Juang:
Index-based incremental language model for scalable directory assistance. 351-367 - Daniel Recasens:
A cross-language acoustic study of initial and final allophones of /l/. 368-383 - Takashi Nose, Takao Kobayashi:
Very low bit-rate F0 coding for phonetic vocoders using MSD-HMM with quantized F0 symbols. 384-392 - Amaro A. de Lima, Thiago de M. Prego, Sergio L. Netto, Bowon Lee, Amir Said, Ronald W. Schafer, Ton Kalker, Majid Fozunbal:
On the quality-assessment of reverberated speech. 393-401 - Peng Dai, Ing Yann Soon:
A temporal frequency warped (TFW) 2D psychoacoustic filter for robust speech recognition system. 402-413 - Ioulia Grichkovtsova, Michel Morel, Anne Lacheret:
The role of voice quality and prosodic contour in affective speech perception. 414-429 - Frank Rudzicz:
Using articulatory likelihoods in the recognition of dysarthric speech. 430-444 - Je Hun Jeon, Yang Liu:
Automatic prosodic event detection using a novel labeling and selection method in co-training. 445-458 - Jordi Adell, David Escudero Mancebo, Antonio Bonafonte:
Production of filled pauses in concatenative speech synthesis based on the underlying fluent sentence. 459-476 - Jae-Hun Choi, Joon-Hyuk Chang:
On using acoustic environment classification for statistical model-based speech enhancement. 477-490 - Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura, Abhinav Sethy, Bhuvana Ramabhadran:
Leveraging word confusion networks for named entity modeling and detection from conversational telephone speech. 491-502 - Angel M. Gomez, Belinda Schwerin, Kuldip K. Paliwal:
Improving objective intelligibility prediction by combining correlation and coherence based methods with a measure based on the negative distortion ratio. 503-515
Volume 54, Number 4, May 2012
- Anis Ben Aicha, Sofia Ben Jebara:
Perceptual speech quality measures separating speech distortion and additive noise degradations. 517-528 - Meihong Wu, Huahui Li, Zhiling Hong, Xinchi Xian, Jingyu Li, Xihong Wu, Liang Li:
Effects of aging on the ability to benefit from prior knowledge of message content in masked speech recognition. 529-542 - Md. Sahidullah, Goutam Saha:
Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition. 543-565 - David Escudero Mancebo, Lourdes Aguilar, María Vanrell, Pilar Prieto:
Analysis of inter-transcriber consistency in the Cat_ToBI prosodic labeling system. 566-582
Volume 54, Number 5, June 2012
- William Ricardo Rodríguez, Oscar Saz, Eduardo Lleida:
A prelingual tool for the education of altered voices. 583-600 - Evaldas Vaiciukynas, Antanas Verikas, Adas Gelzinis, Marija Bacauskiene, Virgilijus Uloza:
Exploring similarity-based classification of larynx disorders from human voice. 601-610 - David M. Howard, Evelyn Abberton, Adrian Fourcin:
Disordered voice measurement and auditory analysis. 611-621 - Tiago H. Falk, Wai-Yip Chan, Fraser Shein:
Characterization of atypical vocal source excitation, temporal dynamics and prosody for objective measurement of dysarthric word intelligibility. 622-631 - Marieke de Bruijn, Louis ten Bosch, Dirk J. Kuik, Birgit I. Witte, Johannes A. Langendijk, C. René Leemans, Irma Verdonck-de Leeuw:
Acoustic-phonetic and artificial neural network feature analysis to assess speech quality of stop consonants produced by patients treated for oral or oropharyngeal cancer. 632-640 - Sevasti-Zoi Karakozoglou, Nathalie Henrich, Christophe d'Alessandro, Yannis Stylianou:
Automatic glottal segmentation using local-based active contours and application to glottovibrography. 641-654 - Ali Alpan, Jean Schoentgen, Youri Maryn, Francis Grenez, P. Murphy:
Assessment of disordered voice via the first rahmonic. 655-663 - Alain Ghio, Gilles Pouchoulin, Bernard Teston, Serge Pinto, Corinne Fredouille, Céline De Looze, Danièle Robert, François Viallet, Antoine Giovanni:
How to manage sound, physiological and clinical data of 2500 dysphonic and dysarthric speakers? 664-679
Volume 54, Number 6, July 2012
- Pilar Prieto, María Vanrell, Lluïsa Astruc, Elinor Payne, Brechtje Post:
Phonotactic and phrasal properties of speech rhythm. Evidence from Catalan, English, and Spanish. 681-702 - Keiichiro Oura, Junichi Yamagishi, Mirjam Wester, Simon King, Keiichi Tokuda:
Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping. 703-714 - Tobias Kaufmann, Beat Pfister:
Syntactic language modeling with formal grammars. 715-731 - Petr Zelinka, Milan Sigmund, Jiri Schimmel:
Impact of vocal effort variability on automatic speech recognition. 732-742 - Rigas Kotsakis, George Kalliris, Charalampos Dimoulas:
Investigation of broadcast-audio semantic analysis scenarios employing radio-programme-adaptive pattern classification. 743-762 - Mohammad Hossein Moattar, Mohammad Mehdi Homayounpour:
Variational conditional random fields for online speaker detection and tracking. 763-780 - Mirjam Wester:
Talker discrimination across languages. 781-790 - Takanobu Oba, Takaaki Hori, Atsushi Nakamura:
Efficient training of discriminative language models by sample selection. 791-800 - Herman Kamper, Félicien Jeje Muamba Mukanya, Thomas Niesler:
Multi-accent acoustic modelling of South African English. 801-813 - Eduardo Pavez, Jorge F. Silva:
Analysis and design of Wavelet-Packet Cepstral coefficients for automatic speech recognition. 814-835 - Ronan Flynn, Edward Jones:
Feature selection for reduced-bandwidth distributed speech recognition. 836-843 - David M. Howard, Evelyn Abberton, Adrian Fourcin:
Erratum to "Disordered voice measurement and auditory analysis" [Speech Comm. 54(2012) 611-621]. 844
Volume 54, Number 7, September 2012
- Lan Wang, Hui Chen, Sheng Li, Helen M. Meng:
Phoneme-level articulatory animation in pronunciation training. 845-856 - Kei Hashimoto, Junichi Yamagishi, William Byrne, Simon King, Keiichi Tokuda:
Impacts of machine translation and speech synthesis on speech-to-speech translation. 857-866 - Shajith Ikbal, Hemant Misra, Hynek Hermansky, Mathew Magimai-Doss:
Phase AutoCorrelation (PAC) features for noise robust speech recognition. 867-880 - Ronan Flynn, Edward Jones:
Reducing bandwidth for robust distributed speech recognition in conditions of packet loss. 881-892 - Thorsten Smit, Friedrich Türckheim, Robert Mores:
Fast and robust formant detection from LP data. 893-902 - Ali Hassan, Robert I. Damper:
Classification of emotional speech using 3DEC hierarchical classifier. 903-916 - Hugo Quené, Gün Refik Semin, Francesco Foroni:
Audible smiles and frowns affect speech comprehension. 917-922
Volume 54, Number 8, October 2012
- Yana Yunusova, Melanie Baljko, Grigore Pintilie, Krista Rudy, Petros Faloutsos, John Daskalogiannakis:
Acquisition of the 3D surface of the palate by in-vivo digitization with Wave. 923-931 - Qinghua Sun, Keikichi Hirose, Nobuaki Minematsu:
A method for generation of Mandarin F0 contours based on tone nucleus model and superpositional model. 932-945 - Peggy P. K. Mok:
Effects of consonant cluster syllabification on vowel-to-vowel coarticulation in English. 946-956 - Zhongbo Li, Shenghui Zhao, Stefan Bruhn, Jing Wang, Jingming Kuang:
Comparison and optimization of packet loss recovery methods based on AMR-WB for VoIP. 957-974
Volume 54, Number 9, November 2012
- Okko Räsänen:
Computational modeling of phonetic and lexical learning in early language acquisition: Existing models and future directions. 975-997 - Toshio Irino, Yoshie Aoki, Hideki Kawahara, Roy D. Patterson:
Comparison of performance with voiced and whispered speech in word recognition and mean-formant-frequency discrimination. 998-1013 - Atsunori Ogawa, Atsushi Nakamura:
Joint estimation of confidence and error causes in speech recognition. 1014-1028 - Irene Ayllón Clemente, Martin Heckmann, Britta Wrede:
Incremental word learning: Efficient HMM initialization and large margin discriminative adaptation. 1029-1048 - Khiet P. Truong, David A. van Leeuwen, Franciska M. G. de Jong:
Speech-based recognition of self-reported and observed emotion in a dimensional space. 1049-1063
Volume 54, Number 10, December 2012
- Mohammad Hossein Moattar, Mohammad Mehdi Homayounpour:
A review on speaker diarization systems and approaches. 1065-1103 - Veena Karjigi, Preeti Rao:
Classification of place of articulation in unvoiced stops with spectro-temporal surface modeling. 1104-1120 - Edward Ozimek, Dariusz Kutzner, Pawel Libiszewski:
Speech intelligibility tested by the Pediatric Matrix Sentence test in 3-6 year old children. 1121-1131 - Doris Baum:
Recognising speakers from the topics they talk about. 1132-1142
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.