default search action
Prasanta Kumar Ghosh
Person information
- affiliation: Indian Institute of Science, Department of Electrical Engineering, Bangalore, India
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c163]Abhayjeet Singh, Amala Nagireddi, Deekshitha G, Jesuraja Bandekar, Roopa R., Sandhya Badiger, Sathvik Udupa, Prasanta Kumar Ghosh, Hema A. Murthy, Pranaw Kumar, Keiichi Tokuda, Mark Hasegawa-Johnson, Philipp Olbrich:
LIMMITS'24: Multi-Speaker, Multi-Lingual Indic TTS with Voice Cloning. ICASSP Workshops 2024: 61-62 - [c162]Shivani Yadav, Dipanjan Gope, K. Uma Maheswari, Prasanta Kumar Ghosh:
An Unsupervised Segmentation of Vocal Breath Sounds. ICASSP 2024: 9891-9895 - [c161]Chowdam Venkata Thirumala Kumar, Tanuka Bhattacharjee, Seena Vengalil, Saraswati Nashi, Madassu Keerthipriya, Yamini Belur, Atchayaram Nalini, Prasanta Kumar Ghosh:
Spectral Analysis of Vowels and Fricatives at Varied Levels of Dysarthria Severity for Amyotrophic Lateral Sclerosis. ICASSP 2024: 12767-12771 - 2023
- [c160]Sathvik Udupa, Jesuraja Bandekar, Deekshitha G, Saurabh Kumar, Prasanta Kumar Ghosh, Sandhya Badiger, Abhayjeet Singh, Savitha Murthy, Priyanka Pai, Srinivasa Raghavan K. M., Raoul Nanavati:
Gated Multi Encoders and Multitask Objectives for Dialectal Speech Recognition in Indian Languages. ASRU 2023: 1-8 - [c159]Tanuka Bhattacharjee, Yamini Belur, Atchayaram Nalini, Ravi Yadav, Prasanta Kumar Ghosh:
Exploring the Role of Fricatives in Classifying Healthy Subjects and Patients with Amyotrophic Lateral Sclerosis and Parkinson's Disease. ICASSP 2023: 1-5 - [c158]Tanuka Bhattacharjee, Chowdam Venkata Thirumala Kumar, Yamini Belur, Atchayaram Nalini, Ravi Yadav, Prasanta Kumar Ghosh:
Static and Dynamic Source and Filter Cues for Classification of Amyotrophic Lateral Sclerosis Patients and Healthy Subjects. ICASSP 2023: 1-5 - [c157]Abhayjeet Singh, Amala Nagireddi, Deekshitha G, Jesuraja Bandekar, Roopa R., Sandhya Badiger, Sathvik Udupa, Prasanta Kumar Ghosh, Hema A. Murthy, Heiga Zen, Pranaw Kumar, Kamal Kant, Amol Bole, Bira Chandra Singh, Keiichi Tokuda, Mark Hasegawa-Johnson, Philipp Olbrich:
Lightweight, Multi-Speaker, Multi-Lingual Indic Text-to-Speech. ICASSP 2023: 1-2 - [c156]Sathvik Udupa, Prasanta Kumar Ghosh:
Real-Time MRI Video Synthesis from Time Aligned Phonemes with Sequence-to-Sequence Networks. ICASSP 2023: 1-5 - [c155]Sathvik Udupa, C. Siddarth, Prasanta Kumar Ghosh:
Improved Acoustic-to-Articulatory Inversion Using Representations from Pretrained Self-Supervised Learning Models. ICASSP 2023: 1-5 - [c154]Chowdam Venkata Thirumala Kumar, Tanuka Bhattacharjee, Yamini Belur, Atchayaram Nalini, Ravi Yadav, Prasanta Kumar Ghosh:
Classification of Multi-class Vowels and Fricatives From Patients Having Amyotrophic Lateral Sclerosis with Varied Levels of Dysarthria Severity. INTERSPEECH 2023: 146-150 - [c153]Tanuka Bhattacharjee, Anjali Jayakumar, Yamini Belur, Atchayaram Nalini, Ravi Yadav, Prasanta Kumar Ghosh:
Transfer Learning to Aid Dysarthria Severity Classification for Patients with Amyotrophic Lateral Sclerosis. INTERSPEECH 2023: 1543-1547 - [c152]Varun Belagali, M. V. Achuth Rao, Prasanta Kumar Ghosh:
Weakly supervised glottis segmentation in high-speed videoendoscopy using bounding box labels. INTERSPEECH 2023: 1578-1582 - [c151]Shelly Jain, Priyanshi Pal, Anil Kumar Vuppala, Prasanta Kumar Ghosh, Chiranjeevi Yarra:
An Investigation of Indian Native Language Phonemic Influences on L2 English Pronunciations. INTERSPEECH 2023: 2608-2612 - [c150]Siddarth Chandrasekar, Arvind Ramesh, Tilak Purohit, Prasanta Kumar Ghosh:
A Study on the Importance of Formant Transitions for Stop-Consonant Classification in VCV Sequence. INTERSPEECH 2023: 4518-4522 - [c149]Jesuraja Bandekar, Sathvik Udupa, Prasanta Kumar Ghosh:
Exploring a classification approach using quantised articulatory movements for acoustic to articulatory inversion. INTERSPEECH 2023: 5147-5151 - [c148]Mohammad Shaique Solanki, Ashutosh Bharadwaj, Jeevan Kylash, Prasanta Kumar Ghosh:
Do Vocal Breath Sounds Encode Gender Cues for Automatic Gender Classification? INTERSPEECH 2023: 5406-5410 - [c147]Abhayjeet Singh, Charu Shah, Rajashri Varadaraj, Sonakshi Chauhan, Prasanta Kumar Ghosh:
SPIRE-SIES: A Spontaneous Indian English Speech Corpus. O-COCOSDA 2023: 1-6 - [c146]Chowdam Venkata Thirumala Kumar, Meenakshi Sirigiraju, Rakesh Vaideeswaran, Prasanta Kumar Ghosh, Chiranjeevi Yarra:
Can the decoded text from automatic speech recognition effectively detect spoken grammar errors? SLaTE 2023: 41-45 - [c145]Abhayjeet Singh, Anjali Jayakumar, Deekshitha G, Hitesh Kumar, Jesuraja Bandekar, Sandhya Badiger, Sathvik Udupa, Saurabh Kumar, Prasanta Kumar Ghosh:
An End-to-End TTS Model in Chhattisgarhi, a Low-Resource Indian Language. SPECOM (2) 2023: 164-172 - [c144]Abhayjeet Singh, Arjun Singh Mehta, Ashish Khuraishi K. S, Deekshitha G, Gauri Date, Jai Nanavati, Jesuraja Bandekar, Karnalius Basumatary, Karthika P, Sandhya Badiger, Sathvik Udupa, Saurabh Kumar, Prasanta Kumar Ghosh, Prashanthi V, Priyanka Pai, Raoul Nanavati, Sai Praneeth Reddy Mora, Srinivasa Raghavan K. M.:
An ASR Corpus in Chhattisgarhi, a Low Resource Indian Language. SPECOM (2) 2023: 173-181 - [c143]Navneet Kaur, Prasanta Kumar Ghosh:
Curriculum Learning Based Approach for Faster Convergence of TTS Model. SPECOM (2) 2023: 208-221 - [c142]Priyanshi Pal, Shelly Jain, Chiranjeevi Yarra, Prasanta Kumar Ghosh, Anil Kumar Vupalla:
Study of Indian English Pronunciation Variabilities Relative to Received Pronunciation. SPECOM (1) 2023: 339-349 - [i14]Abhayjeet Singh, Arjun Singh Mehta, Ashish Khuraishi K. S, Deekshitha G, Gauri Date, Jai Nanavati, Jesuraja Bandekar, Karnalius Basumatary, Karthika P, Sandhya Badiger, Sathvik Udupa, Saurabh Kumar, Savitha, Prasanta Kumar Ghosh, Prashanthi V, Priyanka Pai, Raoul Nanavati, Rohan Saxena, Sai Praneeth Reddy Mora, Srinivasa Raghavan K. M.:
Model Adaptation for ASR in low-resource Indian Languages. CoRR abs/2307.07948 (2023) - 2022
- [j25]Prasanta Kumar Ghosh, Amalesh Kumar Manna, Jayanta Kumar Dey, Samarjit Kar:
A deteriorating food preservation supply chain model with downstream delayed payment and upstream partial prepayment. RAIRO Oper. Res. 56(1): 331-348 (2022) - [j24]Chiranjeevi Yarra, Prasanta Kumar Ghosh:
Automatic syllable stress detection under non-parallel label and data condition. Speech Commun. 138: 80-87 (2022) - [c141]Siddharth Subramani, Achuth Rao M. V, Anwesha Roy, Prasanna Suresh Hegde, Prasanta Kumar Ghosh:
SegNet-Based Deep Representation Learning for Dysphagia Classification. ICASSP 2022: 1141-1145 - [c140]Anwesha Roy, Varun Belagali, Prasanta Kumar Ghosh:
An Error Correction Scheme for Improved Air-Tissue Boundary in Real-Time MRI Video for Speech Production. ICASSP 2022: 8247-8251 - [c139]Aravind Illa, Aanish Nair, Prasanta Kumar Ghosh:
The impact of cross language on acoustic-to-articulatory inversion and its influence on articulatory speech synthesis. ICASSP 2022: 8267-8271 - [c138]Abinay Reddy Naini, Bhavuk Singhal, Prasanta Kumar Ghosh:
Dual Attention Pooling Network for Recording Device Classification Using Neutral and Whispered Speech. ICASSP 2022: 8487-8491 - [c137]Sathvik Udupa, Aravind Illa, Prasanta Kumar Ghosh:
Streaming model for Acoustic to Articulatory Inversion with transformer networks. INTERSPEECH 2022: 625-629 - [c136]Anwesha Roy, Varun Belagali, Prasanta Kumar Ghosh:
Air tissue boundary segmentation using regional loss in real-time Magnetic Resonance Imaging video for speech production. INTERSPEECH 2022: 3113-3117 - [c135]Anish Bhanushali, Grant Bridgman, Deekshitha G, Prasanta Kumar Ghosh, Pratik Kumar, Saurabh Kumar, Adithya Raj Kolladath, Nithya Ravi, Aaditeshwar Seth, Ashish Seth, Abhayjeet Singh, Vrunda N. Sukhadia, Srinivasan Umesh, Sathvik Udupa, Lodagala V. S. V. Durga Prasad:
Gram Vaani ASR Challenge on spontaneous telephone speech recordings in regional variations of Hindi. INTERSPEECH 2022: 3548-3552 - [c134]C. Siddarth, Sathvik Udupa, Prasanta Kumar Ghosh:
Watch Me Speak: 2D Visualization of Human Mouth during Speech. INTERSPEECH 2022: 3667-3668 - [c133]Abinay Reddy Naini, Achuth Rao M. V, Prasanta Kumar Ghosh:
Whisper to Neutral Mapping Using I-Vector Space Likelihood and a Cosine Similarity Based Iterative Optimization for Whispered Speaker Verification. NCC 2022: 130-135 - [c132]Priyanshi Pal, Chiranjeevi Yarra, Prasanta Kumar Ghosh:
Voistutor 2.0: A Speech Corpus with Phonetic Transcription for Pronunciation Evaluation of Indian L2 English Learners. O-COCOSDA 2022 2022: 1-6 - [i13]Anwesha Roy, Varun Belagali, Prasanta Kumar Ghosh:
An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production. CoRR abs/2203.06004 (2022) - [i12]Priyanshi Pal, Shelly Jain, Anil Vuppala, Chiranjeevi Yarra, Prasanta Kumar Ghosh:
Study of Indian English Pronunciation Variabilities relative to Received Pronunciation. CoRR abs/2204.06502 (2022) - [i11]Sathvik Udupa, C. Siddarth, Prasanta Kumar Ghosh:
Improved acoustic-to-articulatory inversion using representations from pretrained self-supervised learning models. CoRR abs/2210.16871 (2022) - [i10]Sathvik Udupa, Prasanta Kumar Ghosh:
Real-Time MRI Video synthesis from time aligned phonemes with sequence-to-sequence networks. CoRR abs/2210.16881 (2022) - [i9]Shelly Jain, Priyanshi Pal, Anil Vuppala, Prasanta Kumar Ghosh, Chiranjeevi Yarra:
An Investigation of Indian Native Language Phonemic Influences on L2 English Pronunciations. CoRR abs/2212.09284 (2022) - 2021
- [j23]Renuka Mannem, Prasanta Kumar Ghosh:
A deep neural network based correction scheme for improved air-tissue boundary prediction in real-time magnetic resonance imaging video. Comput. Speech Lang. 66: 101160 (2021) - [j22]Aparna Srinivasan, Diviya Singh, Chiranjeevi Yarra, Aravind Illa, Prasanta Kumar Ghosh:
A Robust Speaking Rate Estimator Using a CNN-BLSTM Network. Circuits Syst. Signal Process. 40(12): 6098-6120 (2021) - [c131]Achuth Rao M. V, Shailesh BG, Drishti Ramesh Megalmani, Satish S. Jeevannavar, Prasanta Kumar Ghosh:
Noise Robust Detection of Fundamental Heart Sound using Parametric Mixture Gaussian and Dynamic Programming. EMBC 2021: 695-699 - [c130]Drishti Ramesh Megalmani, Shailesh B. G, Achuth Rao M. V, Satish S. Jeevannavar, Prasanta Kumar Ghosh:
Unsegmented Heart Sound Classification Using Hybrid CNN-LSTM Neural Networks. EMBC 2021: 713-717 - [c129]Shivani Yadav, Dipanjan Gope, Uma Maheswari Krishnaswamy, Prasanta Kumar Ghosh:
Role of breath phase and breath boundaries for the classification between asthmatic and healthy subjects. EMBC 2021: 870-873 - [c128]Tilak Purohit, Achuth Rao M. V, Prasanta Kumar Ghosh:
Impact of Speaking Rate on the Source Filter Interaction in Speech: A Study. ICASSP 2021: 6448-6452 - [c127]Sarthak Kumar Maharana, Aravind Illa, Renuka Mannem, Yamini Belur, Preetie Shetty, Preethish-Kumar Veeramani, Seena Vengalil, Kiran Polavarapu, Atchayaram Nalini, Prasanta Kumar Ghosh:
Acoustic-to-Articulatory Inversion for Dysarthric Speech by Using Cross-Corpus Acoustic-Articulatory Data. ICASSP 2021: 6458-6462 - [c126]Tanuka Bhattacharjee, Jhansi Mallela, Yamini Belur, Nalini Atchayarcmf, Ravi Yadav, Pradeep Reddy, Dipanjan Gope, Prasanta Kumar Ghosh:
Effect of Noise and Model Complexity on Detection of Amyotrophic Lateral Sclerosis and Parkinson's Disease Using Pitch and MFCC. ICASSP 2021: 7313-7317 - [c125]Manthan Sharma, Navaneetha Gaddam, Tejas Umesh, Aditya Murthy, Prasanta Kumar Ghosh:
A Comparative Study of Different EMG Features for Acoustics-to-EMG Mapping. Interspeech 2021: 616-620 - [c124]Ananya Muguli, Lancelot Pinto, Nirmala R., Neeraj Kumar Sharma, Prashant Krishnan V, Prasanta Kumar Ghosh, Rohit Kumar, Shrirama Bhat, Srikanth Raj Chetupalli, Sriram Ganapathy, Shreyas Ramoji, Viral Nanda:
DiCOVA Challenge: Dataset, Task, and Baseline System for COVID-19 Diagnosis Using Acoustics. Interspeech 2021: 901-905 - [c123]Sathvik Udupa, Anwesha Roy, Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh:
Estimating Articulatory Movements in Speech Production with Transformer Networks. Interspeech 2021: 1154-1158 - [c122]Chiranjeevi Yarra, Prasanta Kumar Ghosh:
Noise Robust Pitch Stylization Using Minimum Mean Absolute Error Criterion. Interspeech 2021: 1174-1178 - [c121]Anuj Diwan, Rakesh Vaideeswaran, Sanket Shah, Ankita Singh, Srinivasa Raghavan K. M., Shreya Khare, Vinit Unni, Saurabh Vyas, Akash Rajpuria, Chiranjeevi Yarra, Ashish R. Mittal, Prasanta Kumar Ghosh, Preethi Jyothi, Kalika Bali, Vivek Seshadri, Sunayana Sitaram, Samarth Bharadwaj, Jai Nanavati, Raoul Nanavati, Karthik Sankaranarayanan:
MUCS 2021: Multilingual and Code-Switching ASR Challenges for Low Resource Indian Languages. Interspeech 2021: 2446-2450 - [c120]Tanuka Bhattacharjee, Jhansi Mallela, Yamini Belur, Atchayaram Nalini, Ravi Yadav, Pradeep Reddy, Dipanjan Gope, Prasanta Kumar Ghosh:
Source and Vocal Tract Cues for Speech-Based Classification of Patients with Parkinson's Disease and Healthy Subjects. Interspeech 2021: 2961-2965 - [c119]Sathvik Udupa, Anwesha Roy, Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh:
Web Interface for Estimating Articulatory Movements in Speech Production from Acoustics and Text. Interspeech 2021: 4868-4869 - [c118]Shivani Yadav, Dipanjan Gope, Uma Maheswari Krishnaswamy, Prasanta Kumar Ghosh:
Convolutional Dense Neural Network Based Spirometry Variable FVC Prediction Using Sustained Phonations. MLSP 2021: 1-6 - [c117]Abhayjeet Singh, Achuth Rao MV, Rakesh Vaideeswaran, Chiranjeevi Yarra, Prasanta Kumar Ghosh:
A Study on Native American English Speech Recognition by Indian Listeners with Varying Word Familiarity Level. O-COCOSDA 2021: 13-18 - [c116]Tilak Purohit, Tejas Umesh, Shankar Narayanan, Minulakshmi S, Prasanta Kumar Ghosh:
SPIRE VCV: An Acoustic-Articulatory Corpus with Three Different Speaking Rates. O-COCOSDA 2021: 116-121 - [c115]Bhavuk Singhal, Abinay Reddy Naini, Prasanta Kumar Ghosh:
wSPIRE: A Parallel Multi-Device Corpus in Neutral and Whispered Speech. O-COCOSDA 2021: 146-151 - [i8]Ananya Muguli, Lancelot Pinto, Nirmala R., Neeraj Kumar Sharma, Prashant Krishnan V, Prasanta Kumar Ghosh, Rohit Kumar, Shreyas Ramoji, Shrirama Bhat, Srikanth Raj Chetupalli, Sriram Ganapathy, Viral Nanda:
DiCOVA Challenge: Dataset, task, and baseline system for COVID-19 diagnosis using acoustics. CoRR abs/2103.09148 (2021) - [i7]Anuj Diwan, Rakesh Vaideeswaran, Sanket Shah, Ankita Singh, Srinivasa Raghavan K. M., Shreya Khare, Vinit Unni, Saurabh Vyas, Akash Rajpuria, Chiranjeevi Yarra, Ashish R. Mittal, Prasanta Kumar Ghosh, Preethi Jyothi, Kalika Bali, Vivek Seshadri, Sunayana Sitaram, Samarth Bharadwaj, Jai Nanavati, Raoul Nanavati, Karthik Sankaranarayanan, Tejaswi Seeram, Basil Abraham:
Multilingual and code-switching ASR challenges for low resource Indian languages. CoRR abs/2104.00235 (2021) - [i6]Sathvik Udupa, Anwesha Roy, Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh:
Estimating articulatory movements in speech production with transformer networks. CoRR abs/2104.05017 (2021) - [i5]Srikanth Raj Chetupalli, Prashant Krishnan V, Neeraj Kumar Sharma, Ananya Muguli, Rohit Kumar, Viral Nanda, Lancelot Mark Pinto, Prasanta Kumar Ghosh, Sriram Ganapathy:
Multi-modal Point-of-Care Diagnostics for COVID-19 Based On Acoustics and Symptoms. CoRR abs/2106.00639 (2021) - [i4]Abhayjeet Singh, Achuth Rao M. V, Rakesh Vaideeswaran, Chiranjeevi Yarra, Prasanta Kumar Ghosh:
A study on native American English speech recognition by Indian listeners with varying word familiarity level. CoRR abs/2112.04151 (2021) - 2020
- [j21]Aravind Illa, Prasanta Kumar Ghosh:
The impact of speaking rate on acoustic-to-articulatory inversion. Comput. Speech Lang. 59: 75-90 (2020) - [j20]Achuth Rao M. V, Prasanta Kumar Ghosh:
SFNet: A Computationally Efficient Source Filter Model Based Neural Speech Synthesis. IEEE Signal Process. Lett. 27: 1170-1174 (2020) - [c114]Siddharth Subramani, M. V. Achuth Rao, Divya Giridhar, Prasanna Suresh Hegde, Prasanta Kumar Ghosh:
Automatic Classification of Volumes of Water Using Swallow Sounds from Cervical Auscultation. ICASSP 2020: 1185-1189 - [c113]Sanjeev Kadagathur Vadiraj, Achuth Rao M. V, Prasanta Kumar Ghosh:
Automatic Identification of Speakers From Head Gestures in a Narration. ICASSP 2020: 6314-6318 - [c112]Jhansi Mallela, Aravind Illa, Suhas B. N., Sathvik Udupa, Yamini Belur, Atchayaram Nalini, Ravi Yadav, Pradeep Reddy, Dipanjan Gope, Prasanta Kumar Ghosh:
Voice based classification of patients with Amyotrophic Lateral Sclerosis, Parkinson's Disease and Healthy Controls with CNN-LSTM using transfer learning. ICASSP 2020: 6784-6788 - [c111]Shivani Yadav, Merugu Keerthana, Dipanjan Gope, Uma Maheswari Krishnaswamy, Prasanta Kumar Ghosh:
Analysis of Acoustic Features for Speech Sound Based Classification of Asthmatic and Healthy Subjects. ICASSP 2020: 6789-6793 - [c110]Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh:
A Comparative Study of Estimating Articulatory Movements from Phoneme Sequences and Acoustic Features. ICASSP 2020: 7334-7338 - [c109]Avni Rajpal, M. V. Achuth Rao, Chiranjeevi Yarra, Ritu Aggarwal, Prasanta Kumar Ghosh:
Pseudo Likelihood Correction Technique for Low Resource Accented ASR. ICASSP 2020: 7434-7438 - [c108]Aravind Illa, Prasanta Kumar Ghosh:
Speaker Conditioned Acoustic-to-Articulatory Inversion Using x-Vectors. INTERSPEECH 2020: 1376-1380 - [c107]Renuka Mannem, Navaneetha Gaddam, Prasanta Kumar Ghosh:
Air-Tissue Boundary Segmentation in Real Time Magnetic Resonance Imaging Video Using 3-D Convolutional Neural Network. INTERSPEECH 2020: 1396-1400 - [c106]Tilak Purohit, Prasanta Kumar Ghosh:
An Investigation of the Virtual Lip Trajectories During the Production of Bilabial Stops and Nasal at Different Speaking Rates. INTERSPEECH 2020: 1401-1405 - [c105]Renuka Mannem, Hima Jyothi R., Aravind Illa, Prasanta Kumar Ghosh:
Speech Rate Task-Specific Representation Learning from Acoustic-Articulatory Data. INTERSPEECH 2020: 2892-2896 - [c104]Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh:
Attention and Encoder-Decoder Based Models for Transforming Articulatory Movements at Different Speaking Rates. INTERSPEECH 2020: 2907-2911 - [c103]Abinay Reddy Naini, Malla Satyapriya, Prasanta Kumar Ghosh:
Whisper Activity Detection Using CNN-LSTM Based Attention Pooling Network Trained for a Speaker Identification Task. INTERSPEECH 2020: 2922-2926 - [c102]Jhansi Mallela, Aravind Illa, Yamini Belur, Atchayaram Nalini, Ravi Yadav, Pradeep Reddy, Dipanjan Gope, Prasanta Kumar Ghosh:
Raw Speech Waveform Based Classification of Patients with ALS, Parkinson's Disease and Healthy Controls Using CNN-BLSTM. INTERSPEECH 2020: 4586-4590 - [c101]Divya Degala, M. V. Achuth Rao, Rahul Krishnamurthy, Pebbili Gopikishore, Veeramani Priyadharshini, Prakash T. K., Prasanta Kumar Ghosh:
Automatic Glottis Detection and Segmentation in Stroboscopic Videos Using Convolutional Networks. INTERSPEECH 2020: 4801-4805 - [c100]Neeraj Kumar Sharma, Prashant Krishnan V, Rohit Kumar, Shreyas Ramoji, Srikanth Raj Chetupalli, Nirmala R., Prasanta Kumar Ghosh, Sriram Ganapathy:
Coswara - A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis. INTERSPEECH 2020: 4811-4815 - [c99]Renuka Mannem, Hima Jyothi, Aravind Illa, Prasanta Kumar Ghosh:
Speech rate estimation using representations learned from speech with convolutional neural network. SPCOM 2020: 1-5 - [c98]Suhas B. N., Jhansi Mallela, Aravind Illa, B. K. Yamini, Atchayaram Nalini, Ravi Yadav, Dipanjan Gope, Prasanta Kumar Ghosh:
Speech task based automatic classification of ALS and Parkinson's Disease and their severity using log Mel spectrograms. SPCOM 2020: 1-5 - [i3]Neeraj Kumar Sharma, Prashant Krishnan V, Rohit Kumar, Shreyas Ramoji, Srikanth Raj Chetupalli, Nirmala R., Prasanta Kumar Ghosh, Sriram Ganapathy:
Coswara - A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis. CoRR abs/2005.10548 (2020) - [i2]Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh:
Attention and Encoder-Decoder based models for transforming articulatory movements at different speaking rates. CoRR abs/2006.03107 (2020)
2010 – 2019
- 2019
- [j19]M. V. Achuth Rao, Prakhar Gupta, Prasanta Kumar Ghosh:
P- and T-wave delineation in ECG signals using parametric mixture Gaussian and dynamic programming. Biomed. Signal Process. Control. 51: 328-337 (2019) - [j18]M. V. Achuth Rao, Prasanta Kumar Ghosh:
Glottal Inverse Filtering Using Probabilistic Weighted Linear Prediction. IEEE ACM Trans. Audio Speech Lang. Process. 27(1): 114-124 (2019) - [j17]Anurendra Kumar, Tanaya Guha, Prasanta Kumar Ghosh:
Dirichlet Latent Variable Model: A Dynamic Model Based on Dirichlet Prior for Audio Processing. IEEE ACM Trans. Audio Speech Lang. Process. 27(5): 919-931 (2019) - [c97]Achuth Rao M. V, Prasanta Kumar Ghosh, Tanuka Bhattacharjee, Anirban Dutta Choudhury:
Trend Statistics Network and Channel invariant EEG Network for sleep arousal study. EMBC 2019: 5716-5722 - [c96]C. A. Valliappan, Avinash Kumar, Renuka Mannem, Girija Ramesan Karthik, Prasanta Kumar Ghosh:
An Improved Air Tissue Boundary Segmentation Technique for Real Time Magnetic Resonance Imaging Video Using Segnet. ICASSP 2019: 5921-5925 - [c95]Aravind Illa, Prasanta Kumar Ghosh:
Representation Learning Using Convolution Neural Network for Acoustic-to-articulatory Inversion. ICASSP 2019: 5931-5935 - [c94]Gokul Srinivasan, Aravind Illa, Prasanta Kumar Ghosh:
A Study on Robustness of Articulatory Features for Automatic Speech Recognition of Neutral and Whispered Speech. ICASSP 2019: 5936-5940 - [c93]Renuka Mannem, Prasanta Kumar Ghosh:
Air-tissue Boundary Segmentation in Real Time Magnetic Resonance Imaging Video Using a Convolutional Encoder-decoder Network. ICASSP 2019: 5941-5945 - [c92]Abinay Reddy Naini, M. V. Achuth Rao, Prasanta Kumar Ghosh:
Formant-gaps Features for Speaker Verification Using Whispered Speech. ICASSP 2019: 6231-6235 - [c91]Aravind Illa, Prasanta Kumar Ghosh:
An Investigation on Speaker Specific Articulatory Synthesis with Speaker Independent Articulatory Inversion. INTERSPEECH 2019: 121-125 - [c90]Manoj Kumar Ramanathi, Chiranjeevi Yarra, Prasanta Kumar Ghosh:
ASR Inspired Syllable Stress Detection for Pronunciation Evaluation Without Using a Supervised Classifier and Syllable Level Features. INTERSPEECH 2019: 924-928 - [c89]Renuka Mannem, Jhansi Mallela, Aravind Illa, Prasanta Kumar Ghosh:
Acoustic and Articulatory Feature Based Speech Rate Estimation Using a Convolutional Dense Neural Network. INTERSPEECH 2019: 929-933 - [c88]Sweekar Sudhakara, Manoj Kumar Ramanathi, Chiranjeevi Yarra, Prasanta Kumar Ghosh:
An Improved Goodness of Pronunciation (GoP) Measure for Pronunciation Evaluation with DNN-HMM System Considering HMM Transition Probabilities. INTERSPEECH 2019: 954-958 - [c87]Atreyee Saha, Chiranjeevi Yarra, Prasanta Kumar Ghosh:
Low Resource Automatic Intonation Classification Using Gated Recurrent Unit (GRU) Networks Pre-Trained with Synthesized Pitch Patterns. INTERSPEECH 2019: 959-963 - [c86]Chiranjeevi Yarra, Aparna Srinivasan, Sravani Gottimukkala, Prasanta Kumar Ghosh:
SPIRE-fluent: A Self-Learning App for Tutoring Oral Fluency to Second Language English Learners. INTERSPEECH 2019: 968-969 - [c85]Abinay Reddy Naini, Achuth Rao M. V, Prasanta Kumar Ghosh:
Whisper to Neutral Mapping Using Cosine Similarity Maximization in i-Vector Space for Speaker Verification. INTERSPEECH 2019: 4340-4344 - [c84]Suhas B. N., Deep Patel, Nithin Rao Koluguri, Yamini Belur, Pradeep Reddy, Atchayaram Nalini, Ravi Yadav, Dipanjan Gope, Prasanta Kumar Ghosh:
Comparison of Speech Tasks and Recording Devices for Voice Based Automatic Classification of Healthy Subjects and Patients with Amyotrophic Lateral Sclerosis. INTERSPEECH 2019: 4564-4568 - [c83]Renuka Mannem, Valliappan Ca, Prasanta Kumar Ghosh:
A SegNet Based Image Enhancement Technique for Air-Tissue Boundary Segmentation in Real-Time Magnetic Resonance Imaging Video. NCC 2019: 1-6 - [c82]Shankar Narayanan, Aravind Illa, Nayan Anand, Ganesh Sinisetty, Karthick Narayanan, Prasanta Kumar Ghosh:
An acoustic-articulatory database of VCV sequences and words in Toda at different speaking rates. O-COCOSDA 2019: 1-6 - [c81]Chiranjeevi Yarra, Ritu Aggarwal, Avni Rajpal, Prasanta Kumar Ghosh:
Indic TIMIT and Indic English lexicon: A speech database of Indian speakers using TIMIT stimuli and a lexicon from their mispronunciations. O-COCOSDA 2019: 1-6 - [c80]Chiranjeevi Yarra, Aparna Srinivasan, Chandana Srinivasa, Ritu Aggarwal, Prasanta Kumar Ghosh:
voisTUTOR corpus: A speech corpus of Indian L2 English learners for pronunciation assessment. O-COCOSDA 2019: 1-6 - [c79]Aparna Srinivasan, Chiranjeevi Yarra, Prasanta Kumar Ghosh:
Automatic assessment of pronunciation and its dependent factors by exploring their interdependencies using DNN and LSTM. SLaTE 2019: 30-34 - [c78]Chiranjeevi Yarra, Prasanta Kumar Ghosh:
voisTUTOR: Virtual Operator for Interactive Spoken English TUTORing. SLaTE 2019: 35-36 - [c77]Sweekar Sudhakara, Manoj Kumar Ramanathi, Chiranjeevi Yarra, Anurag Das, Prasanta Kumar Ghosh:
Noise robust goodness of pronunciation measures using teacher's utterance. SLaTE 2019: 69-73 - [c76]Chiranjeevi Yarra, Manoj Kumar Ramanathi, Prasanta Kumar Ghosh:
Comparison of automatic syllable stress detection quality with time-aligned boundaries and context dependencies. SLaTE 2019: 79-83 - [i1]Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh:
A comparative study of estimating articulatory movements from phoneme sequences and acoustic features. CoRR abs/1910.14375 (2019) - 2018
- [j16]Ashok Kumar Pattem, Aravind Illa, Amber Afshan, Prasanta Kumar Ghosh:
Optimal sensor placement in electromagnetic articulography recording for speech production study. Comput. Speech Lang. 47: 157-174 (2018) - [j15]M. V. Achuth Rao, Prasanta Kumar Ghosh:
PSFM - A Probabilistic Source Filter Model for Noise Robust Glottal Closure Instant Detection. IEEE ACM Trans. Audio Speech Lang. Process. 26(9): 1645-1657 (2018) - [c75]Tanuka Bhattacharjee, Deepan Das, Shahnawaz Alam, M. V. Achuth Rao, Prasanta Kumar Ghosh, Ayush Ranjan Lohani, Rohan Banerjee, Anirban Dutta Choudhury, Arpan Pal:
SleepTight: Identifying Sleep Arousals Using Inter and Intra-Relation of Multimodal Signals. CinC 2018: 1-4 - [c74]Shivani Yadav, Kausthubha NK, Dipanjan Gope, Uma Maheswari Krishnaswamy, Prasanta Kumar Ghosh:
Comparison of Cough, Wheeze and Sustained Phonations for Automatic Classification Between Healthy Subjects and Asthmatic Patients. EMBC 2018: 1400-1403 - [c73]Tanuka Bhattacharjee, Shreyasi Datta, Deepan Das, Anirban Dutta Choudhury, Arpan Pal, Prasanta Kumar Ghosh:
A Heart Rate Driven Kalman Filter for Continuous Arousal Trend Monitoring. EMBC 2018: 3572-3577 - [c72]Raseena K. T, Prasanta Kumar Ghosh:
A Maximum Likelihood Formulation To Exploit Heart Rate Variability for Robust Heart Rate Estimation From Facial Video. EMBC 2018: 5191-5194 - [c71]Urvish Desai, Chiranjeevi Yarra, Prasanta Kumar Ghosh:
Concatenative Articulatory Video Synthesis Using Real-Time MRI Data for Spoken Language Training. ICASSP 2018: 4999-5003 - [c70]Advait Koparkar, Prasanta Kumar Ghosh:
A Supervised Air-Tissue Boundary Segmentation Technique in Real-Time Magnetic Resonance Imaging Video Using a Novel Measure of Contrast and Dynamic Programming. ICASSP 2018: 5004-5008 - [c69]Pavan Karjol, M. Ajay Kumar, Prasanta Kumar Ghosh:
Speech Enhancement Using Multiple Deep Neural Networks. ICASSP 2018: 5049-5052 - [c68]Girija Ramesan Karthik, Prasanta Kumar Ghosh:
Binaural Speech Source Localization Using Template Matching of Interaural Time Difference Patterns. ICASSP 2018: 5164-5168 - [c67]Aravind Illa, Deep Patel, B. K. Yamini, Meera SS, N. Shivashankar, Preethish-Kumar Veeramani, Seena Vengalil, Kiran Polavarapu, Saraswati Nashi, Atchayaram Nalini, Prasanta Kumar Ghosh:
Comparison of Speech Tasks for Automatic Classification of Patients with Amyotrophic Lateral Sclerosis and Healthy Subjects. ICASSP 2018: 6014-6018 - [c66]G. Nisha Meenakshi, Prasanta Kumar Ghosh:
Whispered Speech to Neutral Speech Conversion Using Bidirectional LSTMs. INTERSPEECH 2018: 491-495 - [c65]Anand P. A, Chiranjeevi Yarra, N. K. Kausthubha, Prasanta Kumar Ghosh:
Intonation tutor by SPIRE (In-SPIRE): An Online Tool for an Automatic Feedback to the Second Language Learners in Learning Intonation. INTERSPEECH 2018: 546-547 - [c64]Girija Ramesan Karthik, Parth Suresh, Prasanta Kumar Ghosh:
Subband Weighting for Binaural Speech Source Localization. INTERSPEECH 2018: 861-865 - [c63]Abinay Reddy Naini, M. V. Achuth Rao, G. Nisha Meenakshi, Prasanta Kumar Ghosh:
Reconstructing Neutral Speech from Tracheoesophageal Speech. INTERSPEECH 2018: 1541-1545 - [c62]Chiranjeevi Yarra, Anand P. A, N. K. Kausthubha, Prasanta Kumar Ghosh:
SPIRE-SST: An Automatic Web-based Self-learning Tool for Syllable Stress Tutoring (SST) to the Second Language Learners. INTERSPEECH 2018: 2390-2391 - [c61]Astha Singh, G. Nisha Meenakshi, Prasanta Kumar Ghosh:
Relating Articulatory Motions in Different Speaking Rates. INTERSPEECH 2018: 2992-2996 - [c60]M. V. Achuth Rao, Rahul Krishnamurthy, Pebbili Gopikishore, Veeramani Priyadharshini, Prasanta Kumar Ghosh:
Automatic Glottis Localization and Segmentation in Stroboscopic Videos Using Deep Neural Network. INTERSPEECH 2018: 3007-3011 - [c59]Aravind Illa, Prasanta Kumar Ghosh:
Low Resource Acoustic-to-articulatory Inversion Using Bi-directional Long Short Term Memory. INTERSPEECH 2018: 3122-3126 - [c58]Chandana Srinivasan, Chiranjeevi Yarra, Ritu Aggarwal, Sanjeev Kumar Mittal, N. K. Kausthubha, Raseena K. T, Astha Singh, Prasanta Kumar Ghosh:
Automatic Visual Augmentation for Concatenation Based Synthesized Articulatory Videos from Real-time MRI Data for Spoken Language Training. INTERSPEECH 2018: 3127-3131 - [c57]C. A. Valliappan, Renuka Mannem, Prasanta Kumar Ghosh:
Air-Tissue Boundary Segmentation in Real-Time Magnetic Resonance Imaging Video Using Semantic Segmentation with Fully Convolutional Networks. INTERSPEECH 2018: 3132-3136 - [c56]Pavan Karjol, Prasanta Kumar Ghosh:
Speech Enhancement Using Deep Mixture of Experts Based on Hard Expectation Maximization. INTERSPEECH 2018: 3254-3258 - [c55]C. A. Valliappan, Anurag Das, Prasanta Kumar Ghosh:
Classification of story-telling and poem recitation using head gesture of the talker. SPCOM 2018: 36-40 - [c54]Pavan Karjol, Prasanta Kumar Ghosh:
Broad Phoneme Class Specific Deep Neural Network Based Speech Enhancement. SPCOM 2018: 372-376 - 2017
- [j14]Nithin Rao Koluguri, G. Nisha Meenakshi, Prasanta Kumar Ghosh:
Spectrogram Enhancement Using Multiple Window Savitzky-Golay (MWSG) Filter for Robust Bird Sound Detection. IEEE ACM Trans. Audio Speech Lang. Process. 25(6): 1183-1192 (2017) - [c53]M. V. Achuth Rao, N. K. Kausthubha, Shivani Yadav, Dipanjan Gope, Uma Maheswari Krishnaswamy, Prasanta Kumar Ghosh:
Automatic prediction of spirometry readings from cough and wheeze for monitoring of asthma severity. EUSIPCO 2017: 41-45 - [c52]Samik Sadhu, Prasanta Kumar Ghosh:
Low resource point process models for keyword spotting using unsupervised online learning. EUSIPCO 2017: 538-542 - [c51]M. V. Achuth Rao, Prasanta Kumar Ghosh:
Pitch prediction from Mel-generalized cepstrum - a computationally efficient pitch modeling approach for speech synthesis. EUSIPCO 2017: 1629-1633 - [c50]Aravind Illa, Nisha Meenakshi, Prasanta Kumar Ghosh:
A comparative study of acoustic-to-articulatory inversion for neutral and whispered speech. ICASSP 2017: 5075-5079 - [c49]Chiranjeevi Yarra, Om D. Deshmukh, Prasanta Kumar Ghosh:
Automatic detection of syllable stress using sonority based prominence features for pronunciation evaluation. ICASSP 2017: 5845-5849 - [c48]Gaurav Fotedar, Prasanta Kumar Ghosh:
An Information Theoretic Analysis of the Temporal Synchrony Between Head Gestures and Prosodic Patterns in Spontaneous Speech. INTERSPEECH 2017: 157-161 - [c47]G. Nisha Meenakshi, Prasanta Kumar Ghosh:
A Robust Voiced/Unvoiced Phoneme Classification from Whispered Speech Using the 'Color' of Whispered Phonemes and Deep Neural Network. INTERSPEECH 2017: 503-507 - [c46]Girija Ramesan Karthik, Prasanta Kumar Ghosh:
Subband Selection for Binaural Speech Source Localization. INTERSPEECH 2017: 1929-1933 - [c45]Akshay Kalkunte Suresh, Srinivasa Raghavan K. M., Prasanta Kumar Ghosh:
Phoneme State Posteriorgram Features for Speech Based Automatic Classification of Speakers in Cold and Healthy Condition. INTERSPEECH 2017: 3462-3466 - [c44]M. V. Achuth Rao, Shivani Yadav, Prasanta Kumar Ghosh:
A Dual Source-Filter Model of Snore Audio for Snorer Group Classification. INTERSPEECH 2017: 3502-3506 - [c43]Abhishek Narwekar, Prasanta Kumar Ghosh:
PRAV: A Phonetically Rich Audio Visual Corpus. INTERSPEECH 2017: 3747-3751 - [c42]Srinivasa Raghavan K. M., Nisha Meenakshi, Sanjeev Kumar Mittal, Chiranjeevi Yarra, Anupam Mandal, K. R. Prasanna Kumar, Prasanta Kumar Ghosh:
A comparative study on the effect of different codecs on speech recognition accuracy using various acoustic modeling techniques. NCC 2017: 1-6 - [c41]H. S. Mekhala, Yamini Belur Keshavaprasad, J. Ketan, Pramod Kumar Pal, N. Shivashankar, Prasanta Kumar Ghosh:
Classification of healthy subjects and patients with essential vocal tremor using empirical mode decomposition of high resolution pitch contour. NCC 2017: 1-6 - [c40]M. V. Achuth Rao, Prasanta Kumar Ghosh:
Pitch prediction from Mel-frequency cepstral coefficients using sparse spectrum recovery. NCC 2017: 1-6 - [c39]Pradyumna B. Suresha, Supriya Nagesh, Priyadarshini Savan Roshan, Aditya Gaonkar P., G. Nisha Meenakshi, Prasanta Kumar Ghosh:
A high resolution ENF based multi-stage classifier for location forensics of media recordings. NCC 2017: 1-6 - 2016
- [j13]Ming Li, Jangwon Kim, Adam C. Lammert, Prasanta Kumar Ghosh, Vikram Ramanarayanan, Shrikanth S. Narayanan:
Speaker verification based on the fusion of speech acoustics and inverted articulatory signals. Comput. Speech Lang. 36: 196-211 (2016) - [j12]Abhay Prasad, Prasanta Kumar Ghosh:
Information theoretic optimal vocal tract region selection from real time magnetic resonance images for broad phonetic class recognition. Comput. Speech Lang. 39: 108-128 (2016) - [j11]Chiranjeevi Yarra, Om D. Deshmukh, Prasanta Kumar Ghosh:
A mode-shape classification technique for robust speech rate estimation and syllable nuclei detection. Speech Commun. 78: 62-71 (2016) - [j10]A. P. Prathosh, P. Sujith, A. G. Ramakrishnan, Prasanta Kumar Ghosh:
Cumulative Impulse Strength for Epoch Extraction. IEEE Signal Process. Lett. 23(4): 424-428 (2016) - [c38]Amber Afshan, Prasanta Kumar Ghosh:
Better acoustic normalization in subject independent acoustic-to-articulatory inversion: Benefit to recognition. ICASSP 2016: 5395-5399 - [c37]Supriya Nagesh, Chiranjeevi Yarra, Om Deshmukh, Prasanta Kumar Ghosh:
A robust speech rate estimation based on the activation profile from the selected acoustic unit dictionary. ICASSP 2016: 5400-5404 - [c36]Gaurav Fotedar, Aditya Gaonkar P., Saikat Chatterjee, Prasanta Kumar Ghosh:
Automatic Recognition of Social Roles Using Long Term Role Transitions in Small Group Interactions. INTERSPEECH 2016: 2065-2069 - [c35]Nazreen P. M., A. G. Ramakrishnan, Prasanta Kumar Ghosh:
A Class-Specific Speech Enhancement for Phoneme Recognition: A Dictionary Learning Approach. INTERSPEECH 2016: 3728-3732 - 2015
- [j9]Amber Afshan, Prasanta Kumar Ghosh:
Improved subject-independent acoustic-to-articulatory inversion. Speech Commun. 66: 1-16 (2015) - [j8]Nisha Meenakshi, Prasanta Kumar Ghosh:
Robust Whisper Activity Detection Using Long-Term Log Energy Variation of Sub-Band Signal. IEEE Signal Process. Lett. 22(11): 1859-1863 (2015) - [j7]Navaneet K. Lakshminarasimha Murthy, Pavan C. Madhusudana, Pradyumna Suresha, Vijitha Periyasamy, Prasanta Kumar Ghosh:
Multiple Spectral Peak Tracking for Heart Rate Monitoring from Photoplethysmography Signal During Intensive Physical Exercise. IEEE Signal Process. Lett. 22(12): 2391-2395 (2015) - [c34]Adria Casamitjana, Martin Sundin, Prasanta Kumar Ghosh, Saikat Chatterjee:
Bayesian learning for time-varying linear prediction of speech. EUSIPCO 2015: 325-329 - [c33]Abhay Prasad, Vijitha Periyasamy, Prasanta Kumar Ghosh:
Estimation of the invariant and variant characteristics in speech articulation and its application to speaker identification. ICASSP 2015: 4265-4269 - [c32]G. Nisha Meenakshi, Prasanta Kumar Ghosh:
A discriminative analysis within and across voiced and unvoiced consonants in neutral and whispered speech in multiple indian languages. INTERSPEECH 2015: 781-785 - [c31]Abhay Prasad, Prasanta Kumar Ghosh:
Automatic classification of eating conditions from speech using acoustic feature selection and a set of hierarchical support vector machine classifiers. INTERSPEECH 2015: 884-888 - [c30]Satyabrata Parida, Ashok Kumar Pattem, Prasanta Kumar Ghosh:
Estimation of the air-tissue boundaries of the vocal tract in the mid-sagittal plane from electromagnetic articulograph data. INTERSPEECH 2015: 2147-2151 - [c29]P. Sujith, A. P. Prathosh, A. G. Ramakrishnan, Prasanta Kumar Ghosh:
An error correction scheme for GCI detection algorithms using pitch smoothness criterion. INTERSPEECH 2015: 3284-3288 - [c28]G. Nisha Meenakshi, Prasanta Kumar Ghosh:
Automatic gender classification using the mel frequency cepstrum of neutral and whispered speech: A comparative study. NCC 2015: 1-6 - 2014
- [c27]Andreas Tsiartas, Prasanta Kumar Ghosh, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Classification of clean and noisy bilingual movie audio for speech-to-speech translation corpora design. ICASSP 2014: 121-125 - [c26]P. Sujith, Prasanta Kumar Ghosh:
Maximum a-posteriori estimation of missing samples with continuity constraint in Electromagnetic Articulography data. ICASSP 2014: 940-944 - [c25]M. N. Abhijith, Prasanta Kumar Ghosh, K. Rajgopal:
Multi-pitch tracking using Gaussian mixture model with time varying parameters and Grating Compression Transform. ICASSP 2014: 1473-1477 - [c24]Prasad Sudhakar, Laurent Jacques, Prasanta Kumar Ghosh:
A sparse smoothing approach for Gaussian Mixture Model based Acoustic-to-Articulatory Inversion. ICASSP 2014: 3032-3036 - [c23]Prasad Sudhakar, Prasanta Kumar Ghosh:
Sparse smoothing of articulatory features from Gaussian mixture model based acoustic-to-articulatory inversion: benefit to speech recognition. INTERSPEECH 2014: 169-173 - [c22]P. Sujith, Prasanta Kumar Ghosh:
Missing samples estimation in electromagnetic articulography data using equality constrained kalman smoother. INTERSPEECH 2014: 716-720 - [c21]Nisha Meenakshi, Chiranjeevi Yarra, B. K. Yamini, Prasanta Kumar Ghosh:
Comparison of speech quality with and without sensors in electromagnetic articulograph AG 501 recording. INTERSPEECH 2014: 935-939 - [c20]Abhay Prasad, Prasanta Kumar Ghosh, Shrikanth S. Narayanan:
Selection of optimal vocal tract regions using real-time magnetic resonance imaging for robust voice activity detection. INTERSPEECH 2014: 1539-1543 - 2013
- [j6]Andreas Tsiartas, Prasanta Kumar Ghosh, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
High-quality bilingual subtitle document alignments with application to spontaneous speech translation. Comput. Speech Lang. 27(2): 572-591 (2013) - [c19]Jangwon Kim, Adam C. Lammert, Prasanta Kumar Ghosh, Shrikanth S. Narayanan:
Spatial and temporal alignment of multimodal human speech production data: Real time imaging, flesh point tracking and audio. ICASSP 2013: 3637-3641 - [c18]Andreas Tsiartas, Theodora Chaspari, Nassos Katsamanis, Prasanta Kumar Ghosh, Ming Li, Maarten Van Segbroeck, Alexandros Potamianos, Shrikanth S. Narayanan:
Multi-band long-term signal variability features for robust voice activity detection. INTERSPEECH 2013: 718-722 - [c17]Ming Li, Jangwon Kim, Prasanta Kumar Ghosh, Vikram Ramanarayanan, Shrikanth S. Narayanan:
Speaker verification based on fusion of acoustic and articulatory information. INTERSPEECH 2013: 1614-1618 - [c16]Prasanta Kumar Ghosh, Shrikanth S. Narayanan:
Information theoretic acoustic feature selection for acoustic-to-articulatory inversion. INTERSPEECH 2013: 3177-3181 - 2012
- [c15]Jangwon Kim, Prasanta Kumar Ghosh, Sungbok Lee, Shrikanth S. Narayanan:
A study of emotional information present in articulatory movements estimated using acoustic-to-articulatory inversion. APSIPA 2012: 1-4 - [c14]Vikram Ramanarayanan, Prasanta Kumar Ghosh, Adam C. Lammert, Shrikanth S. Narayanan:
Exploiting speech production information for automatic speech and speaker modeling and recognition - possibilities and new opportunities. APSIPA 2012: 1-6 - 2011
- [j5]Prasanta Kumar Ghosh, Shrikanth S. Narayanan:
Joint source-filter optimization for robust glottal source estimation in the presence of shimmer and jitter. Speech Commun. 53(1): 98-109 (2011) - [j4]Prasanta Kumar Ghosh, Andreas Tsiartas, Shrikanth S. Narayanan:
Robust Voice Activity Detection Using Long-Term Signal Variability. IEEE Trans. Speech Audio Process. 19(3): 600-613 (2011) - [c13]Prasanta Kumar Ghosh, Shrikanth S. Narayanan:
A subject-independent acoustic-to-articulatory inversion. ICASSP 2011: 4624-4627 - [c12]Bo Xiao, Prasanta Kumar Ghosh, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Overlapped speech detection using long-term spectro-temporal similarity in stereo recording. ICASSP 2011: 5216-5219 - [c11]Andreas Tsiartas, Prasanta Kumar Ghosh, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Bilingual audio-subtitle extraction using automatic segmentation of movie audio. ICASSP 2011: 5624-5627 - [c10]Shrikanth S. Narayanan, Erik Bresch, Prasanta Kumar Ghosh, Louis Goldstein, Athanasios Katsamanis, Yoon Kim, Adam C. Lammert, Michael I. Proctor, Vikram Ramanarayanan, Yinghua Zhu:
A Multimodal Real-Time MRI Articulatory Corpus for Speech Research. INTERSPEECH 2011: 837-840 - [c9]Prasanta Kumar Ghosh, Shrikanth S. Narayanan:
Analysis of Inter-Articulator Correlation in Acoustic-to-Articulatory Inversion Using Generalized Smoothness Criterion. INTERSPEECH 2011: 2685-2688 - 2010
- [j3]Prasanta Kumar Ghosh, Shrikanth S. Narayanan:
Bark Frequency Transform Using an Arbitrary Order Allpass Filter. IEEE Signal Process. Lett. 17(6): 543-546 (2010) - [c8]Prasanta Kumar Ghosh, Andreas Tsiartas, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Robust voice activity detection in stereo recording with crosstalk. INTERSPEECH 2010: 3098-3101
2000 – 2009
- 2009
- [j2]Prasanta Kumar Ghosh, Shrikanth S. Narayanan:
Pitch Contour Stylization Using an Optimal Piecewise Polynomial Approximation. IEEE Signal Process. Lett. 16(9): 810-813 (2009) - [c7]Andreas Tsiartas, Prasanta Kumar Ghosh, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Robust word boundary detection in spontaneous speech using acoustic and lexical cues. ICASSP 2009: 4785-4788 - [c6]Andreas Tsiartas, Prasanta Kumar Ghosh, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Context-driven automatic bilingual movie subtitle alignment. INTERSPEECH 2009: 444-447 - [c5]Prasanta Kumar Ghosh, Shrikanth S. Narayanan, Pierre L. Divenyi, Louis Goldstein, Elliot Saltzman:
Estimation of articulatory gesture patterns from speech acoustics. INTERSPEECH 2009: 2803-2806 - 2008
- [c4]Sankaranarayanan Ananthakrishnan, Prasanta Kumar Ghosh, Shrikanth S. Narayanan:
Automatic classification of question turns in spontaneous speech using lexical and prosodic evidence. ICASSP 2008: 5005-5008 - 2007
- [c3]Prasanta Kumar Ghosh:
Speech Segmentation using Extrema-Based Signal Track Length Measure. ICASSP (4) 2007: 1065-1068 - [c2]Prasanta Kumar Ghosh, Antonio Ortega, Shrikanth S. Narayanan:
Pitch period estimation using multipulse model and wavelet transform. INTERSPEECH 2007: 2761-2764 - 2006
- [j1]Prasanta Kumar Ghosh, T. V. Sreenivas:
Time-varying filter interpretation of Fourier transform and its variants. Signal Process. 86(11): 3258-3263 (2006) - [c1]Prasanta Kumar Ghosh, Thippur V. Sreenivas:
Dynamic Programming Based Optimum Non-Uniform Samples For Speech Reconstruction and Coding. ICASSP (1) 2006: 1221-1224
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:19 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint