default search action
Xuedong Huang 0001
Person information
- affiliation: Microsoft Research, Redmond, WA, USA
Other persons with the same name
- Xuedong Huang 0002 — Civil Aviation University of China, College of Air Traffic Management, Tianjin, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c71]Ziyi Yang, Mahmoud Khademi, Yichong Xu, Reid Pryzant, Yuwei Fang, Chenguang Zhu, Dongdong Chen, Yao Qian, Xuemei Gao, Yi-Ling Chen, Robert Gmyr, Naoyuki Kanda, Noel Codella, Bin Xiao, Yu Shi, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data. NAACL-HLT (Findings) 2024: 1615-1627 - 2023
- [c70]Ziyi Yang, Yuwei Fang, Chenguang Zhu, Reid Pryzant, Dongdong Chen, Yu Shi, Yichong Xu, Yao Qian, Mei Gao, Yi-Ling Chen, Liyang Lu, Yujia Xie, Robert Gmyr, Noel Codella, Naoyuki Kanda, Bin Xiao, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code: An Integrative and Composable Multimodal Learning Framework. AAAI 2023: 10880-10890 - [c69]Pengcheng He, Baolin Peng, Song Wang, Yang Liu, Ruochen Xu, Hany Hassan, Yu Shi, Chenguang Zhu, Wayne Xiong, Michael Zeng, Jianfeng Gao, Xuedong Huang:
Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization. ACL (1) 2023: 5095-5112 - [c68]Chenyang Le, Yao Qian, Long Zhou, Shujie Liu, Yanmin Qian, Michael Zeng, Xuedong Huang:
ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation. NeurIPS 2023 - [i25]Ziyi Yang, Mahmoud Khademi, Yichong Xu, Reid Pryzant, Yuwei Fang, Chenguang Zhu, Dongdong Chen, Yao Qian, Mei Gao, Yi-Ling Chen, Robert Gmyr, Naoyuki Kanda, Noel Codella, Bin Xiao, Yu Shi, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data. CoRR abs/2305.12311 (2023) - [i24]Yuwei Fang, Mahmoud Khademi, Chenguang Zhu, Ziyi Yang, Reid Pryzant, Yichong Xu, Yao Qian, Takuya Yoshioka, Lu Yuan, Michael Zeng, Xuedong Huang:
i-Code Studio: A Configurable and Composable Framework for Integrative AI. CoRR abs/2305.13738 (2023) - [i23]Chenyang Le, Yao Qian, Long Zhou, Shujie Liu, Michael Zeng, Xuedong Huang:
ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation. CoRR abs/2305.14838 (2023) - 2022
- [c67]Hassan Taherian, Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang, Zhuo Chen, Xuedong Huang:
One Model to Enhance Them All: Array Geometry Agnostic Multi-Channel Personalized Speech Enhancement. ICASSP 2022: 271-275 - [c66]Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang, Xiaofei Wang, Zhuo Chen, Xuedong Huang:
Personalized speech enhancement: new models and Comprehensive evaluation. ICASSP 2022: 356-360 - [c65]Yichong Xu, Chenguang Zhu, Shuohang Wang, Siqi Sun, Hao Cheng, Xiaodong Liu, Jianfeng Gao, Pengcheng He, Michael Zeng, Xuedong Huang:
Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention. IJCAI 2022: 2762-2768 - [i22]Ziyi Yang, Yuwei Fang, Chenguang Zhu, Reid Pryzant, Dongdong Chen, Yu Shi, Yichong Xu, Yao Qian, Mei Gao, Yi-Ling Chen, Liyang Lu, Yujia Xie, Robert Gmyr, Noel Codella, Naoyuki Kanda, Bin Xiao, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang:
i-Code: An Integrative and Composable Multimodal Learning Framework. CoRR abs/2205.01818 (2022) - [i21]Pengcheng He, Baolin Peng, Liyang Lu, Song Wang, Jie Mei, Yang Liu, Ruochen Xu, Hany Hassan Awadalla, Yu Shi, Chenguang Zhu, Wayne Xiong, Michael Zeng, Jianfeng Gao, Xuedong Huang:
Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization. CoRR abs/2208.09770 (2022) - 2021
- [c64]Yichong Xu, Chenguang Zhu, Ruochen Xu, Yang Liu, Michael Zeng, Xuedong Huang:
Fusing Context Into Knowledge Graph for Commonsense Question Answering. ACL/IJCNLP (Findings) 2021: 1201-1207 - [c63]Chengyi Wang, Yu Wu, Yao Qian, Ken'ichi Kumatani, Shujie Liu, Furu Wei, Michael Zeng, Xuedong Huang:
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data. ICML 2021: 10937-10947 - [c62]Chenguang Zhu, William Hinthorn, Ruochen Xu, Qingkai Zeng, Michael Zeng, Xuedong Huang, Meng Jiang:
Enhancing Factual Consistency of Abstractive Summarization. NAACL-HLT 2021: 718-733 - [c61]Chenguang Zhu, Ziyi Yang, Robert Gmyr, Michael Zeng, Xuedong Huang:
Leveraging Lead Bias for Zero-shot Abstractive News Summarization. SIGIR 2021: 1462-1471 - [i20]Chengyi Wang, Yu Wu, Yao Qian, Ken'ichi Kumatani, Shujie Liu, Furu Wei, Michael Zeng, Xuedong Huang:
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data. CoRR abs/2101.07597 (2021) - [i19]Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang, Xiaofei Wang, Zhuo Chen, Xuedong Huang:
Personalized Speech Enhancement: New Models and Comprehensive Evaluation. CoRR abs/2110.09625 (2021) - [i18]Hassan Taherian, Sefik Emre Eskimez, Takuya Yoshioka, Huaming Wang, Zhuo Chen, Xuedong Huang:
One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement. CoRR abs/2110.10330 (2021) - [i17]Lu Yuan, Dongdong Chen, Yi-Ling Chen, Noel Codella, Xiyang Dai, Jianfeng Gao, Houdong Hu, Xuedong Huang, Boxin Li, Chunyuan Li, Ce Liu, Mengchen Liu, Zicheng Liu, Yumao Lu, Yu Shi, Lijuan Wang, Jianfeng Wang, Bin Xiao, Zhen Xiao, Jianwei Yang, Michael Zeng, Luowei Zhou, Pengchuan Zhang:
Florence: A New Foundation Model for Computer Vision. CoRR abs/2111.11432 (2021) - [i16]Yichong Xu, Chenguang Zhu, Shuohang Wang, Siqi Sun, Hao Cheng, Xiaodong Liu, Jianfeng Gao, Pengcheng He, Michael Zeng, Xuedong Huang:
Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention. CoRR abs/2112.03254 (2021) - 2020
- [c60]Chenguang Zhu, Ruochen Xu, Michael Zeng, Xuedong Huang:
A Hierarchical Network for Abstractive Meeting Summarization with Cross-Domain Pretraining. EMNLP (Findings) 2020: 194-203 - [c59]Ziyi Yang, Chenguang Zhu, Robert Gmyr, Michael Zeng, Xuedong Huang, Eric Darve:
TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and Denoising. EMNLP (Findings) 2020: 1865-1874 - [c58]Ruochen Xu, Chenguang Zhu, Yu Shi, Michael Zeng, Xuedong Huang:
Mixed-Lingual Pre-training for Cross-lingual Summarization. AACL/IJCNLP 2020: 536-541 - [i15]Ziyi Yang, Chenguang Zhu, Robert Gmyr, Michael Zeng, Xuedong Huang, Eric Darve:
TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and Denoising. CoRR abs/2001.00725 (2020) - [i14]Chenguang Zhu, William Hinthorn, Ruochen Xu, Qingkai Zeng, Michael Zeng, Xuedong Huang, Meng Jiang:
Boosting Factual Correctness of Abstractive Summarization with Knowledge Graph. CoRR abs/2003.08612 (2020) - [i13]Chenguang Zhu, Ruochen Xu, Michael Zeng, Xuedong Huang:
End-to-End Abstractive Summarization for Meetings. CoRR abs/2004.02016 (2020) - [i12]Beliz Gunel, Chenguang Zhu, Michael Zeng, Xuedong Huang:
Mind The Facts: Knowledge-Boosted Coherent Abstractive Text Summarization. CoRR abs/2006.15435 (2020) - [i11]Ruochen Xu, Chenguang Zhu, Yu Shi, Michael Zeng, Xuedong Huang:
Mixed-Lingual Pre-training for Cross-lingual Summarization. CoRR abs/2010.08892 (2020) - [i10]Yichong Xu, Chenguang Zhu, Ruochen Xu, Yang Liu, Michael Zeng, Xuedong Huang:
Fusing Context Into Knowledge Graph for Commonsense Reasoning. CoRR abs/2012.04808 (2020)
2010 – 2019
- 2019
- [c57]Takuya Yoshioka, Yan Huang, Aviv Hurvitz, Li Jiang, Sharon Koubi, Eyal Krupka, Ido Leichter, Changliang Liu, Partha Parthasarathy, Alon Vinnikov, Lingfeng Wu, Igor Abramovski, Xiong Xiao, Wayne Xiong, Huaming Wang, Zhenghao Wang, Jun Zhang, Yong Zhao, Tianyan Zhou, Cem Aksoylar, Zhuo Chen, Moshe David, Dimitrios Dimitriadis, Yifan Gong, Ilya Gurvich, Xuedong Huang:
Advances in Online Audio-Visual Meeting Transcription. ASRU 2019: 276-283 - [c56]Chenguang Zhu, Michael Zeng, Xuedong Huang:
Multi-task Learning for Natural Language Generation in Task-Oriented Dialogue. EMNLP/IJCNLP (1) 2019: 1261-1266 - [c55]Takuya Yoshioka, Dimitrios Dimitriadis, Andreas Stolcke, William Hinthorn, Zhuo Chen, Michael Zeng, Xuedong Huang:
Meeting Transcription Using Asynchronous Distant Microphones. INTERSPEECH 2019: 2968-2972 - [c54]Chenguang Zhu, Michael Zeng, Xuedong Huang:
SIM: A Slot-Independent Neural Model for Dialogue State Tracking. SIGdial 2019: 40-45 - [i9]Takuya Yoshioka, Zhuo Chen, Dimitrios Dimitriadis, William Hinthorn, Xuedong Huang, Andreas Stolcke, Michael Zeng:
Meeting Transcription Using Virtual Microphone Arrays. CoRR abs/1905.02545 (2019) - [i8]Chenguang Zhu, Michael Zeng, Xuedong Huang:
SIM: A Slot-Independent Neural Model for Dialogue State Tracking. CoRR abs/1909.11833 (2019) - [i7]Takuya Yoshioka, Igor Abramovski, Cem Aksoylar, Zhuo Chen, Moshe David, Dimitrios Dimitriadis, Yifan Gong, Ilya Gurvich, Xuedong Huang, Yan Huang, Aviv Hurvitz, Li Jiang, Sharon Koubi, Eyal Krupka, Ido Leichter, Changliang Liu, Partha Parthasarathy, Alon Vinnikov, Lingfeng Wu, Xiong Xiao, Wayne Xiong, Huaming Wang, Zhenghao Wang, Jun Zhang, Yong Zhao, Tianyan Zhou:
Advances in Online Audio-Visual Meeting Transcription. CoRR abs/1912.04979 (2019) - [i6]Chenguang Zhu, Ziyi Yang, Robert Gmyr, Michael Zeng, Xuedong Huang:
Make Lead Bias in Your Favor: A Simple and Effective Method for News Summarization. CoRR abs/1912.11602 (2019) - 2018
- [c53]Xuedong Huang:
Big Data for Speech and Language Processing. IEEE BigData 2018: 2 - [c52]Wayne Xiong, Lingfeng Wu, Fil Alleva, Jasha Droppo, Xuedong Huang, Andreas Stolcke:
The Microsoft 2017 Conversational Speech Recognition System. ICASSP 2018: 5934-5938 - [i5]Hany Hassan, Anthony Aue, Chang Chen, Vishal Chowdhary, Jonathan Clark, Christian Federmann, Xuedong Huang, Marcin Junczys-Dowmunt, William Lewis, Mu Li, Shujie Liu, Tie-Yan Liu, Renqian Luo, Arul Menezes, Tao Qin, Frank Seide, Xu Tan, Fei Tian, Lijun Wu, Shuangzhi Wu, Yingce Xia, Dongdong Zhang, Zhirui Zhang, Ming Zhou:
Achieving Human Parity on Automatic Chinese to English News Translation. CoRR abs/1803.05567 (2018) - [i4]Chenguang Zhu, Michael Zeng, Xuedong Huang:
SDNet: Contextualized Attention-based Deep Network for Conversational Question Answering. CoRR abs/1812.03593 (2018) - 2017
- [j13]Wayne Xiong, Jasha Droppo, Xuedong Huang, Frank Seide, Michael L. Seltzer, Andreas Stolcke, Dong Yu, Geoffrey Zweig:
Toward Human Parity in Conversational Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 25(12): 2410-2423 (2017) - [c51]Wayne Xiong, Jasha Droppo, Xuedong Huang, Frank Seide, Mike Seltzer, Andreas Stolcke, Dong Yu, Geoffrey Zweig:
The microsoft 2016 conversational speech recognition system. ICASSP 2017: 5255-5259 - [i3]Wayne Xiong, Lingfeng Wu, Fil Alleva, Jasha Droppo, Xuedong Huang, Andreas Stolcke:
The Microsoft 2017 Conversational Speech Recognition System. CoRR abs/1708.06073 (2017) - 2016
- [i2]Wayne Xiong, Jasha Droppo, Xuedong Huang, Frank Seide, Mike Seltzer, Andreas Stolcke, Dong Yu, Geoffrey Zweig:
The Microsoft 2016 Conversational Speech Recognition System. CoRR abs/1609.03528 (2016) - [i1]Wayne Xiong, Jasha Droppo, Xuedong Huang, Frank Seide, Mike Seltzer, Andreas Stolcke, Dong Yu, Geoffrey Zweig:
Achieving Human Parity in Conversational Speech Recognition. CoRR abs/1610.05256 (2016) - 2015
- [c50]Zhenghao Wang, Shengquan Yan, Huaming Wang, Xuedong Huang:
Large-Scale Question Answering with Joint Embedding and Proof Tree Decoding. CIKM 2015: 1783-1786 - 2014
- [j12]Xuedong Huang, James Baker, Raj Reddy:
A historical perspective of speech recognition. Commun. ACM 57(1): 94-103 (2014) - [j11]Zheng Chen, Jian-Tao Sun, Xuedong Huang:
Web Information at Your Fingertips: Paper as an Interaction Metaphor. Computer 47(3): 62-66 (2014) - 2010
- [r1]Xuedong Huang, Li Deng:
An Overview of Modern Speech Recognition. Handbook of Natural Language Processing 2010: 339-366
2000 – 2009
- 2008
- [c49]Liu Wenyin, Qing Li, Xuedong Huang:
International workshop on question answering on the web (QAWeb2008). WWW 2008: 1275-1276 - 2004
- [j10]Li Deng, Xuedong Huang:
Challenges in adopting speech recognition. Commun. ACM 47(1): 69-75 (2004) - [j9]Li Deng, Ye-Yi Wang, Kuansan Wang, Alex Acero, Hsiao-Wuen Hon, Jasha Droppo, Constantinos Boulis, Milind Mahajan, Xuedong Huang:
Speech and Language Processing for Multimodal Human-Computer Interaction. J. VLSI Signal Process. 36(2-3): 161-187 (2004) - [c48]Zhengyou Zhang, Zicheng Liu, Mike Sinclair, Alex Acero, Li Deng, Jasha Droppo, Xuedong Huang, Yanli Zheng:
Multi-sensory microphones for robust speech detection, enhancement and recognition. ICASSP (3) 2004: 781-784 - [c47]Xuedong Huang:
Enabling natural computing. ISCSLP 2004 - [c46]Zicheng Liu, Zhengyou Zhang, Alejandro Acero, Jasha Droppo, Xuedong Huang:
Direct filtering for air- and bone-conductive microphones. MMSP 2004: 363-366 - 2002
- [j8]Li Deng, Kuansan Wang, Alex Acero, Hsiao-Wuen Hon, Jasha Droppo, Constantinos Boulis, Ye-Yi Wang, Derek Jacoby, Milind Mahajan, Ciprian Chelba, Xuedong Huang:
Distributed speech processing in miPad's multimodal user interface. IEEE Trans. Speech Audio Process. 10(8): 605-619 (2002) - [c45]Li Deng, Alex Acero, Ye-Yi Wang, Kuansan Wang, Hsiao-Wuen Hon, Jasha Droppo, Milind Mahajan, Xuedong Huang:
A speech-centric perspective for human-computer interface. IEEE Workshop on Multimedia Signal Processing 2002: 263-267 - 2001
- [c44]Xuedong Huang, Alex Acero, Ciprian Chelba, Li Deng, Jasha Droppo, Doug Duchene, Joshua Goodman, Hsiao-Wuen Hon, Derek Jacoby, Li Jiang, Ricky Loynd, Milind Mahajan, Peter Mau, Scott Meredith, Salman Mughal, Salvado Neto, Mike Plumpe, Kuansan Steury, Gina Venolia, Kuansan Wang, Ye-Yi Wang:
MiPad: a multimodal interaction prototype. ICASSP 2001: 9-12 - [c43]Li Deng, Alex Acero, Li Jiang, Jasha Droppo, Xuedong Huang:
High-performance robust speech recognition using stereo training data. ICASSP 2001: 301-304 - 2000
- [c42]Ye-Yi Wang, Milind Mahajan, Xuedong Huang:
A unified context-free grammar and n-gram model for spoken language processing. ICASSP 2000: 1639-1642 - [c41]Xuedong Huang, Alex Acero, Ciprian Chelba, Li Deng, Doug Duchene, Joshua Goodman, Hsiao-Wuen Hon, Derek Jacoby, Li Jiang, Ricky Loynd, Milind Mahajan, Peter Mau, Scott Meredith, Salman Mughal, Salvado Neto, Mike Plumpe, Kuansan Wang, Ye-Yi Wang:
Mipad: a next generation PDA prototype. INTERSPEECH 2000: 33-36 - [c40]Li Jiang, Xuedong Huang:
Subword-dependent speaker clustering for improved speech recognition. INTERSPEECH 2000: 137-140 - [c39]Li Deng, Alex Acero, Mike Plumpe, Xuedong Huang:
Large-vocabulary speech recognition under adverse acoustic environments. INTERSPEECH 2000: 806-809
1990 – 1999
- 1999
- [c38]Milind Mahajan, Doug Beeferman, Xuedong Huang:
Improved topic-dependent language modeling using information retrieval techniques. ICASSP 1999: 541-544 - [c37]Li Jiang, Xuedong Huang:
Unified decoding and feature representation for improved speech recognition. EUROSPEECH 1999 - [c36]Matthew Richardson, Mei-Yuh Hwang, Alex Acero, Xuedong Huang:
Improvements on speech recognition for fast talkers. EUROSPEECH 1999: 411-414 - 1998
- [j7]Fil Alleva, Xuedong Huang, Mei-Yuh Hwang, Li Jiang:
Can continuous speech recognizers handle isolated speech? Speech Commun. 26(3): 183-189 (1998) - [c35]Hsiao-Wuen Hon, Alex Acero, Xuedong Huang, Jingsong Liu, Mike Plumpe:
Automatic generation of synthesis units for trainable text-to-speech systems. ICASSP 1998: 293-296 - [c34]Mei-Yuh Hwang, Xuedong Huang:
Dynamically configurable acoustic models for speech recognition. ICASSP 1998: 669-672 - [c33]Gregory Aist, Peggy Chan, Xuedong Huang, Li Jiang, Rebecca Kennedy, DeWitt Latimer IV, Jack Mostow, Calvin Yeung:
How effective is unsupervised data collection for children's speech recognition? ICSLP 1998 - [c32]Li Jiang, Xuedong Huang:
Vocabulary-independent word confidence measure using subword features. ICSLP 1998 - [c31]Mike Plumpe, Alex Acero, Hsiao-Wuen Hon, Xuedong Huang:
HMM-based smoothing for concatenative speech synthesis. ICSLP 1998 - 1997
- [c30]Xuedong Huang, Alex Acero, Hsiao-Wuen Hon, Yun-Cheng Ju, Jingsong Liu, Scott Meredith, Mike Plumpe:
Recent improvements on Microsoft's trainable text-to-speech system-Whistler. ICASSP 1997: 959-962 - [c29]Li Jiang, Hsiao-Wuen Hon, Xuedong Huang:
Improvements on a trainable letter-to-sound converter. EUROSPEECH 1997: 605-608 - [c28]Fil Alleva, Xuedong Huang, Mei-Yuh Hwang, Li Jiang:
Can continuous speech recognizers handle isolated speech? EUROSPEECH 1997: 911-914 - 1996
- [j6]Mei-Yuh Hwang, Xuedong Huang, Fileno A. Alleva:
Predicting unseen triphones with senones. IEEE Trans. Speech Audio Process. 4(6): 412-419 (1996) - [c27]Fil Alleva, Xuedong Huang, Mei-Yuh Hwang:
Improvements on the pronunciation prefix tree search organization. ICASSP 1996: 133-136 - [c26]Alex Acero, Xuedong Huang:
Speaker and gender normalization for continuous-density hidden Markov models. ICASSP 1996: 342-345 - [c25]Xuedong Huang, Mei-Yuh Hwang, Li Jiang, Milind Mahajan:
Deleted interpolation and density sharing for continuous hidden Markov models. ICASSP 1996: 885-888 - [c24]Xuedong Huang, Alex Acero, J. Adcock, Hsiao-Wuen Hon, John Goldsmith, Jingsong Liu, Mike Plumpe:
Whistler: a trainable text-to-speech system. ICSLP 1996: 2387-2390 - 1995
- [c23]Xuedong Huang, Alex Acero, Fil Alleva, Mei-Yuh Hwang, Li Jiang, Milind Mahajan:
Microsoft Windows highly intelligent speech recognizer: Whisper. ICASSP 1995: 93-96 - 1994
- [c22]Mei-Yuh Hwang, Ronald Rosenfeld, Eric H. Thayer, Mosur Ravishankar, Lin Lawrence Chase, Robert Weide, Xuedong Huang, Fil Alleva:
Improving speech recognition performance via phone-dependent VQ codebooks and adaptive language models in SPHINX-II. ICASSP (1) 1994: 549-552 - [c21]Xuedong Huang:
Session 2: Language Modeling. HLT 1994 - 1993
- [j5]Xuedong Huang, Fileno A. Alleva, Hsiao-Wuen Hon, Mei-Yuh Hwang, Kai-Fu Lee, Ronald Rosenfeld:
The SPHINX-II speech recognition system: an overview. Comput. Speech Lang. 7(2): 137-148 (1993) - [j4]Xuedong Huang, Hsiao-Wuen Hon, Mei-Yuh Hwang, Kai-Fu Lee:
A comparative study of discrete, semicontinuous, and continuous hidden Markov models. Comput. Speech Lang. 7(4): 359-368 (1993) - [j3]Xuedong Huang, Kai-Fu Lee:
On speaker-independent, speaker-dependent, and speaker-adaptive speech recognition. IEEE Trans. Speech Audio Process. 1(2): 150-157 (1993) - [j2]Mei-Yuh Hwang, Xuedong Huang:
Shared-distribution hidden Markov models for speech recognition. IEEE Trans. Speech Audio Process. 1(4): 414-420 (1993) - [c20]Fil Alleva, Xuedong Huang, Mei-Yuh Hwang:
An improved search algorithm using incremental knowledge for continuous speech recognition. ICASSP (2) 1993: 307-310 - [c19]Mei-Yuh Hwang, Xuedong Huang, Fil Alleva:
Predicting unseen triphones with senones. ICASSP (2) 1993: 311-314 - [c18]Xuedong Huang, M. Belin, Fil Alleva, Mei-Yuh Hwang:
Unified stochastic engine (USE) for speech recognition. ICASSP (2) 1993: 636-639 - [c17]Mei-Yuh Hwang, Fil Alleva, Xuedong Huang:
Senones, multi-pass search, and unified stochastic modeling in sphinx-II. EUROSPEECH 1993: 2143-2146 - [c16]Xuedong Huang, Fileno A. Alleva, Mei-Yuh Hwang, Ronald Rosenfeld:
An Overview of the SPHINX-II Speech Recognition System. HLT 1993 - [c15]Fu-Hua Liu, Richard M. Stern, Xuedong Huang, Alejandro Acero:
Efficient Cepstral Normalization For Robust Speech Recognition. HLT 1993 - 1992
- [c14]Ronald Rosenfeld, Xuedong Huang, Merrick L. Furst:
Exploiting correlations among competing models with application to large vocabulary speech recognition. ICASSP 1992: 5-8 - [c13]Mei-Yuh Hwang, Xuedong Huang:
Subphonetic modeling with Markov states-Senone. ICASSP 1992: 33-36 - [c12]Xuedong Huang:
Speaker normalization for speech recognition. ICASSP 1992: 465-468 - [c11]Fil Alleva, Hsiao-Wuen Hon, Xuedong Huang, Mei-Yuh Hwang, Ronald Rosenfeld, Robert Weide:
Applying SPHINX-II to the DARPA Wall Street Journal CSR Task. HLT 1992 - [c10]Xuedong Huang:
Minimizing Speaker Variation Effects for Speaker-Independent Speech Recognition. HLT 1992 - [c9]Mei-Yuh Hwang, Xuedong Huang:
Subphonetic Modeling for Speech Recognition. HLT 1992 - [c8]Ronald Rosenfeld, Xuedong Huang:
Improvements in Stochastic Language Modeling. HLT 1992 - [c7]Wayne H. Ward, Sunil lssar, Xuedong Huang, Hsiao-Wuen Hon, Mei-Yuh Hwang, Sheryl Young, Michael Matessa, Fu-Hua Liu, Richard M. Stern:
Speech Understanding in Open Tasks. HLT 1992 - 1991
- [c6]Xuedong Huang, Kai-Fu Lee, Hsiao-Wuen Hon, Mei-Yuh Hwang:
Improved acoustic modeling with the SPHINX speech recognition system. ICASSP 1991: 345-348 - [c5]Mei-Yuh Hwang, Xuedong Huang:
Acoustic distribution clustering in phonetic hidden Markov models. EUROSPEECH 1991: 785-788 - [c4]Xuedong Huang:
A Study on Speaker-Adaptive Speech Recognition. HLT 1991 - 1990
- [j1]Kai-Fu Lee, Hsiao-Wuen Hon, Mei-Yuh Hwang, Xuedong Huang:
Speech recognition using hidden Markov models: A CMU perspective. Speech Commun. 9(5-6): 497-508 (1990) - [c3]Xuedong Huang, Kai-Fu Lee, Hsiao-Wuen Hon:
On semi-continuous hidden Markov modeling. ICASSP 1990: 689-692 - [c2]Xuedong Huang, Fil Alleva, Satoru Hayamizu, Hsiao-Wuen Hon, Mei-Yuh Hwang, Kai-Fu Lee:
Improved Hidden Markov Modeling for Speaker-Independent Continuous Speech Recognition. HLT 1990
1980 – 1989
- 1989
- [c1]Xuedong Huang, Hsiao-Wuen Hon, Kai-Fu Lee:
Large-vocabulary speaker-independent continuous speech recognition with semi-continuous hidden Markov models. EUROSPEECH 1989: 1163-1166
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-19 23:45 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint