default search action

combined dblp search
author search
venue search
publication search

ask others

Xiaohui Zhang 0007

> Home > Persons

Person information

affiliation: Meta AI, New York, NY, USA
affiliation (PhD): Johns Hopkins University, Center for Language and Speech Processing, Baltimore, MD, USA

Other persons with the same name

see FAQ

Xiaohui Zhang — disambiguation page
Xiaohui Zhang 0001 — Nanyang Technological University, School of Electrical and Electronic Engineering, Singapore
Xiaohui Zhang 0002 — Henan University of Technology, College of Electrical Engineering, Zhengzhou, China
Xiaohui Zhang 0003 — Kunming University of Science and Technology, Faculty of Metallurgical and Energy Engineering, Kunming, China
Xiaohui Zhang 0004 — University of Liverpool, UK
Xiaohui Zhang 0005 — Sheffield Hallam University, UK
Xiaohui Zhang 0006 — University of Notre Dame, MINE Lab, Notre Dame, IN, USA (and 2 more)
Xiaohui Zhang 0008 — Chinese Academy of Sciences, Institute of Information Engineering, Beijing, China (and 1 more)
Xiaohui Zhang 0009 — Qinghai Normal University, School of Computer, Xining, China (and 1 more)
Xiaohui Zhang 0010 — Heidelberg University, Institut für Technische Informatik (ZITI), Heidelberg, Germany

Xiaohui Zhang 0011 — Wuhan University, School of Power and Mechanical Engineering, Wuhan, China
Xiaohui Zhang 0012 — Bayer, Chesterfield, MO, USA (and 2 more)
Xiaohui Zhang 0013 — Naval University of Engineering, Department of Weapon Engineering, Wuhan, China
Xiaohui Zhang 0014 — Xi'an University of Technology, School of Computer Science and Engineering, Xi'an, China
Xiaohui Zhang 0015 — Beihang University, School of Engineering Medicine, Beijing Advanced Innovation Centre for Biomedical Engineering, Beijing, China (and 2 more)
Xiaohui Zhang 0016 — Huazhong University of Science and Technology, School of Computer Science and Technology, Key Laboratory of Cluster and Grid Computing, Key Laboratory of Services Computing Technology and System, Wuhan, China
Xiaohui Zhang 0017 — Inspur (Beijing) Electronic Information Industry Co., Ltd, Beijing, China (and 1 more)
Xiaohui Zhang 0018 — Peking University, People's Hospital, Institute of Hematology, Beijing, China
Xiaohui Zhang 0019 — Kunming University of Science and Technology, Faculty of Information Engineering and Automation, Kunming, China
Xiaohui Zhang 0020 — Northwest Normal University, College of Computer Science and Engineering, Lanzhou, China
Xiaohui Zhang 0021 — Henan University of Science and Technology, School of Information Engineering, Luoyang, China (and 1 more)
Xiaohui Zhang 0022 — Henan Xinda Wangyu Technology Co., Ltd., Zhengzhou, China
Xiaohui Zhang 0023 — Southeast University, National Prestress Engineering Research Center, Key Laboratory of C & PC Structures of Ministry of Education, Nanjing, China
Xiaohui Zhang 0024 — Dalian University of Technology, International School of Information Science & Engineering, Dalian, China
Xiaohui Zhang 0025 — Inner Mongolia University, College of Electronic Information Engineering, Hohhot, China

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j1]
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/PratapTSTBKENVF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/PratapTSTBKENVF24
Vineel Pratap, Andros Tjandra, Bowen Shi, Paden Tomasello, Arun Babu, Sayani Kundu, Ali Elkahky, Zhaoheng Ni, Apoorv Vyas, Maryam Fazel-Zarandi, Alexei Baevski, Yossi Adi, Xiaohui Zhang, Wei-Ning Hsu, Alexis Conneau, Michael Auli:
Scaling Speech Technology to 1, 000+ Languages. J. Mach. Learn. Res. 25: 97:1-97:52 (2024)
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Huang0NSHHMPW0P24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Huang0NSHHMPW0P24
Ruizhe Huang, Xiaohui Zhang, Zhaoheng Ni, Li Sun, Moto Hira, Jeff Hwang, Vimal Manohar, Vineel Pratap, Matthew Wiesner, Shinji Watanabe, Daniel Povey, Sanjeev Khudanpur:
Less Peaky and More Accurate CTC Forced Alignment by Label Priors. ICASSP 2024: 11831-11835
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02560
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02560
Ruizhe Huang, Xiaohui Zhang, Zhaoheng Ni, Li Sun, Moto Hira, Jeff Hwang, Vimal Manohar, Vineel Pratap, Matthew Wiesner, Shinji Watanabe, Daniel Povey, Sanjeev Khudanpur:
Less Peaky and More Accurate CTC Forced Alignment by Label Priors. CoRR abs/2406.02560 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-03752
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-03752
Junteng Jia, Gil Keren, Wei Zhou, Egor Lakomkin, Xiaohui Zhang, Chunyang Wu, Frank Seide, Jay Mahadeokar, Ozlem Kalinli:
Efficient Streaming LLM for Speech Recognition. CoRR abs/2410.03752 (2024)
2023
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/YanS0IPDPFBHZNH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/YanS0IPDPFBHZNH23
Brian Yan, Jiatong Shi, Yun Tang, Hirofumi Inaguma, Yifan Peng, Siddharth Dalmia, Peter Polak, Patrick Fernandes, Dan Berrebbi, Tomoki Hayashi, Xiaohui Zhang, Zhaoheng Ni, Moto Hira, Soumi Maiti, Juan Pino, Shinji Watanabe:
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit. ACL (demo) 2023: 400-411
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/HwangHCZNSMHPZKYZLKRSWST23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/HwangHCZNSMHPZKYZLKRSWST23
Jeff Hwang, Moto Hira, Caroline Chen, Xiaohui Zhang, Zhaoheng Ni, Guangzhi Sun, Pingchuan Ma, Ruizhe Huang, Vineel Pratap, Yuekai Zhang, Anurag Kumar, Chin-Yun Yu, Chuang Zhu, Chunxi Liu, Jacob Kahn, Mirco Ravanelli, Peng Sun, Shinji Watanabe, Yangyang Shi, Yumeng Tao:
TorchAudio 2.1: Advancing Speech Recognition, Self-Supervised Learning, and Audio Processing Components for Pytorch. ASRU 2023: 1-9
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KumarTNMZHX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KumarTNMZHX23
Anurag Kumar, Ke Tan, Zhaoheng Ni, Pranay Manocha, Xiaohui Zhang, Ethan Henderson, Buye Xu:
Torchaudio-Squim: Reference-Less Speech Quality and Intelligibility Measures in Torchaudio. ICASSP 2023: 1-5
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RajJMWMZK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RajJMWMZK23
Desh Raj, Junteng Jia, Jay Mahadeokar, Chunyang Wu, Niko Moritz, Xiaohui Zhang, Ozlem Kalinli:
Anchored Speech Recognition with Neural Transducers. ICASSP 2023: 1-5
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13516
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13516
Vineel Pratap, Andros Tjandra, Bowen Shi, Paden Tomasello, Arun Babu, Sayani Kundu, Ali Elkahky, Zhaoheng Ni, Apoorv Vyas, Maryam Fazel-Zarandi, Alexei Baevski, Yossi Adi, Xiaohui Zhang, Wei-Ning Hsu, Alexis Conneau, Michael Auli:
Scaling Speech Technology to 1, 000+ Languages. CoRR abs/2305.13516 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-17864
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-17864
Jeff Hwang, Moto Hira, Caroline Chen, Xiaohui Zhang, Zhaoheng Ni, Guangzhi Sun, Pingchuan Ma, Ruizhe Huang, Vineel Pratap, Yuekai Zhang, Anurag Kumar, Chin-Yun Yu, Chuang Zhu, Chunxi Liu, Jacob Kahn, Mirco Ravanelli, Peng Sun, Shinji Watanabe, Yangyang Shi, Yumeng Tao, Robin Scheibler, Samuele Cornell, Sean Kim, Stavros Petridis:
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch. CoRR abs/2310.17864 (2023)
2022
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuPSCXZCAHS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuPSCXZCAHS22
Chunxi Liu, Michael Picheny, Leda Sari, Pooja Chitkara, Alex Xiao, Xiaohui Zhang, Mark Chou, Andres Alvarado, Caner Hazirbas, Yatharth Saraf:
Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions. ICASSP 2022: 6162-6166
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangSWLCZVKC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangSWLCZVKC22
Haichuan Yang, Yuan Shangguan, Dilin Wang, Meng Li, Pierce Chuang, Xiaohui Zhang, Ganesh Venkatesh, Ozlem Kalinli, Vikas Chandra:
Omni-Sparsity DNN: Fast Sparsity Optimization for On-Device Streaming E2E ASR Via Supernet. ICASSP 2022: 8197-8201
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShiWWXMZLLSNKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShiWWXMZLLSNKS22
Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer:
Streaming Transformer Transducer based Speech Recognition Using Non-Causal Convolution. ICASSP 2022: 8277-8281
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-11588
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-11588
Desh Raj, Junteng Jia, Jay Mahadeokar, Chunyang Wu, Niko Moritz, Xiaohui Zhang, Ozlem Kalinli:
Anchored Speech Recognition with Neural Transducers. CoRR abs/2210.11588 (2022)
2021
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ZhangMZZSSCPSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ZhangMZZSSCPSS21
Xiaohui Zhang, Vimal Manohar, David Zhang, Frank Zhang, Yangyang Shi, Nayan Singhal, Julian Chan, Fuchun Peng, Yatharth Saraf, Mike Seltzer:
On Lattice-Free Boosted MMI Training of HMM and CTC-Based Full-Context ASR Models. ASRU 2021: 1026-1033
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/Zhang0LSCPLYPSZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/Zhang0LSCPLYPSZ21
Xiaohui Zhang, Frank Zhang, Chunxi Liu, Kjell Schubert, Julian Chan, Pradyot Prakash, Jun Liu, Ching-Feng Yeh, Fuchun Peng, Yatharth Saraf, Geoffrey Zweig:
Benchmarking LF-MMI, CTC And RNN-T Criteria For Streaming ASR. SLT 2021: 46-51
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-04154
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-04154
Xiaohui Zhang, Vimal Manohar, David Zhang, Frank Zhang, Yangyang Shi, Nayan Singhal, Julian Chan, Fuchun Peng, Yatharth Saraf, Mike Seltzer:
On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models. CoRR abs/2107.04154 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05241
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05241
Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer:
Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution. CoRR abs/2110.05241 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-08352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-08352
Haichuan Yang, Yuan Shangguan, Dilin Wang, Meng Li, Pierce Chuang, Xiaohui Zhang, Ganesh Venkatesh, Ozlem Kalinli, Vikas Chandra:
Omni-sparsity DNN: Fast Sparsity Optimization for On-Device Streaming E2E ASR via Supernet. CoRR abs/2110.08352 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-09983
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-09983
Chunxi Liu, Michael Picheny, Leda Sari, Pooja Chitkara, Alex Xiao, Xiaohui Zhang, Mark Chou, Andres Alvarado, Caner Hazirbas, Yatharth Saraf:
Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions. CoRR abs/2111.09983 (2021)
2020
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangPK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangPK20
Xiaohui Zhang, Daniel Povey, Sanjeev Khudanpur:
OOV Recovery with Efficient 2nd Pass Decoding and Open-vocabulary Word-level RNNLM Rescoring for Hybrid ASR. ICASSP 2020: 6334-6338
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangMLLXMHTZZFZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangMLLXMHTZZFZ20
Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer:
Transformer-Based Acoustic Modeling for Hybrid Speech Recognition. ICASSP 2020: 6874-6878
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TjandraLZZWS0Z20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TjandraLZZWS0Z20
Andros Tjandra, Chunxi Liu, Frank Zhang, Xiaohui Zhang, Yongqiang Wang, Gabriel Synnaeve, Satoshi Nakamura, Geoffrey Zweig:
DEJA-VU: Double Feature Presentation and Iterated Loss in Deep Transformer Networks. ICASSP 2020: 6899-6903
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangWZLSZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangWZLSZ20
Frank Zhang, Yongqiang Wang, Xiaohui Zhang, Chunxi Liu, Yatharth Saraf, Geoffrey Zweig:
Faster, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces. INTERSPEECH 2020: 976-980
[c11]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/sltu/LiuZZSSZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sltu/LiuZZSSZ20
Chunxi Liu, Qiaochu Zhang, Xiaohui Zhang, Kritika Singh, Yatharth Saraf, Geoffrey Zweig:
Multilingual Graphemic Hybrid ASR with Massive Data Augmentation. SLTU-CCURL@LREC 2020: 46-52
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-09150
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-09150
Frank Zhang, Yongqiang Wang, Xiaohui Zhang, Chunxi Liu, Yatharth Saraf, Geoffrey Zweig:
Fast, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces. CoRR abs/2005.09150 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-04785
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-04785
Xiaohui Zhang, Frank Zhang, Chunxi Liu, Kjell Schubert, Julian Chan, Pradyot Prakash, Jun Liu, Ching-Feng Yeh, Fuchun Peng, Yatharth Saraf, Geoffrey Zweig:
Benchmarking LF-MMI, CTC and RNN-T Criteria for Streaming ASR. CoRR abs/2011.04785 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LeZZFZS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LeZZFZS19
Duc Le, Xiaohui Zhang, Weiyi Zheng, Christian Fügen, Geoffrey Zweig, Michael L. Seltzer:
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition. ASRU 2019: 457-464
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-06522
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-06522
Chunxi Liu, Qiaochu Zhang, Xiaohui Zhang, Kritika Singh, Yatharth Saraf, Geoffrey Zweig:
Multilingual ASR with Massive Data Augmentation. CoRR abs/1909.06522 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-01493
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-01493
Duc Le, Xiaohui Zhang, Weiyi Zheng, Christian Fügen, Geoffrey Zweig, Michael L. Seltzer:
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition. CoRR abs/1910.01493 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-09799
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-09799
Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer:
Transformer-based Acoustic Modeling for Hybrid Speech Recognition. CoRR abs/1910.09799 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-10324
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-10324
Andros Tjandra, Chunxi Liu, Frank Zhang, Xiaohui Zhang, Yongqiang Wang, Gabriel Synnaeve, Satoshi Nakamura, Geoffrey Zweig:
Deja-vu: Double Feature Presentation in Deep Transformer Networks. CoRR abs/1910.10324 (2019)
2017
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangPXZPK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangPXZPK17
Yiming Wang, Vijayaditya Peddinti, Hainan Xu, Xiaohui Zhang, Daniel Povey, Sanjeev Khudanpur:
Backstitch: Counteracting Finite-Sample Bias via Negative Steps. INTERSPEECH 2017: 1631-1635
[c8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangMPK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangMPK17
Xiaohui Zhang, Vimal Manohar, Daniel Povey, Sanjeev Khudanpur:
Acoustic Data-Driven Lexicon Learning Based on a Greedy Pronunciation Selection Framework. INTERSPEECH 2017: 2541-2545
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TrmalWPZGWMXPK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TrmalWPZGWMXPK17
Jan Trmal, Matthew Wiesner, Vijayaditya Peddinti, Xiaohui Zhang, Pegah Ghahremani, Yiming Wang, Vimal Manohar, Hainan Xu, Daniel Povey, Sanjeev Khudanpur:
The Kaldi OpenKWS System: Improving Low Resource Keyword Search. INTERSPEECH 2017: 3597-3601
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/ZhangMPK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ZhangMPK17
Xiaohui Zhang, Vimal Manohar, Daniel Povey, Sanjeev Khudanpur:
Acoustic data-driven lexicon learning based on a greedy pronunciation selection framework. CoRR abs/1706.03747 (2017)
2015
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangPK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangPK15
Xiaohui Zhang, Daniel Povey, Sanjeev Khudanpur:
A diversity-penalizing ensemble training method for deep learning. INTERSPEECH 2015: 3590-3594
[c5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/PoveyZK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/PoveyZK14
Daniel Povey, Xiaohui Zhang, Sanjeev Khudanpur:
Parallel training of Deep Neural Networks with Natural Gradient and Parameter Averaging. ICLR (Workshop) 2015
2014
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangTPK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangTPK14
Xiaohui Zhang, Jan Trmal, Daniel Povey, Sanjeev Khudanpur:
Improving deep neural network acoustic models using generalized maxout networks. ICASSP 2014: 215-219
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/Garcia-RomeroZM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/Garcia-RomeroZM14
Daniel Garcia-Romero, Xiaohui Zhang, Alan McCree, Daniel Povey:
Improving speaker recognition performance in the domain adaptation challenge using deep neural networks. SLT 2014: 378-383
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/TrmalCPKGZMLJKY14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/TrmalCPKGZMLJKY14
Jan Trmal, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur, Pegah Ghahremani, Xiaohui Zhang, Vimal Manohar, Chunxi Liu, Aren Jansen, Dietrich Klakow, David Yarowsky, Florian Metze:
A keyword search system using open source software. SLT 2014: 530-535
2011
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/wcsp/ZhangSLW11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wcsp/ZhangSLW11
Xiaohui Zhang, Guangmin Sun, Haomiao Liu, Qite Wang:
Flaw classification in ultrasonic guided waves signal using Wavelet Transform and PNN classifier. WCSP 2011: 1-5

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.