default search action
Thomas Fang Zheng
Person information
- affiliation: Tsinghua University, Center for Speech and Language Technologies, BNRist China
- affiliation (PhD 1997): Tsinghua University, Department of Computer Science and Technology, Beijing, China
Other persons with the same name
- Fang Zheng 0002 — Xi'an Jiaotong University, School of Electronics and Information Engineering, China
- Fang Zheng 0003 — IBM T. J. Watson Research Center, USA (and 1 more)
- Fang Zheng 0004 — Lanzhou University, School of Information Science and Engineering, Gansu Provincial Key Laboratory of Wearable Computing, China
- Fang Zheng 0005 — Ernst & Young, Financial Services Risk Management, New York, NY, USA (and 1 more)
- Fang Zheng 0006 — Xiamen University, School of Information Science and Technology, China
- Fang Zheng 0007 — Jiangnan Institute of Computing Technology, State Key Laboratory of Mathematical Engineering and Advanced Computing, Wuxi, China
- Fang Zheng 0008 — University of Kentucky, College of Pharmacy, Lexington, KY, USA
- Fang Zheng 0009 — Xi'an Jiaotong University, Institute of Artificial Intelligence and Robotics, School of Software Engineering, China
- Fang Zheng 0010 — Huazhong Agricultural University, College of Informatics, Wuhan, China
- Fang Zheng 0011 — Fujian Vocational College of Agriculture, Fuzhou, China
- Fang Zheng 0012 — Washington State University Tri-Cities, School of Electrical Engineering and Computer Science, Richland, WA, USA
- Fang Zheng 0013 — Nanjing University, Department of Mathematics, China
- Fang Zheng 0014 — South-Central University for Nationalities, College of Computer Science, Wuhan, China (and 1 more)
- Fang Zheng 0015 — National Research Center of Parallel Computer Engineering and Technology, Beijing, China
- Fang Zheng 0016 — Civil Aviation University of China, School of Electronics and Automatic, Tianjin, China
- Fang Zheng 0017 — Alibaba Group
- Fang Zheng 0018 — Shanxi University of Finance and Economics, School of Information Management, Taiyuan, China
- Fang Zheng 0019 — Wuhan No.1 Hospital, China
- Fang Zheng 0020 — Hong Kong Polytechnic University, Department of Electrical and Electronic Engineering, Hong Kong
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c136]Qiuming Zhao, Guangzhi Sun, Chao Zhang, Mingxing Xu, Thomas Fang Zheng:
Enhancing Quantised End-to-End ASR Models Via Personalisation. ICASSP 2024: 12426-12430 - [i29]Qiuming Zhao, Guangzhi Sun, Chao Zhang, Mingxing Xu, Thomas Fang Zheng:
SAML: Speaker Adaptive Mixture of LoRA Experts for End-to-End ASR. CoRR abs/2406.19706 (2024) - [i28]Qiuming Zhao, Guangzhi Sun, Chao Zhang, Mingxing Xu, Thomas Fang Zheng:
Speaker Adaptation for Quantised End-to-End ASR Models. CoRR abs/2408.03979 (2024) - [i27]Xujiang Xing, Mingxing Xu, Thomas Fang Zheng:
A Joint Noise Disentanglement and Adversarial Training Framework for Robust Speaker Verification. CoRR abs/2408.11562 (2024) - [i26]Yiyang Zhao, Shuai Wang, Guangzhi Sun, Zehua Chen, Chao Zhang, Mingxing Xu, Thomas Fang Zheng:
Whisper-PMFA: Partial Multi-Scale Feature Aggregation for Speaker Verification using Whisper Models. CoRR abs/2408.15585 (2024) - 2023
- [j29]Xiaolong Wu, Chang Feng, Mingxing Xu, Thomas Fang Zheng, Askar Hamdulla:
DialoguePCN: Perception and Cognition Network for Emotion Recognition in Conversations. IEEE Access 11: 141251-141260 (2023) - [j28]Haoran Sun, Dong Wang, Lantian Li, Chen Chen, Thomas Fang Zheng:
Random Cycle Loss and Its Application to Voice Conversion. IEEE Trans. Pattern Anal. Mach. Intell. 45(8): 10331-10345 (2023) - [c135]Chen Chen, Dong Wang, Thomas Fang Zheng:
CN-CVS: A Mandarin Audio-Visual Dataset for Large Vocabulary Continuous Visual to Speech Synthesis. ICASSP 2023: 1-5 - [i25]Qiuming Zhao, Guangzhi Sun, Chao Zhang, Mingxing Xu, Thomas Fang Zheng:
Enhancing Quantised End-to-End ASR Models via Personalisation. CoRR abs/2309.09136 (2023) - 2022
- [j27]Lantian Li, Ruiqi Liu, Jiawen Kang, Yue Fan, Hao Cui, Yunqi Cai, Ravichander Vipperla, Thomas Fang Zheng, Dong Wang:
CN-Celeb: Multi-genre speaker recognition. Speech Commun. 137: 77-91 (2022) - [c134]Wei Liu, Meng Sun, Xiongwei Zhang, Hugo Van hamme, Thomas Fang Zheng:
A Multi-Resolution Front-End for End-to-End Speech Anti-Spoofing. Odyssey 2022: 120-125 - [e4]Weihong Deng, Jianjiang Feng, Di Huang, Meina Kan, Zhenan Sun, Fang Zheng, Wenfeng Wang, Zhaofeng He:
Biometric Recognition - 16th Chinese Conference, CCBR 2022, Beijing, China, November 11-13, 2022, Proceedings. Lecture Notes in Computer Science 13628, Springer 2022, ISBN 978-3-031-20232-2 [contents] - [e3]Thomas Fang Zheng:
Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June - 1 July 2022, Beijing, China. ISCA 2022 [contents] - 2021
- [j26]Linlin Zheng, Jiakang Li, Meng Sun, Xiongwei Zhang, Thomas Fang Zheng:
When Automatic Voice Disguise Meets Automatic Speaker Verification. IEEE Trans. Inf. Forensics Secur. 16: 824-837 (2021) - [c133]Haoran Sun, Lantian Li, Thomas Fang Zheng, Dong Wang:
How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition. APSIPA ASC 2021: 780-785 - [c132]Weiyi Zhang, Shuning Zhao, Le Liu, Jianmin Li, Xingliang Cheng, Thomas Fang Zheng, Xiaolin Hu:
Attack on Practical Speaker Verification System Using Universal Adversarial Perturbations. ICASSP 2021: 2575-2579 - [c131]Lantian Li, Yang Zhang, Jiawen Kang, Thomas Fang Zheng, Dong Wang:
Squeezing Value of Cross-Domain Labels: A Decoupled Scoring Approach for Speaker Verification. ICASSP 2021: 5829-5833 - [c130]Xingliang Cheng, Mingxing Xu, Thomas Fang Zheng:
Cross-Database Replay Detection in Terminal-Dependent Speaker Verification. Interspeech 2021: 4274-4278 - [i24]Weiyi Zhang, Shuning Zhao, Le Liu, Jianmin Li, Xingliang Cheng, Thomas Fang Zheng, Xiaolin Hu:
Attack on practical speaker verification system using universal adversarial perturbations. CoRR abs/2105.09022 (2021) - [i23]Wei Liu, Meng Sun, Xiongwei Zhang, Hugo Van hamme, Thomas Fang Zheng:
A Multi-Resolution Front-End for End-to-End Speech Anti-Spoofing. CoRR abs/2110.05087 (2021) - [i22]Haoran Sun, Lantian Li, Thomas Fang Zheng, Dong Wang:
How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition. CoRR abs/2111.12324 (2021) - 2020
- [c129]Sitong Cheng, Zhixin Liu, Lantian Li, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng:
ASR-Free Pronunciation Assessment. INTERSPEECH 2020: 3047-3051 - [c128]Lantian Li, Dong Wang, Thomas Fang Zheng:
Neural Discriminant Analysis for Deep Speaker Embedding. INTERSPEECH 2020: 3251-3255 - [c127]Jiawen Kang, Ruiqi Liu, Lantian Li, Yunqi Cai, Dong Wang, Thomas Fang Zheng:
Domain-Invariant Speaker Vector Projection by Model-Agnostic Meta-Learning. INTERSPEECH 2020: 3825-3829 - [e2]Helen Meng, Bo Xu, Thomas Fang Zheng:
21st Annual Conference of the International Speech Communication Association, Interspeech 2020, Virtual Event, Shanghai, China, October 25-29, 2020. ISCA 2020 [contents] - [i21]Linlin Zheng, Jiakang Li, Meng Sun, Xiongwei Zhang, Thomas Fang Zheng:
When Automatic Voice Disguise Meets Automatic Speaker Verification. CoRR abs/2009.06863 (2020) - [i20]Haoran Sun, Lantian Li, Yunqi Cai, Yang Zhang, Thomas Fang Zheng, Dong Wang:
Deep generative factorization for speech signal. CoRR abs/2010.14242 (2020) - [i19]Lantian Li, Yang Zhang, Jiawen Kang, Thomas Fang Zheng, Dong Wang:
Squeezing value of cross-domain labels: a decoupled scoring approach for speaker verification. CoRR abs/2010.14243 (2020) - [i18]Lantian Li, Ruiqi Liu, Jiawen Kang, Yue Fan, Hao Cui, Yunqi Cai, Ravichander Vipperla, Thomas Fang Zheng, Dong Wang:
CN-Celeb: multi-genre speaker recognition. CoRR abs/2012.12468 (2020)
2010 – 2019
- 2019
- [j25]Xingyu Zhang, Xia Zou, Meng Sun, Thomas Fang Zheng, Chong Jia, Yimin Wang:
Noise Robust Speaker Recognition Based on Adaptive Frame Weighting in GMM for i-Vector Extraction. IEEE Access 7: 27874-27882 (2019) - [c126]Xingliang Cheng, Mingxing Xu, Thomas Fang Zheng:
Replay detection using CQT-based modified group delay feature and ResNeWt network in ASVspoof 2019. APSIPA 2019: 540-545 - 2018
- [c125]Xingliang Cheng, Xiaotong Zhang, Mingxing Xu, Thomas Fang Zheng:
MMANN: Multimodal Multilevel Attention Neural Network for Horror Clip Detection. APSIPA 2018: 329-334 - [c124]Mijit Ablimit, Sardar Parhat, Askar Hamdulla, Thomas Fang Zheng:
Multilingual Stemming and Term extraction for Uyghur, Kazak and Kirghiz. APSIPA 2018: 587-590 - [c123]Yang Wang, Dong Wang, Thomas Fang Zheng:
RACORN-K: Risk-Aversion Pattern Matching-based Portfolio Selection. APSIPA 2018: 1816-1820 - [c122]Lantian Li, Dong Wang, Yixiang Chen, Ying Shi, Zhiyuan Tang, Thomas Fang Zheng:
Deep Factorization for Speech Signal. ICASSP 2018: 5094-5098 - [c121]Lantian Li, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng:
Full-Info Training for Deep Speaker Feature Learning. ICASSP 2018: 5369-5373 - [c120]Xiaotong Zhang, Xingliang Cheng, Mingxing Xu, Thomas Fang Zheng:
Imbalance Learning-based Framework for Fear Recognition in the MediaEval Emotional Impact of Movies Task. INTERSPEECH 2018: 3678-3682 - [i17]Lantian Li, Dong Wang, Yixiang Chen, Ying Shi, Zhiyuan Tang, Thomas Fang Zheng:
Deep factorization for speech signal. CoRR abs/1803.00886 (2018) - 2017
- [j24]Miao Fan, Qiang Zhou, Thomas Fang Zheng, Ralph Grishman:
Distributed representation learning for knowledge graphs with entity descriptions. Pattern Recognit. Lett. 93: 31-37 (2017) - [c119]Dong Wang, Lantian Li, Zhiyuan Tang, Thomas Fang Zheng:
Deep speaker verification: Do we need end to end? APSIPA 2017: 177-181 - [c118]Guanyu Li, Hongzhi Yu, Thomas Fang Zheng, Jinghao Yan, Shipeng Xu:
Free linguistic and speech resources for Tibetan. APSIPA 2017: 733-736 - [c117]Mijit Ablimit, Sardar Parhat, Askar Hamdulla, Thomas Fang Zheng:
A multilingual language processing tool for Uyghur, Kazak and Kirghiz. APSIPA 2017: 737-740 - [c116]Shipeng Xu, Hongzhi Yu, Thomas Fang Zheng, Guanyu Li, Gegeentana:
Language resource construction for Mongolian. APSIPA 2017: 741-744 - [c115]Ying Shi, Askar Hamdullah, Zhiyuan Tang, Dong Wang, Thomas Fang Zheng:
A free Kazakh speech database and a speech recognition baseline. APSIPA 2017: 745-748 - [c114]Lantian Li, Dong Wang, Askar Rozi, Thomas Fang Zheng:
Cross-lingual speaker verification with deep feature learning. APSIPA 2017: 1040-1044 - [c113]Aodong Li, Shiyue Zhang, Dong Wang, Thomas Fang Zheng:
Enhanced neural machine translation by learning from draft. APSIPA 2017: 1583-1587 - [c112]Longbiao Wang, Seiichi Nakagawa, Jianwu Dang, Jianguo Wei, Tongtong Shen, Lantian Li, Thomas Fang Zheng:
Pseudo-pitch-synchronized phase information extraction and its application for robust speaker recognition. GCCE 2017: 1-5 - [c111]Renyu Wang, Mingliang Gu, Lantian Li, Mingxing Xu, Thomas Fang Zheng:
Speaker segmentation using deep speaker vectors for fast speaker change scenarios. ICASSP 2017: 5420-5424 - [c110]Lantian Li, Yixiang Chen, Dong Wang, Thomas Fang Zheng:
A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification. INTERSPEECH 2017: 92-96 - [c109]Dong Wang, Thomas Fang Zheng, Zhiyuan Tang, Ying Shi, Lantian Li, Shiyue Zhang, Hongzhi Yu, Guanyu Li, Shipeng Xu, Askar Hamdulla, Mijit Ablimit, Gulnigar Mahmut:
M2ASR: Ambitions and first year progress. O-COCOSDA 2017: 1-6 - [i16]Lantian Li, Yixiang Chen, Dong Wang, Thomas Fang Zheng:
A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification. CoRR abs/1706.02101 (2017) - [i15]Dong Wang, Lantian Li, Zhiyuan Tang, Thomas Fang Zheng:
Deep Speaker Verification: Do We Need End to End? CoRR abs/1706.07859 (2017) - [i14]Lantian Li, Dong Wang, Askar Rozi, Thomas Fang Zheng:
Cross-lingual Speaker Verification with Deep Feature Learning. CoRR abs/1706.07861 (2017) - [i13]Aodong Li, Shiyue Zhang, Dong Wang, Thomas Fang Zheng:
Enhanced Neural Machine Translation by Learning from Draft. CoRR abs/1710.01789 (2017) - 2016
- [j23]Miao Fan, Qiang Zhou, Andrew Abel, Thomas Fang Zheng, Ralph Grishman:
Probabilistic Belief Embedding for Large-Scale Knowledge Population. Cogn. Comput. 8(6): 1087-1102 (2016) - [j22]Linlin Wang, Jun Wang, Lantian Li, Thomas Fang Zheng, Frank K. Soong:
Improving speaker verification performance against long-term speaker variability. Speech Commun. 79: 14-29 (2016) - [j21]Meng Sun, Xiongwei Zhang, Hugo Van hamme, Thomas Fang Zheng:
Unseen Noise Estimation Using Separable Deep Auto Encoder for Speech Enhancement. IEEE ACM Trans. Audio Speech Lang. Process. 24(1): 93-104 (2016) - [j20]Lantian Li, Dong Wang, Chenhao Zhang, Thomas Fang Zheng:
Improving Short Utterance Speaker Recognition by Modeling Speech Unit Classes. IEEE ACM Trans. Audio Speech Lang. Process. 24(6): 1129-1139 (2016) - [c108]Lantian Li, Renyu Wang, Gang Wang, Caixia Wang, Thomas Fang Zheng:
Decision making based on cohort scores for speaker verification. APSIPA 2016: 1-4 - [c107]Lantian Li, Dong Wang, Xiaodong Zhang, Thomas Fang Zheng, Panshi Jin:
System combination for short utterance speaker recognition. APSIPA 2016: 1-5 - [c106]Ruru Li, Dali Yang, Xinxing Li, Renyu Wang, Mingxing Xu, Thomas Fang Zheng:
Relative entropy normalized Gaussian supervector for speech emotion recognition using kernel extreme learning machine. APSIPA 2016: 1-5 - [c105]Askar Rozi, Lantian Li, Dong Wang, Thomas Fang Zheng:
Feature transformation for speaker verification under speaking rate mismatch condition. APSIPA 2016: 1-4 - [c104]Lantian Li, Dong Wang, Chao Xing, Thomas Fang Zheng:
Max-margin metric learning for speaker recognition. ISCSLP 2016: 1-4 - [c103]Lantian Li, Chao Xing, Dong Wang, Kaimin Yu, Thomas Fang Zheng:
Binary speaker embedding. ISCSLP 2016: 1-4 - [c102]Miao Fan, Qiang Zhou, Thomas Fang Zheng:
Learning Embedding Representations for Knowledge Inference on Imperfect and Incomplete Repositories. WI 2016: 42-48 - [i12]Lantian Li, Dong Wang, Thomas Fang Zheng:
System Combination for Short Utterance Speaker Recognition. CoRR abs/1603.09460 (2016) - [i11]Lantian Li, Renyu Wang, Gang Wang, Caixia Wang, Thomas Fang Zheng:
Decision Making Based on Cohort Scores for Speaker Verification. CoRR abs/1609.08419 (2016) - 2015
- [j19]Shi Yin, Chao Liu, Zhiyong Zhang, Yiye Lin, Dong Wang, Javier Tejedor, Thomas Fang Zheng, Yinguo Li:
Noisy training for deep neural networks in speech recognition. EURASIP J. Audio Speech Music. Process. 2015: 2 (2015) - [j18]Guoyu Tang, Yunqing Xia, Erik Cambria, Peng Jin, Thomas Fang Zheng:
Document Representation with Statistical Word Senses in Cross-Lingual Document Clustering. Int. J. Pattern Recognit. Artif. Intell. 29(2): 1559003:1-1559003:26 (2015) - [j17]Guoyu Tang, Yunqing Xia, Jun Sun, Min Zhang, Thomas Fang Zheng:
Statistical word sense aware topic models. Soft Comput. 19(1): 13-27 (2015) - [j16]Fanhu Bie, Dong Wang, Jun Wang, Thomas Fang Zheng:
Detection and reconstruction of clipped speech for speaker recognition. Speech Commun. 72: 218-231 (2015) - [c101]Dong Wang, Thomas Fang Zheng:
Transfer learning for speech and language processing. APSIPA 2015: 1225-1237 - [c100]Lantian Li, Thomas Fang Zheng:
Gender-dependent feature extraction for speaker recognition. ChinaSIP 2015: 509-513 - [c99]Rozi Askar, Dong Wang, Fanhu Bie, Jun Wang, Thomas Fang Zheng:
Cross-lingual speaker verification based on linear transform. ChinaSIP 2015: 519-523 - [c98]Askar Rozi, Dong Wang, Zhiyong Zhang, Thomas Fang Zheng:
An open/free database and Benchmark for Uyghur speaker recognition. O-COCOSDA/CASLRE 2015: 81-85 - [c97]Miao Fan, Qiang Zhou, Thomas Fang Zheng:
Distant Supervision for Entity Linking. PACLIC 2015 - [c96]Miao Fan, Qiang Zhou, Thomas Fang Zheng, Ralph Grishman:
Large Margin Nearest Neighbor Embedding for Knowledge Representation. WI-IAT (1) 2015: 53-59 - [i10]Miao Fan, Qiang Zhou, Thomas Fang Zheng:
Learning Embedding Representations for Knowledge Inference on Imperfect and Incomplete Repositories. CoRR abs/1503.08155 (2015) - [i9]Miao Fan, Qiang Zhou, Thomas Fang Zheng, Ralph Grishman:
Large Margin Nearest Neighbor Embedding for Knowledge Representation. CoRR abs/1504.01684 (2015) - [i8]Miao Fan, Qiang Zhou, Andrew Abel, Thomas Fang Zheng, Ralph Grishman:
Probabilistic Belief Embedding for Knowledge Base Completion. CoRR abs/1505.02433 (2015) - [i7]Miao Fan, Qiang Zhou, Thomas Fang Zheng:
Distant Supervision for Entity Linking. CoRR abs/1505.03823 (2015) - [i6]Lantian Li, Dong Wang, Zhiyong Zhang, Thomas Fang Zheng:
Deep Speaker Vectors for Semi Text-independent Speaker Verification. CoRR abs/1505.06427 (2015) - [i5]Miao Fan, Qiang Zhou, Thomas Fang Zheng, Ralph Grishman:
Parallel Knowledge Embedding with MapReduce on a Multi-core Processor. CoRR abs/1509.01183 (2015) - [i4]Lantian Li, Dong Wang, Chao Xing, Kaimin Yu, Thomas Fang Zheng:
Binary Speaker Embedding. CoRR abs/1510.05937 (2015) - [i3]Lantian Li, Dong Wang, Chao Xing, Thomas Fang Zheng:
Max-margin Metric Learning for Speaker Recognition. CoRR abs/1510.05940 (2015) - [i2]Dong Wang, Thomas Fang Zheng:
Transfer Learning for Speech and Language Processing. CoRR abs/1511.06066 (2015) - 2014
- [c95]Miao Fan, Deli Zhao, Qiang Zhou, Zhiyuan Liu, Thomas Fang Zheng, Edward Y. Chang:
Distant Supervision for Relation Extraction with Matrix Completion. ACL (1) 2014: 839-849 - [c94]Jun Wang, Dong Wang, Ziwei Zhu, Thomas Fang Zheng, Frank K. Soong:
Discriminative scoring for speaker recognition based on I-vectors. APSIPA 2014: 1-5 - [c93]Thomas Fang Zheng, Qin Jin, Lantian Li, Jun Wang, Fanhu Bie:
An overview of robustness related issues in speaker recognition. APSIPA 2014: 1-10 - [c92]Fanhu Bie, Jun Wang, Dong Wang, Thomas Fang Zheng:
Block-wise training for i-vector. ChinaSIP 2014: 11-15 - [c91]Guoyu Tang, Yunqing Xia, Jun Sun, Min Zhang, Thomas Fang Zheng:
Topic Models Incorporating Statistical Word Senses. CICLing (1) 2014: 151-162 - [c90]Miao Fan, Qiang Zhou, Thomas Fang Zheng:
Mining the Personal Interests of Microbloggers via Exploiting Wikipedia Knowledge. CICLing (2) 2014: 188-200 - [c89]Yunqing Xia, Guoyu Tang, Huan Zhao, Erik Cambria, Thomas Fang Zheng:
Using Word Sense as a Latent Variable in LDA Can Improve Topic Modeling. ICAART (1) 2014: 532-537 - [c88]Jun Wang, Lantian Li, Dong Wang, Thomas Fang Zheng:
Research on generalization property of time-varying Fbank-weighted MFCC for i-vector based speaker verification. ISCSLP 2014: 423 - [c87]Fanhu Bie, Dong Wang, Thomas Fang Zheng:
Research on truncated speech in speaker verification. ISCSLP 2014: 425 - [c86]Guoyu Tang, Yunqing Xia, Weizhi Wang, Raymond Lau, Fang Zheng:
Clustering tweets usingWikipedia concepts. LREC 2014: 2262-2267 - [c85]Miao Fan, Qiang Zhou, Emily Chang, Thomas Fang Zheng:
Transition-based Knowledge Graph Embedding with Relational Mapping Properties. PACLIC 2014: 328-337 - [e1]Minghui Dong, Jianhua Tao, Haizhou Li, Thomas Fang Zheng, Yanfeng Lu:
The 9th International Symposium on Chinese Spoken Language Processing, Singapore, September 12-14, 2014. IEEE 2014, ISBN 978-1-4799-4220-6 [contents] - [i1]Miao Fan, Deli Zhao, Qiang Zhou, Zhiyuan Liu, Thomas Fang Zheng, Edward Y. Chang:
Errata: Distant Supervision for Relation Extraction with Matrix Completion. CoRR abs/1411.4455 (2014) - 2013
- [j15]Dong Wang, Ravichander Vipperla, Nicholas W. D. Evans, Thomas Fang Zheng:
Online Non-Negative Convolutive Pattern Learning for Speech Signals. IEEE Trans. Signal Process. 61(1): 44-56 (2013) - [c84]Fanhu Bie, Dong Wang, Thomas Fang Zheng, Javier Tejedor, Ruxin Chen:
Emotional adaptive training for speaker verification. APSIPA 2013: 1-4 - [c83]Fanhu Bie, Dong Wang, Thomas Fang Zheng, Ruxin Chen:
Emotional speaker verification with linear adaptation. ChinaSIP 2013: 91-94 - [c82]Chenhao Zhang, Thomas Fang Zheng:
A Fishervoice based feature fusion method for short utterance speaker recognition. ChinaSIP 2013: 165-169 - [c81]Jun Wang, Dong Wang, Xiaojun Wu, Thomas Fang Zheng:
Sequential UBM adaptation for speaker verification. ChinaSIP 2013: 356-359 - [c80]Jun Wang, Dong Wang, Xiaojun Wu, Thomas Fang Zheng, Javier Tejedor:
Sequential model adaptation for speaker verification. INTERSPEECH 2013: 2460-2464 - [c79]Yunqing Xia, Xiaoshi Zhong, Guoyu Tang, Junjun Wang, Qiang Zhou, Thomas Fang Zheng, Qinan Hu, Sen Na, Yaohai Huang:
Ranking Search Intents Underlying a Query. NLDB 2013: 266-271 - [c78]Junjun Wang, Guoyu Tang, Yunqing Xia, Qiang Zhou, Thomas Fang Zheng, Qinan Hu, Sen Na, Yaohai Huang:
Understanding the Query: THCIB and THUIS at NTCIR-10 Intent Task. NTCIR 2013 - 2012
- [j14]Nakhat Fatima, Xiaojun Wu, Thomas Fang Zheng:
Speech unit category based short utterance speaker recognition. Comput. Sci. Inf. Syst. 9(4): 1407-1430 (2012) - [c77]Linlin Wang, Xiaojun Wu, Thomas Fang Zheng, Chenhao Zhang:
An investigation into better frequency warping for time-varying speaker recognition. APSIPA 2012: 1-4 - [c76]Chenhao Zhang, Xiaojun Wu, Thomas Fang Zheng, Linlin Wang, Cong Yin:
A K-phoneme-class based multi-model method for short utterance speaker recognition. APSIPA 2012: 1-4 - [c75]Chenhao Zhang, Thomas Fang Zheng, Ruxin Chen:
Text-Dependent Speaker Recognition with long-term features based on functional data analysis. ISCSLP 2012: 340-344 - [c74]Miao Fan, Qiang Zhou, Thomas Fang Zheng:
Content-Based Semantic Tag Ranking for Recommendation. Web Intelligence 2012: 292-296 - 2011
- [c73]Chao Zhang, Yi Liu, Yunqing Xia, Thomas Fang Zheng, Jesper Ø. Olsen, Jilei Tian:
Reliable accent specific unit generation with dynamic Gaussian mixture selection for multi-accent speech recognition. ICME 2011: 1-6 - [c72]Guoyu Tang, Yunqing Xia, Min Zhang, Haizhou Li, Fang Zheng:
CLGVSM: Adapting Generalized Vector Space Model to Cross-lingual Document Clustering. IJCNLP 2011: 580-588 - 2010
- [j13]Gang Wang, Thomas Fang Zheng:
Using MMSE to improve session variability estimation. Int. J. Biom. 2(4): 350-357 (2010) - [c71]Gang Wang, Xiaojun Wu, Thomas Fang Zheng:
Using phoneme recognition and text-dependent speaker verification to improve speaker segmentation for Chinese speech. INTERSPEECH 2010: 1457-1460 - [c70]Jue Hou, Yi Liu, Thomas Fang Zheng, Jesper Ø. Olsen, Jilei Tian:
Using cepstral and prosodic features for Chinese accent identification. ISCSLP 2010: 177-181
2000 – 2009
- 2009
- [c69]Wenxiao Cao, Danning Jiang, Jue Hou, Yong Qin, Thomas Fang Zheng, Yi Liu:
A phrase-level piecewise linear scaling algorithm for melody match in Query-by-Humming systems. ICME 2009: 942-945 - [c68]Jue Hou, Danning Jiang, Wenxiao Cao, Yong Qin, Thomas Fang Zheng, Yi Liu:
Effectiveness of n-gram fast match for query-by-humming systems. ICME 2009: 1310-1313 - 2008
- [j12]Linquan Liu, Thomas Fang Zheng, Wenhu Wu:
State-dependent phoneme-based model merging for dialectal Chinese speech recognition. Speech Commun. 50(7): 605-615 (2008) - [c67]Jingfan Wang, Yunqing Xia, Thomas Fang Zheng, Xiaojun Wu:
Job Information Retrieval Based on Document Similarity. AIRS 2008: 165-175 - [c66]Wenxiao Cao, Yi Liu, Thomas Fang Zheng:
Local Mismatch Phone for Confidence Measure in Standard and Accented Chinese Speech Recognition. ISCSLP 2008: 209-212 - 2007
- [j11]Wei Wu, Thomas Fang Zheng, Mingxing Xu, Frank K. Soong:
A Cohort-Based Speaker Model Synthesis for Mismatched Channels in Speaker Verification. IEEE Trans. Speech Audio Process. 15(6): 1893-1903 (2007) - [c65]Yi Liu, Fang Zheng, Lei He, Yunqing Xia:
State-dependent mixture tying with variable codebook size for accented speech recognition. ASRU 2007: 300-305 - [c64]Jing Deng, Thomas Fang Zheng, Wenhu Wu:
Session Variability Subspace Projection Based Model Compensation for Speaker Verification. ICASSP (4) 2007: 57-60 - [c63]Huanjun Bao, Ming-Xing Xu, Thomas Fang Zheng:
Emotion attribute projection for speaker recognition on emotional speech. INTERSPEECH 2007: 758-761 - [c62]Linquan Liu, Thomas Fang Zheng, Makoto Akabane, Ruxin Chen, Wenhu Wu:
Using a small development set to build a robust dialectal Chinese speech recognizer. INTERSPEECH 2007: 1729-1732 - 2006
- [j10]Zhenyu Xiong, Thomas Fang Zheng, Zhanjiang Song, Frank K. Soong, Wenhu Wu:
A tree-based kernel selection approach to efficient Gaussian mixture model-universal background model based speaker identification. Speech Commun. 48(10): 1273-1282 (2006) - [c61]Wei Wu, Thomas Fang Zheng, Mingxing Xu:
Cohort-Based Speaker Model Synthesis for Channel Robust Speaker Recognition. ICASSP (1) 2006: 893-896 - [c60]Linquan Liu, Thomas Fang Zheng, Wenhu Wu:
Automatic initial/final generation for dialectal Chinese speech recognition. INTERSPEECH 2006 - [c59]Wei Wu, Thomas Fang Zheng, Ming-Xing Xu, Huanjun Bao:
Study on speaker verification on emotional speech. INTERSPEECH 2006 - [c58]Jian Liu, Thomas Fang Zheng, Wenhu Wu:
Pitch Mean Based Frequency Warping. ISCSLP (Selected Papers) 2006: 87-94 - [c57]Jing Deng, Thomas Fang Zheng, Wenhu Wu:
UBM Based Speaker Segmentation and Clustering for 2-Speaker Detection. ISCSLP (Selected Papers) 2006: 116-125 - [c56]Linquan Liu, Thomas Fang Zheng, Wenhu Wu:
State-Dependent Phoneme-Based Model Merging for Dialectal Chinese Speech Recognition. ISCSLP (Selected Papers) 2006: 282-293 - [c55]Linquan Liu, Thomas Fang Zheng, Wenhu Wu:
English Alphabet Recognition Based on Chinese Acoustic Modeling. ISCSLP 2006 - [c54]Thomas Fang Zheng, Zhanjiang Song, Lihong Zhang, Michael Brasser, Wei Wu, Jing Deng:
CCC Speaker Recognition Evaluation 2006: Overview, Methods, Data, Results and Perspective. ISCSLP (Selected Papers) 2006: 485-493 - 2005
- [c53]Zhenyu Xiong, Thomas Fang Zheng, Zhanjiang Song, Wenhu Wu:
Combining Selection Tree with Observation Reordering Pruning for Efficient Speaker Identification Using GMM-UBM. ICASSP (1) 2005: 625-628 - [c52]Jian Liu, Thomas Fang Zheng, Jing Deng, Wenhu Wu:
Real-time pitch tracking based on combined SMDSF. INTERSPEECH 2005: 301-304 - [c51]Xiaojun Wu, Thomas Fang Zheng, Michael Brasser, Zhanjiang Song:
Rapidly developing spoken Chinese dialogue systems with the d-ear SDS SDK. INTERSPEECH 2005: 829-832 - [c50]Jing Deng, Thomas Fang Zheng, Zhanjiang Song, Jian Liu:
Modeling high-level information by using Gaussian mixture correlation for GMM-UBM based speaker recognition. INTERSPEECH 2005: 2033-2036 - [c49]Jing Deng, Thomas Fang Zheng, Jian Liu, Wenhu Wu:
The predictive differential amplitude spectrum for robust speaker recognition in stationary noises. INTERSPEECH 2005: 3105-3108 - [c48]Defeng Chen, Thomas Fang Zheng, Jian Liu, Jing Deng, Wenhu Wu, Zhanjiang Song, Xunyi Zhou:
The dynamically-adjustable histogram pruning method for embedded voice dialing. SIP 2005: 45-50 - 2004
- [j9]Thomas Fang Zheng:
Making Full Use of Chinese Speech Corpora. J. Chin. Lang. Comput. 14(4) (2004) - [c47]Zhenyu Xiong, Thomas Fang Zheng, Wenhu Wu:
Weighting observation vectors for robust speech recognition in noisy environments. INTERSPEECH 2004: 2069-2072 - [c46]Thomas Fang Zheng, Jing Li, Zhanjiang Song, Mingxing Xu:
A two-step keyword spotting method based on context-dependent a posteriori probability. ISCSLP 2004: 281-284 - 2003
- [j8]Genqing Wu, Fang Zheng:
A Method to Build a Super Small but Practically Accurate Language Model for Handheld Devices. J. Comput. Sci. Technol. 18(6): 747-755 (2003) - [c45]Hui Sun, Guoliang Zhang, Fang Zheng, Mingxing Xu:
Using word confidence measure for OOV words detection in a spontaneous spoken dialog system. INTERSPEECH 2003: 2713-2716 - 2002
- [j7]Fan Wang, Fang Zheng, Wenhu Wu:
Speech Detection in Non-Stationary Noise Based on the 1/f Process. J. Comput. Sci. Technol. 17(1): 83-89 (2002) - [j6]Fang Zheng, Zhanjiang Song, Pascale Fung, William J. Byrne:
Mandarin Pronunciation Modeling Based on CASS Corpus. J. Comput. Sci. Technol. 17(3): 249-263 (2002) - [c44]Genqing Wu, Fang Zheng, Wenhu Wu, Mingxing Xu, Ling Jin:
Improved katz smoothing for language modeling in speech recogniton. INTERSPEECH 2002: 925-928 - [c43]Fang Zheng, Zhanjiang Song, Pascale Fung, William Byrne:
Reducing pronunciation lexicon confusion and using more data without phonetic transcription for pronunciation modeling. INTERSPEECH 2002: 2461-2464 - [c42]Genqing Wu, Fang Zheng, Wenhu Wu:
A compression method used in language modeling for handheld devices. ISCSLP 2002 - 2001
- [j5]Thomas Fang Zheng, Guoliang Zhang, Zhanjiang Song:
Comparison of Different Implementations of MFCC. J. Comput. Sci. Technol. 16(6): 582-589 (2001) - [c41]William Byrne, Veera Venkataramani, Terri Kamm, Thomas Fang Zheng, Zhanjiang Song, Pascale Fung, Yi Liu, Umar Ruhi:
Automatic generation of pronunciation lexicons for Mandarin spontaneous speech. ICASSP 2001: 569-572 - [c40]Xiaojun Wu, Fang Zheng, Mingxing Xu:
Topic Forest: a plan-based dialog management structure. ICASSP 2001: 617-620 - [c39]Fang Zheng, Zhanjiang Song, Pascale Fung, William Byrne:
Modeling pronunciation variation using context-dependent weighting and b/s refined acoustic modeling. INTERSPEECH 2001: 57-60 - [c38]Jiyong Zhang, Fang Zheng, Jing Li, Chunhua Luo, Guoliang Zhang:
Improved context-dependent acoustic modeling for continuous Chinese speech recognition. INTERSPEECH 2001: 1617-1620 - [c37]Guoliang Zhang, Fang Zheng, Wenhu Wu:
A two-layer lexical tree based beam search in continuous Chinese speech recognition. INTERSPEECH 2001: 1801-1804 - [c36]Fan Wang, Fang Zheng, Wenhu Wu:
An MCE based classification tree using hierarchical feature-weighting in speech recognition. INTERSPEECH 2001: 1947-1950 - [c35]Genqing Wu, Fang Zheng, Ling Jin, Wenhu Wu:
An online incremental language model adaptation method. INTERSPEECH 2001: 2139-2142 - [c34]Pengju Yan, Fang Zheng, Mingxing Xu:
Robust parsing in spoken dialogue systems. INTERSPEECH 2001: 2149-2152 - [c33]Yinfei Huang, Fang Zheng, Yi Su, Fang Li, Wenhu Wu:
A theme structure method for the ellipsis resolution. INTERSPEECH 2001: 2153-2156 - [c32]Yi Su, Fang Zheng, Yinfei Huang:
Design of a semantic parser with support to ellipsis resolution in a Chinese spoken language dialogue system. INTERSPEECH 2001: 2161-2164 - 2000
- [j4]Fang Zheng, Jian Wu, Zhanjiang Song:
Improving the Syllable-Synchronous Network Search Algorithm for Word Decoding in Continuous Chinese Speech Recognition. J. Comput. Sci. Technol. 15(5): 461-471 (2000) - [c31]Zhanjiang Song, Fang Zheng, Wenhu Wu:
Statistical knowledge based frame synchronous search strategies in continuous speech recognition. ICASSP 2000: 1583-1586 - [c30]Fang Zheng, Jian Wu, Wenhu Wu:
Input Chinese sentences using digits. INTERSPEECH 2000: 127-130 - [c29]Chunhua Luo, Fang Zheng, Mingxing Xu:
An equivalent-class based MMI learning method for MGCPM. INTERSPEECH 2000: 141-144 - [c28]Jian Wu, Fang Zheng:
On enhancing katz-smoothing based back-off language model. INTERSPEECH 2000: 198-201 - [c27]Jian Wu, Fang Zheng:
Reducing time-synchronous beam search effort using stage based look-ahead and language model rank based pruning. INTERSPEECH 2000: 262-265 - [c26]Jiyong Zhang, Fang Zheng, Mingxing Xu, Ditang Fang:
Semi-continuous segmental probability modeling for continuous speech recognition. INTERSPEECH 2000: 278-281 - [c25]Fang Zheng, Guoliang Zhang:
Integrating the energy information into MFCC. INTERSPEECH 2000: 389-392 - [c24]Aijun Li, Fang Zheng, William Byrne, Pascale Fung, Terri Kamm, Yi Liu, Zhanjiang Song, Umar Ruhi, Veera Venkataramani, Xiaoxia Chen:
CASS: a phonetically transcribed corpus of mandarin spontaneous speech. INTERSPEECH 2000: 485-488 - [c23]Fan Wang, Fang Zheng, Wenhu Wu:
A c/v segmentation method for Mandarin speech based on multiscale fractal dimension. INTERSPEECH 2000: 648-651 - [c22]Aijun Li, Xiaoxia Chen, Guohua Sun, Wu Hua, Zhigang Yin, Yiqing Zu, Fang Zheng, Zhanjiang Song:
The phonetic labeling on read and spontaneous discourse corpora. INTERSPEECH 2000: 724-727 - [c21]Yinfei Huang, Fang Zheng, Mingxing Xu, Pengju Yan, Wenhu Wu:
Language understanding component for Chinese dialogue system. INTERSPEECH 2000: 1053-1056 - [c20]Yinfei Huang, Fang Zheng, Wenhu Wu:
EasyCmd: Navigation by Voice Commands. ISCSLP 2000 - [c19]Ling Jin, Genqing Wu, Fang Zheng, Wenhu Wu:
Improved Strategies For Intelligent Sentence Input Method Engine System. ISCSLP 2000 - [c18]Jing Li, Fang Zheng, Wenhu Wu:
Context-Independent Chinese Initial-Final Acoustic Modeling. ISCSLP 2000 - [c17]Chunhua Luo, Mingxing Xu, Fang Zheng:
Acoustic Level Error Analysis in Continuous Speech Recognition. ISCSLP 2000 - [c16]Fan Wang, Fang Zheng, Wenhu Wu:
A Self adapting Endpoint Detection Algorithm for Speech Recognition in Noisy Environment Based on 1/f Process. ISCSLP 2000 - [c15]Pengju Yan, Fang Zheng, Mingxing Xu, Yinfei Huang:
Word-class Stochastic Model in A Spoken Language Dialogue System. ISCSLP 2000 - [c14]Dali Yang, Mingxing Xu, Wenhu Wu, Fang Zheng:
A Noise Cancellation Method Based on Wavelet Transform. ISCSLP 2000 - [c13]Guoliang Zhang, Fang Zheng, Wenhu Wu:
Tone Recognition of Chinese Continuous Speech. ISCSLP 2000 - [c12]Jiyong Zhang, Fang Zheng, Mingxing Xu, Shuqing Li:
Intra-syllable Dependent Phonetic Modeling For Chinese Speech Recognition. ISCSLP 2000
1990 – 1999
- 1999
- [j3]Fang Zheng, Mingxing Xu, Xiaolong Mou, Jian Wu, Wenhu Wu, Ditang Fang:
HarkMan - A vocabulary-independent keyword spotter for spontaneous Chinese speech. J. Comput. Sci. Technol. 14(1): 18-26 (1999) - [c11]Qing Guo, Fang Zheng, Jian Wu, Wenhu Wu:
An new method used in HMM for modeling frame correlation. ICASSP 1999: 169-172 - [c10]Fang Zheng:
A syllable-synchronous network search algorithm for word decoding in Chinese speech recognition. ICASSP 1999: 601-604 - [c9]Zhanjiang Song, Fang Zheng, Mingxing Xu, Wenhu Wu:
An effective scoring method for speaking skill evaluation system. EUROSPEECH 1999: 187-190 - [c8]Fang Zheng, Zhanjiang Song, Mingxing Xu, Jian Wu, Yinfei Huang, Wenhu Wu, Cheng Bi:
Easytalk: a large-vocabulary speaker-independent Chinese dictation machine. EUROSPEECH 1999: 819-822 - [c7]Mingxing Xu, Fang Zheng, Wenhu Wu:
A fast and effective state decoding algorithm. EUROSPEECH 1999: 1255-1258 - 1998
- [j2]Fang Zheng, Wenhu Wu, Ditang Fang:
Center-distance continuous probability models and the distance measure. J. Comput. Sci. Technol. 13(5): 426-437 (1998) - [c6]Qing Guo, Fang Zheng, Jian Wu, Wenhu Wu:
Non-linear probability estimation method used in HMM for modeling frame correlation. ICSLP 1998 - [c5]Fang Zheng, Zhanjiang Song, Ling Li, Wenjian Yu, Fengzhou Zheng, Wenhu Wu:
The distance measure for line spectrum pairs applied to speech recognition. ICSLP 1998 - [c4]Fang Zheng, Xiaolong Mou, Wenhu Wu, Ditang Fang:
On The Embedded Multiple-Model Scoring Scheme For Speech Recognition. ISCSLP 1998 - [c3]Fang Zheng, Mingxing Xu, Xiaolong Mou, Jian Wu, Wenhu Wu, Ditang Fang:
A Vocabulary-Independent Keyword Spotter for Spontaneous Chinese Speech. ISCSLP 1998 - [c2]Jian Wu, Fang Zheng, Wenhu Wu, Ditang Fang:
The Similarity Measure among Acoustic Models and Its Two Applications. ISCSLP 1998 - 1997
- [j1]Fang Zheng, Wenhu Wu, Ditang Fang:
A log-index weighted cepstral distance measure for speech recognition. J. Comput. Sci. Technol. 12(2): 177-184 (1997) - [c1]Mingxing Xu, Fang Zheng, Wenhu Wu:
Rejection in Speech Recognition Based on CDCPMs. ROCLING 1997: 412-419
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-08 01:30 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint