default search action
Jianwei Yu
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j18]Stefan Uhlich, Giorgio Fabbro, Masato Hirano, Shusuke Takahashi, Gordon Wichern, Jonathan Le Roux, Dipam Chakraborty, Sharada Mohanty, Kai Li, Yi Luo, Jianwei Yu, Rongzhi Gu, Roman A. Solovyev, Alexander L. Stempkovskiy, Tatiana Habruseva, Mikhail Sukhovei, Yuki Mitsufuji:
The Sound Demixing Challenge 2023 - Cinematic Demixing Track. Trans. Int. Soc. Music. Inf. Retr. 7(1): 44-62 (2024) - [j17]Giorgio Fabbro, Stefan Uhlich, Chieh-Hsin Lai, Woosung Choi, Marco A. Martínez Ramírez, Wei-Hsiang Liao, Igor Gadelha, Geraldo Ramos, Eddie Hsu, Hugo Rodrigues, Fabian-Robert Stöter, Alexandre Défossez, Yi Luo, Jianwei Yu, Dipam Chakraborty, Sharada P. Mohanty, Roman A. Solovyev, Alexander L. Stempkovskiy, Tatiana Habruseva, Nabarun Goswami, Tatsuya Harada, Minseok Kim, Jun Hyung Lee, Yuanliang Dong, Xinran Zhang, Jiafeng Liu, Yuki Mitsufuji:
The Sound Demixing Challenge 2023 - Music Demixing Track. Trans. Int. Soc. Music. Inf. Retr. 7(1): 63-84 (2024) - [j16]Xiaoxuan Shen, Jianwei Yu, Ruxia Liang, Qing Li, Shengyingjie Liu, Shangheng Du, Jianwen Sun, Sannyuya Liu:
Autobalanced Multitask Node Embedding Framework for Intelligent Education. IEEE Trans. Neural Networks Learn. Syst. 35(6): 8653-8667 (2024) - [c62]Yaoxun Xu, Hangting Chen, Jianwei Yu, Qiaochu Huang, Zhiyong Wu, Shi-Xiong Zhang, Guangzhi Li, Yi Luo, Rongzhi Gu:
SECap: Speech Emotion Captioning with Large Language Model. AAAI 2024: 19323-19331 - [c61]Yuanyuan Wang, Hangting Chen, Dongchao Yang, Jianwei Yu, Chao Weng, Zhiyong Wu, Helen Meng:
Consistent and Relevant: Rethink the Query Embedding in General Sound Separation. ICASSP 2024: 961-965 - [c60]Jianwei Yu, Hangting Chen, Yanyao Bian, Xiang Li, Yi Luo, Jinchuan Tian, Mengyang Liu, Jiayi Jiang, Shuai Wang:
AutoPrep: An Automatic Preprocessing Framework for In-The-Wild Speech Data. ICASSP 2024: 1136-1140 - [c59]Shuai Wang, Qibing Bai, Qi Liu, Jianwei Yu, Zhengyang Chen, Bing Han, Yanmin Qian, Haizhou Li:
Leveraging in-the-wild Data for Effective Self-supervised Pretraining in Speaker Recognition. ICASSP 2024: 10901-10905 - [c58]Hangting Chen, Jianwei Yu, Chao Weng:
Complexity Scaling for Speech Denoising. ICASSP 2024: 12276-12280 - [c57]He Zhao, Hangting Chen, Jianwei Yu, Yuehai Wang:
Continuous Target Speech Extraction: Enhancing Personalized Diarization and Extraction on Complex Recordings. IJCNN 2024: 1-8 - [c56]Chao Deng, Jianwei Yu, Ying Zhang, Shaolei Wang, Qian Huang:
Improved YOLOv5x for Offshore Wind Turbine Blade Defect Detection. PEAI 2024: 83-88 - [i48]He Zhao, Hangting Chen, Jianwei Yu, Yuehai Wang:
Continuous Target Speech Extraction: Enhancing Personalized Diarization and Extraction on Complex Recordings. CoRR abs/2401.15993 (2024) - [i47]Yi Luo, Jianwei Yu, Hangting Chen, Rongzhi Gu, Chao Weng:
Gull: A Generative Multifunctional Audio Codec. CoRR abs/2404.04947 (2024) - [i46]Yaoxun Xu, Shi-Xiong Zhang, Jianwei Yu, Zhiyong Wu, Dong Yu:
Comparing Discrete and Continuous Space LLMs for Speech Recognition. CoRR abs/2409.00800 (2024) - [i45]Jinchuan Tian, Chunlei Zhang, Jiatong Shi, Hao Zhang, Jianwei Yu, Shinji Watanabe, Dong Yu:
Preference Alignment Improves Language Model-Based TTS. CoRR abs/2409.12403 (2024) - [i44]Yaoxun Xu, Hangting Chen, Jianwei Yu, Wei Tan, Rongzhi Gu, Shun Lei, Zhiwei Lin, Zhiyong Wu:
MuCodec: Ultra Low-Bitrate Music Codec. CoRR abs/2409.13216 (2024) - [i43]Shuai Wang, Ke Zhang, Shaoxiong Lin, Junjie Li, Xuefei Wang, Meng Ge, Jianwei Yu, Yanmin Qian, Haizhou Li:
WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction. CoRR abs/2409.15799 (2024) - 2023
- [j15]Jinchuan Tian, Jianwei Yu, Chao Weng, Yuexian Zou, Dong Yu:
Integrating Lattice-Free MMI Into End-to-End Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 31: 25-38 (2023) - [j14]Dongchao Yang, Jianwei Yu, Helin Wang, Wen Wang, Chao Weng, Yuexian Zou, Dong Yu:
Diffsound: Discrete Diffusion Model for Text-to-Sound Generation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1720-1733 (2023) - [j13]Yi Luo, Jianwei Yu:
Music Source Separation With Band-Split RNN. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1893-1901 (2023) - [c55]Thiranja P. Babarenda Gamage, Ayah Elsayed, Chinchien Lin, Alan Wu, Yuan Feng, Jianwei Yu, Linkun Gao, Savindi Wijenayaka, Martyn P. Nash, Anthony J. Doyle, David P. Nickerson:
Vision for the 12 LABOURS Digital Twin Platform. EMBC 2023: 1-4 - [c54]Jianwei Yu, Hangting Chen, Yi Luo, Rongzhi Gu, Weihua Li, Chao Weng:
TSpeech-AI System Description to the 5th Deep Noise Suppression (DNS) Challenge. ICASSP 2023: 1-2 - [c53]Jianwei Yu, Yi Luo:
Efficient Monaural Speech Enhancement with Universal Sample Rate Band-Split RNN. ICASSP 2023: 1-5 - [c52]Jinchuan Tian, Brian Yan, Jianwei Yu, Chao Weng, Dong Yu, Shinji Watanabe:
Bayes Risk CTC: Controllable CTC Alignment in Sequence-to-Sequence Tasks. ICLR 2023 - [c51]Mengzhe Geng, Xurong Xie, Rongfeng Su, Jianwei Yu, Zengrui Jin, Tianzi Wang, Shujie Hu, Zi Ye, Helen Meng, Xunying Liu:
On-the-Fly Feature Based Rapid Speaker Adaptation for Dysarthric and Elderly Speech Recognition. INTERSPEECH 2023: 1753-1757 - [c50]Mengzhe Geng, Zengrui Jin, Tianzi Wang, Shujie Hu, Jiajun Deng, Mingyu Cui, Guinan Li, Jianwei Yu, Xurong Xie, Xunying Liu:
Use of Speech Impairment Severity for Dysarthric Speech Recognition. INTERSPEECH 2023: 2328-2332 - [c49]Jianwei Yu, Hangting Chen, Yi Luo, Rongzhi Gu, Chao Weng:
High Fidelity Speech Enhancement with Band-split RNN. INTERSPEECH 2023: 2483-2487 - [c48]Hangting Chen, Jianwei Yu, Yi Luo, Rongzhi Gu, Weihua Li, Zhuocheng Lu, Chao Weng:
Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression. INTERSPEECH 2023: 2523-2527 - [c47]Yi Luo, Jianwei Yu:
FRA-RIR: Fast Random Approximation of the Image-source Method. INTERSPEECH 2023: 3884-3888 - [c46]Dongchao Yang, Songxiang Liu, Helin Wang, Jianwei Yu, Chao Weng, Yuexian Zou:
NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS. INTERSPEECH 2023: 4798-4802 - [c45]Jinchuan Tian, Jianwei Yu, Hangting Chen, Brian Yan, Chao Weng, Dong Yu, Shinji Watanabe:
Bayes Risk Transducer: Transducer with Controllable Alignment Prediction. INTERSPEECH 2023: 4968-4972 - [c44]Jianwei Yu, Ying Zhang, Zihao Chen, Yuhao Li, Chao Deng:
Road Pothole Defect Detection Based on Improved YOLOv8xPothole Road Recognition with New Improved VOLOv8x AlgorithmTo solve the problem of recognizing road potholes, we use the improved VOLOv8x model to achieve high-precision recognition results. IoTAAI 2023: 653-658 - [c43]Yichao Du, Zhengsheng Guo, Jinchuan Tian, Zhirui Zhang, Xing Wang, Jianwei Yu, Zhaopeng Tu, Tong Xu, Enhong Chen:
The MineTrans Systems for IWSLT 2023 Offline Speech Translation and Speech-to-Speech Translation Tasks. IWSLT@ACL 2023: 79-88 - [i42]Mengzhe Geng, Zengrui Jin, Tianzi Wang, Shujie Hu, Jiajun Deng, Mingyu Cui, Guinan Li, Jianwei Yu, Xurong Xie, Xunying Liu:
Use of Speech Impairment Severity for Dysarthric Speech Recognition. CoRR abs/2305.10659 (2023) - [i41]Giorgio Fabbro, Stefan Uhlich, Chieh-Hsin Lai, Woosung Choi, Marco A. Martínez Ramírez, Wei-Hsiang Liao, Igor Gadelha, Geraldo Ramos, Eddie Hsu, Hugo Rodrigues, Fabian-Robert Stöter, Alexandre Défossez, Yi Luo, Jianwei Yu, Dipam Chakraborty, Sharada P. Mohanty, Roman A. Solovyev, Alexander L. Stempkovskiy, Tatiana Habruseva, Nabarun Goswami, Tatsuya Harada, Minseok Kim, Jun Hyung Lee, Yuanliang Dong, Xinran Zhang, Jiafeng Liu, Yuki Mitsufuji:
The Sound Demixing Challenge 2023 - Music Demixing Track. CoRR abs/2308.06979 (2023) - [i40]Stefan Uhlich, Giorgio Fabbro, Masato Hirano, Shusuke Takahashi, Gordon Wichern, Jonathan Le Roux, Dipam Chakraborty, Sharada P. Mohanty, Kai Li, Yi Luo, Jianwei Yu, Rongzhi Gu, Roman A. Solovyev, Alexander L. Stempkovskiy, Tatiana Habruseva, Mikhail Sukhovei, Yuki Mitsufuji:
The Sound Demixing Challenge 2023 - Cinematic Demixing Track. CoRR abs/2308.06981 (2023) - [i39]Jinchuan Tian, Jianwei Yu, Hangting Chen, Brian Yan, Chao Weng, Dong Yu, Shinji Watanabe:
Bayes Risk Transducer: Transducer with Controllable Alignment Prediction. CoRR abs/2308.10107 (2023) - [i38]Hangting Chen, Jianwei Yu, Yi Luo, Rongzhi Gu, Weihua Li, Zhuocheng Lu, Chao Weng:
Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression. CoRR abs/2308.11053 (2023) - [i37]Hangting Chen, Jianwei Yu, Chao Weng:
Complexity Scaling for Speech Denoising. CoRR abs/2309.07757 (2023) - [i36]Junzhe Liu, Jianwei Yu, Xie Chen:
Improved Factorized Neural Transducer Model For text-only Domain Adaptation. CoRR abs/2309.09524 (2023) - [i35]Shuai Wang, Qibing Bai, Qi Liu, Jianwei Yu, Zhengyang Chen, Bing Han, Yanmin Qian, Haizhou Li:
Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition. CoRR abs/2309.11730 (2023) - [i34]Jianwei Yu, Hangting Chen, Yanyao Bian, Xiang Li, Yi Luo, Jinchuan Tian, Mengyang Liu, Jiayi Jiang, Shuai Wang:
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data. CoRR abs/2309.13905 (2023) - [i33]Xiang Hao, Jibin Wu, Jianwei Yu, Chenglin Xu, Kay Chen Tan:
Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction. CoRR abs/2310.07284 (2023) - [i32]Yaoxun Xu, Hangting Chen, Jianwei Yu, Qiaochu Huang, Zhiyong Wu, Shi-Xiong Zhang, Guangzhi Li, Yi Luo, Rongzhi Gu:
SECap: Speech Emotion Captioning with Large Language Model. CoRR abs/2312.10381 (2023) - [i31]Yuanyuan Wang, Hangting Chen, Dongchao Yang, Jianwei Yu, Chao Weng, Zhiyong Wu, Helen Meng:
Consistent and Relevant: Rethink the Query Embedding in General Sound Separation. CoRR abs/2312.15463 (2023) - 2022
- [j12]Sannyuya Liu, Jianwei Yu, Qing Li, Ruxia Liang, Yunhan Zhang, Xiaoxuan Shen, Jianwen Sun:
Ability boosted knowledge tracing. Inf. Sci. 596: 567-587 (2022) - [j11]Zhipeng Chen, Qingquan Li, Jiayuan Li, Dejin Zhang, Jianwei Yu, Yu Yin, Shiwang Lv, Anbang Liang:
IMU-Aided Registration of MLS Point Clouds Using Inertial Trajectory Error Model and Least Squares Optimization. Remote. Sens. 14(6): 1365 (2022) - [j10]Jinchuan Tian, Jianwei Yu, Chao Weng, Yuexian Zou, Dong Yu:
Improving Mandarin End-to-End Speech Recognition With Word N-Gram Language Model. IEEE Signal Process. Lett. 29: 812-816 (2022) - [j9]Shoukang Hu, Xurong Xie, Mingyu Cui, Jiajun Deng, Shansong Liu, Jianwei Yu, Mengzhe Geng, Xunying Liu, Helen Meng:
Neural Architecture Search for LF-MMI Trained Time Delay Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1093-1107 (2022) - [c42]Guinan Li, Jianwei Yu, Jiajun Deng, Xunying Liu, Helen Meng:
Audio-Visual Multi-Channel Speech Separation, Dereverberation and Recognition. ICASSP 2022: 6042-6046 - [c41]Junhao Xu, Jianwei Yu, Xunying Liu, Helen Meng:
Mixed Precision DNN Quantization for Overlapped Speech Separation and Recognition. ICASSP 2022: 7297-7301 - [c40]Naijun Zheng, Na Li, Jianwei Yu, Chao Weng, Dan Su, Xunying Liu, Helen Meng:
Multi-Channel Speaker Diarization Using Spatial Features for Meetings. ICASSP 2022: 7337-7341 - [c39]Jinchuan Tian, Jianwei Yu, Chao Weng, Shi-Xiong Zhang, Dan Su, Dong Yu, Yuexian Zou:
Consistent Training and Decoding for End-to-End Speech Recognition Using Lattice-Free MMI. ICASSP 2022: 7782-7786 - [c38]Lingyun Feng, Jianwei Yu, Yan Wang, Songxiang Liu, Deng Cai, Haitao Zheng:
ASR-Robust Natural Language Understanding on ASR-GLUE dataset. INTERSPEECH 2022: 1101-1105 - [c37]Helin Wang, Dongchao Yang, Chao Weng, Jianwei Yu, Yuexian Zou:
Improving Target Sound Extraction with Timestamp Information. INTERSPEECH 2022: 1526-1530 - [c36]Jinchuan Tian, Jianwei Yu, Chunlei Zhang, Yuexian Zou, Dong Yu:
LAE: Language-Aware Encoder for Monolingual and Multilingual ASR. INTERSPEECH 2022: 3178-3182 - [c35]Ziqian Dai, Jianwei Yu, Yan Wang, Nuo Chen, Yanyao Bian, Guangzhi Li, Deng Cai, Dong Yu:
Automatic Prosody Annotation with Pre-Trained Text-Speech Model. INTERSPEECH 2022: 5513-5517 - [i30]Jinchuan Tian, Jianwei Yu, Chao Weng, Yuexian Zou, Dong Yu:
Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model. CoRR abs/2201.01995 (2022) - [i29]Shoukang Hu, Xurong Xie, Mingyu Cui, Jiajun Deng, Shansong Liu, Jianwei Yu, Mengzhe Geng, Xunying Liu, Helen Meng:
Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks. CoRR abs/2201.03943 (2022) - [i28]Mengzhe Geng, Shansong Liu, Jianwei Yu, Xurong Xie, Shoukang Hu, Zi Ye, Zengrui Jin, Xunying Liu, Helen Meng:
Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition. CoRR abs/2201.05554 (2022) - [i27]Mengzhe Geng, Xurong Xie, Shansong Liu, Jianwei Yu, Shoukang Hu, Xunying Liu, Helen Meng:
Investigation of Data Augmentation Techniques for Disordered Speech Recognition. CoRR abs/2201.05562 (2022) - [i26]Shansong Liu, Mengzhe Geng, Shoukang Hu, Xurong Xie, Mingyu Cui, Jianwei Yu, Xunying Liu, Helen Meng:
Recent Progress in the CUHK Dysarthric Speech Recognition System. CoRR abs/2201.05845 (2022) - [i25]Mengzhe Geng, Xurong Xie, Rongfeng Su, Jianwei Yu, Zi Ye, Xunying Liu, Helen Meng:
On-the-fly Feature Based Speaker Adaptation for Dysarthric and Elderly Speech Recognition. CoRR abs/2203.14593 (2022) - [i24]Jinchuan Tian, Jianwei Yu, Chao Weng, Yuexian Zou, Dong Yu:
Integrate Lattice-Free MMI into End-to-End Speech Recognition. CoRR abs/2203.15614 (2022) - [i23]Helin Wang, Dongchao Yang, Chao Weng, Jianwei Yu, Yuexian Zou:
Improving Target Sound Extraction with Timestamp Information. CoRR abs/2204.00821 (2022) - [i22]Guinan Li, Jianwei Yu, Jiajun Deng, Xunying Liu, Helen Meng:
Audio-visual multi-channel speech separation, dereverberation and recognition. CoRR abs/2204.01977 (2022) - [i21]Jinchuan Tian, Jianwei Yu, Chunlei Zhang, Chao Weng, Yuexian Zou, Dong Yu:
LAE: Language-Aware Encoder for Monolingual and Multilingual ASR. CoRR abs/2206.02093 (2022) - [i20]Ziqian Dai, Jianwei Yu, Yan Wang, Nuo Chen, Yanyao Bian, Guangzhi Li, Deng Cai, Dong Yu:
Automatic Prosody Annotation with Pre-Trained Text-Speech Model. CoRR abs/2206.07956 (2022) - [i19]Dongchao Yang, Jianwei Yu, Helin Wang, Wen Wang, Chao Weng, Yuexian Zou, Dong Yu:
Diffsound: Discrete Diffusion Model for Text-to-sound Generation. CoRR abs/2207.09983 (2022) - [i18]Yi Luo, Jianwei Yu:
FRA-RIR: Fast Random Approximation of the Image-source Method. CoRR abs/2208.04101 (2022) - [i17]Yi Luo, Jianwei Yu:
Music Source Separation with Band-split RNN. CoRR abs/2209.15174 (2022) - [i16]Jinchuan Tian, Brian Yan, Jianwei Yu, Chao Weng, Dong Yu, Shinji Watanabe:
Bayes risk CTC: Controllable CTC alignment in Sequence-to-Sequence tasks. CoRR abs/2210.07499 (2022) - [i15]Dongchao Yang, Songxiang Liu, Jianwei Yu, Helin Wang, Chao Weng, Yuexian Zou:
NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS. CoRR abs/2211.02448 (2022) - 2021
- [j8]Anbang Liang, Qingquan Li, Zhipeng Chen, Dejin Zhang, Jiasong Zhu, Jianwei Yu, Xu Fang:
Spherically Optimized RANSAC Aided by an IMU for Fisheye Image Matching. Remote. Sens. 13(10): 2017 (2021) - [j7]Shoukang Hu, Xurong Xie, Shansong Liu, Jianwei Yu, Zi Ye, Mengzhe Geng, Xunying Liu, Helen Meng:
Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1514-1529 (2021) - [j6]Jianwei Yu, Shi-Xiong Zhang, Bo Wu, Shansong Liu, Shoukang Hu, Mengzhe Geng, Xunying Liu, Helen Meng, Dong Yu:
Audio-Visual Multi-Channel Integration and Recognition of Overlapped Speech. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2067-2082 (2021) - [j5]Shansong Liu, Mengzhe Geng, Shoukang Hu, Xurong Xie, Mingyu Cui, Jianwei Yu, Xunying Liu, Helen Meng:
Recent Progress in the CUHK Dysarthric Speech Recognition System. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2267-2281 (2021) - [j4]Junhao Xu, Jianwei Yu, Shoukang Hu, Xunying Liu, Helen Meng:
Mixed Precision Low-Bit Quantization of Neural Network Language Models for Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3679-3693 (2021) - [c34]Jinchao Li, Jianwei Yu, Zi Ye, Simon Wong, Man-Wai Mak, Brian Mak, Xunying Liu, Helen Meng:
A Comparative Study of Acoustic and Linguistic Features Classification for Alzheimer's Disease Detection. ICASSP 2021: 6423-6427 - [c33]Zi Ye, Shoukang Hu, Jinchao Li, Xurong Xie, Mengzhe Geng, Jianwei Yu, Junhao Xu, Boyang Xue, Shansong Liu, Xunying Liu, Helen Meng:
Development of the Cuhk Elderly Speech Recognition System for Neurocognitive Disorder Detection Using the Dementiabank Corpus. ICASSP 2021: 6433-6437 - [c32]Naijun Zheng, Na Li, Bo Wu, Meng Yu, Jianwei Yu, Chao Weng, Dan Su, Xunying Liu, Helen Meng:
A Joint Training Framework of Multi-Look Separator and Speaker Embedding Extractor for Overlapped Speech. ICASSP 2021: 6698-6702 - [c31]Boyang Xue, Jianwei Yu, Junhao Xu, Shansong Liu, Shoukang Hu, Zi Ye, Mengzhe Geng, Xunying Liu, Helen Meng:
Bayesian Transformer Language Models for Speech Recognition. ICASSP 2021: 7378-7382 - [c30]Junhao Xu, Shoukang Hu, Jianwei Yu, Xunying Liu, Helen Meng:
Mixed Precision Quantization of Transformer Language Models for Speech Recognition. ICASSP 2021: 7383-7387 - [c29]Helin Wang, Bo Wu, Lianwu Chen, Meng Yu, Jianwei Yu, Yong Xu, Shi-Xiong Zhang, Chao Weng, Dan Su, Dong Yu:
TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation. Interspeech 2021: 1109-1113 - [c28]Mengzhe Geng, Shansong Liu, Jianwei Yu, Xurong Xie, Shoukang Hu, Zi Ye, Zengrui Jin, Xunying Liu, Helen Meng:
Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition. Interspeech 2021: 4793-4797 - [c27]Zengrui Jin, Mengzhe Geng, Xurong Xie, Jianwei Yu, Shansong Liu, Xunying Liu, Helen Meng:
Adversarial Data Augmentation for Disordered Speech Recognition. Interspeech 2021: 4803-4807 - [c26]Jiajun Deng, Fabian Ritter Gutierrez, Shoukang Hu, Mengzhe Geng, Xurong Xie, Zi Ye, Shansong Liu, Jianwei Yu, Xunying Liu, Helen Meng:
Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition. Interspeech 2021: 4818-4822 - [c25]Disong Wang, Jianwei Yu, Xixin Wu, Lifa Sun, Xunying Liu, Helen Meng:
Improved End-to-End Dysarthric Speech Recognition via Meta-learning Based Model Re-initialization. ISCSLP 2021: 1-5 - [c24]Jia Li, Jiajin Li, Yang Liu, Jianwei Yu, Yueting Li, Hong Cheng:
Deconvolutional Networks on Graph Data. NeurIPS 2021: 21019-21030 - [i14]Boyang Xue, Jianwei Yu, Junhao Xu, Shansong Liu, Shoukang Hu, Zi Ye, Mengzhe Geng, Xunying Liu, Helen Meng:
Bayesian Transformer Language Models for Speech Recognition. CoRR abs/2102.04754 (2021) - [i13]Helin Wang, Bo Wu, Lianwu Chen, Meng Yu, Jianwei Yu, Yong Xu, Shi-Xiong Zhang, Chao Weng, Dan Su, Dong Yu:
TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation. CoRR abs/2103.16849 (2021) - [i12]Zengrui Jin, Mengzhe Geng, Xurong Xie, Jianwei Yu, Shansong Liu, Xunying Liu, Helen Meng:
Adversarial Data Augmentation for Disordered Speech Recognition. CoRR abs/2108.00899 (2021) - [i11]Lingyun Feng, Jianwei Yu, Deng Cai, Songxiang Liu, Haitao Zheng, Yan Wang:
ASR-GLUE: A New Multi-task Benchmark for ASR-Robust Natural Language Understanding. CoRR abs/2108.13048 (2021) - [i10]Jia Li, Jiajin Li, Yang Liu, Jianwei Yu, Yueting Li, Hong Cheng:
Deconvolutional Networks on Graph Data. CoRR abs/2110.15528 (2021) - [i9]Junhao Xu, Jianwei Yu, Xunying Liu, Helen Meng:
Mixed Precision DNN Qunatization for Overlapped Speech Separation and Recognition. CoRR abs/2111.14479 (2021) - [i8]Junhao Xu, Xie Chen, Shoukang Hu, Jianwei Yu, Xunying Liu, Helen Meng:
Low-bit Quantization of Recurrent Neural Network Language Models Using Alternating Direction Methods of Multipliers. CoRR abs/2111.14836 (2021) - [i7]Jinchuan Tian, Jianwei Yu, Chao Weng, Shi-Xiong Zhang, Dan Su, Dong Yu, Yuexian Zou:
Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI. CoRR abs/2112.02498 (2021) - [i6]Junhao Xu, Jianwei Yu, Shoukang Hu, Xunying Liu, Helen Meng:
Mixed Precision Low-bit Quantization of Neural Network Language Models for Speech Recognition. CoRR abs/2112.11438 (2021) - [i5]Junhao Xu, Shoukang Hu, Jianwei Yu, Xunying Liu, Helen Meng:
Mixed Precision of Quantization of Transformer Language Models for Speech Recognition. CoRR abs/2112.11540 (2021) - 2020
- [j3]Xu Fang, Wenhao Guo, Qingquan Li, Jiasong Zhu, Zhipeng Chen, Jianwei Yu, Baoding Zhou, Haokun Yang:
Sewer Pipeline Fault Identification Using Anomaly Detection Algorithms on Video Sequences. IEEE Access 8: 39574-39586 (2020) - [c23]Xu Li, Jinghua Zhong, Xixin Wu, Jianwei Yu, Xunying Liu, Helen Meng:
Adversarial Attacks on GMM I-Vector Based Speaker Verification Systems. ICASSP 2020: 6579-6583 - [c22]Jianwei Yu, Shi-Xiong Zhang, Jian Wu, Shahram Ghorbani, Bo Wu, Shiyin Kang, Shansong Liu, Xunying Liu, Helen Meng, Dong Yu:
Audio-Visual Recognition of Overlapped Speech for the LRS2 Dataset. ICASSP 2020: 6984-6988 - [c21]Disong Wang, Jianwei Yu, Xixin Wu, Songxiang Liu, Lifa Sun, Xunying Liu, Helen Meng:
End-To-End Voice Conversion Via Cross-Modal Knowledge Distillation for Dysarthric Speech Reconstruction. ICASSP 2020: 7744-7748 - [c20]Junhao Xu, Xie Chen, Shoukang Hu, Jianwei Yu, Xunying Liu, Helen Meng:
Low-bit Quantization of Recurrent Neural Network Language Models Using Alternating Direction Methods of Multipliers. ICASSP 2020: 7939-7943 - [c19]Mengzhe Geng, Xurong Xie, Shansong Liu, Jianwei Yu, Shoukang Hu, Xunying Liu, Helen Meng:
Investigation of Data Augmentation Techniques for Disordered Speech Recognition. INTERSPEECH 2020: 696-700 - [c18]Shansong Liu, Xurong Xie, Jianwei Yu, Shoukang Hu, Mengzhe Geng, Rongfeng Su, Shi-Xiong Zhang, Xunying Liu, Helen Meng:
Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition. INTERSPEECH 2020: 711-715 - [c17]Jianwei Yu, Bo Wu, Rongzhi Gu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Dong Yu, Xunying Liu, Helen Meng:
Audio-Visual Multi-Channel Recognition of Overlapped Speech. INTERSPEECH 2020: 3496-3500 - [c16]Jia Li, Jianwei Yu, Jiajin Li, Honglei Zhang, Kangfei Zhao, Yu Rong, Hong Cheng, Junzhou Huang:
Dirichlet Graph Variational Autoencoder. NeurIPS 2020 - [c15]Xu Li, Jinghua Zhong, Jianwei Yu, Shoukang Hu, Xixin Wu, Xunying Liu, Helen Meng:
Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification. Odyssey 2020: 365-371 - [i4]Jianwei Yu, Shi-Xiong Zhang, Jian Wu, Shahram Ghorbani, Bo Wu, Shiyin Kang, Shansong Liu, Xunying Liu, Helen Meng, Dong Yu:
Audio-visual Recognition of Overlapped speech for the LRS2 dataset. CoRR abs/2001.01656 (2020) - [i3]Xu Li, Jinghua Zhong, Jianwei Yu, Shoukang Hu, Xixin Wu, Xunying Liu, Helen Meng:
Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification. CoRR abs/2004.04014 (2020) - [i2]Jianwei Yu, Bo Wu, Rongzhi Gu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Dong Yu, Xunying Liu, Helen Meng:
Audio-visual Multi-channel Recognition of Overlapped Speech. CoRR abs/2005.08571 (2020)
2010 – 2019
- 2019
- [j2]Minglei Guan, Yaxin Cheng, Qingquan Li, Chisheng Wang, Xu Fang, Jianwei Yu:
An Effective Method for Submarine Buried Pipeline Detection via Multi-Sensor Data Fusion. IEEE Access 7: 125300-125309 (2019) - [c14]Shoukang Hu, Max W. Y. Lam, Xurong Xie, Shansong Liu, Jianwei Yu, Xixin Wu, Xunying Liu, Helen Meng:
Bayesian and Gaussian Process Neural Networks for Large Vocabulary Continuous Speech Recognition. ICASSP 2019: 6555-6559 - [c13]Xixin Wu, Songxiang Liu, Yuewen Cao, Xu Li, Jianwei Yu, Dongyang Dai, Xi Ma, Shoukang Hu, Zhiyong Wu, Xunying Liu, Helen Meng:
Speech Emotion Recognition Using Capsule Networks. ICASSP 2019: 6695-6699 - [c12]Yuewen Cao, Xixin Wu, Songxiang Liu, Jianwei Yu, Xu Li, Zhiyong Wu, Xunying Liu, Helen Meng:
End-to-end Code-switched TTS with Mix of Monolingual Recordings. ICASSP 2019: 6935-6939 - [c11]Max W. Y. Lam, Xie Chen, Shoukang Hu, Jianwei Yu, Xunying Liu, Helen Meng:
Gaussian Process Lstm Recurrent Neural Network Language Models for Speech Recognition. ICASSP 2019: 7235-7239 - [c10]Jianwei Yu, Max W. Y. Lam, Xie Chen, Shoukang Hu, Songxiang Liu, Xixin Wu, Xunying Liu, Helen Meng:
Recurrent Neural Network Language Model Training Using Natural Gradient. ICASSP 2019: 7260-7264 - [c9]Shoukang Hu, Xurong Xie, Shansong Liu, Max W. Y. Lam, Jianwei Yu, Xixin Wu, Xunying Liu, Helen Meng:
LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition. INTERSPEECH 2019: 2793-2797 - [c8]Jianwei Yu, Max W. Y. Lam, Shoukang Hu, Xixin Wu, Xu Li, Yuewen Cao, Xunying Liu, Helen Meng:
Comparative Study of Parametric and Representation Uncertainty Modeling for Recurrent Neural Network Language Models. INTERSPEECH 2019: 3510-3514 - [c7]Shoukang Hu, Shansong Liu, Heng Fai Chang, Mengzhe Geng, Jiani Chen, Lau Wing Chung, To Ka Hei, Jianwei Yu, Ka Ho Wong, Xunying Liu, Helen Meng:
The CUHK Dysarthric Speech Recognition Systems for English and Cantonese. INTERSPEECH 2019: 3669-3670 - [c6]Shansong Liu, Shoukang Hu, Yi Wang, Jianwei Yu, Rongfeng Su, Xunying Liu, Helen Meng:
Exploiting Visual Features Using Bayesian Gated Neural Networks for Disordered Speech Recognition. INTERSPEECH 2019: 4120-4124 - [i1]Xu Li, Jinghua Zhong, Xixin Wu, Jianwei Yu, Xunying Liu, Helen Meng:
Adversarial Attacks on GMM i-vector based Speaker Verification Systems. CoRR abs/1911.03078 (2019) - 2018
- [c5]Xunying Liu, Shansong Liu, Jinze Sha, Jianwei Yu, Zhiyuan Xu, Xie Chen, Helen Meng:
Limited-Memory BFGS Optimization of Recurrent Neural Network Language Models for Speech Recognition. ICASSP 2018: 6114-6118 - [c4]Max W. Y. Lam, Shoukang Hu, Xurong Xie, Shansong Liu, Jianwei Yu, Rongfeng Su, Xunying Liu, Helen Meng:
Gaussian Process Neural Networks for Speech Recognition. INTERSPEECH 2018: 1778-1782 - [c3]Jianwei Yu, Xurong Xie, Shansong Liu, Shoukang Hu, Max W. Y. Lam, Xixin Wu, Ka Ho Wong, Xunying Liu, Helen Meng:
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus. INTERSPEECH 2018: 2938-2942 - 2015
- [j1]Qingzhou Mao, Liang Zhang, Qingquan Li, Qingwu Hu, Jianwei Yu, Shaojun Feng, Washington Yotto Ochieng, Hanlu Gong:
A Least Squares Collocation Method for Accuracy Improvement of Mobile LiDAR Systems. Remote. Sens. 7(6): 7402-7424 (2015)
2000 – 2009
- 2009
- [c2]Ranzhe Jing, Jianwei Yu:
A Case Study on Government Procurement Processes Identifying. WKDD 2009: 610-613 - 2008
- [c1]Ranzhe Jing, Jianwei Yu, Zuo Jiang:
Exploring Influencing Factors in E-Commerce Transaction Behaviors. ISECS 2008: 603-607
Coauthor Index
aka: Helen Meng
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-10 21:46 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint