default search action
Yuchen Hu
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j11]Shuitao Guo, Changqing Zhu, Na Ren, Yuchen Hu:
Vector geographic data commutative encryption and watermarking algorithm based on prediction differences. Expert Syst. Appl. 261: 125477 (2025) - 2024
- [j10]Yujiao Lyu, Pengxin Wang, Xueyuan Bai, Xuecao Li, Xin Ye, Yuchen Hu, Jie Zhang:
Machine learning techniques and interpretability for maize yield estimation using Time-Series images of MODIS and Multi-Source data. Comput. Electron. Agric. 222: 109063 (2024) - [j9]Yuchen Hu, Xingxiang Jiang, Changqing Zhu, Na Ren, Shuitao Guo, Jia Duan, Luanyun Hu:
A dual watermarking algorithm for trajectory data based on robust watermarking and fragile watermarking. Comput. Geosci. 191: 105655 (2024) - [j8]Na Ren, Yuchen Hu, Changqing Zhu, Shuitao Guo, Xianshu Zhu:
Moment invariants based zero watermarking algorithm for trajectory data. J. Inf. Secur. Appl. 86: 103867 (2024) - [j7]Yuchen Hu, Zhenxue Chen, Chengyun Liu, Tian Liang, Dan Lu:
SAFLFusionGait: Gait recognition network with separate attention and different granularity feature learnability fusion. J. Vis. Commun. Image Represent. 104: 104284 (2024) - [j6]Tian Liang, Zhenxue Chen, Chengyun Liu, Jiyang Chen, Yuchen Hu, Q. M. Jonathan Wu:
AdaptiveGait: adaptive feature fusion network for gait recognition. Multim. Tools Appl. 83(35): 83357-83376 (2024) - [j5]Yuchen Hu, Chen Chen, Qiushi Zhu, Eng Siong Chng:
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1145-1156 (2024) - [c30]Qiushi Zhu, Jie Zhang, Yu Gu, Yuchen Hu, Lirong Dai:
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation. AAAI 2024: 19768-19776 - [c29]Chen Chen, Ruizhe Li, Yuchen Hu, Yuanyuan Chen, Chengwei Qin, Qiang Zhang:
Overcoming Catastrophic Forgetting by Exemplar Selection in Task-oriented Dialogue System. ACL (Findings) 2024: 48-61 - [c28]Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Dong Zhang, Zhehuai Chen, EngSiong Chng:
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators. ACL (1) 2024: 74-90 - [c27]Yuchen Hu, Chen Chen, Chengwei Qin, Qiushi Zhu, EngSiong Chng, Ruizhe Li:
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models. ACL (Findings) 2024: 666-679 - [c26]Yuchen Hu, Ke Xu, Jialin Sun, Xinwei Fang, Weiwei Shan, Xi Wang, Zhe Jiang:
Make Each Iteration Count. ACM TUR-C 2024 - [c25]Zizheng Zhang, Chen Chen, Hsin-Hung Chen, Xiang Liu, Yuchen Hu, Eng Siong Chng:
Noise-Aware Speech Separation with Contrastive Learning. ICASSP 2024: 1381-1385 - [c24]Heqing Zou, Meng Shen, Yuchen Hu, Chen Chen, Eng Siong Chng, Deepu Rajan:
Cross-Modality and Within-Modality Regularization for Audio-Visual Deepfake Detection. ICASSP 2024: 4900-4904 - [c23]Xiao-Ying Zhao, Qiushi Zhu, Yuchen Hu:
An Experimental Comparison of Noise-Robust Text-To-Speech Synthesis Systems Based On Self-Supervised Representation. ICASSP 2024: 11441-11445 - [c22]Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Engsiong Chng, Chao-Han Huck Yang:
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition. ICLR 2024 - [c21]Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Chao Zhang, Pin-Yu Chen, Engsiong Chng:
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition. ICLR 2024 - [c20]Yuchen Hu, Changqing Zhu, Na Ren, Jinjie Gu:
Trajectory Data Semi-fragile Watermarking Algorithm Considering Spatiotemporal Features. SpatialDI 2024: 319-332 - [i41]Qiushi Zhu, Jie Zhang, Yu Gu, Yuchen Hu, Lirong Dai:
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation. CoRR abs/2401.03468 (2024) - [i40]Heqing Zou, Meng Shen, Yuchen Hu, Chen Chen, Eng Siong Chng, Deepu Rajan:
Cross-Modality and Within-Modality Regularization for Audio-Visual DeepFake Detection. CoRR abs/2401.05746 (2024) - [i39]Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Chao Zhang, Pin-Yu Chen, Eng Siong Chng:
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition. CoRR abs/2401.10446 (2024) - [i38]Chen Chen, Ruizhe Li, Yuchen Hu, Sabato Marco Siniscalchi, Pin-Yu Chen, Eng Siong Chng, Chao-Han Huck Yang:
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition. CoRR abs/2402.05457 (2024) - [i37]Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Dong Zhang, Zhehuai Chen, Eng Siong Chng:
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators. CoRR abs/2402.06894 (2024) - [i36]Chengwei Qin, Wenhan Xia, Tan Wang, Fangkai Jiao, Yuchen Hu, Bosheng Ding, Ruirui Chen, Shafiq Joty:
Relevant or Random: Can LLMs Truly Perform Analogical Reasoning? CoRR abs/2404.12728 (2024) - [i35]Ke Xu, Jialin Sun, Yuchen Hu, Xinwei Fang, Weiwei Shan, Xi Wang, Zhe Jiang:
MEIC: Re-thinking RTL Debug Automation using LLMs. CoRR abs/2405.06840 (2024) - [i34]Yuchen Hu, Chen Chen, Chengwei Qin, Qiushi Zhu, Eng Siong Chng, Ruizhe Li:
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models. CoRR abs/2405.10025 (2024) - [i33]Chen Chen, Ruizhe Li, Yuchen Hu, Yuanyuan Chen, Chengwei Qin, Qiang Zhang:
Overcoming Catastrophic Forgetting by Exemplar Selection in Task-oriented Dialogue System. CoRR abs/2405.10992 (2024) - [i32]Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Chengwei Qin, Pin-Yu Chen, Eng Siong Chng, Chao Zhang:
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models. CoRR abs/2405.14161 (2024) - [i31]Chen Chen, Yuchen Hu, Wen Wu, Helin Wang, Eng Siong Chng, Chao Zhang:
Enhancing Zero-shot Text-to-Speech Synthesis with Human Feedback. CoRR abs/2406.00654 (2024) - [i30]Ruohan Zhan, Shichao Han, Yuchen Hu, Zhenling Jiang:
Estimating Treatment Effects under Recommender Interference: A Structured Neural Networks Approach. CoRR abs/2406.14380 (2024) - [i29]Yuchen Hu, Chen Chen, Siyin Wang, Eng Siong Chng, Chao Zhang:
Robust Zero-Shot Text-to-Speech Synthesis with Reverse Inference Optimization. CoRR abs/2407.02243 (2024) - [i28]Yuchen Dong, XiaoXiang Fang, Yuchen Hu, Renshuang Jiang, Zhe Jiang:
MaxMind: A Memory Loop Network to Enhance Software Productivity based on Large Language Models. CoRR abs/2408.03841 (2024) - [i27]Helin Wang, Meng Yu, Jiarui Hai, Chen Chen, Yuchen Hu, Rilin Chen, Najim Dehak, Dong Yu:
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis. CoRR abs/2409.07556 (2024) - [i26]Chao-Han Huck Yang, Taejin Park, Yuan Gong, Yuanchao Li, Zhehuai Chen, Yen-Ting Lin, Chen Chen, Yuchen Hu, Kunal Dhawan, Piotr Zelasko, Chao Zhang, Yun-Nung Chen, Yu Tsao, Jagadeesh Balam, Boris Ginsburg, Sabato Marco Siniscalchi, Eng Siong Chng, Peter Bell, Catherine Lai, Shinji Watanabe, Andreas Stolcke:
Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition. CoRR abs/2409.09785 (2024) - [i25]Yuchen Hu, Yu Gu, Chenxing Li, Rilin Chen, Dong Yu:
Video-to-Audio Generation with Fine-grained Temporal Semantics. CoRR abs/2409.14709 (2024) - 2023
- [j4]Ye Yuan, Kuankuan Xin, Jian Liu, Peng Zhao, Man Pok Lu, Yuner Yan, Yuchen Hu, Hong Huo, Zhaoyu Li, Tao Fang:
A GNN-based model for capturing spatio-temporal changes in locomotion behaviors of aging C. elegans. Comput. Biol. Medicine 155: 106694 (2023) - [j3]Na Ren, Shuitao Guo, Changqing Zhu, Yuchen Hu:
A zero-watermarking scheme based on spatial topological relations for vector dataset. Expert Syst. Appl. 226: 120217 (2023) - [j2]Xuefeng Zhang, Xiaobing Dai, Xuemin Zhang, Yuchen Hu, Yingdong Kang, Guang Jin:
Improved Generalized IHS Based on Total Variation for Pansharpening. Remote. Sens. 15(11): 2945 (2023) - [c19]Chen Chen, Yuchen Hu, Qiang Zhang, Heqing Zou, Beier Zhu, Eng Siong Chng:
Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning. AAAI 2023: 12607-12615 - [c18]Heqing Zou, Meng Shen, Chen Chen, Yuchen Hu, Deepu Rajan, Eng Siong Chng:
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning. ACL (Findings) 2023: 659-672 - [c17]Yuchen Hu, Chen Chen, Ruizhe Li, Heqing Zou, Eng Siong Chng:
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition. ACL (1) 2023: 11610-11625 - [c16]Yuchen Hu, Ruizhe Li, Chen Chen, Chengwei Qin, Qiu-Shi Zhu, Eng Siong Chng:
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition. ACL (1) 2023: 15213-15232 - [c15]Chen Chen, Yuchen Hu, Weiwei Weng, Eng Siong Chng:
Metric-Oriented Speech Enhancement Using Diffusion Probabilistic Model. ICASSP 2023: 1-5 - [c14]Chen Chen, Yuchen Hu, Heqing Zou, Linhui Sun, Eng Siong Chng:
Unsupervised Noise Adaptation Using Data Simulation. ICASSP 2023: 1-5 - [c13]Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng:
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition. ICASSP 2023: 1-5 - [c12]Yuchen Hu, Chen Chen, Heqing Zou, Xionghu Zhong, Eng Siong Chng:
Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation. ICASSP 2023: 1-5 - [c11]Yuchen Hu, Ruizhe Li, Chen Chen, Heqing Zou, Qiushi Zhu, Eng Siong Chng:
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition. IJCAI 2023: 5076-5084 - [c10]Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition. INTERSPEECH 2023: 2918-2922 - [c9]Chen Chen, Chao-Han Huck Yang, Kai Li, Yuchen Hu, Pin-Jui Ku, Eng Siong Chng:
A Neural State-Space Modeling Approach to Efficient Speech Separation. INTERSPEECH 2023: 3784-3788 - [c8]Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Pin-Yu Chen, Chng Eng Siong:
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models. NeurIPS 2023 - [i24]Yuchen Hu, Chen Chen, Heqing Zou, Xionghu Zhong, Eng Siong Chng:
Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation. CoRR abs/2302.11131 (2023) - [i23]Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng:
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition. CoRR abs/2302.11362 (2023) - [i22]Chen Chen, Yuchen Hu, Heqing Zou, Linhui Sun, Eng Siong Chng:
Unsupervised Noise adaptation using Data Simulation. CoRR abs/2302.11981 (2023) - [i21]Chen Chen, Yuchen Hu, Weiwei Weng, Eng Siong Chng:
Metric-oriented Speech Enhancement using Diffusion Probabilistic Model. CoRR abs/2302.11989 (2023) - [i20]Yuchen Hu, Chen Chen, Qiushi Zhu, Eng Siong Chng:
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR. CoRR abs/2304.04974 (2023) - [i19]Yuchen Hu, Ruizhe Li, Chen Chen, Heqing Zou, Qiushi Zhu, Eng Siong Chng:
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition. CoRR abs/2305.09212 (2023) - [i18]Heqing Zou, Meng Shen, Chen Chen, Yuchen Hu, Deepu Rajan, Eng Siong Chng:
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning. CoRR abs/2305.09299 (2023) - [i17]Zizheng Zhang, Chen Chen, Xiang Liu, Yuchen Hu, Eng Siong Chng:
Noise-aware Speech Separation with Contrastive Learning. CoRR abs/2305.10761 (2023) - [i16]Chen Chen, Chao-Han Huck Yang, Kai Li, Yuchen Hu, Pin-Jui Ku, Eng Siong Chng:
A Neural State-Space Model Approach to Efficient Speech Separation. CoRR abs/2305.16932 (2023) - [i15]Yuchen Hu, Ruizhe Li, Chen Chen, Chengwei Qin, Qiushi Zhu, Eng Siong Chng:
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition. CoRR abs/2306.10563 (2023) - [i14]Yuchen Hu, Chen Chen, Ruizhe Li, Heqing Zou, Eng Siong Chng:
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition. CoRR abs/2306.10567 (2023) - [i13]Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng:
Noise-aware Speech Enhancement using Diffusion Probabilistic Model. CoRR abs/2307.08029 (2023) - [i12]Qiushi Zhu, Yu Gu, Chao Weng, Yuchen Hu, Lirong Dai, Jie Zhang:
Rep2wav: Noise Robust text-to-speech Using self-supervised representations. CoRR abs/2308.14553 (2023) - [i11]Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Pin-Yu Chen, Eng Siong Chng:
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models. CoRR abs/2309.15701 (2023) - [i10]Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Hexin Liu, Sabato Marco Siniscalchi, Eng Siong Chng:
Generative error correction for code-switching speech recognition using large language models. CoRR abs/2310.13013 (2023) - 2022
- [c7]Chen Chen, Yuchen Hu, Nana Hou, Xiaofeng Qi, Heqing Zou, Eng Siong Chng:
Self-Critical Sequence Training for Automatic Speech Recognition. ICASSP 2022: 3688-3692 - [c6]Chen Chen, Nana Hou, Yuchen Hu, Shashank Shirol, Eng Siong Chng:
Noise-Robust Speech Recognition With 10 Minutes Unparalleled In-Domain Data. ICASSP 2022: 4298-4302 - [c5]Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition. ICASSP 2022: 6292-6296 - [c4]Chen Chen, Nana Hou, Yuchen Hu, Heqing Zou, Xiaofeng Qi, Eng Siong Chng:
Interactive Auido-text Representation for Automated Audio Captioning with Contrastive Learning. INTERSPEECH 2022: 2773-2777 - [i9]Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition. CoRR abs/2203.14838 (2022) - [i8]Chen Chen, Nana Hou, Yuchen Hu, Shashank Shirol, Eng Siong Chng:
Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data. CoRR abs/2203.15321 (2022) - [i7]Chen Chen, Nana Hou, Yuchen Hu, Heqing Zou, Xiaofeng Qi, Eng Siong Chng:
Interactive Audio-text Representation for Automated Audio Captioning with Contrastive Learning. CoRR abs/2203.15526 (2022) - [i6]Chen Chen, Yuchen Hu, Nana Hou, Xiaofeng Qi, Heqing Zou, Eng Siong Chng:
Self-critical Sequence Training for Automatic Speech Recognition. CoRR abs/2204.06260 (2022) - [i5]Leilei Cao, Zhuang Li, Bo Yan, Feng Zhang, Fengliang Qi, Yuchen Hu, Hongbin Wang:
The Second Place Solution for The 4th Large-scale Video Object Segmentation Challenge-Track 3: Referring Video Object Segmentation. CoRR abs/2206.12035 (2022) - [i4]Chen Chen, Yuchen Hu, Qiang Zhang, Heqing Zou, Beier Zhu, Eng Siong Chng:
Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning. CoRR abs/2212.05301 (2022) - 2021
- [c3]Dan Liu, Mengge Du, Xiaoxi Li, Yuchen Hu, Lirong Dai:
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021. IWSLT 2021: 30-38 - [i3]Dan Liu, Mengge Du, Xiaoxi Li, Yuchen Hu, Lirong Dai:
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021. CoRR abs/2107.00279 (2021) - [i2]Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition. CoRR abs/2110.05267 (2021) - [i1]Yuchen Hu, Stefan Wager:
Off-Policy Evaluation in Partially Observed Markov Decision Processes. CoRR abs/2110.12343 (2021)
2010 – 2019
- 2019
- [c2]Haiyan Qiang, Wanli Li, Guanyuan Li, Yuchen Hu, Yougang Sun:
Study on Prediction of Power Allocation for the Double-Wheel Trench Cutter Control System Based on Extreme Learning Machine Method. ICARM 2019: 516-520 - 2017
- [j1]Yao Li, Guozhu Jia, Yang Cheng, Yuchen Hu:
Additive manufacturing technology in spare parts supply chain: a comparative study. Int. J. Prod. Res. 55(5): 1498-1515 (2017) - 2016
- [c1]Zhuting Yao, Yuchen Hu:
Gearbox fault diagnosis based on LMD and cyclostationary demodulation. URAI 2016: 984-989
Coauthor Index
aka: Qiu-Shi Zhu
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-07 20:35 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint