default search action
Lu Lu 0015
Person information
- affiliation: Bytedance Inc., ByteDance AI Lab, Speech and Audio Team
Other persons with the same name
- Lu Lu — disambiguation page
- Lu Lu 0001 — Chinese Academy of Sciences, Technology and Engineering Center for Space Utilization, Beijing, China (and 2 more)
- Lu Lu 0002 — Georgia Institute of Technology, School of Electrical and Computer Engineering, Atlanta, GA, USA (and 2 more)
- Lu Lu 0003 — New Jersey Institute of Technology, Department of Mechanical and Industrial Engineering, Newark, NJ, USA (and 3 more)
- Lu Lu 0004 — University of Tennessee, Health Science Center, Memphis, TN, USA (and 1 more)
- Lu Lu 0005 — Sichuan University, College of Electronics and Information Engineering, Chengdu, China (and 1 more)
- Lu Lu 0006 — Huazhong University of Science and Technology, Services Computing Technology and System Lab / Big Data Technology and System Lab, China
- Lu Lu 0007 — University of South Florida, Department of Mathematics and Statistics, Tampa, FL, USA (and 1 more)
- Lu Lu 0008 — China University of Mining and Technology, School of Environment Science and Spatial Informatics, Xuzhou, China
- Lu Lu 0009 — LSI Corporation, Milpitas, CA, USA (and 1 more)
- Lu Lu 0010 — Brown University, Division of Applied Mathematics, Providence, RI, USA
- Lu Lu 0011 — South China University of Technology, School of Computer Science and Engineering, Guangzhou, China (and 1 more)
- Lu Lu 0012 — Central South University, School of Mathematics and Statistics, Changsha, Hunan, China (and 1 more)
- Lu Lu 0013 — Nanyang Technological University, School of Electrical and Electronic Engineering, Singapore
- Lu Lu 0014 — Harbin Institute of Technology, School of Economic and Management, China
- Lu Lu 0016 — China Mobile Research Institute, Department of Basic Network Technology, Beijing, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j4]Yu Liu, Zirui Zhuang, Qi Qi, Jingyu Wang, Dezhi Chen, Lu Lu, Hongwei Yang, Jianxin Liao, Zhu Han:
Slice Sandwich: Jagged Slicing Multi-Tier Dynamic Resources for Diversified V2X Services. IEEE Trans. Mob. Comput. 23(5): 4285-4302 (2024) - [c14]Zhiyun Fan, Linhao Dong, Jun Zhang, Lu Lu, Zejun Ma:
SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR. ICASSP 2024: 9986-9990 - [c13]Changli Tang, Wenyi Yu, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang:
Extending Large Language Models for Speech and Audio Captioning. ICASSP 2024: 11236-11240 - [c12]Wenyi Yu, Changli Tang, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang:
Connecting Speech Encoder and Large Language Model for ASR. ICASSP 2024: 12637-12641 - [c11]Qianqian Dong, Zhiying Huang, Qi Tian, Chen Xu, Tom Ko, Yunlong Zhao, Siyuan Feng, Tang Li, Kexin Wang, Xuxin Cheng, Fengpeng Yue, Ye Bai, Xi Chen, Lu Lu, Zejun Ma, Yuping Wang, Mingxuan Wang, Yuxuan Wang:
PolyVoice: Language Models for Speech to Speech Translation. ICLR 2024 - [c10]Changli Tang, Wenyi Yu, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang:
SALMONN: Towards Generic Hearing Abilities for Large Language Models. ICLR 2024 - [c9]Pratik Rathore, Weimu Lei, Zachary Frangella, Lu Lu, Madeleine Udell:
Challenges in Training PINNs: A Loss Landscape Perspective. ICML 2024 - [c8]Guangzhi Sun, Wenyi Yu, Changli Tang, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Yuxuan Wang, Chao Zhang:
video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models. ICML 2024 - [i20]Pratik Rathore, Weimu Lei, Zachary Frangella, Lu Lu, Madeleine Udell:
Challenges in Training PINNs: A Loss Landscape Perspective. CoRR abs/2402.01868 (2024) - [i19]Hang Zhao, Yifei Xin, Zhesong Yu, Bilei Zhu, Lu Lu, Zejun Ma:
MINT: Boosting Audio-Language Model via Multi-Target Pre-Training and Instruction Tuning. CoRR abs/2402.07485 (2024) - [i18]Zhiyun Fan, Linhao Dong, Jun Zhang, Lu Lu, Zejun Ma:
SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR. CoRR abs/2403.02010 (2024) - [i17]Changli Tang, Wenyi Yu, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Jun Zhang, Lu Lu, Zejun Ma, Yuxuan Wang, Chao Zhang:
Can Large Language Models Understand Spatial Audio? CoRR abs/2406.07914 (2024) - [i16]Junyi Ao, Yuancheng Wang, Xiaohai Tian, Dekun Chen, Jun Zhang, Lu Lu, Yuxuan Wang, Haizhou Li, Zhizheng Wu:
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words. CoRR abs/2406.13340 (2024) - [i15]Guangzhi Sun, Wenyi Yu, Changli Tang, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Yuxuan Wang, Chao Zhang:
video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models. CoRR abs/2406.15704 (2024) - [i14]Van Tung Pham, Yist Y. Lin, Tao Han, Wei Li, Jun Zhang, Lu Lu, Yuxuan Wang:
A Comprehensive Solution to Connect Speech Encoder and Large Language Model for ASR. CoRR abs/2406.17272 (2024) - [i13]Ye Bai, Jingping Chen, Jitong Chen, Wei Chen, Zhuo Chen, Chuang Ding, Linhao Dong, Qianqian Dong, Yujiao Du, Kepan Gao, Lu Gao, Yi Guo, Minglun Han, Ting Han, Wenchao Hu, Xinying Hu, Yuxiang Hu, Deyu Hua, Lu Huang, Mingkun Huang, Youjia Huang, Jishuo Jin, Fanliu Kong, Zongwei Lan, Tianyu Li, Xiaoyang Li, Zeyang Li, Zehua Lin, Rui Liu, Shouda Liu, Lu Lu, Yizhou Lu, Jingting Ma, Shengtao Ma, Yulin Pei, Chen Shen, Tian Tan, Xiaogang Tian, Ming Tu, Bo Wang, Hao Wang, Yuping Wang, Yuxuan Wang, Hanzhang Xia, Rui Xia, Shuangyi Xie, Hongmin Xu, Meng Yang, Bihong Zhang, Jun Zhang, Wanyi Zhang, Yang Zhang, Yawei Zhang, Yijie Zheng, Ming Zou:
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition. CoRR abs/2407.04675 (2024) - [i12]Minglun Han, Ye Bai, Chen Shen, Youjia Huang, Mingkun Huang, Zehua Lin, Linhao Dong, Lu Lu, Yuxuan Wang:
NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training. CoRR abs/2409.08680 (2024) - [i11]Siyin Wang, Wenyi Yu, Yudong Yang, Changli Tang, Yixuan Li, Jimin Zhuang, Xianzhao Chen, Xiaohai Tian, Jun Zhang, Guangzhi Sun, Lu Lu, Chao Zhang:
Enabling Auditory Large Language Models for Automatic Speech Quality Evaluation. CoRR abs/2409.16644 (2024) - 2023
- [j3]Jingyu Wang, Lei Zhang, Yiran Yang, Zirui Zhuang, Qi Qi, Haifeng Sun, Lu Lu, Junlan Feng, Jianxin Liao:
Network Meets ChatGPT: Intent Autonomous Management, Control and Operation. J. Commun. Inf. Networks 8(3): 239-255 (2023) - [j2]Bo He, Jingyu Wang, Qi Qi, Haifeng Sun, Jianxin Liao, Lu Lu, Zhu Han:
Learning-Based Real-Time Transmission Control for Multi-Path TCP Networks. IEEE Trans. Cogn. Commun. Netw. 9(5): 1353-1369 (2023) - [j1]Rongxin Han, Dezhi Chen, Song Guo, Jingyu Wang, Qi Qi, Lu Lu, Jianxin Liao:
Multi-SP Network Slicing Parallel Relieving Edge Network Conflict. IEEE Trans. Parallel Distributed Syst. 34(11): 2860-2875 (2023) - [c7]Linhao Dong, Zhecheng An, Peihao Wu, Jun Zhang, Lu Lu, Zejun Ma:
CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training. ACL (Findings) 2023: 8894-8907 - [c6]Jin Qiu, Lu Huang, Boyu Li, Jun Zhang, Lu Lu, Zejun Ma:
Improving Large-Scale Deep Biasing With Phoneme Features and Text-Only Data in Streaming Transducer. ASRU 2023: 1-8 - [c5]Xinghua Qu, Xiang Yin, Pengfei Wei, Lu Lu, Zejun Ma:
AudioQR: Deep Neural Audio Watermarks For QR Code. IJCAI 2023: 6192-6200 - [c4]Lu Huang, Boyu Li, Jun Zhang, Lu Lu, Zejun Ma:
Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer. INTERSPEECH 2023: 386-390 - [c3]Yist Y. Lin, Tao Han, Haihua Xu, Van Tung Pham, Yerbolat Khassanov, Tze Yuang Chong, Yi He, Lu Lu, Zejun Ma:
Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition. INTERSPEECH 2023: 904-908 - [c2]Zhiyun Fan, Linhao Dong, Chen Shen, Zhenlin Liang, Jun Zhang, Lu Lu, Zejun Ma:
Language-specific Boundary Learning for Improving Mandarin-English Code-switching Speech Recognition. INTERSPEECH 2023: 3322-3326 - [c1]Xinghua Qu, Hongyang Liu, Zhu Sun, Xiang Yin, Yew Soon Ong, Lu Lu, Zejun Ma:
Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions and Prospects. SIGIR 2023: 2701-2711 - [i10]Xinnian Liang, Bing Wang, Hui Huang, Shuangzhi Wu, Peihao Wu, Lu Lu, Zejun Ma, Zhoujun Li:
Unleashing Infinite-Length Input Capacity for Large-scale Language Models with Self-Controlled Memory System. CoRR abs/2304.13343 (2023) - [i9]Linhao Dong, Zhecheng An, Peihao Wu, Jun Zhang, Lu Lu, Zejun Ma:
CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training. CoRR abs/2305.17499 (2023) - [i8]Qianqian Dong, Zhiying Huang, Qiao Tian, Chen Xu, Tom Ko, Yunlong Zhao, Siyuan Feng, Tang Li, Kexin Wang, Xuxin Cheng, Fengpeng Yue, Ye Bai, Xi Chen, Lu Lu, Zejun Ma, Yuping Wang, Mingxuan Wang, Yuxuan Wang:
PolyVoice: Language Models for Speech to Speech Translation. CoRR abs/2306.02982 (2023) - [i7]Lu Huang, Boyu Li, Jun Zhang, Lu Lu, Zejun Ma:
Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer. CoRR abs/2306.04076 (2023) - [i6]Zhiyun Fan, Linhao Dong, Chen Shen, Zhenlin Liang, Jun Zhang, Lu Lu, Zejun Ma:
Language-specific Acoustic Boundary Learning for Mandarin-English Code-switching Speech Recognition. CoRR abs/2306.05279 (2023) - [i5]Xinghua Qu, Hongyang Liu, Zhu Sun, Xiang Yin, Yew Soon Ong, Lu Lu, Zejun Ma:
Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions, and Prospects. CoRR abs/2306.08219 (2023) - [i4]Wenyi Yu, Changli Tang, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang:
Connecting Speech Encoder and Large Language Model for ASR. CoRR abs/2309.13963 (2023) - [i3]Guangzhi Sun, Wenyi Yu, Changli Tang, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang:
Fine-grained Audio-Visual Joint Representations for Multimodal Large Language Models. CoRR abs/2310.05863 (2023) - [i2]Changli Tang, Wenyi Yu, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang:
SALMONN: Towards Generic Hearing Abilities for Large Language Models. CoRR abs/2310.13289 (2023) - [i1]Jin Qiu, Lu Huang, Boyu Li, Jun Zhang, Lu Lu, Zejun Ma:
Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming Transducer. CoRR abs/2311.08966 (2023)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-26 00:45 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint