default search action

combined dblp search
author search
venue search
publication search

ask others

Lu Lu 0015

> Home > Persons

Person information

affiliation: Bytedance Inc., ByteDance AI Lab, Speech and Audio Team

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tmc/LiuZQWCLYLH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmc/LiuZQWCLYLH24
Yu Liu, Zirui Zhuang, Qi Qi, Jingyu Wang, Dezhi Chen, Lu Lu, Hongwei Yang, Jianxin Liao, Zhu Han:
Slice Sandwich: Jagged Slicing Multi-Tier Dynamic Resources for Diversified V2X Services. IEEE Trans. Mob. Comput. 23(5): 4285-4302 (2024)
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FanDZ0M24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FanDZ0M24
Zhiyun Fan, Linhao Dong, Jun Zhang, Lu Lu, Zejun Ma:
SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR. ICASSP 2024: 9986-9990
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TangYSC0LLMZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TangYSC0LLMZ24
Changli Tang, Wenyi Yu, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang:
Extending Large Language Models for Speech and Audio Captioning. ICASSP 2024: 11236-11240
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuTSC0L0M024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuTSC0L0M024
Wenyi Yu, Changli Tang, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang:
Connecting Speech Encoder and Large Language Model for ASR. ICASSP 2024: 12637-12641
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/DongH00KZF0WCYB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DongH00KZF0WCYB24
Qianqian Dong, Zhiying Huang, Qi Tian, Chen Xu, Tom Ko, Yunlong Zhao, Siyuan Feng, Tang Li, Kexin Wang, Xuxin Cheng, Fengpeng Yue, Ye Bai, Xi Chen, Lu Lu, Zejun Ma, Yuping Wang, Mingxuan Wang, Yuxuan Wang:
PolyVoice: Language Models for Speech to Speech Translation. ICLR 2024
[c10]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/TangYSC000M024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/TangYSC000M024
Changli Tang, Wenyi Yu, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang:
SALMONN: Towards Generic Hearing Abilities for Large Language Models. ICLR 2024
[c9]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/RathoreLF0U24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/RathoreLF0U24
Pratik Rathore, Weimu Lei, Zachary Frangella, Lu Lu, Madeleine Udell:
Challenges in Training PINNs: A Loss Landscape Perspective. ICML 2024
[c8]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/SunYTC000M0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SunYTC000M0024
Guangzhi Sun, Wenyi Yu, Changli Tang, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Yuxuan Wang, Chao Zhang:
video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models. ICML 2024
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-01868
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-01868
Pratik Rathore, Weimu Lei, Zachary Frangella, Lu Lu, Madeleine Udell:
Challenges in Training PINNs: A Loss Landscape Perspective. CoRR abs/2402.01868 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-07485
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-07485
Hang Zhao, Yifei Xin, Zhesong Yu, Bilei Zhu, Lu Lu, Zejun Ma:
MINT: Boosting Audio-Language Model via Multi-Target Pre-Training and Instruction Tuning. CoRR abs/2402.07485 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-02010
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-02010
Zhiyun Fan, Linhao Dong, Jun Zhang, Lu Lu, Zejun Ma:
SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR. CoRR abs/2403.02010 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07914
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07914
Changli Tang, Wenyi Yu, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Jun Zhang, Lu Lu, Zejun Ma, Yuxuan Wang, Chao Zhang:
Can Large Language Models Understand Spatial Audio? CoRR abs/2406.07914 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-13340
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-13340
Junyi Ao, Yuancheng Wang, Xiaohai Tian, Dekun Chen, Jun Zhang, Lu Lu, Yuxuan Wang, Haizhou Li, Zhizheng Wu:
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words. CoRR abs/2406.13340 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-15704
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-15704
Guangzhi Sun, Wenyi Yu, Changli Tang, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Yuxuan Wang, Chao Zhang:
video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models. CoRR abs/2406.15704 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-17272
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-17272
Van Tung Pham, Yist Y. Lin, Tao Han, Wei Li, Jun Zhang, Lu Lu, Yuxuan Wang:
A Comprehensive Solution to Connect Speech Encoder and Large Language Model for ASR. CoRR abs/2406.17272 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-04675
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-04675
Ye Bai, Jingping Chen, Jitong Chen, Wei Chen, Zhuo Chen, Chuang Ding, Linhao Dong, Qianqian Dong, Yujiao Du, Kepan Gao, Lu Gao, Yi Guo, Minglun Han, Ting Han, Wenchao Hu, Xinying Hu, Yuxiang Hu, Deyu Hua, Lu Huang, Mingkun Huang, Youjia Huang, Jishuo Jin, Fanliu Kong, Zongwei Lan, Tianyu Li, Xiaoyang Li, Zeyang Li, Zehua Lin, Rui Liu, Shouda Liu, Lu Lu, Yizhou Lu, Jingting Ma, Shengtao Ma, Yulin Pei, Chen Shen, Tian Tan, Xiaogang Tian, Ming Tu, Bo Wang, Hao Wang, Yuping Wang, Yuxuan Wang, Hanzhang Xia, Rui Xia, Shuangyi Xie, Hongmin Xu, Meng Yang, Bihong Zhang, Jun Zhang, Wanyi Zhang, Yang Zhang, Yawei Zhang, Yijie Zheng, Ming Zou:
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition. CoRR abs/2407.04675 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-08680
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-08680
Minglun Han, Ye Bai, Chen Shen, Youjia Huang, Mingkun Huang, Zehua Lin, Linhao Dong, Lu Lu, Yuxuan Wang:
NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training. CoRR abs/2409.08680 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-16644
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-16644
Siyin Wang, Wenyi Yu, Yudong Yang, Changli Tang, Yixuan Li, Jimin Zhuang, Xianzhao Chen, Xiaohai Tian, Jun Zhang, Guangzhi Sun, Lu Lu, Chao Zhang:
Enabling Auditory Large Language Models for Automatic Speech Quality Evaluation. CoRR abs/2409.16644 (2024)
2023
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/jcin/WangZYZQSLFL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jcin/WangZYZQSLFL23
Jingyu Wang, Lei Zhang, Yiran Yang, Zirui Zhuang, Qi Qi, Haifeng Sun, Lu Lu, Junlan Feng, Jianxin Liao:
Network Meets ChatGPT: Intent Autonomous Management, Control and Operation. J. Commun. Inf. Networks 8(3): 239-255 (2023)
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tccn/HeWQSLLH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tccn/HeWQSLLH23
Bo He, Jingyu Wang, Qi Qi, Haifeng Sun, Jianxin Liao, Lu Lu, Zhu Han:
Learning-Based Real-Time Transmission Control for Multi-Path TCP Networks. IEEE Trans. Cogn. Commun. Netw. 9(5): 1353-1369 (2023)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tpds/HanCGWQLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tpds/HanCGWQLL23
Rongxin Han, Dezhi Chen, Song Guo, Jingyu Wang, Qi Qi, Lu Lu, Jianxin Liao:
Multi-SP Network Slicing Parallel Relieving Edge Network Conflict. IEEE Trans. Parallel Distributed Syst. 34(11): 2860-2875 (2023)
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/DongAWZLM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/DongAWZLM23
Linhao Dong, Zhecheng An, Peihao Wu, Jun Zhang, Lu Lu, Zejun Ma:
CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training. ACL (Findings) 2023: 8894-8907
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/QiuHLZLM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/QiuHLZLM23
Jin Qiu, Lu Huang, Boyu Li, Jun Zhang, Lu Lu, Zejun Ma:
Improving Large-Scale Deep Biasing With Phoneme Features and Text-Only Data in Streaming Transducer. ASRU 2023: 1-8
[c5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/QuYWLM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/QuYWLM23
Xinghua Qu, Xiang Yin, Pengfei Wei, Lu Lu, Zejun Ma:
AudioQR: Deep Neural Audio Watermarks For QR Code. IJCAI 2023: 6192-6200
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangL00M23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangL00M23
Lu Huang, Boyu Li, Jun Zhang, Lu Lu, Zejun Ma:
Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer. INTERSPEECH 2023: 386-390
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinHXPKCHLM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinHXPKCHLM23
Yist Y. Lin, Tao Han, Haihua Xu, Van Tung Pham, Yerbolat Khassanov, Tze Yuang Chong, Yi He, Lu Lu, Zejun Ma:
Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition. INTERSPEECH 2023: 904-908
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FanD0L00M23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FanD0L00M23
Zhiyun Fan, Linhao Dong, Chen Shen, Zhenlin Liang, Jun Zhang, Lu Lu, Zejun Ma:
Language-specific Boundary Learning for Improving Mandarin-English Code-switching Speech Recognition. INTERSPEECH 2023: 3322-3326
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/sigir/QuLS0OLM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigir/QuLS0OLM23
Xinghua Qu, Hongyang Liu, Zhu Sun, Xiang Yin, Yew Soon Ong, Lu Lu, Zejun Ma:
Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions and Prospects. SIGIR 2023: 2701-2711
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-13343
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-13343
Xinnian Liang, Bing Wang, Hui Huang, Shuangzhi Wu, Peihao Wu, Lu Lu, Zejun Ma, Zhoujun Li:
Unleashing Infinite-Length Input Capacity for Large-scale Language Models with Self-Controlled Memory System. CoRR abs/2304.13343 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-17499
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-17499
Linhao Dong, Zhecheng An, Peihao Wu, Jun Zhang, Lu Lu, Zejun Ma:
CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training. CoRR abs/2305.17499 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-02982
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-02982
Qianqian Dong, Zhiying Huang, Qiao Tian, Chen Xu, Tom Ko, Yunlong Zhao, Siyuan Feng, Tang Li, Kexin Wang, Xuxin Cheng, Fengpeng Yue, Ye Bai, Xi Chen, Lu Lu, Zejun Ma, Yuping Wang, Mingxuan Wang, Yuxuan Wang:
PolyVoice: Language Models for Speech to Speech Translation. CoRR abs/2306.02982 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-04076
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-04076
Lu Huang, Boyu Li, Jun Zhang, Lu Lu, Zejun Ma:
Text-only Domain Adaptation using Unified Speech-Text Representation in Transducer. CoRR abs/2306.04076 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-05279
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-05279
Zhiyun Fan, Linhao Dong, Chen Shen, Zhenlin Liang, Jun Zhang, Lu Lu, Zejun Ma:
Language-specific Acoustic Boundary Learning for Mandarin-English Code-switching Speech Recognition. CoRR abs/2306.05279 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-08219
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-08219
Xinghua Qu, Hongyang Liu, Zhu Sun, Xiang Yin, Yew Soon Ong, Lu Lu, Zejun Ma:
Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions, and Prospects. CoRR abs/2306.08219 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13963
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13963
Wenyi Yu, Changli Tang, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang:
Connecting Speech Encoder and Large Language Model for ASR. CoRR abs/2309.13963 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-05863
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-05863
Guangzhi Sun, Wenyi Yu, Changli Tang, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang:
Fine-grained Audio-Visual Joint Representations for Multimodal Large Language Models. CoRR abs/2310.05863 (2023)
[i2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-13289
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-13289
Changli Tang, Wenyi Yu, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang:
SALMONN: Towards Generic Hearing Abilities for Large Language Models. CoRR abs/2310.13289 (2023)
[i1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-08966
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-08966
Jin Qiu, Lu Huang, Boyu Li, Jun Zhang, Lu Lu, Zejun Ma:
Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming Transducer. CoRR abs/2311.08966 (2023)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.