default search action

combined dblp search
author search
venue search
publication search

ask others

Mike Seltzer

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShangguanYLWFWD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShangguanYLWFWD24
Yuan Shangguan, Haichuan Yang, Danni Li, Chunyang Wu, Yassir Fathullah, Dilin Wang, Ayushi Dalmia, Raghuraman Krishnamoorthi, Ozlem Kalinli, Junteng Jia, Jay Mahadeokar, Xin Lei, Mike Seltzer, Vikas Chandra:
TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-Device ASR Models. ICASSP 2024: 10216-10220
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuoMMSWMKFS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuoMMSWMKFS24
Jinxi Guo, Niko Moritz, Yingyi Ma, Frank Seide, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Effective Internal Language Model Training and Fusion for Factorized Transducer Model. ICASSP 2024: 12687-12691
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FathullahWLJSLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FathullahWLJSLG24
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. ICASSP 2024: 13351-13355
[c10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/FathullahWLLJSM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/FathullahWLLJSM24
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Ke Li, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs. NAACL-HLT 2024: 5522-5532
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-01716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-01716
Jinxi Guo, Niko Moritz, Yingyi Ma, Frank Seide, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Effective internal language model training and fusion for factorized transducer model. CoRR abs/2404.01716 (2024)
2023
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiangSSMPZTS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiangSSMPZTS23
Dawei Liang, Hang Su, Tarun Singh, Jay Mahadeokar, Shanil Puri, Jiedan Zhu, Edison Thomaz, Mike Seltzer:
Dynamic Speech Endpoint Detection with Regression Targets. ICASSP 2023: 1-5
[c8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FathullahWSJXML23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FathullahWSJXML23
Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales:
Multi-Head State Space Model for Speech Recognition. INTERSPEECH 2023: 241-245
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12498
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12498
Yassir Fathullah, Chunyang Wu, Yuan Shangguan, Junteng Jia, Wenhan Xiong, Jay Mahadeokar, Chunxi Liu, Yangyang Shi, Ozlem Kalinli, Mike Seltzer, Mark J. F. Gales:
Multi-Head State Space Model for Speech Recognition. CoRR abs/2305.12498 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-11795
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-11795
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. CoRR abs/2307.11795 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-01947
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-01947
Yuan Shangguan, Haichuan Yang, Danni Li, Chunyang Wu, Yassir Fathullah, Dilin Wang, Ayushi Dalmia, Raghuraman Krishnamoorthi, Ozlem Kalinli, Junteng Jia, Jay Mahadeokar, Xin Lei, Mike Seltzer, Vikas Chandra:
TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models. CoRR abs/2309.01947 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-06753
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-06753
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data. CoRR abs/2311.06753 (2023)
2022
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShiWWXMZLLSNKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShiWWXMZLLSNKS22
Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer:
Streaming Transformer Transducer based Speech Recognition Using Non-Causal Convolution. ICASSP 2022: 8277-8281
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-14252
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-14252
Dawei Liang, Hang Su, Tarun Singh, Jay Mahadeokar, Shanil Puri, Jiedan Zhu, Edison Thomaz, Mike Seltzer:
Dynamic Speech Endpoint Detection with Regression Targets. CoRR abs/2210.14252 (2022)
2021
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ZhangMZZSSCPSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ZhangMZZSSCPSS21
Xiaohui Zhang, Vimal Manohar, David Zhang, Frank Zhang, Yangyang Shi, Nayan Singhal, Julian Chan, Fuchun Peng, Yatharth Saraf, Mike Seltzer:
On Lattice-Free Boosted MMI Training of HMM and CTC-Based Full-Context ASR Models. ASRU 2021: 1026-1033
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShiWWYC0LS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShiWWYC0LS21
Yangyang Shi, Yongqiang Wang, Chunyang Wu, Ching-Feng Yeh, Julian Chan, Frank Zhang, Duc Le, Mike Seltzer:
Emformer: Efficient Memory Transformer Based Acoustic Model for Low Latency Streaming Speech Recognition. ICASSP 2021: 6783-6787
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-04154
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-04154
Xiaohui Zhang, Vimal Manohar, David Zhang, Frank Zhang, Yangyang Shi, Nayan Singhal, Julian Chan, Fuchun Peng, Yatharth Saraf, Mike Seltzer:
On lattice-free boosted MMI training of HMM and CTC-based full-context ASR models. CoRR abs/2107.04154 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-03174
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-03174
Dawei Liang, Yangyang Shi, Yun Wang, Nayan Singhal, Alex Xiao, Jonathan Shaw, Edison Thomaz, Ozlem Kalinli, Mike Seltzer:
Transferring Voice Knowledge for Acoustic Event Detection: An Empirical Study. CoRR abs/2110.03174 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05241
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05241
Yangyang Shi, Chunyang Wu, Dilin Wang, Alex Xiao, Jay Mahadeokar, Xiaohui Zhang, Chunxi Liu, Ke Li, Yuan Shangguan, Varun Nagaraja, Ozlem Kalinli, Mike Seltzer:
Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution. CoRR abs/2110.05241 (2021)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2017
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiongDHSSSYZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiongDHSSSYZ17
Wayne Xiong, Jasha Droppo, Xuedong Huang, Frank Seide, Mike Seltzer, Andreas Stolcke, Dong Yu, Geoffrey Zweig:
The microsoft 2016 conversational speech recognition system. ICASSP 2017: 5255-5259
2016
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/XiongDHSSSYZ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/XiongDHSSSYZ16
Wayne Xiong, Jasha Droppo, Xuedong Huang, Frank Seide, Mike Seltzer, Andreas Stolcke, Dong Yu, Geoffrey Zweig:
The Microsoft 2016 Conversational Speech Recognition System. CoRR abs/1609.03528 (2016)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/XiongDHSSSYZ16a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/XiongDHSSSYZ16a
Wayne Xiong, Jasha Droppo, Xuedong Huang, Frank Seide, Mike Seltzer, Andreas Stolcke, Dong Yu, Geoffrey Zweig:
Achieving Human Parity in Conversational Speech Recognition. CoRR abs/1610.05256 (2016)
2014
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Boulanger-LewandowskiDSY14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Boulanger-LewandowskiDSY14
Nicolas Boulanger-Lewandowski, Jasha Droppo, Mike Seltzer, Dong Yu:
Phone sequence modeling with recurrent neural networks. ICASSP 2014: 5417-5421
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XueLYSG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XueLYSG14
Jian Xue, Jinyu Li, Dong Yu, Mike Seltzer, Yifan Gong:
Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network. ICASSP 2014: 6359-6363
2013
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JansenDGJKCFHMRSCMVBBCDFHLLNPRST13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JansenDGJKCFHMRSCMVBBCDFHLLNPRST13
Aren Jansen, Emmanuel Dupoux, Sharon Goldwater, Mark Johnson, Sanjeev Khudanpur, Kenneth Church, Naomi Feldman, Hynek Hermansky, Florian Metze, Richard C. Rose, Mike Seltzer, Pascal Clark, Ian McGraw, Balakrishnan Varadarajan, Erin Bennett, Benjamin Börschinger, Justin T. Chiu, Ewan Dunbar, Abdellah Fourtassi, David Harwath, Chia-ying Lee, Keith D. Levin, Atta Norouzian, Vijayaditya Peddinti, Rachael Richardson, Thomas Schatz, Samuel Thomas:
A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition. ICASSP 2013: 8111-8115

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.