default search action
Erich Elsen
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2022
- [c24]Nikolay Savinov, Junyoung Chung, Mikolaj Binkowski, Erich Elsen, Aäron van den Oord:
Step-unrolled Denoising Autoencoders for Text Generation. ICLR 2022 - [c23]Sebastian Borgeaud, Arthur Mensch, Jordan Hoffmann, Trevor Cai, Eliza Rutherford, Katie Millican, George van den Driessche, Jean-Baptiste Lespiau, Bogdan Damoc, Aidan Clark, Diego de Las Casas, Aurelia Guy, Jacob Menick, Roman Ring, Tom Hennigan, Saffron Huang, Loren Maggiore, Chris Jones, Albin Cassirer, Andy Brock, Michela Paganini, Geoffrey Irving, Oriol Vinyals, Simon Osindero, Karen Simonyan, Jack W. Rae, Erich Elsen, Laurent Sifre:
Improving Language Models by Retrieving from Trillions of Tokens. ICML 2022: 2206-2240 - [c22]Aidan Clark, Diego de Las Casas, Aurelia Guy, Arthur Mensch, Michela Paganini, Jordan Hoffmann, Bogdan Damoc, Blake A. Hechtman, Trevor Cai, Sebastian Borgeaud, George van den Driessche, Eliza Rutherford, Tom Hennigan, Matthew J. Johnson, Albin Cassirer, Chris Jones, Elena Buchatskaya, David Budden, Laurent Sifre, Simon Osindero, Oriol Vinyals, Marc'Aurelio Ranzato, Jack W. Rae, Erich Elsen, Koray Kavukcuoglu, Karen Simonyan:
Unified Scaling Laws for Routed Language Models. ICML 2022: 4057-4086 - [c21]Laura Graesser, Utku Evci, Erich Elsen, Pablo Samuel Castro:
The State of Sparse Training in Deep Reinforcement Learning. ICML 2022: 7766-7792 - [c20]Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego de Las Casas, Lisa Anne Hendricks, Johannes Welbl, Aidan Clark, Tom Hennigan, Eric Noland, Katherine Millican, George van den Driessche, Bogdan Damoc, Aurelia Guy, Simon Osindero, Karen Simonyan, Erich Elsen, Oriol Vinyals, Jack W. Rae, Laurent Sifre:
An empirical analysis of compute-optimal large language model training. NeurIPS 2022 - [i28]Aidan Clark, Diego de Las Casas, Aurelia Guy, Arthur Mensch, Michela Paganini, Jordan Hoffmann, Bogdan Damoc, Blake A. Hechtman, Trevor Cai, Sebastian Borgeaud, George van den Driessche, Eliza Rutherford, Tom Hennigan, Matthew J. Johnson, Katie Millican, Albin Cassirer, Chris Jones, Elena Buchatskaya, David Budden, Laurent Sifre, Simon Osindero, Oriol Vinyals, Jack W. Rae, Erich Elsen, Koray Kavukcuoglu, Karen Simonyan:
Unified Scaling Laws for Routed Language Models. CoRR abs/2202.01169 (2022) - [i27]Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego de Las Casas, Lisa Anne Hendricks, Johannes Welbl, Aidan Clark, Tom Hennigan, Eric Noland, Katie Millican, George van den Driessche, Bogdan Damoc, Aurelia Guy, Simon Osindero, Karen Simonyan, Erich Elsen, Jack W. Rae, Oriol Vinyals, Laurent Sifre:
Training Compute-Optimal Large Language Models. CoRR abs/2203.15556 (2022) - [i26]Laura Graesser, Utku Evci, Erich Elsen, Pablo Samuel Castro:
The State of Sparse Training in Deep Reinforcement Learning. CoRR abs/2206.10369 (2022) - 2021
- [c19]Jeff Donahue, Sander Dieleman, Mikolaj Binkowski, Erich Elsen, Karen Simonyan:
End-to-end Adversarial Text-to-Speech. ICLR 2021 - [c18]Jacob Menick, Erich Elsen, Utku Evci, Simon Osindero, Karen Simonyan, Alex Graves:
Practical Real Time Recurrent Learning with a Sparse Approximation. ICLR 2021 - [i25]Siddhant M. Jayakumar, Razvan Pascanu, Jack W. Rae, Simon Osindero, Erich Elsen:
Top-KAST: Top-K Always Sparse Training. CoRR abs/2106.03517 (2021) - [i24]Sebastian Borgeaud, Arthur Mensch, Jordan Hoffmann, Trevor Cai, Eliza Rutherford, Katie Millican, George van den Driessche, Jean-Baptiste Lespiau, Bogdan Damoc, Aidan Clark, Diego de Las Casas, Aurelia Guy, Jacob Menick, Roman Ring, Tom Hennigan, Saffron Huang, Loren Maggiore, Chris Jones, Albin Cassirer, Andy Brock, Michela Paganini, Geoffrey Irving, Oriol Vinyals, Simon Osindero, Karen Simonyan, Jack W. Rae, Erich Elsen, Laurent Sifre:
Improving language models by retrieving from trillions of tokens. CoRR abs/2112.04426 (2021) - [i23]Nikolay Savinov, Junyoung Chung, Mikolaj Binkowski, Erich Elsen, Aäron van den Oord:
Step-unrolled Denoising Autoencoders for Text Generation. CoRR abs/2112.06749 (2021) - [i22]Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann, H. Francis Song, John Aslanides, Sarah Henderson, Roman Ring, Susannah Young, Eliza Rutherford, Tom Hennigan, Jacob Menick, Albin Cassirer, Richard Powell, George van den Driessche, Lisa Anne Hendricks, Maribeth Rauh, Po-Sen Huang, Amelia Glaese, Johannes Welbl, Sumanth Dathathri, Saffron Huang, Jonathan Uesato, John Mellor, Irina Higgins, Antonia Creswell, Nat McAleese, Amy Wu, Erich Elsen, Siddhant M. Jayakumar, Elena Buchatskaya, David Budden, Esme Sutherland, Karen Simonyan, Michela Paganini, Laurent Sifre, Lena Martens, Xiang Lorraine Li, Adhiguna Kuncoro, Aida Nematzadeh, Elena Gribovskaya, Domenic Donato, Angeliki Lazaridou, Arthur Mensch, Jean-Baptiste Lespiau, Maria Tsimpoukelli, Nikolai Grigorev, Doug Fritz, Thibault Sottiaux, Mantas Pajarskas, Toby Pohlen, Zhitao Gong, Daniel Toyama, Cyprien de Masson d'Autume, Yujia Li, Tayfun Terzi, Vladimir Mikulik, Igor Babuschkin, Aidan Clark, Diego de Las Casas, Aurelia Guy, Chris Jones, James Bradbury, Matthew J. Johnson, Blake A. Hechtman, Laura Weidinger, Iason Gabriel, William Isaac, Edward Lockhart, Simon Osindero, Laura Rimell, Chris Dyer, Oriol Vinyals, Kareem Ayoub, Jeff Stanway, Lorrayne Bennett, Demis Hassabis, Koray Kavukcuoglu, Geoffrey Irving:
Scaling Language Models: Methods, Analysis & Insights from Training Gopher. CoRR abs/2112.11446 (2021) - 2020
- [c17]Erich Elsen, Marat Dukhan, Trevor Gale, Karen Simonyan:
Fast Sparse ConvNets. CVPR 2020: 14617-14626 - [c16]Mikolaj Binkowski, Jeff Donahue, Sander Dieleman, Aidan Clark, Erich Elsen, Norman Casagrande, Luis C. Cobo, Karen Simonyan:
High Fidelity Speech Synthesis with Adversarial Networks. ICLR 2020 - [c15]Utku Evci, Trevor Gale, Jacob Menick, Pablo Samuel Castro, Erich Elsen:
Rigging the Lottery: Making All Tickets Winners. ICML 2020: 2943-2952 - [c14]Samuel L. Smith, Erich Elsen, Soham De:
On the Generalization Benefit of Noise in Stochastic Gradient Descent. ICML 2020: 9058-9067 - [c13]Siddhant M. Jayakumar, Razvan Pascanu, Jack W. Rae, Simon Osindero, Erich Elsen:
Top-KAST: Top-K Always Sparse Training. NeurIPS 2020 - [c12]Trevor Gale, Matei Zaharia, Cliff Young, Erich Elsen:
Sparse GPU kernels for deep learning. SC 2020: 17 - [i21]Jeff Donahue, Sander Dieleman, Mikolaj Binkowski, Erich Elsen, Karen Simonyan:
End-to-End Adversarial Text-to-Speech. CoRR abs/2006.03575 (2020) - [i20]Jacob Menick, Erich Elsen, Utku Evci, Simon Osindero, Karen Simonyan, Alex Graves:
A Practical Sparse Approximation for Real Time Recurrent Learning. CoRR abs/2006.07232 (2020) - [i19]Jordan Hoffmann, Simon Schmitt, Simon Osindero, Karen Simonyan, Erich Elsen:
AlgebraNets. CoRR abs/2006.07360 (2020) - [i18]Trevor Gale, Matei Zaharia, Cliff Young, Erich Elsen:
Sparse GPU Kernels for Deep Learning. CoRR abs/2006.10901 (2020) - [i17]Samuel L. Smith, Erich Elsen, Soham De:
On the Generalization Benefit of Noise in Stochastic Gradient Descent. CoRR abs/2006.15081 (2020)
2010 – 2019
- 2019
- [c11]Curtis Hawthorne, Andriy Stasyuk, Adam Roberts, Ian Simon, Cheng-Zhi Anna Huang, Sander Dieleman, Erich Elsen, Jesse H. Engel, Douglas Eck:
Enabling Factorized Piano Music Modeling and Generation with the MAESTRO Dataset. ICLR 2019 - [i16]Trevor Gale, Erich Elsen, Sara Hooker:
The State of Sparsity in Deep Neural Networks. CoRR abs/1902.09574 (2019) - [i15]Karel Lenc, Erich Elsen, Tom Schaul, Karen Simonyan:
Non-Differentiable Supervised Learning with Evolution Strategies and Hybrid Methods. CoRR abs/1906.03139 (2019) - [i14]Utku Evci, Fabian Pedregosa, Aidan N. Gomez, Erich Elsen:
The Difficulty of Training Sparse Neural Networks. CoRR abs/1906.10732 (2019) - [i13]Mikolaj Binkowski, Jeff Donahue, Sander Dieleman, Aidan Clark, Erich Elsen, Norman Casagrande, Luis C. Cobo, Karen Simonyan:
High Fidelity Speech Synthesis with Adversarial Networks. CoRR abs/1909.11646 (2019) - [i12]Erich Elsen, Marat Dukhan, Trevor Gale, Karen Simonyan:
Fast Sparse ConvNets. CoRR abs/1911.09723 (2019) - [i11]Utku Evci, Trevor Gale, Jacob Menick, Pablo Samuel Castro, Erich Elsen:
Rigging the Lottery: Making All Tickets Winners. CoRR abs/1911.11134 (2019) - 2018
- [c10]Paulius Micikevicius, Sharan Narang, Jonah Alben, Gregory F. Diamos, Erich Elsen, David García, Boris Ginsburg, Michael Houston, Oleksii Kuchaiev, Ganesh Venkatesh, Hao Wu:
Mixed Precision Training. ICLR (Poster) 2018 - [c9]Nal Kalchbrenner, Erich Elsen, Karen Simonyan, Seb Noury, Norman Casagrande, Edward Lockhart, Florian Stimberg, Aäron van den Oord, Sander Dieleman, Koray Kavukcuoglu:
Efficient Neural Audio Synthesis. ICML 2018: 2415-2424 - [c8]Aäron van den Oord, Yazhe Li, Igor Babuschkin, Karen Simonyan, Oriol Vinyals, Koray Kavukcuoglu, George van den Driessche, Edward Lockhart, Luis C. Cobo, Florian Stimberg, Norman Casagrande, Dominik Grewe, Seb Noury, Sander Dieleman, Erich Elsen, Nal Kalchbrenner, Heiga Zen, Alex Graves, Helen King, Tom Walters, Dan Belov, Demis Hassabis:
Parallel WaveNet: Fast High-Fidelity Speech Synthesis. ICML 2018: 3915-3923 - [c7]Curtis Hawthorne, Erich Elsen, Jialin Song, Adam Roberts, Ian Simon, Colin Raffel, Jesse H. Engel, Sageev Oore, Douglas Eck:
Onsets and Frames: Dual-Objective Piano Transcription. ISMIR 2018: 50-57 - [i10]Nal Kalchbrenner, Erich Elsen, Karen Simonyan, Seb Noury, Norman Casagrande, Edward Lockhart, Florian Stimberg, Aäron van den Oord, Sander Dieleman, Koray Kavukcuoglu:
Efficient Neural Audio Synthesis. CoRR abs/1802.08435 (2018) - [i9]Curtis Hawthorne, Andriy Stasyuk, Adam Roberts, Ian Simon, Cheng-Zhi Anna Huang, Sander Dieleman, Erich Elsen, Jesse H. Engel, Douglas Eck:
Enabling Factorized Piano Music Modeling and Generation with the MAESTRO Dataset. CoRR abs/1810.12247 (2018) - 2017
- [c6]Song Han, Jeff Pool, Sharan Narang, Huizi Mao, Enhao Gong, Shijian Tang, Erich Elsen, Peter Vajda, Manohar Paluri, John Tran, Bryan Catanzaro, William J. Dally:
DSD: Dense-Sparse-Dense Training for Deep Neural Networks. ICLR (Poster) 2017 - [c5]Sharan Narang, Greg Diamos, Shubho Sengupta, Erich Elsen:
Exploring Sparsity in Recurrent Neural Networks. ICLR (Poster) 2017 - [i8]Sharan Narang, Gregory F. Diamos, Shubho Sengupta, Erich Elsen:
Exploring Sparsity in Recurrent Neural Networks. CoRR abs/1704.05119 (2017) - [i7]Paulius Micikevicius, Sharan Narang, Jonah Alben, Gregory F. Diamos, Erich Elsen, David García, Boris Ginsburg, Michael Houston, Oleksii Kuchaiev, Ganesh Venkatesh, Hao Wu:
Mixed Precision Training. CoRR abs/1710.03740 (2017) - [i6]Curtis Hawthorne, Erich Elsen, Jialin Song, Adam Roberts, Ian Simon, Colin Raffel, Jesse H. Engel, Sageev Oore, Douglas Eck:
Onsets and Frames: Dual-Objective Piano Transcription. CoRR abs/1710.11153 (2017) - [i5]Aäron van den Oord, Yazhe Li, Igor Babuschkin, Karen Simonyan, Oriol Vinyals, Koray Kavukcuoglu, George van den Driessche, Edward Lockhart, Luis C. Cobo, Florian Stimberg, Norman Casagrande, Dominik Grewe, Seb Noury, Sander Dieleman, Erich Elsen, Nal Kalchbrenner, Heiga Zen, Alex Graves, Helen King, Tom Walters, Dan Belov, Demis Hassabis:
Parallel WaveNet: Fast High-Fidelity Speech Synthesis. CoRR abs/1711.10433 (2017) - 2016
- [c4]Dario Amodei, Sundaram Ananthanarayanan, Rishita Anubhai, Jingliang Bai, Eric Battenberg, Carl Case, Jared Casper, Bryan Catanzaro, Jingdong Chen, Mike Chrzanowski, Adam Coates, Greg Diamos, Erich Elsen, Jesse H. Engel, Linxi Fan, Christopher Fougner, Awni Y. Hannun, Billy Jun, Tony Han, Patrick LeGresley, Xiangang Li, Libby Lin, Sharan Narang, Andrew Y. Ng, Sherjil Ozair, Ryan Prenger, Sheng Qian, Jonathan Raiman, Sanjeev Satheesh, David Seetapun, Shubho Sengupta, Chong Wang, Yi Wang, Zhiqian Wang, Bo Xiao, Yan Xie, Dani Yogatama, Jun Zhan, Zhenyao Zhu:
Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin. ICML 2016: 173-182 - [c3]Greg Diamos, Shubho Sengupta, Bryan Catanzaro, Mike Chrzanowski, Adam Coates, Erich Elsen, Jesse H. Engel, Awni Y. Hannun, Sanjeev Satheesh:
Persistent RNNs: Stashing Recurrent Weights On-Chip. ICML 2016: 2024-2033 - [i4]Song Han, Jeff Pool, Sharan Narang, Huizi Mao, Shijian Tang, Erich Elsen, Bryan Catanzaro, John Tran, William J. Dally:
DSD: Regularizing Deep Neural Networks with Dense-Sparse-Dense Training Flow. CoRR abs/1607.04381 (2016) - 2015
- [i3]Dario Amodei, Rishita Anubhai, Eric Battenberg, Carl Case, Jared Casper, Bryan Catanzaro, Jingdong Chen, Mike Chrzanowski, Adam Coates, Greg Diamos, Erich Elsen, Jesse H. Engel, Linxi Fan, Christopher Fougner, Tony Han, Awni Y. Hannun, Billy Jun, Patrick LeGresley, Libby Lin, Sharan Narang, Andrew Y. Ng, Sherjil Ozair, Ryan Prenger, Jonathan Raiman, Sanjeev Satheesh, David Seetapun, Shubho Sengupta, Yi Wang, Zhiqian Wang, Chong Wang, Bo Xiao, Dani Yogatama, Jun Zhan, Zhenyao Zhu:
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin. CoRR abs/1512.02595 (2015) - 2014
- [i2]Awni Y. Hannun, Carl Case, Jared Casper, Bryan Catanzaro, Greg Diamos, Erich Elsen, Ryan Prenger, Sanjeev Satheesh, Shubho Sengupta, Adam Coates, Andrew Y. Ng:
Deep Speech: Scaling up end-to-end speech recognition. CoRR abs/1412.5567 (2014) - 2011
- [c2]Zach DeVito, Niels Joubert, Francisco Palacios, Stephen Oakley, Montserrat Medina, Mike Barrientos, Erich Elsen, Frank Ham, Alex Aiken, Karthik Duraisamy, Eric Darve, Juan J. Alonso, Pat Hanrahan:
Liszt: a domain specific language for building portable mesh-based PDE solvers. SC 2011: 9:1-9:12
2000 – 2009
- 2008
- [j1]Erich Elsen, Patrick LeGresley, Eric Darve:
Large calculation of the flow over a hypersonic vehicle using a GPU. J. Comput. Phys. 227(24): 10148-10161 (2008) - 2007
- [i1]Erich Elsen, Vaidyanathan Vishal, Mike Houston, Vijay S. Pande, Pat Hanrahan, Eric Darve:
N-Body Simulations on GPUs. CoRR abs/0706.3060 (2007) - 2006
- [c1]Erich Elsen, Mike Houston, Vaidyanathan Vishal, Eric Darve, Pat Hanrahan, Vijay S. Pande:
Poster reception - N-Body simulation on GPUs. SC 2006: 188
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-04-25 05:45 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint