default search action
Suchin Gururangan
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c18]Kai Nylund, Suchin Gururangan, Noah A. Smith:
Time is Encoded in the Weights of Finetuned Language Models. ACL (1) 2024: 2571-2587 - [c17]Li Lucy, Suchin Gururangan, Luca Soldaini, Emma Strubell, David Bamman, Lauren Klein, Jesse Dodge:
AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters. ACL (1) 2024: 7393-7420 - [c16]Sewon Min, Suchin Gururangan, Eric Wallace, Weijia Shi, Hannaneh Hajishirzi, Noah A. Smith, Luke Zettlemoyer:
SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore. ICLR 2024 - [c15]Mengzhou Xia, Sadhika Malladi, Suchin Gururangan, Sanjeev Arora, Danqi Chen:
LESS: Selecting Influential Data for Targeted Instruction Tuning. ICML 2024 - [c14]Trishita Tiwari, Suchin Gururangan, Chuan Guo, Weizhe Hua, Sanjay Kariyappa, Udit Gupta, Wenjie Xiong, Kiwan Maeng, Hsien-Hsin S. Lee, G. Edward Suh:
Information Flow Control in Machine Learning through Modular Model Architecture. USENIX Security Symposium 2024 - [i25]Li Lucy, Suchin Gururangan, Luca Soldaini, Emma Strubell, David Bamman, Lauren Klein, Jesse Dodge:
AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters. CoRR abs/2401.06408 (2024) - [i24]Terra Blevins, Tomasz Limisiewicz, Suchin Gururangan, Margaret Li, Hila Gonen, Noah A. Smith, Luke Zettlemoyer:
Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models. CoRR abs/2401.10440 (2024) - [i23]Mengzhou Xia, Sadhika Malladi, Suchin Gururangan, Sanjeev Arora, Danqi Chen:
LESS: Selecting Influential Data for Targeted Instruction Tuning. CoRR abs/2402.04333 (2024) - [i22]Samir Yitzhak Gadre, Georgios Smyrnis, Vaishaal Shankar, Suchin Gururangan, Mitchell Wortsman, Rulin Shao, Jean Mercat, Alex Fang, Jeffrey Li, Sedrick Keh, Rui Xin, Marianna Nezhurina, Igor Vasiljevic, Jenia Jitsev, Alexandros G. Dimakis, Gabriel Ilharco, Shuran Song, Thomas Kollar, Yair Carmon, Achal Dave, Reinhard Heckel, Niklas Muennighoff, Ludwig Schmidt:
Language models scale reliably with over-training and on downstream tasks. CoRR abs/2403.08540 (2024) - [i21]Jeffrey Li, Alex Fang, Georgios Smyrnis, Maor Ivgi, Matt Jordan, Samir Yitzhak Gadre, Hritik Bansal, Etash Kumar Guha, Sedrick Keh, Kushal Arora, Saurabh Garg, Rui Xin, Niklas Muennighoff, Reinhard Heckel, Jean Mercat, Mayee Chen, Suchin Gururangan, Mitchell Wortsman, Alon Albalak, Yonatan Bitton, Marianna Nezhurina, Amro Abbas, Cheng-Yu Hsieh, Dhruba Ghosh, Josh Gardner, Maciej Kilian, Hanlin Zhang, Rulin Shao, Sarah M. Pratt, Sunny Sanyal, Gabriel Ilharco, Giannis Daras, Kalyani Marathe, Aaron Gokaslan, Jieyu Zhang, Khyathi Raghavi Chandu, Thao Nguyen, Igor Vasiljevic, Sham M. Kakade, Shuran Song, Sujay Sanghavi, Fartash Faghri, Sewoong Oh, Luke Zettlemoyer, Kyle Lo, Alaaeldin El-Nouby, Hadi Pouransari, Alexander Toshev, Stephanie Wang, Dirk Groeneveld, Luca Soldaini, Pang Wei Koh, Jenia Jitsev, Thomas Kollar, Alexandros G. Dimakis, Yair Carmon, Achal Dave, Ludwig Schmidt, Vaishaal Shankar:
DataComp-LM: In search of the next generation of training sets for language models. CoRR abs/2406.11794 (2024) - 2023
- [j1]Mitchell Wortsman, Suchin Gururangan, Shen Li, Ali Farhadi, Ludwig Schmidt, Michael G. Rabbat, Ari S. Morcos:
lo-fi: distributed fine-tuning without communication. Trans. Mach. Learn. Res. 2023 (2023) - [i20]Suchin Gururangan, Margaret Li, Mike Lewis, Weijia Shi, Tim Althoff, Noah A. Smith, Luke Zettlemoyer:
Scaling Expert Language Models with Unsupervised Domain Discovery. CoRR abs/2303.14177 (2023) - [i19]Trishita Tiwari, Suchin Gururangan, Chuan Guo, Weizhe Hua, Sanjay Kariyappa, Udit Gupta, Wenjie Xiong, Kiwan Maeng, Hsien-Hsin S. Lee, G. Edward Suh:
Information Flow Control in Machine Learning through Modular Model Architecture. CoRR abs/2306.03235 (2023) - [i18]Sewon Min, Suchin Gururangan, Eric Wallace, Hannaneh Hajishirzi, Noah A. Smith, Luke Zettlemoyer:
SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore. CoRR abs/2308.04430 (2023) - [i17]Kai Nylund, Suchin Gururangan, Noah A. Smith:
Time is Encoded in the Weights of Finetuned Language Models. CoRR abs/2312.13401 (2023) - 2022
- [c13]Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer:
M2D2: A Massively Multi-Domain Language Modeling Dataset. EMNLP 2022: 964-975 - [c12]Suchin Gururangan, Dallas Card, Sarah K. Dreier, Emily K. Gade, Leroy Z. Wang, Zeyu Wang, Luke Zettlemoyer, Noah A. Smith:
Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection. EMNLP 2022: 2562-2580 - [c11]Weijia Shi, Julian Michael, Suchin Gururangan, Luke Zettlemoyer:
Nearest Neighbor Zero-Shot Inference. EMNLP 2022: 3254-3265 - [c10]Suchin Gururangan, Mike Lewis, Ari Holtzman, Noah A. Smith, Luke Zettlemoyer:
DEMix Layers: Disentangling Domains for Modular Language Modeling. NAACL-HLT 2022: 5557-5576 - [c9]Kelvin Luu, Daniel Khashabi, Suchin Gururangan, Karishma Mandyam, Noah A. Smith:
Time Waits for No One! Analysis and Challenges of Temporal Misalignment. NAACL-HLT 2022: 5944-5958 - [i16]Suchin Gururangan, Dallas Card, Sarah K. Dreier, Emily K. Gade, Leroy Z. Wang, Zeyu Wang, Luke Zettlemoyer, Noah A. Smith:
Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection. CoRR abs/2201.10474 (2022) - [i15]Weijia Shi, Julian Michael, Suchin Gururangan, Luke Zettlemoyer:
Nearest Neighbor Zero-Shot Inference. CoRR abs/2205.13792 (2022) - [i14]Margaret Li, Suchin Gururangan, Tim Dettmers, Mike Lewis, Tim Althoff, Noah A. Smith, Luke Zettlemoyer:
Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models. CoRR abs/2208.03306 (2022) - [i13]Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer:
M2D2: A Massively Multi-domain Language Modeling Dataset. CoRR abs/2210.07370 (2022) - [i12]Mitchell Wortsman, Suchin Gururangan, Shen Li, Ali Farhadi, Ludwig Schmidt, Michael G. Rabbat, Ari S. Morcos:
lo-fi: distributed fine-tuning without communication. CoRR abs/2210.11948 (2022) - [i11]Gabriel Ilharco, Marco Túlio Ribeiro, Mitchell Wortsman, Suchin Gururangan, Ludwig Schmidt, Hannaneh Hajishirzi, Ali Farhadi:
Editing Models with Task Arithmetic. CoRR abs/2212.04089 (2022) - 2021
- [c8]Elizabeth Clark, Tal August, Sofia Serrano, Nikita Haduong, Suchin Gururangan, Noah A. Smith:
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated Text. ACL/IJCNLP (1) 2021: 7282-7296 - [c7]Jesse Dodge, Suchin Gururangan, Dallas Card, Roy Schwartz, Noah A. Smith:
Expected Validation Performance and Estimation of a Random Variable's Maximum. EMNLP (Findings) 2021: 4066-4073 - [c6]Albert Xu, Eshaan Pathak, Eric Wallace, Suchin Gururangan, Maarten Sap, Dan Klein:
Detoxifying Language Models Risks Marginalizing Minority Voices. NAACL-HLT 2021: 2390-2397 - [i10]Albert Xu, Eshaan Pathak, Eric Wallace, Suchin Gururangan, Maarten Sap, Dan Klein:
Detoxifying Language Models Risks Marginalizing Minority Voices. CoRR abs/2104.06390 (2021) - [i9]Elizabeth Clark, Tal August, Sofia Serrano, Nikita Haduong, Suchin Gururangan, Noah A. Smith:
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated Text. CoRR abs/2107.00061 (2021) - [i8]Suchin Gururangan, Mike Lewis, Ari Holtzman, Noah A. Smith, Luke Zettlemoyer:
DEMix Layers: Disentangling Domains for Modular Language Modeling. CoRR abs/2108.05036 (2021) - [i7]Jesse Dodge, Suchin Gururangan, Dallas Card, Roy Schwartz, Noah A. Smith:
Expected Validation Performance and Estimation of a Random Variable's Maximum. CoRR abs/2110.00613 (2021) - [i6]Kelvin Luu, Daniel Khashabi, Suchin Gururangan, Karishma Mandyam, Noah A. Smith:
Time Waits for No One! Analysis and Challenges of Temporal Misalignment. CoRR abs/2111.07408 (2021) - 2020
- [c5]Suchin Gururangan, Ana Marasovic, Swabha Swayamdipta, Kyle Lo, Iz Beltagy, Doug Downey, Noah A. Smith:
Don't Stop Pretraining: Adapt Language Models to Domains and Tasks. ACL 2020: 8342-8360 - [c4]Samuel Gehman, Suchin Gururangan, Maarten Sap, Yejin Choi, Noah A. Smith:
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models. EMNLP (Findings) 2020: 3356-3369 - [i5]Suchin Gururangan, Ana Marasovic, Swabha Swayamdipta, Kyle Lo, Iz Beltagy, Doug Downey, Noah A. Smith:
Don't Stop Pretraining: Adapt Language Models to Domains and Tasks. CoRR abs/2004.10964 (2020) - [i4]Samuel Gehman, Suchin Gururangan, Maarten Sap, Yejin Choi, Noah A. Smith:
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models. CoRR abs/2009.11462 (2020)
2010 – 2019
- 2019
- [c3]Suchin Gururangan, Tam Dang, Dallas Card, Noah A. Smith:
Variational Pretraining for Semi-supervised Text Classification. ACL (1) 2019: 5880-5894 - [c2]Jesse Dodge, Suchin Gururangan, Dallas Card, Roy Schwartz, Noah A. Smith:
Show Your Work: Improved Reporting of Experimental Results. EMNLP/IJCNLP (1) 2019: 2185-2194 - [i3]Suchin Gururangan, Tam Dang, Dallas Card, Noah A. Smith:
Variational Pretraining for Semi-supervised Text Classification. CoRR abs/1906.02242 (2019) - [i2]Jesse Dodge, Suchin Gururangan, Dallas Card, Roy Schwartz, Noah A. Smith:
Show Your Work: Improved Reporting of Experimental Results. CoRR abs/1909.03004 (2019) - 2018
- [c1]Suchin Gururangan, Swabha Swayamdipta, Omer Levy, Roy Schwartz, Samuel R. Bowman, Noah A. Smith:
Annotation Artifacts in Natural Language Inference Data. NAACL-HLT (2) 2018: 107-112 - [i1]Suchin Gururangan, Swabha Swayamdipta, Omer Levy, Roy Schwartz, Samuel R. Bowman, Noah A. Smith:
Annotation Artifacts in Natural Language Inference Data. CoRR abs/1803.02324 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:07 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint