Skip to main content

Showing 1–12 of 12 results for author: Fleisig, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16342  [pdf, other

    cs.CL

    ADVSCORE: A Metric for the Evaluation and Creation of Adversarial Benchmarks

    Authors: Yoo Yeon Sung, Eve Fleisig, Ishani Mondal, Jordan Lee Boyd-Graber

    Abstract: Adversarial benchmarks validate model abilities by providing samples that fool models but not humans. However, despite the proliferation of datasets that claim to be adversarial, there does not exist an established metric to evaluate how adversarial these datasets are. To address this lacuna, we introduce ADVSCORE, a metric which quantifies how adversarial and discriminative an adversarial dataset… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2401.11185

  2. arXiv:2406.08818  [pdf, other

    cs.CL cs.CY

    Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination

    Authors: Eve Fleisig, Genevieve Smith, Madeline Bossi, Ishita Rustagi, Xavier Yin, Dan Klein

    Abstract: We present a large-scale study of linguistic bias exhibited by ChatGPT covering ten dialects of English (Standard American English, Standard British English, and eight widely spoken non-"standard" varieties from around the world). We prompted GPT-3.5 Turbo and GPT-4 with text by native speakers of each variety and analyzed the responses via detailed linguistic feature annotation and native speaker… ▽ More

    Submitted 17 September, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2406.08726  [pdf, ps, other

    cs.CL

    Standard Language Ideology in AI-Generated Language

    Authors: Genevieve Smith, Eve Fleisig, Madeline Bossi, Ishita Rustagi, Xavier Yin

    Abstract: In this position paper, we explore standard language ideology in language generated by large language models (LLMs). First, we outline how standard language ideology is reflected and reinforced in LLMs. We then present a taxonomy of open problems regarding standard language ideology in AI-generated language with implications for minoritized language communities. We introduce the concept of standar… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  4. arXiv:2405.05860  [pdf, other

    cs.LG cs.CL cs.CY

    The Perspectivist Paradigm Shift: Assumptions and Challenges of Capturing Human Labels

    Authors: Eve Fleisig, Su Lin Blodgett, Dan Klein, Zeerak Talat

    Abstract: Longstanding data labeling practices in machine learning involve collecting and aggregating labels from multiple annotators. But what should we do when annotators disagree? Though annotator disagreement has long been seen as a problem to minimize, new perspectivist approaches challenge this assumption by treating disagreement as a valuable source of information. In this position paper, we examine… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  5. arXiv:2404.13038  [pdf, ps, other

    cs.AI cs.CY

    Mapping Social Choice Theory to RLHF

    Authors: Jessica Dai, Eve Fleisig

    Abstract: Recent work on the limitations of using reinforcement learning from human feedback (RLHF) to incorporate human preferences into model behavior often raises social choice theory as a reference point. Social choice theory's analysis of settings such as voting mechanisms provides technical infrastructure that can inform how to aggregate human preferences amid disagreement. We analyze the problem sett… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  6. arXiv:2311.05020  [pdf, other

    cs.CL

    First Tragedy, then Parse: History Repeats Itself in the New Era of Large Language Models

    Authors: Naomi Saphra, Eve Fleisig, Kyunghyun Cho, Adam Lopez

    Abstract: Many NLP researchers are experiencing an existential crisis triggered by the astonishing success of ChatGPT and other systems based on large language models (LLMs). After such a disruptive change to our understanding of the field, what is left to do? Taking a historical lens, we look for guidance from the first era of LLMs, which began in 2005 with large $n$-gram models for machine translation (MT… ▽ More

    Submitted 25 March, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

  7. arXiv:2311.02802  [pdf, other

    cs.CL cs.AI

    Incorporating Worker Perspectives into MTurk Annotation Practices for NLP

    Authors: Olivia Huang, Eve Fleisig, Dan Klein

    Abstract: Current practices regarding data collection for natural language processing on Amazon Mechanical Turk (MTurk) often rely on a combination of studies on data quality and heuristics shared among NLP researchers. However, without considering the perspectives of MTurk workers, these approaches are susceptible to issues regarding workers' rights and poor response quality. We conducted a critical litera… ▽ More

    Submitted 15 November, 2023; v1 submitted 5 November, 2023; originally announced November 2023.

  8. arXiv:2305.15047  [pdf, other

    cs.CL cs.AI

    Ghostbuster: Detecting Text Ghostwritten by Large Language Models

    Authors: Vivek Verma, Eve Fleisig, Nicholas Tomlin, Dan Klein

    Abstract: We introduce Ghostbuster, a state-of-the-art system for detecting AI-generated text. Our method works by passing documents through a series of weaker language models, running a structured search over possible combinations of their features, and then training a classifier on the selected features to predict whether documents are AI-generated. Crucially, Ghostbuster does not require access to token… ▽ More

    Submitted 5 April, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: NAACL 2024

  9. arXiv:2305.14735  [pdf, other

    cs.CL cs.AI cs.LG

    Centering the Margins: Outlier-Based Identification of Harmed Populations in Toxicity Detection

    Authors: Vyoma Raman, Eve Fleisig, Dan Klein

    Abstract: The impact of AI models on marginalized communities has traditionally been measured by identifying performance differences between specified demographic subgroups. Though this approach aims to center vulnerable groups, it risks obscuring patterns of harm faced by intersectional subgroups or shared across multiple groups. To address this, we draw on theories of marginalization from disability studi… ▽ More

    Submitted 1 December, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  10. arXiv:2305.06626  [pdf, other

    cs.CL cs.AI

    When the Majority is Wrong: Modeling Annotator Disagreement for Subjective Tasks

    Authors: Eve Fleisig, Rediet Abebe, Dan Klein

    Abstract: Though majority vote among annotators is typically used for ground truth labels in natural language processing, annotator disagreement in tasks such as hate speech detection may reflect differences in opinion across groups, not noise. Thus, a crucial problem in hate speech detection is determining whether a statement is offensive to the demographic group that it targets, when that group may consti… ▽ More

    Submitted 17 March, 2024; v1 submitted 11 May, 2023; originally announced May 2023.

  11. arXiv:2203.10675  [pdf, other

    cs.CL cs.AI

    Mitigating Gender Bias in Machine Translation through Adversarial Learning

    Authors: Eve Fleisig, Christiane Fellbaum

    Abstract: Machine translation and other NLP systems often contain significant biases regarding sensitive attributes, such as gender or race, that worsen system performance and perpetuate harmful stereotypes. Recent preliminary research suggests that adversarial learning can be used as part of a model-agnostic bias mitigation method that requires no data modifications. However, adapting this strategy for mac… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

  12. arXiv:2010.02316  [pdf, other

    cs.CL

    Sentiment Analysis for Reinforcement Learning

    Authors: Ameet Deshpande, Eve Fleisig

    Abstract: While reinforcement learning (RL) has been successful in natural language processing (NLP) domains such as dialogue generation and text-based games, it typically faces the problem of sparse rewards that leads to slow or no convergence. Traditional methods that use text descriptions to extract only a state representation ignore the feedback inherently present in them. In text-based games, for examp… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: Work in progress