Skip to main content

Showing 1–17 of 17 results for author: Kasirzadeh, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11843  [pdf

    cs.CY cs.AI

    Explanation Hacking: The perils of algorithmic recourse

    Authors: Emily Sullivan, Atoosa Kasirzadeh

    Abstract: We argue that the trend toward providing users with feasible and actionable explanations of AI decisions, known as recourse explanations, comes with ethical downsides. Specifically, we argue that recourse explanations face several conceptual pitfalls and can lead to problematic explanation hacking, which undermines their ethical status. As an alternative, we advocate that explanations of AI decisi… ▽ More

    Submitted 22 March, 2024; originally announced June 2024.

  2. arXiv:2405.13974  [pdf, other

    cs.CL cs.AI

    CIVICS: Building a Dataset for Examining Culturally-Informed Values in Large Language Models

    Authors: Giada Pistilli, Alina Leidinger, Yacine Jernite, Atoosa Kasirzadeh, Alexandra Sasha Luccioni, Margaret Mitchell

    Abstract: This paper introduces the "CIVICS: Culturally-Informed & Values-Inclusive Corpus for Societal impacts" dataset, designed to evaluate the social and cultural variation of Large Language Models (LLMs) across multiple languages and value-sensitive topics. We create a hand-crafted, multilingual dataset of value-laden prompts which address specific socially sensitive topics, including LGBTQI rights, so… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  3. arXiv:2404.09932  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    Foundational Challenges in Assuring Alignment and Safety of Large Language Models

    Authors: Usman Anwar, Abulhair Saparov, Javier Rando, Daniel Paleka, Miles Turpin, Peter Hase, Ekdeep Singh Lubana, Erik Jenner, Stephen Casper, Oliver Sourbut, Benjamin L. Edelman, Zhaowei Zhang, Mario Günther, Anton Korinek, Jose Hernandez-Orallo, Lewis Hammond, Eric Bigelow, Alexander Pan, Lauro Langosco, Tomasz Korbak, Heidi Zhang, Ruiqi Zhong, Seán Ó hÉigeartaigh, Gabriel Recchia, Giulio Corsi , et al. (13 additional authors not shown)

    Abstract: This work identifies 18 foundational challenges in assuring the alignment and safety of large language models (LLMs). These challenges are organized into three different categories: scientific understanding of LLMs, development and deployment methods, and sociotechnical challenges. Based on the identified challenges, we pose $200+$ concrete research questions.

    Submitted 15 April, 2024; originally announced April 2024.

  4. arXiv:2404.00579  [pdf, other

    cs.IR cs.AI

    A Review of Modern Recommender Systems Using Generative Models (Gen-RecSys)

    Authors: Yashar Deldjoo, Zhankui He, Julian McAuley, Anton Korikov, Scott Sanner, Arnau Ramisa, René Vidal, Maheswaran Sathiamoorthy, Atoosa Kasirzadeh, Silvia Milano

    Abstract: Traditional recommender systems (RS) typically use user-item rating histories as their main data source. However, deep generative models now have the capability to model and sample from complex data distributions, including user-item interactions, text, images, and videos, enabling novel recommendation tasks. This comprehensive, multidisciplinary survey connects key advancements in RS using Genera… ▽ More

    Submitted 4 July, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: This survey accompanies a tutorial presented at ACM KDD'24

  5. arXiv:2402.06811  [pdf, ps, other

    cs.AI

    Discipline and Label: A WEIRD Genealogy and Social Theory of Data Annotation

    Authors: Andrew Smart, Ding Wang, Ellis Monk, Mark Díaz, Atoosa Kasirzadeh, Erin Van Liemt, Sonja Schmer-Galunder

    Abstract: Data annotation remains the sine qua non of machine learning and AI. Recent empirical work on data annotation has begun to highlight the importance of rater diversity for fairness, model performance, and new lines of research have begun to examine the working conditions for data annotation workers, the impacts and role of annotator subjectivity on labels, and the potential psychological harms from… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: 18 pages

  6. arXiv:2401.07836  [pdf, ps, other

    cs.CY cs.AI cs.LG

    Two Types of AI Existential Risk: Decisive and Accumulative

    Authors: Atoosa Kasirzadeh

    Abstract: The conventional discourse on existential risks (x-risks) from AI typically focuses on abrupt, dire events caused by advanced AI systems, particularly those that might achieve or surpass human-level intelligence. These events have severe consequences that either lead to human extinction or irreversibly cripple human civilization to a point beyond recovery. This discourse, however, often neglects t… ▽ More

    Submitted 6 February, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

  7. arXiv:2307.05543  [pdf, ps, other

    cs.CY

    Typology of Risks of Generative Text-to-Image Models

    Authors: Charlotte Bird, Eddie L. Ungless, Atoosa Kasirzadeh

    Abstract: This paper investigates the direct risks and harms associated with modern text-to-image generative models, such as DALL-E and Midjourney, through a comprehensive literature review. While these models offer unprecedented capabilities for generating images, their development and use introduce new types of risk that require careful consideration. Our review reveals significant knowledge gaps concerni… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: Accepted for publication in 2023 AAAI/ACM Conference on AI, Ethics, and Society (AIES 2023)

  8. arXiv:2306.01479  [pdf, ps, other

    cs.CY

    Reconciling Governmental Use of Online Targeting With Democracy

    Authors: Katja Andric, Atoosa Kasirzadeh

    Abstract: The societal and epistemological implications of online targeted advertising have been scrutinized by AI ethicists, legal scholars, and policymakers alike. However, the government's use of online targeting and its consequential socio-political ramifications remain under-explored from a critical socio-technical standpoint. This paper investigates the socio-political implications of governmental onl… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted for publication in 2023 ACM Conference on Fairness, Accountability, and Transparency (FAccT 2023)

  9. arXiv:2304.11163  [pdf, other

    cs.CY cs.CL

    ChatGPT, Large Language Technologies, and the Bumpy Road of Benefiting Humanity

    Authors: Atoosa Kasirzadeh

    Abstract: The allure of emerging AI technologies is undoubtedly thrilling. However, the promise that AI technologies will benefit all of humanity is empty so long as we lack a nuanced understanding of what humanity is supposed to be in the face of widening global inequality and pressing existential threats. Going forward, it is crucial to invest in rigorous and collaborative AI safety and ethics research. W… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: As part of a series on Dailynous : "Philosophers on next-generation large language models"

  10. arXiv:2209.00731  [pdf, ps, other

    cs.CY cs.CL

    In conversation with Artificial Intelligence: aligning language models with human values

    Authors: Atoosa Kasirzadeh, Iason Gabriel

    Abstract: Large-scale language technologies are increasingly used in various forms of communication with humans across different contexts. One particular use case for these technologies is conversational agents, which output natural language text in response to prompts and queries. This mode of engagement raises a number of social and ethical questions. For example, what does it mean to align conversational… ▽ More

    Submitted 21 December, 2022; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: Accepted for publication with minor revisions at Philosophy & Technology

  11. Algorithmic Fairness and Structural Injustice: Insights from Feminist Political Philosophy

    Authors: Atoosa Kasirzadeh

    Abstract: Data-driven predictive algorithms are widely used to automate and guide high-stake decision making such as bail and parole recommendation, medical resource distribution, and mortgage allocation. Nevertheless, harmful outcomes biased against vulnerable groups have been reported. The growing research field known as 'algorithmic fairness' aims to mitigate these harmful biases. Its primary methodology… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Comments: This paper is accepted for publication in the Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society (AIES 22)

  12. arXiv:2112.04359  [pdf, other

    cs.CL cs.AI cs.CY

    Ethical and social risks of harm from Language Models

    Authors: Laura Weidinger, John Mellor, Maribeth Rauh, Conor Griffin, Jonathan Uesato, Po-Sen Huang, Myra Cheng, Mia Glaese, Borja Balle, Atoosa Kasirzadeh, Zac Kenton, Sasha Brown, Will Hawkins, Tom Stepleton, Courtney Biles, Abeba Birhane, Julia Haas, Laura Rimell, Lisa Anne Hendricks, William Isaac, Sean Legassick, Geoffrey Irving, Iason Gabriel

    Abstract: This paper aims to help structure the risk landscape associated with large-scale Language Models (LMs). In order to foster advances in responsible innovation, an in-depth understanding of the potential risks posed by these models is needed. A wide range of established and anticipated risks are analysed in detail, drawing on multidisciplinary expertise and literature from computer science, linguist… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

  13. Fairness and Data Protection Impact Assessments

    Authors: Atoosa Kasirzadeh, Damian Clifford

    Abstract: In this paper, we critically examine the effectiveness of the requirement to conduct a Data Protection Impact Assessment (DPIA) in Article 35 of the General Data Protection Regulation (GDPR) in light of fairness metrics. Through this analysis, we explore the role of the fairness principle as introduced in Article 5(1)(a) and its multifaceted interpretation in the obligation to conduct a DPIA. Our… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Journal ref: AIES '21: Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society

  14. User Tampering in Reinforcement Learning Recommender Systems

    Authors: Charles Evans, Atoosa Kasirzadeh

    Abstract: In this paper, we introduce new formal methods and provide empirical evidence to highlight a unique safety concern prevalent in reinforcement learning (RL)-based recommendation algorithms -- 'user tampering.' User tampering is a situation where an RL-based recommender system may manipulate a media user's opinions through its suggestions as part of a policy to maximize long-term user engagement. We… ▽ More

    Submitted 24 July, 2023; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: In proceedings of the 6th AAAI/ACM Conference on Artificial Intelligence, Ethics and Society (AIES '23)

  15. arXiv:2103.00752  [pdf, ps, other

    cs.CY cs.AI

    Reasons, Values, Stakeholders: A Philosophical Framework for Explainable Artificial Intelligence

    Authors: Atoosa Kasirzadeh

    Abstract: The societal and ethical implications of the use of opaque artificial intelligence systems for consequential decisions, such as welfare allocation and criminal justice, have generated a lively debate among multiple stakeholder groups, including computer scientists, ethicists, social scientists, policy makers, and end users. However, the lack of a common language or a multi-dimensional framework to… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

    Comments: This paper is accepted for non-archival publication at the ACM conference on Fairness, Accountability, and Transparency (FAccT) 2021

  16. arXiv:2102.05085  [pdf, ps, other

    cs.CY

    The Use and Misuse of Counterfactuals in Ethical Machine Learning

    Authors: Atoosa Kasirzadeh, Andrew Smart

    Abstract: The use of counterfactuals for considerations of algorithmic fairness and explainability is gaining prominence within the machine learning community and industry. This paper argues for more caution with the use of counterfactuals when the facts to be considered are social categories such as race or gender. We review a broad body of papers from philosophy and social sciences on social ontology and… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: 9 pages, 1 table, 1 figure

  17. arXiv:1910.13607  [pdf, ps, other

    cs.AI cs.CY cs.HC cs.LG

    Mathematical decisions and non-causal elements of explainable AI

    Authors: Atoosa Kasirzadeh

    Abstract: The social implications of algorithmic decision-making in sensitive contexts have generated lively debates among multiple stakeholders, such as moral and political philosophers, computer scientists, and the public. Yet, the lack of a common language and a conceptual framework for an appropriate bridging of the moral, technical, and political aspects of the debate prevents the discussion to be as e… ▽ More

    Submitted 12 December, 2019; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: A shorter version of this paper was presented at the NeurIPS 2019, Human-Centric Machine Learning Workshop