Skip to main content

Showing 1–7 of 7 results for author: Korinek, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.09932  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    Foundational Challenges in Assuring Alignment and Safety of Large Language Models

    Authors: Usman Anwar, Abulhair Saparov, Javier Rando, Daniel Paleka, Miles Turpin, Peter Hase, Ekdeep Singh Lubana, Erik Jenner, Stephen Casper, Oliver Sourbut, Benjamin L. Edelman, Zhaowei Zhang, Mario Günther, Anton Korinek, Jose Hernandez-Orallo, Lewis Hammond, Eric Bigelow, Alexander Pan, Lauro Langosco, Tomasz Korbak, Heidi Zhang, Ruiqi Zhong, Seán Ó hÉigeartaigh, Gabriel Recchia, Giulio Corsi , et al. (13 additional authors not shown)

    Abstract: This work identifies 18 foundational challenges in assuring the alignment and safety of large language models (LLMs). These challenges are organized into three different categories: scientific understanding of LLMs, development and deployment methods, and sociotechnical challenges. Based on the identified challenges, we pose $200+$ concrete research questions.

    Submitted 15 April, 2024; originally announced April 2024.

  2. arXiv:2403.12107  [pdf, other

    econ.GN cs.AI

    Scenarios for the Transition to AGI

    Authors: Anton Korinek, Donghyun Suh

    Abstract: We analyze how output and wages behave under different scenarios for technological progress that may culminate in Artificial General Intelligence (AGI), defined as the ability of AI systems to perform all tasks that humans can perform. We assume that human work can be decomposed into atomistic tasks that differ in their complexity. Advances in technology make ever more complex tasks amenable to au… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  3. arXiv:2311.09227  [pdf, other

    cs.CY cs.AI cs.SE

    Open-Sourcing Highly Capable Foundation Models: An evaluation of risks, benefits, and alternative methods for pursuing open-source objectives

    Authors: Elizabeth Seger, Noemi Dreksler, Richard Moulange, Emily Dardaman, Jonas Schuett, K. Wei, Christoph Winter, Mackenzie Arnold, Seán Ó hÉigeartaigh, Anton Korinek, Markus Anderljung, Ben Bucknall, Alan Chan, Eoghan Stafford, Leonie Koessler, Aviv Ovadya, Ben Garfinkel, Emma Bluemke, Michael Aird, Patrick Levermore, Julian Hazell, Abhishek Gupta

    Abstract: Recent decisions by leading AI labs to either open-source their models or to restrict access to their models has sparked debate about whether, and how, increasingly capable AI models should be shared. Open-sourcing in AI typically refers to making model architecture and weights freely and publicly accessible for anyone to modify, study, build on, and use. This offers advantages such as enabling ex… ▽ More

    Submitted 29 September, 2023; originally announced November 2023.

    Comments: Official release at https://www.governance.ai/research-paper/open-sourcing-highly-capable-foundation-models

  4. arXiv:2311.01550  [pdf

    cs.AI econ.GN

    Market Concentration Implications of Foundation Models

    Authors: Jai Vipra, Anton Korinek

    Abstract: We analyze the structure of the market for foundation models, i.e., large AI models such as those that power ChatGPT and that are adaptable to downstream uses, and we examine the implications for competition policy and regulation. We observe that the most capable models will have a tendency towards natural monopoly and may have potentially vast markets. This calls for a two-pronged regulatory resp… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: Working Paper

  5. arXiv:2307.03718  [pdf, other

    cs.CY cs.AI

    Frontier AI Regulation: Managing Emerging Risks to Public Safety

    Authors: Markus Anderljung, Joslyn Barnhart, Anton Korinek, Jade Leung, Cullen O'Keefe, Jess Whittlestone, Shahar Avin, Miles Brundage, Justin Bullock, Duncan Cass-Beggs, Ben Chang, Tantum Collins, Tim Fist, Gillian Hadfield, Alan Hayes, Lewis Ho, Sara Hooker, Eric Horvitz, Noam Kolt, Jonas Schuett, Yonadav Shavit, Divya Siddarth, Robert Trager, Kevin Wolf

    Abstract: Advanced AI models hold the promise of tremendous benefits for humanity, but society needs to proactively manage the accompanying risks. In this paper, we focus on what we term "frontier AI" models: highly capable foundation models that could possess dangerous capabilities sufficient to pose severe risks to public safety. Frontier AI models pose a distinct regulatory challenge: dangerous capabilit… ▽ More

    Submitted 7 November, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: Update July 11th: - Added missing footnote back in. - Adjusted author order (mistakenly non-alphabetical among the first 6 authors) and adjusted affiliations (Jess Whittlestone's affiliation was mistagged and Gillian Hadfield had SRI added to her affiliations) Updated September 4th: Various typos

  6. arXiv:2205.04279  [pdf

    cs.CY cs.AI

    Aligned with Whom? Direct and social goals for AI systems

    Authors: Anton Korinek, Avital Balwit

    Abstract: As artificial intelligence (AI) becomes more powerful and widespread, the AI alignment problem - how to ensure that AI systems pursue the goals that we want them to pursue - has garnered growing attention. This article distinguishes two types of alignment problems depending on whose goals we consider, and analyzes the different solutions necessitated by each. The direct alignment problem considers… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: Prepared for the Oxford Handbook of AI Governance (23 pages, 2 figures)

  7. arXiv:2105.08475  [pdf, other

    cs.AI cs.CY econ.GN

    AI and Shared Prosperity

    Authors: Katya Klinova, Anton Korinek

    Abstract: Future advances in AI that automate away human labor may have stark implications for labor markets and inequality. This paper proposes a framework to analyze the effects of specific types of AI systems on the labor market, based on how much labor demand they will create versus displace, while taking into account that productivity gains also make society wealthier and thereby contribute to addition… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

    ACM Class: J.4; K.4.1

    Journal ref: Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society (AIES '21)