Skip to main content

Showing 1–9 of 9 results for author: Brown-Cohen, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04622  [pdf, other

    cs.LG

    On scalable oversight with weak LLMs judging strong LLMs

    Authors: Zachary Kenton, Noah Y. Siegel, János Kramár, Jonah Brown-Cohen, Samuel Albanie, Jannis Bulian, Rishabh Agarwal, David Lindner, Yunhao Tang, Noah D. Goodman, Rohin Shah

    Abstract: Scalable oversight protocols aim to enable humans to accurately supervise superhuman AI. In this paper we study debate, where two AI's compete to convince a judge; consultancy, where a single AI tries to convince a judge that asks questions; and compare to a baseline of direct question-answering, where the judge just answers outright without the AI. We use large language models (LLMs) as both AI a… ▽ More

    Submitted 12 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: 15 pages (53 including appendices). V2: minor correction to Figure 3; add Figure A.9 comparing open vs assigned consultancy; add a reference

  2. arXiv:2311.14125  [pdf, other

    cs.AI cs.LG

    Scalable AI Safety via Doubly-Efficient Debate

    Authors: Jonah Brown-Cohen, Geoffrey Irving, Georgios Piliouras

    Abstract: The emergence of pre-trained AI systems with powerful capabilities across a diverse and ever-increasing set of complex domains has raised a critical challenge for AI safety as tasks can become too complicated for humans to judge directly. Irving et al. [2018] proposed a debate method in this direction with the goal of pitting the power of such AI models against each other until the problem of iden… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

  3. arXiv:2310.17567  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models

    Authors: Dingli Yu, Simran Kaur, Arushi Gupta, Jonah Brown-Cohen, Anirudh Goyal, Sanjeev Arora

    Abstract: With LLMs shifting their role from statistical modeling of language to serving as general-purpose AI agents, how should LLM evaluations change? Arguably, a key ability of an AI agent is to flexibly combine, as needed, the basic skills it has learned. The capability to combine skills plays an important role in (human) pedagogy and also in a paper on emergence phenomena (Arora & Goyal, 2023). This… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  4. arXiv:2306.05873  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Detecting Adversarial Directions in Deep Reinforcement Learning to Make Robust Decisions

    Authors: Ezgi Korkmaz, Jonah Brown-Cohen

    Abstract: Learning in MDPs with highly complex state representations is currently possible due to multiple advancements in reinforcement learning algorithm design. However, this incline in complexity, and furthermore the increase in the dimensions of the observation came at the cost of volatility that can be taken advantage of via adversarial attacks (i.e. moving along worst-case directions in the observati… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: Published in ICML 2023

  5. arXiv:2112.13832  [pdf, ps, other

    cs.DS cs.LG

    Faster Algorithms and Constant Lower Bounds for the Worst-Case Expected Error

    Authors: Jonah Brown-Cohen

    Abstract: The study of statistical estimation without distributional assumptions on data values, but with knowledge of data collection methods was recently introduced by Chen, Valiant and Valiant (NeurIPS 2020). In this framework, the goal is to design estimators that minimize the worst-case expected error. Here the expectation is over a known, randomized data collection process from some population, and th… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

  6. arXiv:1911.02911  [pdf, ps, other

    cs.CC cs.DS

    Extended Formulation Lower Bounds for Refuting Random CSPs

    Authors: Jonah Brown-Cohen, Prasad Raghavendra

    Abstract: Random constraint satisfaction problems (CSPs) such as random $3$-SAT are conjectured to be computationally intractable. The average case hardness of random $3$-SAT and other CSPs has broad and far-reaching implications on problems in approximation, learning theory and cryptography. In this work, we show subexponential lower bounds on the size of linear programming relaxations for refuting rando… ▽ More

    Submitted 7 November, 2019; originally announced November 2019.

  7. arXiv:1809.06528  [pdf, other

    cs.GT cs.CR

    Formal Barriers to Longest-Chain Proof-of-Stake Protocols

    Authors: Jonah Brown-Cohen, Arvind Narayanan, Christos-Alexandros Psomas, S. Matthew Weinberg

    Abstract: The security of most existing cryptocurrencies is based on a concept called Proof-of-Work, in which users must solve a computationally hard cryptopuzzle to authorize transactions (`one unit of computation, one vote'). This leads to enormous expenditure on hardware and electricity in order to collect the rewards associated with transaction authorization. Proof-of-Stake is an alternative concept tha… ▽ More

    Submitted 18 September, 2018; originally announced September 2018.

  8. arXiv:1504.00703  [pdf, other

    cs.CC

    The matching problem has no small symmetric SDP

    Authors: Gábor Braun, Jonah Brown-Cohen, Arefin Huq, Sebastian Pokutta, Prasad Raghavendra, Aurko Roy, Benjamin Weitz, Daniel Zink

    Abstract: Yannakakis showed that the matching problem does not have a small symmetric linear program. Rothvoß recently proved that any, not necessarily symmetric, linear program also has exponential size. It is natural to ask whether the matching problem can be expressed compactly in a framework such as semidefinite programming (SDP) that is more powerful than linear programming but still allows efficient o… ▽ More

    Submitted 30 November, 2016; v1 submitted 2 April, 2015; originally announced April 2015.

    Comments: 18 pages

    MSC Class: 68Q17; 68R10

    Journal ref: Proceedings of SODA 2016, 1067-1078

  9. arXiv:1501.01598  [pdf, ps, other

    cs.CC

    Combinatorial Optimization Algorithms via Polymorphisms

    Authors: Jonah Brown-Cohen, Prasad Raghavendra

    Abstract: An elegant characterization of the complexity of constraint satisfaction problems has emerged in the form of the the algebraic dichotomy conjecture of [BKJ00]. Roughly speaking, the characterization asserts that a CSP Λ is tractable if and only if there exist certain non-trivial operations known as polymorphisms to combine solutions to Λ to create new ones. In an entirely separate line of work, th… ▽ More

    Submitted 7 January, 2015; originally announced January 2015.