Skip to main content

Showing 1–22 of 22 results for author: Pachocki, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.08774  [pdf, other

    cs.CL cs.AI

    GPT-4 Technical Report

    Authors: OpenAI, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-Shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko , et al. (256 additional authors not shown)

    Abstract: We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo… ▽ More

    Submitted 4 March, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 100 pages; updated authors list; fixed author names and added citation

  2. arXiv:2203.03466  [pdf, other

    cs.LG cond-mat.dis-nn cs.NE

    Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer

    Authors: Greg Yang, Edward J. Hu, Igor Babuschkin, Szymon Sidor, Xiaodong Liu, David Farhi, Nick Ryder, Jakub Pachocki, Weizhu Chen, Jianfeng Gao

    Abstract: Hyperparameter (HP) tuning in deep learning is an expensive process, prohibitively so for neural networks (NNs) with billions of parameters. We show that, in the recently discovered Maximal Update Parametrization (muP), many optimal HPs remain stable even as model size changes. This leads to a new HP tuning paradigm we call muTransfer: parametrize the target model in muP, tune the HP indirectly on… ▽ More

    Submitted 28 March, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: NeurIPS 2021

  3. arXiv:1912.06680  [pdf, other

    cs.LG stat.ML

    Dota 2 with Large Scale Deep Reinforcement Learning

    Authors: OpenAI, :, Christopher Berner, Greg Brockman, Brooke Chan, Vicki Cheung, Przemysław Dębiak, Christy Dennison, David Farhi, Quirin Fischer, Shariq Hashme, Chris Hesse, Rafal Józefowicz, Scott Gray, Catherine Olsson, Jakub Pachocki, Michael Petrov, Henrique P. d. O. Pinto, Jonathan Raiman, Tim Salimans, Jeremy Schlatter, Jonas Schneider, Szymon Sidor, Ilya Sutskever, Jie Tang , et al. (2 additional authors not shown)

    Abstract: On April 13th, 2019, OpenAI Five became the first AI system to defeat the world champions at an esports game. The game of Dota 2 presents novel challenges for AI systems such as long time horizons, imperfect information, and complex, continuous state-action spaces, all challenges which will become increasingly central to more capable AI systems. OpenAI Five leveraged existing reinforcement learnin… ▽ More

    Submitted 13 December, 2019; originally announced December 2019.

  4. arXiv:1904.08178  [pdf, other

    cs.SI cs.DS

    Novel Dense Subgraph Discovery Primitives: Risk Aversion and Exclusion Queries

    Authors: Charalampos E. Tsourakakis, Tianyi Chen, Naonori Kakimura, Jakub Pachocki

    Abstract: In the densest subgraph problem, given a weighted undirected graph $G(V,E,w)$, with non-negative edge weights, we are asked to find a subset of nodes $S\subseteq V$ that maximizes the degree density $w(S)/|S|$, where $w(S)$ is the sum of the edge weights induced by $S$. This problem is a well studied problem, known as the {\em densest subgraph problem}, and is solvable in polynomial time. But what… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

    Comments: 24 pages

  5. arXiv:1808.05662  [pdf, ps, other

    cs.DS

    Constant Arboricity Spectral Sparsifiers

    Authors: Timothy Chu, Michael B. Cohen, Jakub W. Pachocki, Richard Peng

    Abstract: We show that every graph is spectrally similar to the union of a constant number of forests. Moreover, we show that Spielman-Srivastava sparsifiers are the union of O(logn) forests. This result can be used to estimate boundaries of small subsets of vertices in nearly optimal query time.

    Submitted 16 August, 2018; originally announced August 2018.

  6. arXiv:1808.00177  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Learning Dexterous In-Hand Manipulation

    Authors: OpenAI, Marcin Andrychowicz, Bowen Baker, Maciek Chociej, Rafal Jozefowicz, Bob McGrew, Jakub Pachocki, Arthur Petron, Matthias Plappert, Glenn Powell, Alex Ray, Jonas Schneider, Szymon Sidor, Josh Tobin, Peter Welinder, Lilian Weng, Wojciech Zaremba

    Abstract: We use reinforcement learning (RL) to learn dexterous in-hand manipulation policies which can perform vision-based object reorientation on a physical Shadow Dexterous Hand. The training is performed in a simulated environment in which we randomize many of the physical properties of the system like friction coefficients and an object's appearance. Our policies transfer to the physical robot despite… ▽ More

    Submitted 18 January, 2019; v1 submitted 1 August, 2018; originally announced August 2018.

    Comments: Making OpenAI the first author. We wish this paper to be cited as "Learning Dexterous In-Hand Manipulation" by OpenAI et al. We are replicating the approach from the physics community: arXiv:1812.06489

  7. arXiv:1710.03748  [pdf, other

    cs.AI

    Emergent Complexity via Multi-Agent Competition

    Authors: Trapit Bansal, Jakub Pachocki, Szymon Sidor, Ilya Sutskever, Igor Mordatch

    Abstract: Reinforcement learning algorithms can train agents that solve problems in complex, interesting environments. Normally, the complexity of the trained agent is closely related to the complexity of the environment. This suggests that a highly capable agent requires a complex environment for training. In this paper, we point out that a competitive multi-agent environment trained with self-play can pro… ▽ More

    Submitted 14 March, 2018; v1 submitted 10 October, 2017; originally announced October 2017.

    Comments: Published as a conference paper at ICLR 2018

  8. arXiv:1704.00633  [pdf, ps, other

    cs.CC cs.DS

    Optimal lower bounds for universal relation, and for samplers and finding duplicates in streams

    Authors: Michael Kapralov, Jelani Nelson, Jakub Pachocki, Zhengyu Wang, David P. Woodruff, Mobin Yahyazadeh

    Abstract: In the communication problem $\mathbf{UR}$ (universal relation) [KRW95], Alice and Bob respectively receive $x, y \in\{0,1\}^n$ with the promise that $x\neq y$. The last player to receive a message must output an index $i$ such that $x_i\neq y_i$. We prove that the randomized one-way communication complexity of this problem in the public coin model is exactly… ▽ More

    Submitted 3 April, 2017; originally announced April 2017.

    Comments: merge of arXiv:1703.08139 and of work of Kapralov, Woodruff, and Yahyazadeh

  9. arXiv:1703.08139  [pdf, ps, other

    cs.CC cs.DS

    Optimal lower bounds for universal relation, samplers, and finding duplicates

    Authors: Jelani Nelson, Jakub Pachocki, Zhengyu Wang

    Abstract: In the communication problem $\mathbf{UR}$ (universal relation) [KRW95], Alice and Bob respectively receive $x$ and $y$ in $\{0,1\}^n$ with the promise that $x\neq y$. The last player to receive a message must output an index $i$ such that $x_i\neq y_i$. We prove that the randomized one-way communication complexity of this problem in the public coin model is exactly… ▽ More

    Submitted 23 March, 2017; originally announced March 2017.

  10. arXiv:1611.06940  [pdf, ps, other

    cs.DS

    A Framework for Analyzing Resparsification Algorithms

    Authors: Rasmus Kyng, Jakub Pachocki, Richard Peng, Sushant Sachdeva

    Abstract: A spectral sparsifier of a graph $G$ is a sparser graph $H$ that approximately preserves the quadratic form of $G$, i.e. for all vectors $x$, $x^T L_G x \approx x^T L_H x$, where $L_G$ and $L_H$ denote the respective graph Laplacians. Spectral sparsifiers generalize cut sparsifiers, and have found many applications in designing graph algorithms. In recent years, there has been interest in computin… ▽ More

    Submitted 21 November, 2016; originally announced November 2016.

    Comments: This paper supersedes arXiv:1605.08194

  11. arXiv:1611.00721  [pdf, other

    cs.DS

    Approximating Cycles in Directed Graphs: Fast Algorithms for Girth and Roundtrip Spanners

    Authors: Jakub Pachocki, Liam Roditty, Aaron Sidford, Roei Tov, Virginia Vassilevska Williams

    Abstract: The girth of a graph, i.e. the length of its shortest cycle, is a fundamental graph parameter. Unfortunately all known algorithms for computing, even approximately, the girth and girth-related structures in directed weighted $m$-edge and $n$-node graphs require $Ω(\min\{n^ω, mn\})$ time (for $2\leqω<2.373$). In this paper, we drastically improve these runtimes as follows: * Multiplicative Approx… ▽ More

    Submitted 10 August, 2018; v1 submitted 2 November, 2016; originally announced November 2016.

    Comments: this is a revision of the original submission that fixes an error (the original submission claimed that the additive approximation algorithm also works for roundtrip spanners; this is false)

  12. arXiv:1606.06235  [pdf, other

    cs.DS cs.DM cs.SI math.CO

    Scalable motif-aware graph clustering

    Authors: Charalampos Tsourakakis, Jakub Pachocki, Michael Mitzenmacher

    Abstract: We develop new methods based on graph motifs for graph clustering, allowing more efficient detection of communities within networks. We focus on triangles within graphs, but our techniques extend to other clique motifs as well. Our intuition, which has been suggested but not formalized similarly in previous works, is that triangles are a better signature of community than edges. We therefore gener… ▽ More

    Submitted 4 February, 2017; v1 submitted 20 June, 2016; originally announced June 2016.

    Comments: 17 pages, to appear in WWW 2017 Proceedings

  13. arXiv:1606.05225  [pdf, other

    cs.DS math.OC

    Geometric Median in Nearly Linear Time

    Authors: Michael B. Cohen, Yin Tat Lee, Gary Miller, Jakub Pachocki, Aaron Sidford

    Abstract: In this paper we provide faster algorithms for solving the geometric median problem: given $n$ points in $\mathbb{R}^{d}$ compute a point that minimizes the sum of Euclidean distances to the points. This is one of the oldest non-trivial problems in computational geometry yet despite an abundance of research the previous fastest algorithms for computing a $(1+ε)$-approximate geometric median were… ▽ More

    Submitted 16 June, 2016; originally announced June 2016.

    Comments: Symposium on Theory of Computing (STOC) 2016

  14. arXiv:1605.08194  [pdf, ps, other

    cs.DS

    Analysis of Resparsification

    Authors: Jakub Pachocki

    Abstract: We show that schemes for sparsifying matrices based on iteratively resampling rows yield guarantees matching classic 'offline' sparsifiers (see e.g. Spielman and Srivastava [STOC 2008]). In particular, this gives a formal analysis of a scheme very similar to the one proposed by Kelner and Levin [TCS 2013].

    Submitted 26 May, 2016; originally announced May 2016.

    Comments: preliminary draft

  15. arXiv:1604.05448  [pdf, ps, other

    cs.DS

    Online Row Sampling

    Authors: Michael B. Cohen, Cameron Musco, Jakub Pachocki

    Abstract: Finding a small spectral approximation for a tall $n \times d$ matrix $A$ is a fundamental numerical primitive. For a number of reasons, one often seeks an approximation whose rows are sampled from those of $A$. Row sampling improves interpretability, saves space when $A$ is sparse, and preserves row structure, which is especially important, for example, when $A$ represents a graph. However, cor… ▽ More

    Submitted 19 April, 2016; originally announced April 2016.

  16. arXiv:1603.09009  [pdf, ps, other

    cs.DS

    Routing under Balance

    Authors: Alina Ene, Gary Miller, Jakub Pachocki, Aaron Sidford

    Abstract: We introduce the notion of balance for directed graphs: a weighted directed graph is $α$-balanced if for every cut $S \subseteq V$, the total weight of edges going from $S$ to $V\setminus S$ is within factor $α$ of the total weight of edges going from $V\setminus S$ to $S$. Several important families of graphs are nearly balanced, in particular, Eulerian graphs (with $α= 1$) and residual graphs of… ▽ More

    Submitted 29 March, 2016; originally announced March 2016.

    Comments: To appear in STOC 2016

    ACM Class: C.2.2; F.2.0

  17. arXiv:1602.05016  [pdf, ps, other

    cs.DS

    Tight Lower Bounds on Graph Embedding Problems

    Authors: Marek Cygan, Fedor V. Fomin, Alexander Golovnev, Alexander S. Kulikov, Ivan Mihajlin, Jakub Pachocki, Arkadiusz Socała

    Abstract: We prove that unless the Exponential Time Hypothesis (ETH) fails, deciding if there is a homomorphism from graph $G$ to graph $H$ cannot be done in time $|V(H)|^{o(|V(G)|)}$. We also show an exponential-time reduction from Graph Homomorphism to Subgraph Isomorphism. This rules out (subject to ETH) a possibility of $|V(H)|^{o(|V(H)|)}$-time algorithm deciding if graph $G$ is a subgraph of $H$. For… ▽ More

    Submitted 16 February, 2016; originally announced February 2016.

    Comments: 23 pages. arXiv admin note: substantial text overlap with arXiv:1502.05447, arXiv:1507.03738

  18. arXiv:1504.02876  [pdf, ps, other

    cs.DS

    The Hardness of Subgraph Isomorphism

    Authors: Marek Cygan, Jakub Pachocki, Arkadiusz Socała

    Abstract: Subgraph Isomorphism is a very basic graph problem, where given two graphs $G$ and $H$ one is to check whether $G$ is a subgraph of $H$. Despite its simple definition, the Subgraph Isomorphism problem turns out to be very broad, as it generalizes problems such as Clique, $r$-Coloring, Hamiltonicity, Set Packing and Bandwidth. However, for all of the mentioned problems $2^{\mathcal{O}(n)}$ time alg… ▽ More

    Submitted 11 April, 2015; originally announced April 2015.

  19. arXiv:1407.6144  [pdf, ps, other

    cs.DS

    On the String Consensus Problem and the Manhattan Sequence Consensus Problem

    Authors: Tomasz Kociumaka, Jakub W. Pachocki, Jakub Radoszewski, Wojciech Rytter, Tomasz Waleń

    Abstract: In the Manhattan Sequence Consensus problem (MSC problem) we are given $k$ integer sequences, each of length $l$, and we are to find an integer sequence $x$ of length $l$ (called a consensus sequence), such that the maximum Manhattan distance of $x$ from each of the input sequences is minimized. For binary sequences Manhattan distance coincides with Hamming distance, hence in this case the string… ▽ More

    Submitted 23 July, 2014; originally announced July 2014.

    Comments: accepted to SPIRE 2014

  20. arXiv:1401.6236  [pdf, ps, other

    cs.DS math.NA

    Preconditioning in Expectation

    Authors: Michael B. Cohen, Rasmus Kyng, Jakub W. Pachocki, Richard Peng, Anup Rao

    Abstract: We show that preconditioners constructed by random sampling can perform well without meeting the standard requirements of iterative methods. When applied to graph Laplacians, this leads to ultra-sparsifiers that in expectation behave as the nearly-optimal ones given by [Kolla-Makarychev-Saberi-Teng STOC`10]. Combining this with the recursive preconditioning framework by [Spielman-Teng STOC`04] and… ▽ More

    Submitted 23 January, 2014; originally announced January 2014.

  21. arXiv:1401.2454  [pdf, ps, other

    cs.DS

    Stretching Stretch

    Authors: Michael B. Cohen, Gary L. Miller, Jakub W. Pachocki, Richard Peng, Shen Chen Xu

    Abstract: We give a generalized definition of stretch that simplifies the efficient construction of low-stretch embeddings suitable for graph algorithms. The generalization, based on discounting highly stretched edges by taking their $p$-th power for some $0 < p < 1$, is directly related to performances of existing algorithms. This discounting of high-stretch edges allows us to treat many classes of edges w… ▽ More

    Submitted 5 February, 2014; v1 submitted 10 January, 2014; originally announced January 2014.

  22. arXiv:1208.3313  [pdf, ps, other

    cs.DS cs.DM

    A Note on Efficient Computation of All Abelian Periods in a String

    Authors: Maxime Crochemore, Costas Iliopoulos, Tomasz Kociumaka, Marcin Kubica, Jakub Pachocki, Jakub Radoszewski, Wojciech Rytter, Wojciech Tyczyński, Tomasz Waleń

    Abstract: We derive a simple efficient algorithm for Abelian periods knowing all Abelian squares in a string. An efficient algorithm for the latter problem was given by Cummings and Smyth in 1997. By the way we show an alternative algorithm for Abelian squares. We also obtain a linear time algorithm finding all `long' Abelian periods. The aim of the paper is a (new) reduction of the problem of all Abelian p… ▽ More

    Submitted 16 August, 2012; originally announced August 2012.

    ACM Class: F.2.2