Skip to main content

Showing 1–11 of 11 results for author: Huizinga, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.08774  [pdf, other

    cs.CL cs.AI

    GPT-4 Technical Report

    Authors: OpenAI, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-Shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko , et al. (256 additional authors not shown)

    Abstract: We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo… ▽ More

    Submitted 4 March, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 100 pages; updated authors list; fixed author names and added citation

  2. arXiv:2206.11795  [pdf, other

    cs.LG cs.AI

    Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

    Authors: Bowen Baker, Ilge Akkaya, Peter Zhokhov, Joost Huizinga, Jie Tang, Adrien Ecoffet, Brandon Houghton, Raul Sampedro, Jeff Clune

    Abstract: Pretraining on noisy, internet-scale datasets has been heavily studied as a technique for training models with broad, general capabilities for text, images, and other modalities. However, for many sequential decision domains such as robotics, video games, and computer use, publicly available data does not contain the labels required to train behavioral priors in the same way. We extend the interne… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

  3. arXiv:2106.14876  [pdf, other

    cs.LG stat.ML

    Multi-task curriculum learning in a complex, visual, hard-exploration domain: Minecraft

    Authors: Ingmar Kanitscheider, Joost Huizinga, David Farhi, William Hebgen Guss, Brandon Houghton, Raul Sampedro, Peter Zhokhov, Bowen Baker, Adrien Ecoffet, Jie Tang, Oleg Klimov, Jeff Clune

    Abstract: An important challenge in reinforcement learning is training agents that can solve a wide variety of tasks. If tasks depend on each other (e.g. needing to learn to walk before learning to run), curriculum learning can speed up learning by focusing on the next best task to learn. We explore curriculum learning in a complex, visual domain with many hard exploration challenges: Minecraft. We find tha… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: first submission

  4. First return, then explore

    Authors: Adrien Ecoffet, Joost Huizinga, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: The promise of reinforcement learning is to solve complex sequential decision problems autonomously by specifying a high-level reward function only. However, reinforcement learning algorithms struggle when, as is often the case, simple and intuitive rewards provide sparse and deceptive feedback. Avoiding these pitfalls requires thoroughly exploring the environment, but creating algorithms that can… ▽ More

    Submitted 16 September, 2021; v1 submitted 27 April, 2020; originally announced April 2020.

    Comments: 47 pages, 14 figures, 4 tables; reorganized sections and modified SI text extensively; added reference to the published version, changed title to published title; added reference to published unformatted pdf

    Journal ref: Nature 590, 580-586 (2021)

  5. arXiv:2003.01825  [pdf, other

    cs.NE cs.AI cs.LG

    Scaling MAP-Elites to Deep Neuroevolution

    Authors: Cédric Colas, Joost Huizinga, Vashisht Madhavan, Jeff Clune

    Abstract: Quality-Diversity (QD) algorithms, and MAP-Elites (ME) in particular, have proven very useful for a broad range of applications including enabling real robots to recover quickly from joint damage, solving strongly deceptive maze tasks or evolving robot morphologies to discover new gaits. However, present implementations of MAP-Elites and other QD algorithms seem to be limited to low-dimensional co… ▽ More

    Submitted 5 June, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: Accepted to GECCO 2020

  6. arXiv:2001.08868  [pdf, other

    cs.CL cs.AI

    Exploration Based Language Learning for Text-Based Games

    Authors: Andrea Madotto, Mahdi Namazifar, Joost Huizinga, Piero Molino, Adrien Ecoffet, Huaixiu Zheng, Alexandros Papangelis, Dian Yu, Chandra Khatri, Gokhan Tur

    Abstract: This work presents an exploration and imitation-learning-based agent capable of state-of-the-art performance in playing text-based computer games. Text-based computer games describe their world to the player through natural language and expect the player to interact with the game using text. These games are of interest as they can be seen as a testbed for language understanding, problem-solving, a… ▽ More

    Submitted 7 June, 2020; v1 submitted 23 January, 2020; originally announced January 2020.

    Comments: Accepted at IJCAI 2020

  7. Guiding Neuroevolution with Structural Objectives

    Authors: Kai Olav Ellefsen, Joost Huizinga, Jim Torresen

    Abstract: The structure and performance of neural networks are intimately connected, and by use of evolutionary algorithms, neural network structures optimally adapted to a given task can be explored. Guiding such neuroevolution with additional objectives related to network structure has been shown to improve performance in some cases, especially when modular neural networks are beneficial. However, apart f… ▽ More

    Submitted 23 April, 2019; v1 submitted 12 February, 2019; originally announced February 2019.

  8. arXiv:1901.10995  [pdf, other

    cs.LG cs.AI stat.ML

    Go-Explore: a New Approach for Hard-Exploration Problems

    Authors: Adrien Ecoffet, Joost Huizinga, Joel Lehman, Kenneth O. Stanley, Jeff Clune

    Abstract: A grand challenge in reinforcement learning is intelligent exploration, especially when rewards are sparse or deceptive. Two Atari games serve as benchmarks for such hard-exploration domains: Montezuma's Revenge and Pitfall. On both games, current RL algorithms perform poorly, even those with intrinsic motivation, which is the dominant method to improve performance on hard-exploration domains. To… ▽ More

    Submitted 26 February, 2021; v1 submitted 30 January, 2019; originally announced January 2019.

    Comments: 37 pages, 14 figures; added references to Goyal et al. and Oh et al., updated reference to Colas et al; updated author emails; point readers to updated paper

  9. arXiv:1807.03392  [pdf, other

    cs.NE

    Evolving Multimodal Robot Behavior via Many Stepping Stones with the Combinatorial Multi-Objective Evolutionary Algorithm

    Authors: Joost Huizinga, Jeff Clune

    Abstract: An important challenge in reinforcement learning, including evolutionary robotics, is to solve multimodal problems, where agents have to act in qualitatively different ways depending on the circumstances. Because multimodal problems are often too difficult to solve directly, it is helpful to take advantage of staging, where a difficult task is divided into simpler subtasks that can serve as steppi… ▽ More

    Submitted 10 December, 2019; v1 submitted 9 July, 2018; originally announced July 2018.

  10. The Emergence of Canalization and Evolvability in an Open-Ended, Interactive Evolutionary System

    Authors: Joost Huizinga, Kenneth O. Stanley, Jeff Clune

    Abstract: Natural evolution has produced a tremendous diversity of functional organisms. Many believe an essential component of this process was the evolution of evolvability, whereby evolution speeds up its ability to innovate by generating a more adaptive pool of offspring. One hypothesized mechanism for evolvability is developmental canalization, wherein certain dimensions of variation become more likely… ▽ More

    Submitted 14 February, 2019; v1 submitted 17 April, 2017; originally announced April 2017.

    Comments: SI can be found at: http://www.evolvingai.org/files/SI_0.zip

    Journal ref: Artificial life, 24(3), pp.157-181 (2018)

  11. The evolutionary origins of hierarchy

    Authors: Henok Mengistu, Joost Huizinga, Jean-Baptiste Mouret, Jeff Clune

    Abstract: Hierarchical organization -- the recursive composition of sub-modules -- is ubiquitous in biological networks, including neural, metabolic, ecological, and genetic regulatory networks, and in human-made systems, such as large organizations and the Internet. To date, most research on hierarchy in networks has been limited to quantifying this property. However, an open, important question in evoluti… ▽ More

    Submitted 23 May, 2015; originally announced May 2015.

    Comments: 32 pages