Skip to main content

Showing 1–6 of 6 results for author: Gomrokchi, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.11971  [pdf, other

    cs.LG cs.AI

    AdCraft: An Advanced Reinforcement Learning Benchmark Environment for Search Engine Marketing Optimization

    Authors: Maziar Gomrokchi, Owen Levin, Jeffrey Roach, Jonah White

    Abstract: We introduce AdCraft, a novel benchmark environment for the Reinforcement Learning (RL) community distinguished by its stochastic and non-stationary properties. The environment simulates bidding and budgeting dynamics within Search Engine Marketing (SEM), a digital marketing technique utilizing paid advertising to enhance the visibility of websites on search engine results pages (SERPs). The perfo… ▽ More

    Submitted 14 November, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

  2. arXiv:2109.03975  [pdf, other

    cs.LG cs.CR

    Membership Inference Attacks Against Temporally Correlated Data in Deep Reinforcement Learning

    Authors: Maziar Gomrokchi, Susan Amin, Hossein Aboutalebi, Alexander Wong, Doina Precup

    Abstract: While significant research advances have been made in the field of deep reinforcement learning, there have been no concrete adversarial attack strategies in literature tailored for studying the vulnerability of deep reinforcement learning algorithms to membership inference attacks. In such attacking systems, the adversary targets the set of collected input data on which the deep reinforcement lear… ▽ More

    Submitted 15 November, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

  3. arXiv:2109.00157  [pdf, other

    cs.LG cs.AI

    A Survey of Exploration Methods in Reinforcement Learning

    Authors: Susan Amin, Maziar Gomrokchi, Harsh Satija, Herke van Hoof, Doina Precup

    Abstract: Exploration is an essential component of reinforcement learning algorithms, where agents need to learn how to predict and control unknown and often stochastic environments. Reinforcement learning agents depend crucially on exploration to obtain informative data for the learning process as the lack of enough information could hinder effective learning. In this article, we provide a survey of modern… ▽ More

    Submitted 2 September, 2021; v1 submitted 31 August, 2021; originally announced September 2021.

  4. arXiv:2012.13658  [pdf, other

    cs.LG

    Locally Persistent Exploration in Continuous Control Tasks with Sparse Rewards

    Authors: Susan Amin, Maziar Gomrokchi, Hossein Aboutalebi, Harsh Satija, Doina Precup

    Abstract: A major challenge in reinforcement learning is the design of exploration strategies, especially for environments with sparse reward structures and continuous state and action spaces. Intuitively, if the reinforcement signal is very scarce, the agent should rely on some form of short-term memory in order to cover its environment efficiently. We propose a new exploration method, based on two intuiti… ▽ More

    Submitted 11 June, 2021; v1 submitted 25 December, 2020; originally announced December 2020.

    Comments: To be published in ICML, 2021

  5. arXiv:1708.04133  [pdf, other

    cs.LG

    Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control

    Authors: Riashat Islam, Peter Henderson, Maziar Gomrokchi, Doina Precup

    Abstract: Policy gradient methods in reinforcement learning have become increasingly prevalent for state-of-the-art performance in continuous control tasks. Novel methods typically benchmark against a few key algorithms such as deep deterministic policy gradients and trust region policy optimization. As such, it is important to present and use consistent baselines experiments. However, this can be difficult… ▽ More

    Submitted 10 August, 2017; originally announced August 2017.

    Comments: Accepted to Reproducibility in Machine Learning Workshop, ICML'17

  6. arXiv:1603.02010  [pdf, other

    cs.LG stat.ML

    Differentially Private Policy Evaluation

    Authors: Borja Balle, Maziar Gomrokchi, Doina Precup

    Abstract: We present the first differentially private algorithms for reinforcement learning, which apply to the task of evaluating a fixed policy. We establish two approaches for achieving differential privacy, provide a theoretical analysis of the privacy and utility of the two algorithms, and show promising results on simple empirical examples.

    Submitted 7 March, 2016; originally announced March 2016.