-
Analysis of Stochastic Chemical Reaction Networks with a Hierarchy of Timescales
Authors:
Lucie Laurence,
Philippe Robert
Abstract:
We investigate a class of stochastic chemical reaction networks with $n{\ge}1$ chemical species $S_1$, \ldots, $S_n$, and whose complexes are only of the form $k_iS_i$, $i{=}1$,\ldots, $n$, where $(k_i)$ are integers. The time evolution of these CRNs is driven by the kinetics of the law of mass action. A scaling analysis is done when the rates of external arrivals of chemical species are proportio…
▽ More
We investigate a class of stochastic chemical reaction networks with $n{\ge}1$ chemical species $S_1$, \ldots, $S_n$, and whose complexes are only of the form $k_iS_i$, $i{=}1$,\ldots, $n$, where $(k_i)$ are integers. The time evolution of these CRNs is driven by the kinetics of the law of mass action. A scaling analysis is done when the rates of external arrivals of chemical species are proportional to a large scaling parameter $N$. A natural hierarchy of fast processes, a subset of the coordinates of $(X_i(t))$, is determined by the values of the mapping $i{\mapsto}k_i$. We show that the scaled vector of coordinates $i$ such that $k_i{=}1$ and the scaled occupation measure of the other coordinates are converging in distribution to a deterministic limit as $N$ gets large. The proof of this result is obtained by establishing a functional equation for the limiting points of the occupation measure, by an induction on the hierarchy of timescales and with relative entropy functions.
△ Less
Submitted 28 August, 2024;
originally announced August 2024.
-
On integral priors for multiple comparison in Bayesian model selection
Authors:
Diego Salmerón,
Juan Antonio Cano,
Christian P. Robert
Abstract:
Noninformative priors constructed for estimation purposes are usually not appropriate for model selection and testing. The methodology of integral priors was developed to get prior distributions for Bayesian model selection when comparing two models, modifying initial improper reference priors. We propose a generalization of this methodology to more than two models. Our approach adds an artificial…
▽ More
Noninformative priors constructed for estimation purposes are usually not appropriate for model selection and testing. The methodology of integral priors was developed to get prior distributions for Bayesian model selection when comparing two models, modifying initial improper reference priors. We propose a generalization of this methodology to more than two models. Our approach adds an artificial copy of each model under comparison by compactifying the parametric space and creating an ergodic Markov chain across all models that returns the integral priors as marginals of the stationary distribution. Besides the garantee of their existance and the lack of paradoxes attached to estimation reference priors, an additional advantage of this methodology is that the simulation of this Markov chain is straightforward as it only requires simulations of imaginary training samples for all models and from the corresponding posterior distributions. This renders its implementation automatic and generic, both in the nested case and in the nonnested case.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Stochastic Chemical Reaction Networks with Discontinuous Limits and AIMD processes
Authors:
Lucie Laurence,
Philippe Robert
Abstract:
In this paper we study a class of stochastic chemical reaction networks (CRNs) for which chemical species are created by a sequence of chain reactions. We prove that under some convenient conditions on the initial state, some of these networks exhibit a discrete-induced transitions (DIT) property: isolated, random, events have a direct impact on the macroscopic state of the process. If this phenom…
▽ More
In this paper we study a class of stochastic chemical reaction networks (CRNs) for which chemical species are created by a sequence of chain reactions. We prove that under some convenient conditions on the initial state, some of these networks exhibit a discrete-induced transitions (DIT) property: isolated, random, events have a direct impact on the macroscopic state of the process. If this phenomenon has already been noticed in several CRNs, in auto-catalytic networks in the literature of physics in particular, there are up to now few rigorous studies in this domain. A scaling analysis of several cases of such CRNs with several classes of initial states is achieved. The DIT property is investigated for the case of a CRN with four nodes. We show that on the normal timescale and for a subset of (large) initial states and for convenient Skorohod topologies, the scaled process converges in distribution to a Markov process with jumps, an Additive Increase/Multiplicative Decrease (AIMD) process. This asymptotically discontinuous limiting behavior is a consequence of a DIT property due to random, local, blowups of jumps occurring during small time intervals. With an explicit representation of invariant measures of AIMD processes and time-change arguments, we show that, with a speed-up of the timescale, the scaled process is converging in distribution to a continuous deterministic function. The DIT analyzed in this paper is connected to a simple chain reaction between three chemical species and is therefore likely to be a quite generic phenomenon for a large class of CRNs.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
ROB 204: Introduction to Human-Robot Systems at the University of Michigan, Ann Arbor
Authors:
Leia Stirling,
Joseph Montgomery,
Mark Draelos,
Christoforos Mavrogiannis,
Lionel P. Robert Jr.,
Odest Chadwicke Jenkins
Abstract:
The University of Michigan Robotics program focuses on the study of embodied intelligence that must sense, reason, act, and work with people to improve quality of life and productivity equitably across society. ROB 204, part of the core curriculum towards the undergraduate degree in Robotics, introduces students to topics that enable conceptually designing a robotic system to address users' needs…
▽ More
The University of Michigan Robotics program focuses on the study of embodied intelligence that must sense, reason, act, and work with people to improve quality of life and productivity equitably across society. ROB 204, part of the core curriculum towards the undergraduate degree in Robotics, introduces students to topics that enable conceptually designing a robotic system to address users' needs from a sociotechnical context. Students are introduced to human-robot interaction (HRI) concepts and the process for socially-engaged design with a Learn-Reinforce-Integrate approach. In this paper, we discuss the course topics and our teaching methodology, and provide recommendations for delivering this material. Overall, students leave the course with a new understanding and appreciation for how human capabilities can inform requirements for a robotics system, how humans can interact with a robot, and how to assess the usability of robotic systems.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
A discussion of the paper "Safe testing" by Grünwald, de Heide, and Koolen
Authors:
Joshua Bon,
Christian P Robert
Abstract:
This is a discussion of the paper "Safe testing" by Grünwald, de Heide, and Koolen, Read before The Royal Statistical Society at a meeting organized by the Research Section on Wednesday, 24 January, 2024
This is a discussion of the paper "Safe testing" by Grünwald, de Heide, and Koolen, Read before The Royal Statistical Society at a meeting organized by the Research Section on Wednesday, 24 January, 2024
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
Shaping Human-AI Collaboration: Varied Scaffolding Levels in Co-writing with Language Models
Authors:
Paramveer S. Dhillon,
Somayeh Molaei,
Jiaqi Li,
Maximilian Golub,
Shaochun Zheng,
Lionel P. Robert
Abstract:
Advances in language modeling have paved the way for novel human-AI co-writing experiences. This paper explores how varying levels of scaffolding from large language models (LLMs) shape the co-writing process. Employing a within-subjects field experiment with a Latin square design, we asked participants (N=131) to respond to argumentative writing prompts under three randomly sequenced conditions:…
▽ More
Advances in language modeling have paved the way for novel human-AI co-writing experiences. This paper explores how varying levels of scaffolding from large language models (LLMs) shape the co-writing process. Employing a within-subjects field experiment with a Latin square design, we asked participants (N=131) to respond to argumentative writing prompts under three randomly sequenced conditions: no AI assistance (control), next-sentence suggestions (low scaffolding), and next-paragraph suggestions (high scaffolding). Our findings reveal a U-shaped impact of scaffolding on writing quality and productivity (words/time). While low scaffolding did not significantly improve writing quality or productivity, high scaffolding led to significant improvements, especially benefiting non-regular writers and less tech-savvy users. No significant cognitive burden was observed while using the scaffolded writing tools, but a moderate decrease in text ownership and satisfaction was noted. Our results have broad implications for the design of AI-powered writing tools, including the need for personalized scaffolding mechanisms.
△ Less
Submitted 18 February, 2024;
originally announced February 2024.
-
Simulating signed mixtures
Authors:
Julien Stoehr,
Christian P. Robert
Abstract:
Simulating mixtures of distributions with signed weights proves a challenge as standard simulation algorithms are inefficient in handling the negative weights. In particular, the natural representation of mixture variates as associated with latent component indicators is no longer available. We propose here an exact accept-reject algorithm in the general case of finite signed mixtures that relies…
▽ More
Simulating mixtures of distributions with signed weights proves a challenge as standard simulation algorithms are inefficient in handling the negative weights. In particular, the natural representation of mixture variates as associated with latent component indicators is no longer available. We propose here an exact accept-reject algorithm in the general case of finite signed mixtures that relies on optimaly pairing positive and negative components and designing a stratified sampling scheme on pairs. We analyze the performances of our approach, relative to the inverse cdf approach, since the cdf of the distribution remains available for standard signed mixtures.
△ Less
Submitted 30 January, 2024;
originally announced January 2024.
-
Asymptotics of approximate Bayesian computation when summary statistics converge at heterogeneous rates
Authors:
Caroline Lawless,
Christian P. Robert,
Judith Rousseau,
Robin J. Ryder
Abstract:
We consider the asymptotic properties of Approximate Bayesian Computation (ABC) for the realistic case of summary statistics with heterogeneous rates of convergence. We allow some statistics to converge faster than the ABC tolerance, other statistics to converge slower, and cover the case where some statistics do not converge at all. We give conditions for the ABC posterior to converge, and provid…
▽ More
We consider the asymptotic properties of Approximate Bayesian Computation (ABC) for the realistic case of summary statistics with heterogeneous rates of convergence. We allow some statistics to converge faster than the ABC tolerance, other statistics to converge slower, and cover the case where some statistics do not converge at all. We give conditions for the ABC posterior to converge, and provide an explicit representation of the shape of the ABC posterior distribution in our general setting; in particular, we show how the shape of the posterior depends on the number of slow statistics. We then quantify the gain brought by the local linear post-processing step.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
A Stochastic Analysis of Particle Systems with Pairing
Authors:
Vincent Fromion,
Philippe Robert,
Jana Zaherddine
Abstract:
Motivated by a general principle governing regulation mechanisms in biological cells, we investigate a general interaction scheme between different populations of particles and specific particles, referred to as agents. Assuming that each particle follows a random path in the medium, when a particle and an agent meet, they may bind and form a pair which has some specific functional properties. Suc…
▽ More
Motivated by a general principle governing regulation mechanisms in biological cells, we investigate a general interaction scheme between different populations of particles and specific particles, referred to as agents. Assuming that each particle follows a random path in the medium, when a particle and an agent meet, they may bind and form a pair which has some specific functional properties. Such a pair is also subject to random events and it splits after some random amount of time. In a stochastic context, using a Markovian model for the vector of the number of paired particles, and by taking the total number of particles as a scaling parameter, we study the asymptotic behavior of the time evolution of the number of paired particles. Two scenarios are investigated: one with a large but fixed number of agents, and the other one, the dynamic case, when agents are created at a bounded rate and may die after some time when they are not paired. A first order limit theorem is established for the time evolution of the system in both cases. The proof of an averaging principle of the dynamic case is one of the main contributions of the paper. Limit theorems for fluctuations are obtained in the case of a fixed number agents. The impact of dynamical arrivals of agents on the level of pairing of the system is discussed.
△ Less
Submitted 7 October, 2023;
originally announced October 2023.
-
A Scaling Approach to Stochastic Chemical Reaction Networks
Authors:
Lucie Laurence,
Philippe Robert
Abstract:
We investigate the asymptotic properties of Markov processes associated to stochastic chemical reaction networks (CRNs) driven by the kinetics of the law of mass action. Their transition rates exhibit a polynomial dependence on the state variable, with possible discontinuities of the dynamics along the boundary of the state space. We investigate the scaling properties of these networks when the no…
▽ More
We investigate the asymptotic properties of Markov processes associated to stochastic chemical reaction networks (CRNs) driven by the kinetics of the law of mass action. Their transition rates exhibit a polynomial dependence on the state variable, with possible discontinuities of the dynamics along the boundary of the state space. We investigate the scaling properties of these networks when the norm of the initial state is converging to infinity and the reaction rates are fixed. This scaling approach is used to have insight on the time evolution of these networks when they start from a ``large'' initial state. The main difference with the scalings of the literature is that it does not change neither the graph structure of the CRN, nor its reaction rates. Several simple and interesting examples of CRNs are investigated with this scaling approach, including the detailed analysis of a CRN with several unsual asymptotic properties in the last section. We also show that a stability criterion due to Filonov for positive recurrence of Markov processes may simplify significantly the stability analysis of these networks.
△ Less
Submitted 6 September, 2024; v1 submitted 3 October, 2023;
originally announced October 2023.
-
High flux strontium atom source
Authors:
C. -H. Feng,
P. Robert,
P. Bouyer,
B. Canuel,
J. Li,
S. Das,
C. C. Kwong,
D. Wilkowski,
M. Prevedelli,
A. Bertoldi
Abstract:
We present a novel cold strontium atom source designed for quantum sensors. We optimized the deceleration process to capture a large velocity class of atoms emitted from an oven and achieved a compact and low-power setup capable of generating a high atomic flux. Our approach involves velocity-dependent transverse capture of atoms using a two-dimensional magneto-optical trap. To enhance the atomic…
▽ More
We present a novel cold strontium atom source designed for quantum sensors. We optimized the deceleration process to capture a large velocity class of atoms emitted from an oven and achieved a compact and low-power setup capable of generating a high atomic flux. Our approach involves velocity-dependent transverse capture of atoms using a two-dimensional magneto-optical trap. To enhance the atomic flux, we employ tailored magnetic fields that minimize radial beam expansion and incorporate a cascaded Zeeman-slowing configuration utilizing two optical frequencies. The performance is comparable to that of conventional Zeeman slower sources, and the scheme is applicable to other atomic species. Our results represent a significant advancement towards the deployment of portable and, possibly, space-based cold atom sensors.
△ Less
Submitted 18 March, 2024; v1 submitted 1 October, 2023;
originally announced October 2023.
-
Insufficient Gibbs Sampling
Authors:
Antoine Luciano,
Christian P. Robert,
Robin J. Ryder
Abstract:
In some applied scenarios, the availability of complete data is restricted, often due to privacy concerns; only aggregated, robust and inefficient statistics derived from the data are made accessible. These robust statistics are not sufficient, but they demonstrate reduced sensitivity to outliers and offer enhanced data protection due to their higher breakdown point. We consider a parametric frame…
▽ More
In some applied scenarios, the availability of complete data is restricted, often due to privacy concerns; only aggregated, robust and inefficient statistics derived from the data are made accessible. These robust statistics are not sufficient, but they demonstrate reduced sensitivity to outliers and offer enhanced data protection due to their higher breakdown point. We consider a parametric framework and propose a method to sample from the posterior distribution of parameters conditioned on various robust and inefficient statistics: specifically, the pairs (median, MAD) or (median, IQR), or a collection of quantiles. Our approach leverages a Gibbs sampler and simulates latent augmented data, which facilitates simulation from the posterior distribution of parameters belonging to specific families of distributions. A by-product of these samples from the joint posterior distribution of parameters and data given the observed statistics is that we can estimate Bayes factors based on observed statistics via bridge sampling. We validate and outline the limitations of the proposed methods through toy examples and an application to real-world income data.
△ Less
Submitted 22 February, 2024; v1 submitted 27 July, 2023;
originally announced July 2023.
-
A Palm Space Approach to Non-Linear Hawkes Processes
Authors:
Philippe Robert,
Gaëtan Vignoud
Abstract:
A Hawkes process on $\R$ is a point process whose intensity function at time $t$ is a functional of its past activity before time $t$. It is defined by its activation function $Φ$ and its memory function $h$. In this paper, the Hawkes property is expressed as an operator on the sub-space of non-negative sequences associated to distances between its points. By using the classical correspondence bet…
▽ More
A Hawkes process on $\R$ is a point process whose intensity function at time $t$ is a functional of its past activity before time $t$. It is defined by its activation function $Φ$ and its memory function $h$. In this paper, the Hawkes property is expressed as an operator on the sub-space of non-negative sequences associated to distances between its points. By using the classical correspondence between a stationary point process and its Palm measure, we establish a characterization of the corresponding Palm measure as an invariant distribution of a Markovian kernel. We prove that if $Φ$ is continuous and its growth rate is at most linear with a rate below some constant, then there exists a stationary Hawkes point process. The classical Lipschitz condition of the literature for an unbounded function $Φ$ is relaxed. Our proofs rely on a combination of coupling methods, monotonicity properties of linear Hawkes processes and classical results on Palm distributions. An investigation of the Hawkes process starting from the null measure, the empty state, on $\R_-$ plays also an important role. The linear case of Hawkes and Oakes is revisited at this occasion.
If the memory function $h$ is an exponential function, under a weak condition it is shown that there exists a unique stationary Hawkes point process. In this case, its Palm measure is expressed in terms of the invariant distribution of a one-dimensional Harris ergodic Markov chain. When the activation function is a polynomial $Φ$ with degree ${>}1$, there does not exist a stationary Hawkes process and if the Hawkes process starts from the empty state, a scaling result for the accumulation of its points is obtained.
△ Less
Submitted 4 December, 2023; v1 submitted 22 December, 2022;
originally announced December 2022.
-
Sampling using Adaptive Regenerative Processes
Authors:
Hector McKimm,
Andi Q Wang,
Murray Pollock,
Christian P Robert,
Gareth O Roberts
Abstract:
Enriching Brownian motion with regenerations from a fixed regeneration distribution $μ$ at a particular regeneration rate $κ$ results in a Markov process that has a target distribution $π$ as its invariant distribution. For the purpose of Monte Carlo inference, implementing such a scheme requires firstly selection of regeneration distribution $μ$, and secondly computation of a specific constant…
▽ More
Enriching Brownian motion with regenerations from a fixed regeneration distribution $μ$ at a particular regeneration rate $κ$ results in a Markov process that has a target distribution $π$ as its invariant distribution. For the purpose of Monte Carlo inference, implementing such a scheme requires firstly selection of regeneration distribution $μ$, and secondly computation of a specific constant $C$. Both of these tasks can be very difficult in practice for good performance. We introduce a method for adapting the regeneration distribution, by adding point masses to it. This allows the process to be simulated with as few regenerations as possible and obviates the need to find said constant $C$. Moreover, the choice of fixed $μ$ is replaced with the choice of the initial regeneration distribution, which is considerably less difficult. We establish convergence of this resulting self-reinforcing process and explore its effectiveness at sampling from a number of target distributions. The examples show that adapting the regeneration distribution guards against poor choices of fixed regeneration distribution and can reduce the error of Monte Carlo estimates of expectations of interest, especially when $π$ is skewed.
△ Less
Submitted 20 February, 2024; v1 submitted 18 October, 2022;
originally announced October 2022.
-
Considerations for Task Allocation in Human-Robot Teams
Authors:
Arsha Ali,
Dawn M. Tilbury,
Lionel P. Robert Jr
Abstract:
In human-robot teams where agents collaborate together, there needs to be a clear allocation of tasks to agents. Task allocation can aid in achieving the presumed benefits of human-robot teams, such as improved team performance. Many task allocation methods have been proposed that include factors such as agent capability, availability, workload, fatigue, and task and domain-specific parameters. In…
▽ More
In human-robot teams where agents collaborate together, there needs to be a clear allocation of tasks to agents. Task allocation can aid in achieving the presumed benefits of human-robot teams, such as improved team performance. Many task allocation methods have been proposed that include factors such as agent capability, availability, workload, fatigue, and task and domain-specific parameters. In this paper, selected work on task allocation is reviewed. In addition, some areas for continued and further consideration in task allocation are discussed. These areas include level of collaboration, novel tasks, unknown and dynamic agent capabilities, negotiation and fairness, and ethics. Where applicable, we also mention some of our work on task allocation. Through continued efforts and considerations in task allocation, human-robot teaming can be improved.
△ Less
Submitted 6 October, 2022;
originally announced October 2022.
-
ImmunoLingo: Linguistics-based formalization of the antibody language
Authors:
Mai Ha Vu,
Philippe A. Robert,
Rahmad Akbar,
Bartlomiej Swiatczak,
Geir Kjetil Sandve,
Dag Trygve Truslew Haug,
Victor Greiff
Abstract:
Apparent parallels between natural language and biological sequence have led to a recent surge in the application of deep language models (LMs) to the analysis of antibody and other biological sequences. However, a lack of a rigorous linguistic formalization of biological sequence languages, which would define basic components, such as lexicon (i.e., the discrete units of the language) and grammar…
▽ More
Apparent parallels between natural language and biological sequence have led to a recent surge in the application of deep language models (LMs) to the analysis of antibody and other biological sequences. However, a lack of a rigorous linguistic formalization of biological sequence languages, which would define basic components, such as lexicon (i.e., the discrete units of the language) and grammar (i.e., the rules that link sequence well-formedness, structure, and meaning) has led to largely domain-unspecific applications of LMs, which do not take into account the underlying structure of the biological sequences studied. A linguistic formalization, on the other hand, establishes linguistically-informed and thus domain-adapted components for LM applications. It would facilitate a better understanding of how differences and similarities between natural language and biological sequences influence the quality of LMs, which is crucial for the design of interpretable models with extractable sequence-functions relationship rules, such as the ones underlying the antibody specificity prediction problem. Deciphering the rules of antibody specificity is crucial to accelerating rational and in silico biotherapeutic drug design. Here, we formalize the properties of the antibody language and thereby establish not only a foundation for the application of linguistic tools in adaptive immune receptor analysis but also for the systematic immunolinguistic studies of immune receptor specificity in general.
△ Less
Submitted 29 November, 2022; v1 submitted 26 September, 2022;
originally announced September 2022.
-
Bi-color atomic beam slower and magnetic field compensation for ultracold gases
Authors:
Jianing Li,
Kelvin Lim,
Swarup Das,
Thomas Zanon-Willette,
Chen-Hao Feng,
Paul Robert,
Andrea Bertoldi,
Philippe Bouyer,
Chang Chi Kwong,
Shau-Yu Lan,
David Wilkowski
Abstract:
Transversely loaded bidimensional-magneto-optical-traps (2D-MOT) have been recently developed as high flux sources for cold strontium atoms to realize a new generation of compact experimental setups. Here, we discuss on the implementation of a cross-polarized bi-color slower for a strontium atomic beam improving the 2D-MOT loading, and increasing the number of atoms in a final MOT by eleven times.…
▽ More
Transversely loaded bidimensional-magneto-optical-traps (2D-MOT) have been recently developed as high flux sources for cold strontium atoms to realize a new generation of compact experimental setups. Here, we discuss on the implementation of a cross-polarized bi-color slower for a strontium atomic beam improving the 2D-MOT loading, and increasing the number of atoms in a final MOT by eleven times. Our slowing scheme addresses simultaneously two excited Zeeman substates of the 88Sr 1S0->1P1 transition at 461 nm. We also realized a 3-axis active feedback control of the magnetic field down to the microgauss regime. Such a compensation is performed thanks to a network of eight magnetic field probes arranged in a cuboid configuration around the atomic cold sample, and a pair of coils in Helmholtz configuration along each of three Cartesian directions. Our active feedback is capable of efficiently suppressing most of the magnetically-induced position fluctuations of the 689~nm intercombination-line MOT.
△ Less
Submitted 5 January, 2023; v1 submitted 18 September, 2022;
originally announced September 2022.
-
Stochastic Models of Regulation of Transcription in Biological Cells
Authors:
Vincent Fromion,
Philippe Robert,
Jana Zaherddine
Abstract:
In this paper we study an important global regulation mechanism of transcription of biological cells using specific macro-molecules, 6S RNAs. The functional property of 6S RNAs is of blocking the transcription of RNAs when the environment of the cell is not favorable. We investigate the efficiency of this mechanism with a scaling analysis of a stochastic model. The evolution equations of our model…
▽ More
In this paper we study an important global regulation mechanism of transcription of biological cells using specific macro-molecules, 6S RNAs. The functional property of 6S RNAs is of blocking the transcription of RNAs when the environment of the cell is not favorable. We investigate the efficiency of this mechanism with a scaling analysis of a stochastic model. The evolution equations of our model are driven by the law of mass action and the total number of polymerases is used as a scaling parameter. Two regimes are analyzed: exponential phase when the environment of the cell is favorable to its growth, and the stationary phase when resources are scarce. In both regimes, by defining properly occupation measures of the model, we prove an averaging principle for the associated multi-dimensional Markov process on a convenient timescale, as well as convergence results for fast variables of the system. An analytical expression of the asymptotic fraction of sequestrated polymerases in stationary phase is in particular obtained. The consequences of these results are discussed.
△ Less
Submitted 19 August, 2022;
originally announced August 2022.
-
Computing Bayes: From Then 'Til Now'
Authors:
Gael M. Martin,
David T. Frazier,
Christian P. Robert
Abstract:
This paper takes the reader on a journey through the history of Bayesian computation, from the 18th century to the present day. Beginning with the one-dimensional integral first confronted by Bayes in 1763, we highlight the key contributions of: Laplace, Metropolis (and, importantly, his co-authors!), Hammersley and Handscomb, and Hastings, all of which set the foundations for the computational re…
▽ More
This paper takes the reader on a journey through the history of Bayesian computation, from the 18th century to the present day. Beginning with the one-dimensional integral first confronted by Bayes in 1763, we highlight the key contributions of: Laplace, Metropolis (and, importantly, his co-authors!), Hammersley and Handscomb, and Hastings, all of which set the foundations for the computational revolution in the late 20th century -- led, primarily, by Markov chain Monte Carlo (MCMC) algorithms. A very short outline of 21st century computational methods -- including pseudo-marginal MCMC, Hamiltonian Monte Carlo, sequential Monte Carlo, and the various `approximate' methods -- completes the paper.
△ Less
Submitted 1 August, 2022;
originally announced August 2022.
-
The Importance Markov Chain
Authors:
Charly Andral,
Randal Douc,
Hugo Marival,
Christian P. Robert
Abstract:
The Importance Markov chain is a novel algorithm bridging the gap between rejection sampling and importance sampling, moving from one to the other through a tuning parameter. Based on a modified sample of an instrumental Markov chain targeting an instrumental distribution (typically via a MCMC kernel), the Importance Markov chain produces an extended Markov chain where the marginal distribution of…
▽ More
The Importance Markov chain is a novel algorithm bridging the gap between rejection sampling and importance sampling, moving from one to the other through a tuning parameter. Based on a modified sample of an instrumental Markov chain targeting an instrumental distribution (typically via a MCMC kernel), the Importance Markov chain produces an extended Markov chain where the marginal distribution of the first component converges to the target distribution. For example, when targeting a multimodal distribution, the instrumental distribution can be chosen as a tempered version of the target which allows the algorithm to explore its modes more efficiently. We obtain a Law of Large Numbers and a Central Limit Theorem as well as geometric ergodicity for this extended kernel under mild assumptions on the instrumental kernel. Computationally, the algorithm is easy to implement and preexisting libraries can be used to sample from the instrumental distribution.
△ Less
Submitted 26 February, 2024; v1 submitted 17 July, 2022;
originally announced July 2022.
-
Linguistically inspired roadmap for building biologically reliable protein language models
Authors:
Mai Ha Vu,
Rahmad Akbar,
Philippe A. Robert,
Bartlomiej Swiatczak,
Victor Greiff,
Geir Kjetil Sandve,
Dag Trygve Truslew Haug
Abstract:
Deep neural-network-based language models (LMs) are increasingly applied to large-scale protein sequence data to predict protein function. However, being largely black-box models and thus challenging to interpret, current protein LM approaches do not contribute to a fundamental understanding of sequence-function mappings, hindering rule-based biotherapeutic drug development. We argue that guidance…
▽ More
Deep neural-network-based language models (LMs) are increasingly applied to large-scale protein sequence data to predict protein function. However, being largely black-box models and thus challenging to interpret, current protein LM approaches do not contribute to a fundamental understanding of sequence-function mappings, hindering rule-based biotherapeutic drug development. We argue that guidance drawn from linguistics, a field specialized in analytical rule extraction from natural language data, can aid with building more interpretable protein LMs that are more likely to learn relevant domain-specific rules. Differences between protein sequence data and linguistic sequence data require the integration of more domain-specific knowledge in protein LMs compared to natural language LMs. Here, we provide a linguistics-based roadmap for protein LM pipeline choices with regard to training data, tokenization, token embedding, sequence embedding, and model interpretation. Incorporating linguistic ideas into protein LMs enables the development of next-generation interpretable machine-learning models with the potential of uncovering the biological mechanisms underlying sequence-function relationships.
△ Less
Submitted 28 April, 2023; v1 submitted 3 July, 2022;
originally announced July 2022.
-
50 shades of Bayesian testing of hypotheses
Authors:
Christian P Robert
Abstract:
Hypothesis testing and model choice are quintessential questions for statistical inference and while the Bayesian paradigm seems ideally suited for answering these questions, it faces difficulties of its own ranging from prior modelling to calibration, to numerical implementation. This c
Hypothesis testing and model choice are quintessential questions for statistical inference and while the Bayesian paradigm seems ideally suited for answering these questions, it faces difficulties of its own ranging from prior modelling to calibration, to numerical implementation. This c
△ Less
Submitted 14 June, 2022;
originally announced June 2022.
-
Discovery of three new near-pristine absorption clouds at $z=2.6$-4.4
Authors:
P. Frédéric Robert,
Michael T. Murphy,
John M. O'Meara,
Neil H. M. Crighton,
Michele Fumagalli
Abstract:
We report the discovery of three new "near-pristine" Lyman Limit Systems (LLSs), with metallicities ~1/1000 solar, at redshifts 2.6, 3.8 and 4.0, with a targeted survey at the Keck Observatory. High resolution echelle spectra of eight candidates yielded precise column densities of hydrogen and weak, but clearly detected, metal lines in seven LLSs; we previously reported the one remaining, apparent…
▽ More
We report the discovery of three new "near-pristine" Lyman Limit Systems (LLSs), with metallicities ~1/1000 solar, at redshifts 2.6, 3.8 and 4.0, with a targeted survey at the Keck Observatory. High resolution echelle spectra of eight candidates yielded precise column densities of hydrogen and weak, but clearly detected, metal lines in seven LLSs; we previously reported the one remaining, apparently metal-free LLS, to have metallicity <1/10000 solar. Robust photoionisation modelling provides metallicities [Si/H] = -3.05 to -2.94, with 0.26 dex uncertainties (95% confidence) for three LLSs, and [Si/H] >~ -2.5 for the remaining four. Previous simulations suggest that near-pristine LLSs could be the remnants of PopIII supernovae, so comparing their detailed metal abundances with nucleosynthetic yields from supernovae models is an important goal. Unfortunately, at most two different metals were detected in each new system, despite their neutral hydrogen column densities (10^{19.2-19.4} cm^{-2}) being two orders of magnitude larger than the two previous, serendipitously discovered near-pristine LLSs. Nevertheless, the success of this first targeted survey for near-pristine systems demonstrates the prospect that a much larger, future survey could identify clear observational signatures of PopIII stars. With a well-understood selection function, such a survey would also yield the number density of near-pristine absorbers which, via comparison to future simulations, could reveal the origin(s) of these rare systems.
△ Less
Submitted 6 June, 2022;
originally announced June 2022.
-
Evidence estimation in finite and infinite mixture models and applications
Authors:
Adrien Hairault,
Christian P. Robert,
Judith Rousseau
Abstract:
Estimating the model evidence - or mariginal likelihood of the data - is a notoriously difficult task for finite and infinite mixture models and we reexamine here different Monte Carlo techniques advocated in the recent literature, as well as novel approaches based on Geyer (1994) reverse logistic regression technique, Chib (1995) algorithm, and Sequential Monte Carlo (SMC). Applications are numer…
▽ More
Estimating the model evidence - or mariginal likelihood of the data - is a notoriously difficult task for finite and infinite mixture models and we reexamine here different Monte Carlo techniques advocated in the recent literature, as well as novel approaches based on Geyer (1994) reverse logistic regression technique, Chib (1995) algorithm, and Sequential Monte Carlo (SMC). Applications are numerous. In particular, testing for the number of components in a finite mixture model or against the fit of a finite mixture model for a given dataset has long been and still is an issue of much interest, albeit yet missing a fully satisfactory resolution. Using a Bayes factor to find the right number of components K in a finite mixture model is known to provide a consistent procedure. We furthermore establish the consistence of the Bayes factor when comparing a parametric family of finite mixtures against the nonparametric 'strongly identifiable' Dirichlet Process Mixture (DPM) model.
△ Less
Submitted 11 May, 2022;
originally announced May 2022.
-
AntBO: Towards Real-World Automated Antibody Design with Combinatorial Bayesian Optimisation
Authors:
Asif Khan,
Alexander I. Cowen-Rivers,
Antoine Grosnit,
Derrick-Goh-Xin Deik,
Philippe A. Robert,
Victor Greiff,
Eva Smorodina,
Puneet Rawat,
Kamil Dreczkowski,
Rahmad Akbar,
Rasul Tutunov,
Dany Bou-Ammar,
Jun Wang,
Amos Storkey,
Haitham Bou-Ammar
Abstract:
Antibodies are canonically Y-shaped multimeric proteins capable of highly specific molecular recognition. The CDRH3 region located at the tip of variable chains of an antibody dominates antigen-binding specificity. Therefore, it is a priority to design optimal antigen-specific CDRH3 regions to develop therapeutic antibodies. However, the combinatorial nature of CDRH3 sequence space makes it imposs…
▽ More
Antibodies are canonically Y-shaped multimeric proteins capable of highly specific molecular recognition. The CDRH3 region located at the tip of variable chains of an antibody dominates antigen-binding specificity. Therefore, it is a priority to design optimal antigen-specific CDRH3 regions to develop therapeutic antibodies. However, the combinatorial nature of CDRH3 sequence space makes it impossible to search for an optimal binding sequence exhaustively and efficiently using computational approaches. Here, we present \texttt{AntBO}: a combinatorial Bayesian optimisation framework enabling efficient \textit{in silico} design of the CDRH3 region. Ideally, antibodies are expected to have high target specificity and developability. We introduce a CDRH3 trust region that restricts the search to sequences with favourable developability scores to achieve this goal. For benchmarking, \texttt{AntBO} uses the \texttt{Absolut!} software suite as a black-box oracle to score the target specificity and affinity of designed antibodies \textit{in silico} in an unconstrained fashion~\citep{robert2021one}. The experiments performed for $159$ discretised antigens used in \texttt{Absolut!} demonstrate the benefit of \texttt{AntBO} in designing CDRH3 regions with diverse biophysical properties. In under $200$ calls to black-box oracle, \texttt{AntBO} can suggest antibody sequences that outperform the best binding sequence drawn from 6.9 million experimentally obtained CDRH3s and a commonly used genetic algorithm baseline. Additionally, \texttt{AntBO} finds very-high affinity CDRH3 sequences in only 38 protein designs whilst requiring no domain knowledge. We conclude \texttt{AntBO} brings automated antibody design methods closer to what is practically viable for in vitro experimentation.
△ Less
Submitted 14 October, 2022; v1 submitted 29 January, 2022;
originally announced January 2022.
-
Approximating Bayes in the 21st Century
Authors:
Gael M. Martin,
David T. Frazier,
Christian P. Robert
Abstract:
The 21st century has seen an enormous growth in the development and use of approximate Bayesian methods. Such methods produce computational solutions to certain intractable statistical problems that challenge exact methods like Markov chain Monte Carlo: for instance, models with unavailable likelihoods, high-dimensional models, and models featuring large data sets. These approximate methods are th…
▽ More
The 21st century has seen an enormous growth in the development and use of approximate Bayesian methods. Such methods produce computational solutions to certain intractable statistical problems that challenge exact methods like Markov chain Monte Carlo: for instance, models with unavailable likelihoods, high-dimensional models, and models featuring large data sets. These approximate methods are the subject of this review. The aim is to help new researchers in particular -- and more generally those interested in adopting a Bayesian approach to empirical work -- distinguish between different approximate techniques; understand the sense in which they are approximate; appreciate when and why particular methods are useful; and see the ways in which they can can be combined.
△ Less
Submitted 20 December, 2021;
originally announced December 2021.
-
On the Spontaneous Dynamics of Synaptic Weights in Stochastic Models with Pair-Based STDP
Authors:
Philippe Robert,
Gaëtan Vignoud
Abstract:
We investigate spike-timing dependent plasticity (STPD) in the case of a synapse connecting two neural cells. We develop a theoretical analysis of several STDP rules using Markovian theory. In this context there are two different timescales, fast neural activity and slower synaptic weight updates. Exploiting this timescale separation, we derive the long-time limits of a single synaptic weight subj…
▽ More
We investigate spike-timing dependent plasticity (STPD) in the case of a synapse connecting two neural cells. We develop a theoretical analysis of several STDP rules using Markovian theory. In this context there are two different timescales, fast neural activity and slower synaptic weight updates. Exploiting this timescale separation, we derive the long-time limits of a single synaptic weight subject to STDP. We show that the pairing model of presynaptic and postsynaptic spikes controls the synaptic weight dynamics for small external input, on an excitatory synapse. This result implies in particular that mean-field analysis of plasticity may miss some important properties of STDP. Anti-Hebbian STDP seems to favor the emergence of a stable synaptic weight, but only for high external input. In the case of inhibitory synapse the pairing schemes matter less, and we observe convergence of the synaptic weight to a non-null value only for Hebbian STDP. We extensively study different asymptotic regimes for STDP rules, raising interesting questions for future works on adaptative neural networks and, more generally, on adaptive systems.
△ Less
Submitted 15 November, 2021;
originally announced November 2021.
-
Living on the Edge: An Unified Approach to Antithetic Sampling
Authors:
Roberto Casarin,
Radu V. Craiu,
Lorenzo Frattarolo,
Christian P. Robert
Abstract:
We identify recurrent ingredients in the antithetic sampling literature leading to a unified sampling framework. We introduce a new class of antithetic schemes that includes the most used antithetic proposals. This perspective enables the derivation of new properties of the sampling schemes: i) optimality in the Kullback-Leibler sense; ii) closed-form multivariate Kendall's $τ$ and Spearman's $ρ$;…
▽ More
We identify recurrent ingredients in the antithetic sampling literature leading to a unified sampling framework. We introduce a new class of antithetic schemes that includes the most used antithetic proposals. This perspective enables the derivation of new properties of the sampling schemes: i) optimality in the Kullback-Leibler sense; ii) closed-form multivariate Kendall's $τ$ and Spearman's $ρ$; iii)ranking in concordance order and iv) a central limit theorem that characterizes stochastic behavior of Monte Carlo estimators when the sample size tends to infinity. Finally, we provide applications to Monte Carlo integration and Markov Chain Monte Carlo Bayesian estimation.
△ Less
Submitted 6 December, 2021; v1 submitted 28 October, 2021;
originally announced October 2021.
-
Using Trust for Heterogeneous Human-Robot Team Task Allocation
Authors:
Arsha Ali,
Hebert Azevedo-Sa,
Dawn M. Tilbury,
Lionel P. Robert Jr
Abstract:
Human-robot teams have the ability to perform better across various tasks than human-only and robot-only teams. However, such improvements cannot be realized without proper task allocation. Trust is an important factor in teaming relationships, and can be used in the task allocation strategy. Despite the importance, most existing task allocation strategies do not incorporate trust. This paper revi…
▽ More
Human-robot teams have the ability to perform better across various tasks than human-only and robot-only teams. However, such improvements cannot be realized without proper task allocation. Trust is an important factor in teaming relationships, and can be used in the task allocation strategy. Despite the importance, most existing task allocation strategies do not incorporate trust. This paper reviews select studies on trust and task allocation. We also summarize and discuss how a bi-directional trust model can be used for a task allocation strategy. The bi-directional trust model represents task requirements and agents by their capabilities, and can be used to predict trust for both existing and new tasks. Our task allocation approach uses predicted trust in the agent and expected total reward for task assignment. Finally, we present some directions for future work, including the incorporation of trust from the human and human capacity for task allocation, and a negotiation phase for resolving task disagreements.
△ Less
Submitted 8 October, 2021;
originally announced October 2021.
-
From the Head or the Heart? An Experimental Design on the Impact of Explanation on Cognitive and Affective Trust
Authors:
Qiaoning Zhang,
X. Jessie Yang,
Lionel P. Robert Jr
Abstract:
Automated vehicles (AVs) are social robots that can potentially benefit our society. According to the existing literature, AV explanations can promote passengers' trust by reducing the uncertainty associated with the AV's reasoning and actions. However, the literature on AV explanations and trust has failed to consider how the type of trust
- cognitive versus affective - might alter this relatio…
▽ More
Automated vehicles (AVs) are social robots that can potentially benefit our society. According to the existing literature, AV explanations can promote passengers' trust by reducing the uncertainty associated with the AV's reasoning and actions. However, the literature on AV explanations and trust has failed to consider how the type of trust
- cognitive versus affective - might alter this relationship. Yet, the existing literature has shown that the implications associated with trust vary widely depending on whether it is cognitive or affective. To address this shortcoming and better understand the impacts of explanations on trust in AVs, we designed a study to investigate the effectiveness of explanations on both cognitive and affective trust. We expect these results to be of great significance in designing AV explanations to promote AV trust.
△ Less
Submitted 7 October, 2021;
originally announced October 2021.
-
High power continuous laser at 461 nm based on a compact and high-efficiency frequency-doubling linear cavity
Authors:
Chen-Hao Feng,
Sébastien Vidal,
Paul Robert,
Philippe Bouyer,
Bruno Desruelle,
Marco Prevedelli,
Johan Boullet,
Giorgio Santarelli,
Andrea Bertoldi
Abstract:
A Watt-level continuous and single frequency blue laser at 461 nm is obtained by frequency-doubling an amplified diode laser operating at 922 nm via a LBO crystal in a resonant Fabry-Pérot cavity. We achieved a best optical conversion efficiency equal to 87\% with more than 1 W output power in the blue, and limited by the available input power. The frequency-converted beam is characterized in term…
▽ More
A Watt-level continuous and single frequency blue laser at 461 nm is obtained by frequency-doubling an amplified diode laser operating at 922 nm via a LBO crystal in a resonant Fabry-Pérot cavity. We achieved a best optical conversion efficiency equal to 87\% with more than 1 W output power in the blue, and limited by the available input power. The frequency-converted beam is characterized in terms of long term power stability, residual intensity noise, and geometrical shape. The blue beam has a linewidth of the order of 1 MHz, and we used it to magneto-optically trap $^{88}$Sr atoms on the 5s$^{2}\,^{1}$S$_0$ -- 5s5p$\,^{1}$P$_1$ transition. The low-finesse, linear-cavity doubling system is very robust, maintains the lock for several days, and is compatible with a tenfold increase of the power levels which could be obtained with fully-fibered amplifiers and large mode area fibers.
△ Less
Submitted 6 September, 2021; v1 submitted 14 June, 2021;
originally announced June 2021.
-
Stochastic Models of Neural Plasticity: A Scaling Approach
Authors:
Philippe Robert,
Gaetan Vignoud
Abstract:
In neuroscience, synaptic plasticity refers to the set of mechanisms driving the dynamics of neuronal connections, called synapses and represented by a scalar value, the synaptic weight. A Spike-Timing Dependent Plasticity (STDP) rule is a biologically-based model representing the time evolution of the synaptic weight as a functional of the past spiking activity of adjacent neurons. A general math…
▽ More
In neuroscience, synaptic plasticity refers to the set of mechanisms driving the dynamics of neuronal connections, called synapses and represented by a scalar value, the synaptic weight. A Spike-Timing Dependent Plasticity (STDP) rule is a biologically-based model representing the time evolution of the synaptic weight as a functional of the past spiking activity of adjacent neurons. A general mathematical framework has been introduced in~arXiv:2010.08195.
In this paper we develop and investigate a scaling approach of these models based on several biological assumptions. Experiments show that long-term synaptic plasticity evolves on a much slower timescale than the cellular mechanisms driving the activity of neuronal cells, like their spiking activity or the concentration of various chemical components created/suppressed by this spiking activity. For this reason, a scaled version of the stochastic model of~arXiv:2010.08195 is introduced and a limit theorem, an averaging principle, is stated for a large class of plasticity kernels. A companion paper~arXiv:2010.08790 is entirely devoted to the tightness properties used to prove these convergence results.
These averaging principles are used to study two important STDP models: pair-based rules and calcium-based rules. Our results are compared with the approximations of neuroscience STDP models. A class of discrete models of STDP rules is also investigated for the analytical tractability of its limiting dynamical system.
△ Less
Submitted 16 November, 2021; v1 submitted 9 June, 2021;
originally announced June 2021.
-
A Unified Bi-directional Model for Natural and Artificial Trust in Human-Robot Collaboration
Authors:
Hebert Azevedo-Sa,
X. Jessie Yang,
Lionel P. Robert Jr.,
Dawn M. Tilbury
Abstract:
We introduce a novel capabilities-based bi-directional multi-task trust model that can be used for trust prediction from either a human or a robotic trustor agent. Tasks are represented in terms of their capability requirements, while trustee agents are characterized by their individual capabilities. Trustee agents' capabilities are not deterministic; they are represented by belief distributions.…
▽ More
We introduce a novel capabilities-based bi-directional multi-task trust model that can be used for trust prediction from either a human or a robotic trustor agent. Tasks are represented in terms of their capability requirements, while trustee agents are characterized by their individual capabilities. Trustee agents' capabilities are not deterministic; they are represented by belief distributions. For each task to be executed, a higher level of trust is assigned to trustee agents who have demonstrated that their capabilities exceed the task's requirements. We report results of an online experiment with 284 participants, revealing that our model outperforms existing models for multi-task trust prediction from a human trustor. We also present simulations of the model for determining trust from a robotic trustor. Our model is useful for control authority allocation applications that involve human-robot teams.
△ Less
Submitted 3 June, 2021;
originally announced June 2021.
-
Using Trust in Automation to Enhance Driver-(Semi)Autonomous Vehicle Interaction and Improve Team Performance
Authors:
Hebert Azevedo-Sa,
X. Jessie Yang,
Lionel P. Robert Jr.,
Dawn M. Tilbury
Abstract:
Trust in robots has been gathering attention from multiple directions, as it has special relevance in the theoretical descriptions of human-robot interactions. It is essential for reaching high acceptance and usage rates of robotic technologies in society, as well as for enabling effective human-robot teaming. Researchers have been trying to model the development of trust in robots to improve the…
▽ More
Trust in robots has been gathering attention from multiple directions, as it has special relevance in the theoretical descriptions of human-robot interactions. It is essential for reaching high acceptance and usage rates of robotic technologies in society, as well as for enabling effective human-robot teaming. Researchers have been trying to model the development of trust in robots to improve the overall rapport between humans and robots. Unfortunately, the miscalibration of trust in automation is a common issue that jeopardizes the effectiveness of automation use. It happens when a user's trust levels are not appropriate to the capabilities of the automation being used. Users can be: under-trusting the automation -- when they do not use the functionalities that the machine can perform correctly because of a lack of trust; or over-trusting the automation -- when, due to an excess of trust, they use the machine in situations where its capabilities are not adequate. The main objective of this work is to examine driver's trust development in the ADS. We aim to model how risk factors (e.g.: false alarms and misses from the ADS) and the short-term interactions associated with these risk factors influence the dynamics of drivers' trust in the ADS. The driving context facilitates the instrumentation to measure trusting behaviors, such as drivers' eye movements and usage time of the automated features. Our findings indicate that a reliable characterization of drivers' trusting behaviors and a consequent estimation of trust levels is possible. We expect that these techniques will permit the design of ADSs able to adapt their behaviors to attempt to adjust driver's trust levels. This capability could avoid under- and over-trusting, which could harm their safety or their performance.
△ Less
Submitted 3 June, 2021;
originally announced June 2021.
-
Rao-Blackwellization in the MCMC era
Authors:
Christian P. Robert,
Gareth O. Roberts
Abstract:
Rao-Blackwellization is a notion often occurring in the MCMC literature, with possibly different meanings and connections with the original Rao--Blackwell theorem (Rao, 1945 and Blackwell,1947), including a reduction of the variance of the resulting Monte Carlo approximations. This survey reviews some of the meanings of the term.
Rao-Blackwellization is a notion often occurring in the MCMC literature, with possibly different meanings and connections with the original Rao--Blackwell theorem (Rao, 1945 and Blackwell,1947), including a reduction of the variance of the resulting Monte Carlo approximations. This survey reviews some of the meanings of the term.
△ Less
Submitted 4 January, 2021;
originally announced January 2021.
-
Averaging Principles for Markovian Models of Plasticity
Authors:
Philippe Robert,
Gaetan Vignoud
Abstract:
Mathematical models of biological neural networks are associated to a rich and complex class of stochastic processes. In this paper, we consider a simple {\em plastic} neural network whose {\em connectivity/synaptic strength} $(W(t))$ depends on a set of activity-dependent processes to model {\em synaptic plasticity}, a well-studied mechanism from neuroscience. A general class of stochastic models…
▽ More
Mathematical models of biological neural networks are associated to a rich and complex class of stochastic processes. In this paper, we consider a simple {\em plastic} neural network whose {\em connectivity/synaptic strength} $(W(t))$ depends on a set of activity-dependent processes to model {\em synaptic plasticity}, a well-studied mechanism from neuroscience. A general class of stochastic models has been introduced in \cite{robert_mathematical_2020} to study the stochastic process $(W(t))$. It has been observed experimentally that its dynamics occur on much slower timescale than that of the main cellular processes. The purpose of this paper is to establish limit theorems for the distribution of $(W(t))$ with respect to the fast timescale of neuronal processes.
The central result of the paper is an averaging principle for the stochastic process $(W(t))$. Mathematically, the key variable is the point process whose jumps occur at the instants of neuronal spikes. A thorough analysis of several of its unbounded additive functionals is achieved in the slow-fast limit. Additionally, technical results on interacting shot-noise processes are developed and used in the general proof of the averaging principle.
△ Less
Submitted 19 December, 2020; v1 submitted 17 October, 2020;
originally announced October 2020.
-
Stochastic Models of Neural Synaptic Plasticity
Authors:
Philippe Robert,
Gaetan Vignoud
Abstract:
In neuroscience, learning and memory are usually associated to long-term changes of neuronal connectivity. In this context, synaptic plasticity refers to the set of mechanisms driving the dynamics of neuronal connections, called {\em synapses} and represented by a scalar value, the synaptic weight. Spike-Timing Dependent Plasticity (STDP) is a biologically-based model representing the time evoluti…
▽ More
In neuroscience, learning and memory are usually associated to long-term changes of neuronal connectivity. In this context, synaptic plasticity refers to the set of mechanisms driving the dynamics of neuronal connections, called {\em synapses} and represented by a scalar value, the synaptic weight. Spike-Timing Dependent Plasticity (STDP) is a biologically-based model representing the time evolution of the synaptic weight as a functional of the past spiking activity of adjacent neurons.
If numerous models of neuronal cells have been proposed in the mathematical literature, few of them include a variable for the time-varying strength of the connection. A new, general, mathematical framework is introduced to study synaptic plasticity associated to different STDP rules. The system composed of two neurons connected by a single synapse is investigated and a stochastic process describing its dynamical behavior is presented and analyzed. The notion of plasticity kernel is introduced as a key component of plastic neural networks models, generalizing a notion used for pair-based models. We show that a large number of STDP rules from neuroscience and physics can be represented by this formalism. Several aspects of these models are discussed and compared to canonical models of computational neuroscience. An important sub-class of plasticity kernels with a Markovian formulation is also defined and investigated. In these models, the time evolution of cellular processes such as the neuronal membrane potential and the concentrations of chemical components created/suppressed by spiking activity has the Markov property.
△ Less
Submitted 9 June, 2021; v1 submitted 16 October, 2020;
originally announced October 2020.
-
A Distributed Hierarchy Framework for Enhancing Cyber Security of Control Center Applications
Authors:
Chetan Kumar Kuraganti,
Bryan Paul Robert,
Gurunath Gurrala,
Ashish Joglekar,
Arun Babu Puthuparambil,
Rajesh Sundaresan,
Himanshu Tyagi
Abstract:
Recent cyber-attacks on power grids highlight the necessity to protect the critical functionalities of a control center vital for the safe operation of a grid. Even in a distributed framework one central control center acts as a coordinator in majority of the control center architectures. Such a control center can become a prime target for cyber as well as physical attacks, and, hence, a single po…
▽ More
Recent cyber-attacks on power grids highlight the necessity to protect the critical functionalities of a control center vital for the safe operation of a grid. Even in a distributed framework one central control center acts as a coordinator in majority of the control center architectures. Such a control center can become a prime target for cyber as well as physical attacks, and, hence, a single point failure can lead to complete loss of visibility of the power grid. If the control center which runs the critical functions in a distributed computing environment can be randomly chosen between the available control centers in a secure framework, the ability of the attacker in causing a single point failure can be reduced to a great extent. To achieve this, a novel distributed hierarchy based framework to secure critical functions is proposed in this paper. The proposed framework ensures that the data aggregation and the critical functions are carried out at a random location, and incorporates security features such as attestation and trust management to detect compromised agents. A theoretical result is proved on the evolution and convergence of the trust values in the proposed trust management protocol. It is also shown that the system is nominally robust so long as the number of compromised nodes is strictly less than one-half of the nodes minus 1. For demonstration, a Kalman filter-based state estimation using phasor measurements is used as the critical function to be secured. The proposed framework's implementation feasibility is tested on a physical hardware cluster of Parallella boards. The framework is also validated using simulations on the IEEE 118 bus system.
△ Less
Submitted 10 October, 2020;
originally announced October 2020.
-
Personality in Healthcare Human Robot Interaction (H-HRI): A Literature Review and Brief Critique
Authors:
Connor Esterwood,
Lionel P. Robert
Abstract:
Robots are becoming an important way to deliver health care, and personality is vital to understanding their effectiveness. Despite this, there is a lack of a systematic overarching understanding of personality in health care human robot interaction (H-HRI). To address this, the authors conducted a review that identified 18 studies on personality in H-HRI. This paper presents the results of that s…
▽ More
Robots are becoming an important way to deliver health care, and personality is vital to understanding their effectiveness. Despite this, there is a lack of a systematic overarching understanding of personality in health care human robot interaction (H-HRI). To address this, the authors conducted a review that identified 18 studies on personality in H-HRI. This paper presents the results of that systematic literature review. Insights are derived from this review regarding the methodologies, outcomes, and samples utilized. The authors of this review discuss findings across this literature while identifying several gaps worthy of attention. Overall, this paper is an important starting point in understanding personality in H-HRI.
△ Less
Submitted 15 August, 2020;
originally announced August 2020.
-
Computing Bayes: Bayesian Computation from 1763 to the 21st Century
Authors:
Gael M. Martin,
David T. Frazier,
Christian P. Robert
Abstract:
The Bayesian statistical paradigm uses the language of probability to express uncertainty about the phenomena that generate observed data. Probability distributions thus characterize Bayesian analysis, with the rules of probability used to transform prior probability distributions for all unknowns - parameters, latent variables, models - into posterior distributions, subsequent to the observation…
▽ More
The Bayesian statistical paradigm uses the language of probability to express uncertainty about the phenomena that generate observed data. Probability distributions thus characterize Bayesian analysis, with the rules of probability used to transform prior probability distributions for all unknowns - parameters, latent variables, models - into posterior distributions, subsequent to the observation of data. Conducting Bayesian analysis requires the evaluation of integrals in which these probability distributions appear. Bayesian computation is all about evaluating such integrals in the typical case where no analytical solution exists. This paper takes the reader on a chronological tour of Bayesian computation over the past two and a half centuries. Beginning with the one-dimensional integral first confronted by Bayes in 1763, through to recent problems in which the unknowns number in the millions, we place all computational problems into a common framework, and describe all computational methods using a common notation. The aim is to help new researchers in particular - and more generally those interested in adopting a Bayesian approach to empirical work - make sense of the plethora of computational techniques that are now on offer; understand when and why different methods are useful; and see the links that do exist, between them all.
△ Less
Submitted 5 December, 2020; v1 submitted 14 April, 2020;
originally announced April 2020.
-
Efficient Behavior-aware Control of Automated Vehicles at Crosswalks using Minimal Information Pedestrian Prediction Model
Authors:
Suresh Kumaar Jayaraman,
Lionel P. Robert Jr.,
Xi Jessie Yang,
Anuj K. Pradhan,
Dawn M. Tilbury
Abstract:
For automated vehicles (AVs) to reliably navigate through crosswalks, they need to understand pedestrians crossing behaviors. Simple and reliable pedestrian behavior models aid in real-time AV control by allowing the AVs to predict future pedestrian behaviors. In this paper, we present a Behavior aware Model Predictive Controller (B-MPC) for AVs that incorporates long-term predictions of pedestria…
▽ More
For automated vehicles (AVs) to reliably navigate through crosswalks, they need to understand pedestrians crossing behaviors. Simple and reliable pedestrian behavior models aid in real-time AV control by allowing the AVs to predict future pedestrian behaviors. In this paper, we present a Behavior aware Model Predictive Controller (B-MPC) for AVs that incorporates long-term predictions of pedestrian crossing behavior using a previously developed pedestrian crossing model. The model incorporates pedestrians gap acceptance behavior and utilizes minimal pedestrian information, namely their position and speed, to predict pedestrians crossing behaviors. The BMPC controller is validated through simulations and compared to a rule-based controller. By incorporating predictions of pedestrian behavior, the B-MPC controller is able to efficiently plan for longer horizons and handle a wider range of pedestrian interaction scenarios than the rule-based controller. Results demonstrate the applicability of the controller for safe and efficient navigation at crossing scenarios.
△ Less
Submitted 22 March, 2020;
originally announced March 2020.
-
Analysis and Prediction of Pedestrian Crosswalk Behavior during Automated Vehicle Interactions
Authors:
Suresh Kumaar Jayaraman,
Dawn M. Tilbury,
X. Jessie Yang,
Anuj K. Pradhan,
Lionel P. Robert Jr
Abstract:
For safe navigation around pedestrians, automated vehicles (AVs) need to plan their motion by accurately predicting pedestrians trajectories over long time horizons. Current approaches to AV motion planning around crosswalks predict only for short time horizons (1-2 s) and are based on data from pedestrian interactions with human-driven vehicles (HDVs). In this paper, we develop a hybrid systems m…
▽ More
For safe navigation around pedestrians, automated vehicles (AVs) need to plan their motion by accurately predicting pedestrians trajectories over long time horizons. Current approaches to AV motion planning around crosswalks predict only for short time horizons (1-2 s) and are based on data from pedestrian interactions with human-driven vehicles (HDVs). In this paper, we develop a hybrid systems model that uses pedestrians gap acceptance behavior and constant velocity dynamics for long-term pedestrian trajectory prediction when interacting with AVs. Results demonstrate the applicability of the model for long-term (> 5 s) pedestrian trajectory prediction at crosswalks. Further we compared measures of pedestrian crossing behaviors in the immersive virtual environment (when interacting with AVs) to that in the real world (results of published studies of pedestrians interacting with HDVs), and found similarities between the two. These similarities demonstrate the applicability of the hybrid model of AV interactions developed from an immersive virtual environment (IVE) for real-world scenarios for both AVs and HDVs.
△ Less
Submitted 22 March, 2020;
originally announced March 2020.
-
Designing Fair AI for Managing Employees in Organizations: A Review, Critique, and Design Agenda
Authors:
Lionel P. Robert,
Casey Pierce,
Liz Morris,
Sangmi Kim,
Rasha Alahmad
Abstract:
Organizations are rapidly deploying artificial intelligence (AI) systems to manage their workers. However, AI has been found at times to be unfair to workers. Unfairness toward workers has been associated with decreased worker effort and increased worker turnover. To avoid such problems, AI systems must be designed to support fairness and redress instances of unfairness. Despite the attention rela…
▽ More
Organizations are rapidly deploying artificial intelligence (AI) systems to manage their workers. However, AI has been found at times to be unfair to workers. Unfairness toward workers has been associated with decreased worker effort and increased worker turnover. To avoid such problems, AI systems must be designed to support fairness and redress instances of unfairness. Despite the attention related to AI unfairness, there has not been a theoretical and systematic approach to developing a design agenda. This paper addresses the issue in three ways. First, we introduce the organizational justice theory, three different fairness types (distributive, procedural, interactional), and the frameworks for redressing instances of unfairness (retributive justice, restorative justice). Second, we review the design literature that specifically focuses on issues of AI fairness in organizations. Third, we propose a design agenda for AI fairness in organizations that applies each of the fairness types to organizational scenarios. Then, the paper concludes with implications for future research.
△ Less
Submitted 20 February, 2020;
originally announced February 2020.
-
Generalized Poisson Difference Autoregressive Processes
Authors:
Giulia Carallo,
Roberto Casarin,
Christian P. Robert
Abstract:
This paper introduces a new stochastic process with values in the set Z of integers with sign. The increments of process are Poisson differences and the dynamics has an autoregressive structure. We study the properties of the process and exploit the thinning representation to derive stationarity conditions and the stationary distribution of the process. We provide a Bayesian inference method and a…
▽ More
This paper introduces a new stochastic process with values in the set Z of integers with sign. The increments of process are Poisson differences and the dynamics has an autoregressive structure. We study the properties of the process and exploit the thinning representation to derive stationarity conditions and the stationary distribution of the process. We provide a Bayesian inference method and an efficient posterior approximation procedure based on Monte Carlo. Numerical illustrations on both simulated and real data show the effectiveness of the proposed inference.
△ Less
Submitted 11 February, 2020;
originally announced February 2020.
-
A Review of Personality in Human Robot Interactions
Authors:
Lionel P. Robert,
Rasha Alahmad,
Connor Esterwood,
Sangmi Kim,
Sangseok You,
Qiaoning Zhang
Abstract:
Personality has been identified as a vital factor in understanding the quality of human robot interactions. Despite this the research in this area remains fragmented and lacks a coherent framework. This makes it difficult to understand what we know and identify what we do not. As a result our knowledge of personality in human robot interactions has not kept pace with the deployment of robots in or…
▽ More
Personality has been identified as a vital factor in understanding the quality of human robot interactions. Despite this the research in this area remains fragmented and lacks a coherent framework. This makes it difficult to understand what we know and identify what we do not. As a result our knowledge of personality in human robot interactions has not kept pace with the deployment of robots in organizations or in our broader society. To address this shortcoming, this paper reviews 83 articles and 84 separate studies to assess the current state of human robot personality research. This review: (1) highlights major thematic research areas, (2) identifies gaps in the literature, (3) derives and presents major conclusions from the literature and (4) offers guidance for future research.
△ Less
Submitted 5 February, 2020; v1 submitted 31 January, 2020;
originally announced January 2020.
-
Markov Chain Monte Carlo Methods, a survey with some frequent misunderstandings
Authors:
Christian P. Robert,
Wu Changye
Abstract:
In this chapter, we review some of the most standard MCMC tools used in Bayesian computation, along with vignettes on standard misunderstandings of these approaches taken from Q \&~A's on the forum Cross-validated answered by the first author.
In this chapter, we review some of the most standard MCMC tools used in Bayesian computation, along with vignettes on standard misunderstandings of these approaches taken from Q \&~A's on the forum Cross-validated answered by the first author.
△ Less
Submitted 17 January, 2020;
originally announced January 2020.
-
Examining the Effects of Emotional Valence and Arousal on Takeover Performance in Conditionally Automated Driving
Authors:
Na Du,
Feng Zhou,
Elizabeth Pulver,
Dawn M. Tilbury,
Lionel P. Robert,
Anuj K. Pradhan,
X. Jessie Yang
Abstract:
In conditionally automated driving, drivers have difficulty in takeover transitions as they become increasingly decoupled from the operational level of driving. Factors influencing takeover performance, such as takeover lead time and the engagement of non-driving related tasks, have been studied in the past. However, despite the important role emotions play in human-machine interaction and in manu…
▽ More
In conditionally automated driving, drivers have difficulty in takeover transitions as they become increasingly decoupled from the operational level of driving. Factors influencing takeover performance, such as takeover lead time and the engagement of non-driving related tasks, have been studied in the past. However, despite the important role emotions play in human-machine interaction and in manual driving, little is known about how emotions influence drivers takeover performance. This study, therefore, examined the effects of emotional valence and arousal on drivers takeover timeliness and quality in conditionally automated driving. We conducted a driving simulation experiment with 32 participants. Movie clips were played for emotion induction. Participants with different levels of emotional valence and arousal were required to take over control from automated driving, and their takeover time and quality were analyzed. Results indicate that positive valence led to better takeover quality in the form of a smaller maximum resulting acceleration and a smaller maximum resulting jerk. However, high arousal did not yield an advantage in takeover time. This study contributes to the literature by demonstrating how emotional valence and arousal affect takeover performance. The benefits of positive emotions carry over from manual driving to conditionally automated driving while the benefits of arousal do not.
△ Less
Submitted 13 January, 2020;
originally announced January 2020.
-
Parallelising MCMC via Random Forests
Authors:
Wu Changye,
Christian P. Robert
Abstract:
For Bayesian computation in big data contexts, the divide-and-conquer MCMC concept splits the whole data set into batches, runs MCMC algorithms separately over each batch to produce samples of parameters, and combines them to produce an approximation of the target distribution. In this article, we embed random forests into this framework and use each subposterior/partial-posterior as a proposal di…
▽ More
For Bayesian computation in big data contexts, the divide-and-conquer MCMC concept splits the whole data set into batches, runs MCMC algorithms separately over each batch to produce samples of parameters, and combines them to produce an approximation of the target distribution. In this article, we embed random forests into this framework and use each subposterior/partial-posterior as a proposal distribution to implement importance sampling. Unlike the existing divide-and-conquer MCMC, our methods are based on scaled subposteriors, whose scale factors are not necessarily restricted to being equal to one or to the number of subsets. Through several experiments, we show that our methods work well with models ranging from Gaussian cases to strongly non-Gaussian cases, and include model misspecification.
△ Less
Submitted 21 November, 2019;
originally announced November 2019.
-
An Automated Vehicle (AV) like Me? The Impact of Personality Similarities and Differences between Humans and AVs
Authors:
Qiaoning Zhang,
Connor Esterwood,
X. Jessie Yang,
Lionel P. Robert Jr
Abstract:
To better understand the impacts of similarities and dissimilarities in human and AV personalities we conducted an experimental study with 443 individuals. Generally, similarities in human and AV personalities led to a higher perception of AV safety only when both were high in specific personality traits. Dissimilarities in human and AV personalities also yielded a higher perception of AV safety,…
▽ More
To better understand the impacts of similarities and dissimilarities in human and AV personalities we conducted an experimental study with 443 individuals. Generally, similarities in human and AV personalities led to a higher perception of AV safety only when both were high in specific personality traits. Dissimilarities in human and AV personalities also yielded a higher perception of AV safety, but only when the AV was higher than the human in a particular personality trait.
△ Less
Submitted 11 September, 2019;
originally announced September 2019.
-
IntrinSeqNet: Learning to Estimate the Reflectance from Varying Illumination
Authors:
Grégoire Nieto,
Mohammad Rouhani,
Philippe Robert
Abstract:
This article has been removed by arXiv administrators because the submitter did not have the rights to agree to the license at the time of submission
This article has been removed by arXiv administrators because the submitter did not have the rights to agree to the license at the time of submission
△ Less
Submitted 13 June, 2019;
originally announced June 2019.