Discovering Agents

Kenton, Zachary; Kumar, Ramana; Farquhar, Sebastian; Richens, Jonathan; MacDermott, Matt; Everitt, Tom

Computer Science > Artificial Intelligence

arXiv:2208.08345 (cs)

[Submitted on 17 Aug 2022 (v1), last revised 24 Aug 2022 (this version, v2)]

Title:Discovering Agents

Authors:Zachary Kenton, Ramana Kumar, Sebastian Farquhar, Jonathan Richens, Matt MacDermott, Tom Everitt

View PDF

Abstract:Causal models of agents have been used to analyse the safety aspects of machine learning systems. But identifying agents is non-trivial -- often the causal model is just assumed by the modeler without much justification -- and modelling failures can lead to mistakes in the safety analysis. This paper proposes the first formal causal definition of agents -- roughly that agents are systems that would adapt their policy if their actions influenced the world in a different way. From this we derive the first causal discovery algorithm for discovering agents from empirical data, and give algorithms for translating between causal models and game-theoretic influence diagrams. We demonstrate our approach by resolving some previous confusions caused by incorrect causal modelling of agents.

Comments:	Some typos corrected
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2208.08345 [cs.AI]
	(or arXiv:2208.08345v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2208.08345

Submission history

From: Zachary Kenton [view email]
[v1] Wed, 17 Aug 2022 15:13:25 UTC (1,362 KB)
[v2] Wed, 24 Aug 2022 10:01:23 UTC (1,381 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2022-08

Change to browse by:

cs
cs.LG

References & Citations

1 blog link

(what is this?)

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Discovering Agents

Submission history

Access Paper:

References & Citations

1 blog link

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Discovering Agents

Submission history

Access Paper:

References & Citations

1 blog link

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators