Active inference and epistemic value

Friston, Karl, Rigoli, Francesco, Ognibene, Dimitri, Mathys, Christoph, Fitzgerald, Thomas and Pezzulo, Giovanni (2015) Active inference and epistemic value. Cognitive Neuroscience, 6 (4). pp. 187-214. ISSN 1758-8928

Full text not available from this repository. (Request a copy)

Abstract

We offer a formal treatment of choice behavior based on the premise that agents minimize the expected free energy of future outcomes. Crucially, the negative free energy or quality of a policy can be decomposed into extrinsic and epistemic (or intrinsic) value. Minimizing expected free energy is therefore equivalent to maximizing extrinsic value or expected utility (defined in terms of prior preferences or goals), while maximizing information gain or intrinsic value (or reducing uncertainty about the causes of valuable outcomes). The resulting scheme resolves the exploration-exploitation dilemma: Epistemic value is maximized until there is no further information gain, after which exploitation is assured through maximization of extrinsic value. This is formally consistent with the Infomax principle, generalizing formulations of active vision based upon salience (Bayesian surprise) and optimal decisions based on expected utility and risk-sensitive (Kullback-Leibler) control. Furthermore, as with previous active inference formulations of discrete (Markovian) problems, ad hoc softmax parameters become the expected (Bayes-optimal) precision of beliefs about, or confidence in, policies. This article focuses on the basic theory, illustrating the ideas with simulations. A key aspect of these simulations is the similarity between precision updates and dopaminergic discharges observed in conditioning paradigms.

Item Type: Article
Uncontrolled Keywords: active inference,agency,bayesian inference,bounded rationality,free energy,utility theory,information gain,bayesian surprise,epistemic value,exploration,exploitation,dopamine neurons,neural organization,incentive salience,visual-attention,action selection,basal ganglia,trade-off,information,reward,systems
Faculty \ School: Faculty of Social Sciences > School of Psychology
Depositing User: Pure Connector
Date Deposited: 15 Apr 2016 10:01
Last Modified: 22 Apr 2020 01:16
URI: https://ueaeprints.uea.ac.uk/id/eprint/58264
DOI: 10.1080/17588928.2015.1020053

Actions (login required)

View Item View Item