Bruineberg and colleagues provide a thorough review of Markov blankets and their limitations in the context of the free-energy principle (FEP). We wish to complement this by drawing attention to two additional issues that we believe have important consequences for the FEP.
Firstly, contrary to what one might expect, the condition known as “Markov blanket” in the FEP literature is generally not guaranteed by a sensor–motor loop structure. Secondly, the Markov blanket condition needed for the FEP is far stronger than it appears to be. These issues severely limit the scope of applicability of current formulations of the FEP. Fortunately, we believe they can be solved, and give some hints towards a resolution.
As Bruineberg et al. explain, the notion of a Markov blanket arises in the context of graphical models, and in particular, Bayesian networks. In a Bayesian network each node represents a random variable, and their joint distribution factors in a particular way that depends on the topology of the graph (Pearl, Reference Pearl1988).
The literature on FEP is also concerned with graphs that are not Bayesian networks. Each node in these graphs represents a dynamical variable of a system and an edge represents the possibility that one dynamical variable can influence another. These include the adjacency matrix described in Bruineberg et al.'s section 4.2, and also the sensor–motor loop as illustrated in their Figure 2. Typically, the edges in such graphs correspond to nonzero terms in a Jacobian matrix. We will call such graphs influence graphs.
A stationary state defines a joint distribution over the nodes of an influence graph. There is then some resemblance between the influence graphs and Bayesian networks, since both contain nodes that represent random variables and edges that represent influences of some kind.
However, these two types of graph are fundamentally different. Influence graphs are not necessarily acyclic, but more importantly, the theorems in Pearl's formalism do not apply to influence graphs. In particular, one might expect that the sensor–motor loop (Bruineberg et al.'s Fig. 2) would imply the time-synchronous Markov blanket condition
However, this is not the case in general – and this is important because (1) is used in deriving the FEP. This issue has been recently pointed out (Aguilera, Millidge, Tschantz, & Buckley, Reference Aguilera, Millidge, Tschantz and Buckley2021; Biehl, Pollock, & Kanai, Reference Biehl, Pollock and Kanai2021), and while it has been acknowledged in some of the most recent FEP literature it is not as widely known as it should be. We sketch the underlying reason for it in Figure 1.
Figure 1. Top: A “time-unrolled” sensor–motor loop in discrete time. The time-unrolled graph is a Bayesian network, and hence the Pearl framework can be applied. It follows that the current internal and external states, μ 3 and ϕ 3 (blue, orange) are conditionally independent given the nodes in light green, which consist of the past histories of sensor and actuator states, s 1, s 2, s 3 and a 1, a 2, a 3, as well as the initial states μ 0 and ϕ 0. (The dashed line indicates that the initial states might be correlated.) Bottom: The current internal and external states are in general not conditionally independent given only the current sensor and actuator states, s 3 and a 3, because μ 3 and ϕ 3 have common ancestors that are not screened off by these nodes, for example, μ 1 and ϕ 1 (red). This is true regardless of stationarity. If the system is ergodic then the dependence on the initial states will disappear in the infinite limit, so that we can effectively say that μ t and ϕ t are conditionally independent given the infinite past history of the sensorimotor states.
Recent works (e.g., Friston, Heins, Ueltzhöffer, Da Costa, and Parr, Reference Friston, Heins, Ueltzhöffer, Da Costa and Parr2021a; Friston, Da Costa, and Parr, Reference Friston, Da Costa and Parr2021b) have sought to address this by seeking additional conditions or conjectures under which the needed relationship holds. However, the fact that these conditions are highly non-trivial suggests that the scope of the FEP may be much more limited than previously thought.
Furthermore, (1) itself puts a very strong constraint on a system's dynamics. One way to see this is via the data processing inequality (Cover & Thomas, Reference Cover and Thomas2006, p. 34), which imposes that if (1) holds then all information that μ t and ϕ t share needs to be present in (s t, a t). This would mean that the internal and external states could share no more information than is contained in the sensor and motor states at the current time.
But cases where information is stored in the environment and the agent but not in the blanket are ubiquitous. Imagine a friend gives you a phone number written on a piece of paper, which you memorise and then store in a box. The statistical independence between internal and external variables conditioned on active and sensory ones is broken as soon as the piece of paper is away from your sensory input. Once it's out of sight the phone number cannot be stored simultaneously in your internal state and on the piece of paper. As Parr, Da Costa, Heins, Ramstead, and Friston (Reference Parr, Da Costa, Heins, Ramstead and Friston2021) discuss, this need not be true in transients even if it holds in stationary state. Nevertheless it puts an unrealistic constraint on the stationary dynamics, which we don't expect to be applicable to living organisms.
A possible resolution of this limitation follows from Figure 1. Although (1) cannot be assumed for a general sensor–motor loop, we do have the relationship
We expect an analogous result in continuous time. The internal and external states are not conditionally independent given the current sensorimotor states, but, under only mild assumptions, they are conditionally independent given the sensorimotor history. Alternative constructions of blankets that follow these principles are currently being investigated (e.g., Rosas, Mediano, Biehl, Chandaria, and Polani, Reference Rosas, Mediano, Biehl, Chandaria, Polani, Verbelen, Lanillos, Buckley and De Boom2020).
This makes intuitive sense: Your knowledge of the world is not limited by what you can sense at the current moment, but it is limited by what you have been able to sense over your whole lifetime. If a new version of the FEP can be constructed based on this alternative conditional independence relation then it will be more encompassing and will have something close to the broad applicability that was originally intended.
Bruineberg and colleagues provide a thorough review of Markov blankets and their limitations in the context of the free-energy principle (FEP). We wish to complement this by drawing attention to two additional issues that we believe have important consequences for the FEP.
Firstly, contrary to what one might expect, the condition known as “Markov blanket” in the FEP literature is generally not guaranteed by a sensor–motor loop structure. Secondly, the Markov blanket condition needed for the FEP is far stronger than it appears to be. These issues severely limit the scope of applicability of current formulations of the FEP. Fortunately, we believe they can be solved, and give some hints towards a resolution.
As Bruineberg et al. explain, the notion of a Markov blanket arises in the context of graphical models, and in particular, Bayesian networks. In a Bayesian network each node represents a random variable, and their joint distribution factors in a particular way that depends on the topology of the graph (Pearl, Reference Pearl1988).
The literature on FEP is also concerned with graphs that are not Bayesian networks. Each node in these graphs represents a dynamical variable of a system and an edge represents the possibility that one dynamical variable can influence another. These include the adjacency matrix described in Bruineberg et al.'s section 4.2, and also the sensor–motor loop as illustrated in their Figure 2. Typically, the edges in such graphs correspond to nonzero terms in a Jacobian matrix. We will call such graphs influence graphs.
A stationary state defines a joint distribution over the nodes of an influence graph. There is then some resemblance between the influence graphs and Bayesian networks, since both contain nodes that represent random variables and edges that represent influences of some kind.
However, these two types of graph are fundamentally different. Influence graphs are not necessarily acyclic, but more importantly, the theorems in Pearl's formalism do not apply to influence graphs. In particular, one might expect that the sensor–motor loop (Bruineberg et al.'s Fig. 2) would imply the time-synchronous Markov blanket condition
However, this is not the case in general – and this is important because (1) is used in deriving the FEP. This issue has been recently pointed out (Aguilera, Millidge, Tschantz, & Buckley, Reference Aguilera, Millidge, Tschantz and Buckley2021; Biehl, Pollock, & Kanai, Reference Biehl, Pollock and Kanai2021), and while it has been acknowledged in some of the most recent FEP literature it is not as widely known as it should be. We sketch the underlying reason for it in Figure 1.
Figure 1. Top: A “time-unrolled” sensor–motor loop in discrete time. The time-unrolled graph is a Bayesian network, and hence the Pearl framework can be applied. It follows that the current internal and external states, μ 3 and ϕ 3 (blue, orange) are conditionally independent given the nodes in light green, which consist of the past histories of sensor and actuator states, s 1, s 2, s 3 and a 1, a 2, a 3, as well as the initial states μ 0 and ϕ 0. (The dashed line indicates that the initial states might be correlated.) Bottom: The current internal and external states are in general not conditionally independent given only the current sensor and actuator states, s 3 and a 3, because μ 3 and ϕ 3 have common ancestors that are not screened off by these nodes, for example, μ 1 and ϕ 1 (red). This is true regardless of stationarity. If the system is ergodic then the dependence on the initial states will disappear in the infinite limit, so that we can effectively say that μ t and ϕ t are conditionally independent given the infinite past history of the sensorimotor states.
Recent works (e.g., Friston, Heins, Ueltzhöffer, Da Costa, and Parr, Reference Friston, Heins, Ueltzhöffer, Da Costa and Parr2021a; Friston, Da Costa, and Parr, Reference Friston, Da Costa and Parr2021b) have sought to address this by seeking additional conditions or conjectures under which the needed relationship holds. However, the fact that these conditions are highly non-trivial suggests that the scope of the FEP may be much more limited than previously thought.
Furthermore, (1) itself puts a very strong constraint on a system's dynamics. One way to see this is via the data processing inequality (Cover & Thomas, Reference Cover and Thomas2006, p. 34), which imposes that if (1) holds then all information that μ t and ϕ t share needs to be present in (s t, a t). This would mean that the internal and external states could share no more information than is contained in the sensor and motor states at the current time.
But cases where information is stored in the environment and the agent but not in the blanket are ubiquitous. Imagine a friend gives you a phone number written on a piece of paper, which you memorise and then store in a box. The statistical independence between internal and external variables conditioned on active and sensory ones is broken as soon as the piece of paper is away from your sensory input. Once it's out of sight the phone number cannot be stored simultaneously in your internal state and on the piece of paper. As Parr, Da Costa, Heins, Ramstead, and Friston (Reference Parr, Da Costa, Heins, Ramstead and Friston2021) discuss, this need not be true in transients even if it holds in stationary state. Nevertheless it puts an unrealistic constraint on the stationary dynamics, which we don't expect to be applicable to living organisms.
A possible resolution of this limitation follows from Figure 1. Although (1) cannot be assumed for a general sensor–motor loop, we do have the relationship
We expect an analogous result in continuous time. The internal and external states are not conditionally independent given the current sensorimotor states, but, under only mild assumptions, they are conditionally independent given the sensorimotor history. Alternative constructions of blankets that follow these principles are currently being investigated (e.g., Rosas, Mediano, Biehl, Chandaria, and Polani, Reference Rosas, Mediano, Biehl, Chandaria, Polani, Verbelen, Lanillos, Buckley and De Boom2020).
This makes intuitive sense: Your knowledge of the world is not limited by what you can sense at the current moment, but it is limited by what you have been able to sense over your whole lifetime. If a new version of the FEP can be constructed based on this alternative conditional independence relation then it will be more encompassing and will have something close to the broad applicability that was originally intended.
Financial support
M. B. and N. V. acknowledge the support of Grant 62229 from the John Templeton Foundation. The opinions expressed in this publication are those of the author(s) and do not necessarily reflect the views of the John Templeton Foundation. F. R. is supported by the Ad Astra Chandaria Foundation.
Conflict of interest
None.