1 Introduction
The time evolution of dynamical systems is a central topic in ergodic theory, probability theory, geometry and analysis. Similarly, decay properties of dissipative quantum systems also naturally arise in quantum many-body systems, quantum information theory and high energy physics. The aim of this article is to provide a new framework of decay estimates that applies for both classical and quantum systems in the non-ergodic setting. Here, ergodicity means the system admits a unique equilibrium state, also termed primitive in mathematical physics literature, whereas non-ergodic systems admit multiple equilibrium states.
Logarithmic Sobolev inequality (LSI) is a powerful functional inequality in deriving the mixing time of Markovian evolution. LSI was first introduced in the seminal works of Gross [Reference Gross35, Reference Gross34] as an equivalent reformulation of Nelson’s Hypercontractivity (HC) [Reference Nelson58, Reference Nelson59]. It has been widely studied on manifolds and graphs for the deep connections to geometry and concentration phenomenon. However, attempts to translate the notion of hypercontractivity to the matrix-valued setting or the non-ergodic setting failed miserably [Reference Bardet and Rouzé7], due to the lack of uniform convexity of certain noncommutative spaces [Reference Junge and Parcet42]. This results in a roadblock for the standard argument connecting hypercontractivity, entropy decay and mixing time, as well as the lack of tensorization property used in many-body systems.
We propose a new, direct approach to entropy decay that also applies to fully noncommutative, non-ergodic setting. Let $T_t=e^{-Lt}$ be a quantum Markov semigroup on a finite von Neumann algebra ${\mathcal M}$ with generator L (i.e., a semigroup of completely positive trace-preserving maps). We aim to establish the exponential entropy decay,
where $D(\rho \|\sigma ) \hspace {.1cm} = \hspace {.1cm} \tau (\rho \ln \rho -\rho \ln \sigma )$ is the quantum relative entropy for two density operators $\rho ,\sigma $ and $\tau $ can be any normal faithful trace on ${\mathcal M}$ . The equilibrium state $E(\rho )$ associated to any initial density $\rho $ is given by the ergodic mean $\displaystyle E(\rho )=\lim _{t\to \infty }\frac {1}{t}\int _0^t T_s(\rho ) ds$ . It turns out that the simple properties of relative entropy enable us to prove a direct link between positivity order and entropy decay. Indeed, let us for simplicity assume that the semigroup is trace symmetric
Under this assumption, we discover the following entropy difference lemma:
The new quantity $D_{\Phi }(\rho )$ is the loss of von Neumann entropy under a channel map $\Phi $ . Our second ingredient is a stability estimate inspired by the positivity order condition by Gao and Rouzé [Reference Gao and Rouzé31] (see also [Reference LaRacuente46]) that
for some constant $C_{\varepsilon }$ only depending on $\varepsilon $ and the index of the ergodic mean projection E. Now, suppose the condition (1.3) holds for time $t(\varepsilon )$ and find
where we apply (1.2) iteratively to the term $D(\rho \|T_{\frac {(n-1)t(\varepsilon )}{n}}(\rho ))$ . Taking the limit $n\to \infty $ , we derive the inequality
which is the differential version of (1.1) with $\alpha _1= \frac {1}{C_\varepsilon t(\varepsilon )}$ , called the modified logarithmic Sobolev inequality (in short, MLSI). The largest possible constant $\alpha _1$ in (1.1) is called the MLSI constant.
1.1 MLSI for GNS-symmetric semigroups
Many dynamics in quantum information processing are not trace symmetric. One major application of open systems is state preparation by simulating time evolution governed by a Lindbladian
A natural one is the Davies semigroup, which converges to the thermal Gibbs state $\phi =\frac {e^{-\beta H}}{{\text {tr}}(e^{-\beta H} )}$ . For any finite inverse temperature $\beta>0$ , the Davies semigroup is never trace symmetric but satisfies the following detailed balance condition
with respect to the Gibbs state $\phi $ , which we call GNS-symmetry. In this context, a breakthrough result of MLSI constant $\alpha _1$ was made by Gao and Rouzé [Reference Gao and Rouzé31] that
for every GNS symmetric semigroups in finite dimensions. Here, $\lambda (L)$ is the spectral gap of the semigroup generator L, and $C(E)=\inf \{\mu \hspace {.1cm} |\hspace {.1cm} x\le \mu E (x)\hspace {.1cm}, \hspace {.1cm} \text { for all } x\ge 0 \}$ is the Pimsner-Popa index for the condition expectation $\displaystyle E=\lim _{t\to \infty }T_t$ . An important consequence of Gao and Rouzé’s estimate (1.5) is the positivity of the complete MLSI constant $\alpha _c(L)=\inf _{n}\alpha _1(L\otimes \operatorname {\mathrm {\operatorname {id}}}_{\mathbb {M}_n})$ (in short, CMLSI constant),
because the complete Pimsner-Popa index $C_{cb}(E)=\sup _{n}C(E\otimes \operatorname {\mathrm {\operatorname {id}}}_{\mathbb {M}_n})$ is finite in finite dimensions. The CMLSI constant is of particular interest because it satisfies the tensorization property $\alpha _c(T_t\otimes S_t)=\min \{\alpha _c(T_t), \alpha _c(S_t)\}$ , while the MLSI constant $\alpha _1$ does not.
Our ‘positivity order implies entropy decay’ argument above gives an exponential improvement to (1.6) in terms of the dimension constant $C_{cb}(E)$ .
Theorem 1.1 (cf. Theorem 3.2 and 4.10).
Let $T_t:{\mathcal M}\to {\mathcal M}$ be a quantum Markov semigroup GNS-symmetric to a faithful normal state $\phi $ . Then the optimal CMLSI constant satisfies
Here, $\Psi \le _{cp}\Phi $ means $\Psi -\Phi $ is a completely positive map. Moreover, in finite dimensions,
The quantity $t_{cb}$ , called CB return time, is the mixing time in terms of complete positivity order. Similar terms of complete positivity have been also considered in the quantum setting for the study of approximate unitary t-design ([Reference Brandao, Harrow and Horodecki12]). In the fully non-ergodic noncommutative setting, $t_{cb}$ was originally introduced in [Reference Gao, Junge and LaRacuente29] via completely bounded (CB) $L_1\to L_\infty $ norm, whose connection to complete positivity order and spectral gap relies heavily on operator space theory (see Section 3.2).
The proof to GNS-symmetric cases uses the ideas of Haagerup reduction [Reference Haagerup, Junge and Xu36], a method to derive results for type III von Neumann algebras by reducing them to cases of tracial von Neumann algebras. Thanks to this machinery, our estimate in trace-symmetric settings can be salvaged to a GNS-symmetric semigroup on general $\sigma $ -finite von Neumann algebras, including both classical systems and quantum systems. A particular interesting example is a matrix version of the classical n-level death-birth process which admits an invariant state $\rho _{n}\varpropto (e^{-\beta k})_{k=0,..,n}$ and a Lindbladian given by nearest neighbor interactions. In this example, we show that both the spectral gap is are uniformly controlled, and
Hence, both estimates in our Theorem 1.1 are asymptotically tight for this GNS-symmetric example.
1.2 MLSI for matrix-valued functions
Besides the quantum setting, our results also provide interesting MLSI and concentration inequalities for random matrices of arbitrary size. For a classical Markov semigroup $P_t:L_\infty (\Omega ,\mu )\to L_\infty (\Omega ,\mu )$ on some probability space $(\Omega ,\mu )$ , the notion of CMLSI is basically a uniform MLSI for positive matrix-valued random variables $g:\Omega \to {\mathbb M}_n$ of all dimensions $n\ge 1$ ,
Here, $\mu (f)=\int fd\mu $ is the scalar valued mean, $E_\mu (g)=\int gd\mu \in {\mathbb M}_n$ is the matrix valued mean, and ${\text {tr}}$ is the standard matrix trace. In this setting, the CB return time $t_{cb}$ is simply the $L_\infty $ -mixing time
which is accessible by kernel estimates derived from harmonic analysis. As a consequence of Theorem 1.1, we obtain CMLSI for all sub-Laplacians of Hörmander system.
Theorem 1.2. Let $(M,g)$ be a compact connected Riemannian manifold without boundary, and ${\omega } d\operatorname {vol}$ be a probability measure with a smooth density $\omega $ with respect to the volume form $d\operatorname {vol}$ . Suppose $H=\{X_i\}_{i=1}^k\subset TM$ is a family of vectors fields satisfying the Hörmander’s condition that at every point $x\in M$ ,
Then the horizontal heat semigroup $P_t=e^{-\Delta _H t}$ generated by the sub-Laplacian
has CMLSI constant $\alpha _c(\Delta _H)>0$ . Here, $X_i^*$ is the adjoint operator with respect to $L_2(M, \omega d\operatorname {vol})$ .
For scalar-valued functions, the positivity of $\alpha _1(\Delta _H)$ was proved by Ługiewicz and Zegarliński [Reference Lugiewicz and Zegarliński53], using a hypercontractive argument similar from [Reference Diaconis and Saloff-Coste23]. Nevertheless, both [Reference Diaconis and Saloff-Coste23] and [Reference Lugiewicz and Zegarliński53] rely on the Rothaus Lemma [Reference Rothaus68, Reference Bakry3], a crucial step which does not apply for matrix-valued functions (see Section 3.6).
In this setting, our Theorem 1.1 gives a short proof of
for scalar-valued function, and also extends to matrix-valued setting by replacing LSI with CMLSI. A particular interesting example, also covered in [Reference Gao and Gordina28], is the Lie group $M=SU(2)$ with the canonical sub-Laplacian $\Delta _H=-X^2-Y^2$ , where the Lie algebra $\mathfrak {su}(2)$ is spanned by the Pauli matrices $X,Y$ and $Z=\frac {1}{2}[X,Y]$ . The CMLSI of heat semigroups (standard Laplacians) was obtained in [Reference Li, Junge and LaRacuente49, Reference Brannan, Gao and Junge14] using the Ricci curvature lower bound as a crucial tool. Nevertheless, in the sub-elliptic case the Ricci curvature in the degenerate direction of the vector field $H=\{X_i\}_{i=1}^k$ can be interpreted as $-\infty $ . In [Reference Gao and Gordina28], the curvature condition were substituted by a gradient estimate that was first introduced by Driver and Melcher [Reference Driver and Melcher24] for Heisenberg group, later obtained for nilpotent Lie groups [Reference Melcher54] and $SU(2)$ [Reference Baudoin and Bonnefont8]. Our Theorem 1.2 obtains CMLSI for all sub-Laplacian of Hörmander systems, without using any curvature condition. It implies the following uniform CMLSI constant for trace symmetric Lindbladians as ‘representation’ of Hörmander system on Lie groups.
Corollary 1.3. Let G be a compact Lie group and $H=\{X_1,\cdots ,X_k\}$ be a generating set of its Lie algebra $\mathfrak {g}$ . There exists a constant $\alpha _{c}(\Delta _H)>0$ such that for all unitary representation u, the induced quantum Markov semigroup generated by
satisfies $\alpha _{c}(L_H)\ge \alpha _{c}(\Delta _H)>0$ . Here, $d_u$ is the Lie algebra homomorphism induced by u.
1.3 Concentration inequalities
An important application of MLSI is to derive concentration inequalities. This was first discovered by Otto and Villani [Reference Otto and Villani61], later extended to the discrete case by Erbar and Maas [Reference Erbar and Maas26], and more recently to the noncommutative setting in [Reference Rouzé and Datta69, Reference Gao, Junge and LaRacuente29, Reference Carlen and Maas16]. As an application of our MLSI estimate for GNS-symmetric semigroups, we derive concentration inequalities for a general faithful invariant state $\phi $ . Recall that the Lipschitz semi-norm
The Lipschitz semi-norm is defined through the gradient form (or Carré du Champ operator)
Theorem 1.4. Let ${\mathcal M}$ be a $\sigma $ -finite von Neumann algebra and let $T_t=e^{-tL}$ be a GNS- $\phi $ -symmetric quantum Markov semigroup with positive MLSI constant $\alpha _{1}(L)>0$ . Then there exists a universal constant c such that for $2\le p <\infty $ ,
Moreover, for any $t>0$ , there exists a projection e such that
As a special case, we obtain the following matrix concentration inequalities which can be compared to the work of Tropp [Reference Tropp75].
Corollary 1.5. Let $S_1,\cdots ,S_n$ be an independent sequence of random $d\times d$ -matrices such that $\parallel \! S_i-\mathbb {E}S_i \! \parallel _{\infty }\le M \hspace {.1cm} , \hspace {.1cm} a.e.$ Then, we have the matrix Bernstein inequality that for the sum $Z=\sum _{k=1}^nS_k$ ,
and the matrix Chernoff bound
where
In particular, the inequality (1.10) improves the term $M\log d$ in Tropp’s result [Reference Tropp75] to $M\sqrt {\log d}$ . For more details, see Example 5.18.
After the acceptance of this paper, we get to know the sub-Gaussian type estimate (1.10) of matrix concentration was obtained by Huang and Tropp [Reference Huang and Tropp38, Reference Huang and Tropp39] via matrix-valued Poincare inequality and matrix-valued Bakry-Émery curvature condition. Actually, in the introduction of [Reference Huang and Tropp39] they raise the question whether the sub-gaussian estimate can be obtained by matrix-valued Log-Sobolev inequality. Our result answers this question.
1.4 Outline of the paper
We organize our paper as follows to make it accessible for readers from different backgrounds. In Section 2, we provide a brief review of quantum information basics in the setting of tracial von Neumann algebras. We prove our key entropy difference lemma (Lemma 2.1) and an improved data processing inequality (Theorem 2.5). Building upon these results, we discuss the functional inequalities of symmetric quantum Markov semigroups in Section 3. We prove our main Theorem 1.1 in the trace symmetric case and its consequence Theorem 1.2 for classical Markov semigroups. We also illustrate the failure of the matrix-valued logarithmic Sobolev ineuqality in Proposition 3.15. The discussion up to this point does not involve much technicality beyond the basic concepts of finite von Neumann algebras. Readers from quantum information and classical analysis are welcome to consider examples such as the matrix algebra ${\mathbb M}_n$ and matrix-valued functions $L_\infty (\Omega , {\mathbb M}_n)$ .
In Section 4, we dive into the GNS-symmetric cases. Here, we discuss the Haagerup reduction for channels and entropic quantities, deriving Theorem 1.1 (Theorem 4.10 and Corollary 4.13) in its full generality. Section 5 collects applications of our general results Theorem 1.4 and Corollary 1.5. We conclude the paper in Section 6 with some discussions on remaining open questions.
Notations. We use calligraphic letters ${\mathcal M},{\mathcal N}$ for von Neumann algebras and denote ${\mathbb M}_n$ as the algebra of $n\times n$ as complex matrices. We use $\tau $ as the trace on von Neumann algebra, and ${\text {tr}}$ as the standard matrix trace. The identity operator is denoted by $1$ , and the identity map between spaces is denoted as $\operatorname {\mathrm {\operatorname {id}}}$ , sometimes specified with subscript like $1_{\mathcal M}$ and $\operatorname {\mathrm {\operatorname {id}}}_{{\mathbb M}_n}$ . We write $ a^*$ as the adjoint element of a and $\Phi _*$ for a pre-adjoint map of $\Phi $ .
2 Entropy contraction of symmetric Markov maps
2.1 States, channels and entropies
We briefly review some basic information-theoretic concepts in the noncommutative setting. Recall that a von Neumann algebra ${\mathcal M}$ is a unital $*$ -subalgebra of $B(H)$ closed under weak $^*$ -topology. A linear functional $\phi :{\mathcal M}\to \mathbb {C}$ is called a state if it is positive, meaning $\phi (x^*x)\ge 0$ for any $x\in {\mathcal M}$ , and additionally, $\phi (1)=1$ . We say $\phi $ is normal if $\phi $ is weak $^*$ -continuous. Throughout the paper, we will only consider normal states and denote $S({\mathcal M})$ as the normal state space of ${\mathcal M}$ . We write $s(\phi )$ as the support projection of a state $\phi $ , which is the minimal projection e such that $\phi (x)=\phi (exe)\hspace {.1cm}, \forall \hspace {.1cm} x\in {\mathcal M}$ . A normal state $\phi $ is faithful if $s(\phi )=1$ . For two normal states $\rho $ and $\sigma $ , the relative entropy is defined as
where $\xi _\rho $ is a vector implementing the state $\rho $ , and $\Delta (\rho /\sigma )$ is the relative modular operator. This form of definition (2.1) was introduced by Araki [Reference Araki2] for general von Neumann algebras.
In this section, we will focus on the case that ${\mathcal M}$ is a finite von Neumann algebra. Namely, ${\mathcal M}$ is equipped with a normal faithful tracial state $\tau $ . The tracial noncommutative $L_p$ -space $L_p({\mathcal M},\tau )$ is defined as the completion of ${\mathcal M}$ with respect to the p-norm $\parallel \! a \! \parallel _{p}=\tau (|a|^{p})^{1/p}$ . We identify $L_\infty ({\mathcal M})\cong {\mathcal M}$ , and also $ L_1({\mathcal M})\cong {\mathcal M}_*$ via the trace duality
We say $\rho \in L_1({\mathcal M})$ is a density operator if $\rho \ge 0$ and $\tau (\rho )=1$ , which corresponds to a normal state in the above identification. We will often identify normal states with their density operators if no ambiguity. Via this identification, relative entropy reduces to the original definition of Umegaki [Reference Umegaki76],
provided this trace is well defined. For example, for $\rho $ and $\sigma $ in the bounded state space
the Umegaki’s formula is always well defined and finite. For this reason, we will mostly work with bounded states from $S_{B}({\mathcal M})$ and derive results for general case $S({\mathcal M})$ by approximation. When the second state $\sigma =1$ , this gives the entropy functional
Note that the standard convention of von Neumann entropy in quantum information literature is often with an additional negative sign .
We say a linear map $T:{\mathcal M}\to {\mathcal M}$ is a quantum Markov map if T is normal, unital and completely positive. Recall that T is unital if $T( 1)=1$ . The pre-adjoint map $T_*:{\mathcal M}_*\to {\mathcal M}_*$ is called a quantum channel, which sends normal states to normal states. In the tracial setting, $T_*:L_1({\mathcal M})\to L_1({\mathcal M})$ given by
is completely positive and trace-preserving (in short, CPTP). A fundamental inequality about quantum channel is the data processing inequality (also called monotonicity of relative entropy)
The data processing inequality states that two quantum states cannot become more distinguishable under a quantum channel. The data processing inequality remains valid for T being positive but not necessarily completely positive; see [Reference Müller-Hermes and Reeb56, Reference Frenkel27]. The main technical result of this work is an improved data processing inequality for quantum channels under symmetric conditions (Theorem 2.5).
2.2 Entropy contraction for unital quantum channels
We start our discussion on entropy contraction of unital quantum channels. The restriction of $\Phi $ on ${\mathcal M}$ is bounded and normal; thus, $\Phi $ can be viewed as the $L_1$ -norm extension of its restriction $\Phi :{\mathcal M}\to {\mathcal M}$ . By duality, its adjoint $\Phi ^*:{\mathcal M}\to {\mathcal M}$ is a trace-preserving quantum Markov map and hence also extends to a unital quantum channel.
For a state $\rho $ with $H(\rho )<\infty $ , we define the entropy difference of $\Phi $ ,
Non-negativity of the entropy difference $D_\Phi (\rho )\ge 0$ follows from data processing inequality (2.2) and $\Phi (1)=1$ ,
We start with the key lemma in our argument.
Lemma 2.1 (Entropy difference lemma).
Let $\Phi :L_1({\mathcal M})\to L_1({\mathcal M})$ be a unital quantum channel and $\Phi ^*$ be its adjoint. Then for two bounded states $\rho ,\omega \in {S_B}({\mathcal M})$ ,
Proof. By duality, $\Phi ^*$ is also completely positive unital. Then,
where (2) follows from the monotonicity of relative entropy. The inequality (1) uses the operator concavity [Reference Choi18] of logarithm function $t\mapsto \ln t$ that for any positive operator $x\ge 0$ ,
This proves the first inequality. For the second part, it suffices to notice that
where we use the operator concavity $\Phi (\ln x)\le \ln \Phi (x)$ again.
We iterate the above lemma as follows:
Then a relevant question is what would be the limit of $(\Phi ^* \Phi )^n(\rho )$ as $n \to \infty $ . This leads to the multiplicative domain of $\Phi $ . Recall that the multiplicative domain of a unital completely positive map $\Phi $ is
When $\Phi $ is normal, ${\mathcal N}_\Phi \subset \mathcal {M}$ is a von Neumann subalgebra ([Reference Luczak52, Theorem 1]). A linear map ${E:{\mathcal M}\to {\mathcal M}}$ is called a conditional expectation if E is a unital completely positive map and idempotent $E\circ E=E$ . When ${\mathcal M}$ is a finite von Neumann algebra, for any subalgebra ${\mathcal N}\subset {\mathcal M}$ , there always exists a (unique) trace-preserving conditional expectation E onto ${\mathcal N}$ such that
Such E is a unital quantum channel.
Proposition 2.2. Let $\Phi :L_1({\mathcal M})\to L_1({\mathcal M})$ be a unital quantum channel, and let $E:{\mathcal M}\to {\mathcal N}$ be the trace-preserving conditional expectation onto the multiplicative domain ${\mathcal N}:={\mathcal N}_\Phi $ . Then
-
i) $\Phi :{\mathcal N}\to \Phi ({\mathcal N})$ is a $*$ -isomorphism with inverse $\Phi ^*:\Phi ({\mathcal N})\to {\mathcal N}$ . Moreover, $\Phi ({\mathcal N})$ is the multiplicative domain for $\Phi ^*$ , and
(2.4) $$ \begin{align} (\Phi^* \Phi)\circ E=E\circ (\Phi^* \Phi)=E, \hspace{.1cm} E_0\circ \Phi=\Phi\circ E, \end{align} $$where $E_0:{\mathcal M}\to \Phi ({\mathcal N})$ is the trace-preserving conditional expectation onto $\Phi ({\mathcal N})$ . -
ii) $\Phi $ is an isometry on $L_2({\mathcal N})$ . If, in addition, $\parallel \! \Phi (\operatorname {\mathrm {\operatorname {id}}}-E):L_2({\mathcal M})\to L_2({\mathcal M}) \! \parallel _{2}<1$ , then ${E=\lim _{n}(\Phi ^* \Phi )^n}$ as a map from $L_2({\mathcal M})$ to $L_2({\mathcal M})$ .
Proof. It is clear that $\Phi $ is a $*$ -homomorphism on ${\mathcal N}$ . For any $x,y\in L_2({\mathcal N})\subset L_2({\mathcal M})$ ,
Thus, $\Phi ^*\circ \Phi |_{{\mathcal N}}=\operatorname {\mathrm {\operatorname {id}}}_{\mathcal N}$ is the identity map. This verifies $(\Phi ^* \Phi )\circ E=E$ . Since $E^*=E$ , $E\circ (\Phi ^* \Phi )=E$ follows from taking the adjoint. Thus, $\Phi :{\mathcal N}\to \Phi ({\mathcal N})$ is a $*$ -isomorphism with inverse $\Phi ^*$ . Denoting ${\mathcal N}_0$ as the multiplicative domain for $\Phi ^*$ , we have $\Phi ({\mathcal N})\subset {\mathcal N}_0$ . Conversely, we also have $\Phi ^*(N_0)\subset {\mathcal N}$ by switching the role of $\Phi =(\Phi ^*)_*$ . Then $\Phi ({\mathcal N})={\mathcal N}_0$ since $\Phi $ is bijective on ${\mathcal N}$ . For ii), we note that by (2.4),
Therefore,
which goes to $0$ as $n\to \infty $ .
In order to estimate entropic quantities, we will use the approximation in terms of complete positivity. Recall that for a density operator $\sigma \in S({\mathcal M})$ with full support, the Bogoliubov-Kubo-Mori (BKM) metric for $X\in {\mathcal M}$ is defined by
The BKM metric is a Riemannian metric on the space of states with full support that is monotone under any quantum channel $\Psi $ ,
It connects to the relative entropy as follows ([Reference Gao and Rouzé31, Lemma 2.2]):
where $\rho _t=t\rho +(1-t) \sigma $ for $t\in [0,1]$ . It is proved in [Reference Gao and Rouzé31, Lemma 2.1 & 2.2] that if $\rho \le c\sigma $ ,
where $k(c)=\frac {c\ln c-c+1}{(c-1)^2}$ . The above discussion remains valid if $s(\rho )\le s(\sigma )$ and $X\in s(\sigma ){\mathcal M} s(\sigma )$ . For two positive maps $\Psi $ and $\Phi $ , we write $\Phi \le \Psi $ if $\Psi -\Phi $ is positive.
Lemma 2.3. Let E be a conditional expectation (not necessarily trace-preserving) and $\Psi $ be a quantum Markov map such that
Assume that $E\circ \Psi =E$ . Then for any $\rho \in S({\mathcal M})$ ,
In particular, for $\varepsilon =\frac {1}{10}$ ,
Proof. By assumption, $\Psi _*=(1-\varepsilon )E_*+\varepsilon \Psi _0$ for some unital positive map $\Psi _0\le 2E_*$ . We denote $\sigma =E_*(\rho ), \tilde {\sigma }=\Phi _*(\rho )$ and $\omega =\Psi _0(\rho )$ . Then $\tilde {\sigma }=(1-\varepsilon )\sigma +\varepsilon \omega $ . Note that for any bounded state $\sigma \in S_B({\mathcal M})$ , $X\mapsto \sqrt {\gamma _\sigma (X)}$ is a Hilbert space norm. Then by the triangle inequality,
where $\gamma $ can be $\gamma _\phi $ for any bounded state $\phi \in S_B({\mathcal M})$ . Then
Now take $\rho _t=t\rho +(1-t)\sigma $ and $\tilde {\rho }_t=t\rho +(1-t)\tilde {\sigma }$ ,
For the first term, because $\tilde {\rho }_t\le (1+\varepsilon )\rho _t$ ,
For the second term, consider that $\tilde {\rho }_t\ge (1-\varepsilon )(1-t)\sigma $ ,
Here, the inequality (1) above uses $\omega \le 2\sigma $ and (2.6). The inequality (2) above follows from the monotonicity of relative entropy and the fact $\Psi _*(\sigma )=\sigma $ . Combining the estimated above, we obtained
where $k(2)=2\ln 2-1$ . The above inequality is nontrivial for $\varepsilon $ such that
Taking $\varepsilon =0.1$ , the above expression is approximately $0.53>\frac {1}{2}$ .
Remark 2.4. This lemma is related to [Reference LaRacuente47, Corollary 2.15] and is a variant of [Reference Gao and Rouzé31, Theorem 5.3], which proves for GNS symmetric $\Phi $ ,
Compared with [Reference LaRacuente47, Corollary 2.15], the above Lemma assumes a simpler condition and may achieve a stronger constant in certain regimes of interest. The above Lemma improves (2.7) from two points: 1) does not need any symmetric assumption; 2) remove the square in $\Phi _*^2$ . When $\Psi _*=\Phi _*^2$ is a square, (2.7) could yield better bound that for $\varepsilon =0.4$ ,
Putting the above lemma together, we obtain the following entropy contraction of unital quantum channels.
Theorem 2.5. Let $\Phi $ be a unital quantum channel and let $E:{\mathcal M}\to {\mathcal N}$ be the trace-preserving conditional expectation onto the multiplicative domain $\mathcal {N}$ of $\Phi $ . Define the CB return time
Then for any state $\rho \in S({\mathcal M})$ ,
Furthermore, for any finite von Neumann algebra $\mathcal {Q}$ and state $\rho \in S({\mathcal M} \overline {\otimes }\mathcal {Q}),$
Proof. It suffices to consider a bounded state $\rho \in S_B({\mathcal M})$ . Note that by the conditional expectation property (2.3),
where we used the property $\Phi \circ E=E_0 \circ \Phi $ from Proposition 2.2. Moreover, $H(E(\rho ))=H(\Phi \circ E(\rho ))$ as $\Phi $ is a trace-preserving $*$ -isomorphism on ${\mathcal N}$ . Thus, we have
Iterating the entropy difference Lemma 2.1, we have
Now using Lemma 2.3, for $k=k_{cb}(\Phi )$ ,
Rearranging the terms gives the assertion. The general case $\rho \in S({\mathcal M})$ can be obtained via approximation $\rho _\varepsilon =(1-\varepsilon )\rho +\varepsilon 1$ as [Reference Brannan, Gao and Junge14, Lemma A.2]. The same argument applies to $\operatorname {\mathrm {\operatorname {id}}}_{\mathcal Q}\otimes \Phi $ , because the CB return time $k_{cb}(\operatorname {\mathrm {\operatorname {id}}}_{\mathcal Q}\otimes \Phi )=k_{cb}(\Phi )$ is same as of $\Phi $ by the definition.
The above theorem is an improved data processing inequality for the relative entropy between a state $\rho $ and its conditional expectation $E(\rho )$ . Here, ${\mathcal N}$ is the ‘decoherence free’ subalgebra. Indeed, for any two states $\sigma _1,\sigma _2\in {\mathcal N},$
does not decay. Outside the ‘decoherence free’ subalgebra ${\mathcal N}$ , the relative entropy from a state $\rho $ to its projection $E(\rho )$ on ${\mathcal N}$ is strictly contractive under every use of the channel $\Phi $ .
For $\Phi $ being a symmetric quantum Markov map, we have $\Phi =\Phi _*$ . Moreover, Proposition 2.2 reduces to
Then
We can iterate the entropy contraction above and obtain the discrete time entropy decay,
3 Complete modified log-Sobolev inequality for symmetric Markov semigroups
3.1 Functional inequalities
In this section, we discuss a continuous time relative entropy decay for symmetric quantum Markov semigroups. We first review some basics of quantum Markov semigroups. A quantum Markov semigroup $(T_t)_{t\ge 0}:{\mathcal M}\to {\mathcal M}$ is a family of maps satisfying
-
i) for each $t\ge 0$ , $T_t$ is a quantum Markov map (i.e., normal, completely positive and unital)
-
ii) $T_0=\operatorname {\mathrm {\operatorname {id}}}_{\mathcal M}$ and $T_s\circ T_t=T_{s+t}$ for any $s,t \ge 0$ .
-
iii) for $x\in {\mathcal M}$ , $t\mapsto T_t(x)$ is weak $^*$ -continuous.
The generator of the semigroup is defined as
on the domain of L that the limit exists. In this section, we still consider ${\mathcal M}$ as a finite von Neumann algebra equipped with a normal faithful tracial state $\tau $ . Given $(T_t)_{t\ge 0}$ is symmetric (or more specifically, trace-symmetric), that is,
the generator L is a positive, symmetric operator, densely defined on $L_2({\mathcal M})$ . Its kernel is the fixed-point subspace ${\mathcal N}:=\ker (L)=\{x\in {\mathcal M}\hspace {.1cm} | \hspace {.1cm} T_t(x)=x, \forall t\ge 0\}$ , which coincides with the common multiplicative domain of all $T_t$ – hence a von Neumann subalgebra. Moreover, each $T_t$ is an ${\mathcal N}$ -bimodule map
In particular, we have
where $E:{\mathcal M}\to {\mathcal N}$ is the trace-preserving conditional expectation onto the fixpoint algebra ${\mathcal N}$ . We say $(T_t)$ is ergodic if ${\mathcal N}={\mathbb C} 1$ is trivial. Note that in the mathematical physics literature, it is common to use primitive instead of ergodic. In this case, the semigroup admits a unique invariant state – namely, the trace $\tau $ up to normalization. We will consider symmetric quantum Markov semigroups that are not necessarily ergodic.
Recall that a semigroup is equivalently determined by its Dirichlet form
We write $\operatorname {\mathrm {\operatorname {dom}}} (L)$ for the domain of L and $\operatorname {\mathrm {\operatorname {dom}}} ({\mathcal E})$ for the domain of ${\mathcal E}$ . The Dirichlet subalgebra is defined as ${\mathcal A}_{\mathcal E}:=\operatorname {\mathrm {\operatorname {dom}}} ({\mathcal E})\cap {\mathcal M}$ . It was proved [Reference Davies and Lindsay21] that ${\mathcal A}_{\mathcal E}$ is a dense $*$ -subalgebra of ${\mathcal M}$ and a core of $L^{1/2}$ . We denote by
the set of bounded density operators from ${\mathcal A}_{\mathcal E}$ . We now introduce the formal definitions of functional inequalities for quantum Markov semigroups.
Definition 3.1. Let $T_t=e^{-Lt}:{\mathcal M}\to {\mathcal M}$ be a symmetric quantum Markov semigroup and $E:{\mathcal M}\to {\mathcal N}$ be the trace-preserving conditional expectation onto its fixed point space. We say $T_t$ satisfies
-
i) the Poincaré inequality (PI) for $\lambda>0$ if
(3.1) $$ \begin{align} \lambda\parallel \! x-E(x) \! \parallel_{2}^2\,{\le}\, {\mathcal E}(x,x), \hspace{.1cm} \forall x\in {\mathcal A}_{\mathcal E}, \end{align} $$ -
ii) the log-Sobolev inequality (LSI) for $\alpha>0$ if
(3.2) $$ \begin{align} \alpha \tau\big(|x|^2\ln |x^2|-E(|x^2|)\ln E(|x^2|)\big)\le 2{\mathcal E}(x,x), \hspace{.1cm} \forall x\in {\mathcal A}_{\mathcal E}, \end{align} $$ -
iii) the modified log-Sobolev inequality (MLSI) for $\alpha>0$ if
(3.3) $$ \begin{align} 2\alpha D(\rho||E(\rho))\le {\mathcal E}(\rho,\ln\rho), \hspace{.1cm} \forall \rho\in S_B({\mathcal A}_{\mathcal E}), \end{align} $$ -
iv) the complete modified log-Sobolev inequality (CMLSI) for $\alpha>0$ if $\operatorname {\mathrm {\operatorname {id}}}_{\mathcal Q}\otimes T_t$ satisfies $\alpha $ -MLSI inequality for any finite von Neumann algebra ${\mathcal Q}$ .
The optimal (largest possible) constants for PI, LSI, MLSI and CMLSI will be denoted respectively as $\lambda (L), \alpha _2(L),\alpha _{1}(L)$ and $\alpha _{c}(L)$ (or $\lambda , \alpha _2,\alpha _1$ and $\alpha _{c}$ in short if the generator is clear).
The Poincaré inequality (3.1) is equivalent to the spectral gap of L as a positive operator. LSI (3.2) is equivalent to hypercontractivity [Reference Olkiewicz and Zegarlinski60]
MLSI (3.3) is known to be equivalent to the exponential decay of relative entropy ([Reference Bardet5, Theorem 3.2] and [Reference Brannan, Gao and Junge14, Proposition A.3]) that
The equivalence of (3.3) and (3.5) is obtained by differentiating the relative entropy for $T_t$ at $0$ , which leads to the entropy production on the R.H.S of MLSI
It is well known that
The main motivation to consider CMLSI over MLSI and LSI is the tensorization property
which in the quantum cases fails for $\alpha _1$ [Reference Brannan, Gao and Junge14, Section 4.4], and is only known to hold for $\alpha _2$ for limited examples in small dimensions. The main result of this section is Theorem 1.1, which asserts a lower bound
by the inverse of CB return time
Here, we set $\varepsilon =0.1$ for the notation $t_{cb}(\varepsilon )$ in Theorem 1.1 because of Lemma 2.3.
Theorem 3.2. Let $T_t=e^{-Lt}:{\mathcal M}\to {\mathcal M}$ be a symmetric quantum Markov semigroup and $E:{\mathcal M} \to {\mathcal N}$ be the trace-preserving conditional expectation onto the fixed point subalgebra ${\mathcal N}$ . Define the CB return time as
Then
Proof. Let $t_m=t_{cb}(L)/2m$ for some $m\in \mathbb {N}_+$ . Since $T_t$ is symmetric, $T_{t_m}^*T_{t_m}=T_{t_m}T_{t_m}=T_{2t_m}$ . Hence, $T_{t_m}$ has discrete return time $k_{cb}(T_{t_m})= m$ . By the Lemma 2.1, for any $\rho \in S_B({\mathcal M})$ ,
Write $t_{cb}=t_{cb}(L)$ . Now assume further $\rho \in \cup _{t>0} T_t({\mathcal M})\subset \operatorname {\mathrm {\operatorname {dom}}} (L)$ . We have by Theorem 2.5,
The entropy decay for general $\rho \in S({\mathcal M})$ can be obtained by approximation as in the Appendix [Reference Brannan, Gao and Junge14, Appendix]). This proves $\alpha _{1}(L)\ge \frac {1}{2t_{cb}(L)}$ . The same argument applies to $\operatorname {\mathrm {\operatorname {id}}}_{\mathcal Q}\otimes L$ yields the assertion $\alpha _{c}(L)\ge \frac {1}{2t_{cb}(L)}$ .
Remark 3.3. a) For LSI constant $\alpha _2$ , the $\Omega (\frac {1}{t_{cb}})$ lower bounds were obtained for ergodic semigroups in both classical [Reference Diaconis and Saloff-Coste23] and quantum setting [Reference Temme, Pastawski and Kastoryano74]. These bounds as well as our bound for $\alpha _c$ are asymptotic tight (See Example 5.6 and Section 5.3).
b) In [Reference Brannan, Gao and Junge14], a similar estimate $\alpha _c\ge \Omega (\frac {1}{t_{cb}})$ was obtained for semigroups that admits non-negative entropic Ricci curvature lower bound. The entropy Ricci curvature lower bound for quantum Markov semigroup was introduced by Carlen and Mass [Reference Carlen and Maas15] using $\lambda $ -displacement convexity of entropy functionals H w.r.t to certain noncommutative Wasserstein distance, inspired from Lott and Villani [Reference Lott and Villani51], and Sturm’s [Reference Sturm71] work on metric measure spaces. For heat semigroups on Riemmannian manifold, the entropy Ricci curvature lower bound follows from a lower bound of the Ricci curvature tensor. Nevertheless, in the noncommutative case, these entropy Ricci curvature lower bounds for quantum Markov semigroup are in general hard to verify. So far, most examples rely on certain interwining relation $\nabla T_t= e^{-\lambda t}\tilde {T}_t \nabla $ between the semigroup $T_t$ and a gradient operator $\nabla $ (see [Reference Carlen and Maas15, Reference Brannan, Gao and Junge13, Reference Wirth and Zhang81]).
c) Our Theorem 1.1 here does not rely on any curvature conditions, which uses only information theoretic tools such as entropic quantities and inequalities. To the best of our knowledge, this direct proof is even novel in the classical setting. It is worth pointing out that the definition of relative entropy as well as its exponential decay of relative entropy is independent of the choice of the trace, which also shows the naturalness of our approach and the extension to non-tracial von Neumann algebras in Section 4.
3.2 CB return time
We now consider a common scenario where the CB return time $t_{cb}$ is finite. The original motivation for the notion, despite defining using CP (completely positive) order (3.7), is the following characterization using CB (completely bounded) norm. Recall that a linear map $\Psi :{\mathcal M}\to {\mathcal M}$ is called a ${\mathcal N}$ -bimodule map if
Proposition 3.4. Let ${\mathcal N}\subset {\mathcal M}$ be a subalgebra and $E:{\mathcal M}\to {\mathcal N}$ be the trace-preserving conditional expectation. Let $\Psi :{\mathcal M}\to {\mathcal M}$ be an ${\mathcal N}$ -bimodule $*$ -preserving map. For any $\varepsilon>0$ , the following two conditions are equivalent:
-
i) $(1-\varepsilon ) E\le _{cp} \Psi \le _{cp} (1+\varepsilon ) E\hspace {.1cm} $ ;
-
ii) $\parallel \! \Psi -E:L_\infty ^1({\mathcal N}\subset {\mathcal M})\to L_\infty ({\mathcal M}) \! \parallel _{cb}\le \varepsilon $ .
The condition ii) above is the completely bounded norm from the space $L_\infty ^1({\mathcal N}\subset {\mathcal M})$ to ${\mathcal M}$ . $L_\infty ^1({\mathcal N}\subset {\mathcal M})$ is called a conditional $L_\infty $ space, defined as the completion of ${\mathcal M}$ with respect to the norm
where the supremum takes over all $a,b\in L_2({\mathcal N})$ with $\|a\|_2=\|b\|_2=1$ . The operator space structure of $L_\infty ^1({\mathcal N}\subset {\mathcal M})$ is given by the identification
(see [Reference Junge and Parcet42] and [Reference Gao, Junge and LaRacuente30, Appendix]). Proposition 3.4 is relatively self-evident in the ergodic case ${\mathcal N}=\mathbb {C}1$ , $L_\infty ^1({\mathcal N}\subset {\mathcal M})\cong L_1({\mathcal M})$ , which we illustrate below.
Example 3.5 (Classical case).
Let $(\Omega ,\mu )$ be a probability space. Let $P:L_\infty (\Omega )\to L_\infty (\Omega )$ be a linear map with kernel $P(f)(x)=\int _{\Omega }k(x,y)f(y)d\mu (y)$ . It is clear that P is $*$ -preserving (i.e., $P(\bar {f})=\overline {P(f)}$ if k is real); P is a positive map if and only if the kernel function $k\ge 0$ . Recall the expectation map
where $\textbf {1}$ is the unit constant function. The kernel of $E_\mu $ is the constant function $ \textbf {1}$ on the product space $\Omega \times \Omega $ . The following equivalence is self-evident:
To see the equivalence in terms of complete positivity and completely bounded norm in Proposition 3.4, it suffices to notice that every positive (resp. bounded) map to $L_\infty (\Omega )$ is automatically completely positive (resp. completely bounded with same norm [Reference Smith70]).
Example 3.6 (Quantum case).
The above argument also applies to the noncommutative ergodic case ${\mathcal N}=\mathbb {C}1\subset {\mathcal M}$ . The correspondence between the map P and its kernel k generalizes to the isomorphism between the map T and its Choi operator $C_T\in {\mathcal M}^{op}\overline {\otimes }{\mathcal M}$
where ${\mathcal M}^{op}$ is the opposite algebra of ${\mathcal M}$ . The isomorphism $T\mapsto C_T$ is not only positivity-preserving by Choi’s Theorem ( T is CP iff $C_T\ge 0$ ), but also isometric by Effros-Ruan Theorem (see [Reference Effros and Ruan25, Reference Blecher and Paulsen10]),
For the general case of a ${\mathcal N}$ -bimodule map T with a nontrivial ${\mathcal N}$ , the above isomorphism holds with more involved module Choi operator, which we refer to the discussion in Section 4.3 and also [Reference Bardet, Capel, Gao, Lucia, Pérez-García and Rouzé6, Lemma 5.1] and [Reference Gao, Junge and LaRacuente29, Lemma 3.15] for the complete proof of Proposition 3.4.
With Proposition 3.4, the CB-return time can be equivalently defined as
It is known that $t_{cb}$ is finite whenever $T_t$ satisfies the Poincaré inequality and one-point ultra-contractive estimate.
Proposition 3.7. Let $T_t:{\mathcal M}\to {\mathcal M}$ be a symmetric quantum Markov semigroup and $E:{\mathcal M}\to {\mathcal N}$ be the trace-preserving conditional expectation onto the fixed point space. Suppose
-
i) $T_t$ satisfies the Poincaré inequality: $\lambda>0$ such that $\parallel \! T_{t}-E: L_2({\mathcal M})\to L_2({\mathcal M}) \! \parallel _{}\le e^{-\lambda t}\hspace {.1cm} ,\hspace {.1cm} \forall t\ge 0$ ;
-
ii) There exists $t_0\ge 0$ such that $\parallel \! T_{t_0}: L_\infty ^1({\mathcal N}\subset {\mathcal M})\to L_\infty ({\mathcal M}) \! \parallel _{cb}\le C_0$ .
Then $ t_{cb}\hspace {.1cm} \le \hspace {.1cm} \frac {1}{\lambda }\ln (10 C_0)+t_0$ .
Proof. This is now a standard argument similar to [Reference Brannan, Gao and Junge14, Proposition 3.8] and [Reference Gao and Rouzé31, Lemma B.1].
Remark 3.8. For the special case of $t_0=0$ and $T_0=\operatorname {\mathrm {\operatorname {id}}}:{\mathcal M}\to {\mathcal M}$ , we consider
$C_{cb}(E)$ was introduced in [Reference Gao, Junge and LaRacuente30] as the complete bounded version of Popa and Pimsner’s subalgebra index [Reference Pimsner and Popa67],
When ${\mathcal M}$ is finite dimensional, both the index $C(E)$ and $C_{cb}(E)$ are finite and admit the explicit formula [Reference Pimsner and Popa67, Theorem 6.1]. In this case, one can take $t_0=0$ in above Proposition 3.7 and yields
3.3 Classical Markov semigroups
In the remainder of this section, we focus on applications toward classical Markov map. We postpone the discussion of truly noncommutative semigroups to Section 4. Let $T_t=e^{-Lt}:L_\infty (\Omega ,\mu )\to L_\infty (\Omega ,\mu )$ be an ergodic Markov semigroup symmetric to the probability measure $\mu $ . Note that in the ergodic case $L_\infty ^1(\mathbb {C}1\subset L_\infty (\Omega ) )=L_1(\Omega ,\mu )$ , and by Smith’s lemma [Reference Smith70], any bounded map $T:L_1(\Omega ,\mu )\to L_\infty (\Omega ,\mu )$ is automatic completely bounded
Then the CB return time $t_{cb}$ reduces to the standard $L_\infty $ -mixing time
Then by a combination of Theorem 1.1 and Proposition 3.7, we obtain the following theorem.
Theorem 3.9. Let $T_t=e^{-tL}:L_\infty (\Omega ,\mu )\to L_\infty (\Omega ,\mu )$ be an ergodic Markov semigroup symmetric to the probability measure $\mu $ . Suppose
-
i) $T_t$ satisfies $\lambda $ -Poincaré inequality for some $\lambda>0$ : for $f\in \operatorname {\mathrm {\operatorname {dom}}}(L^{1/2})$ ,
(3.10) $$ \begin{align} \lambda\mu(|f-E_\mu(f)|^2)\le \int f(Lf)d\mu .\end{align} $$ -
ii) There exists $t_0>0$ such that
(3.11) $$ \begin{align}\parallel \! T_{t_0}: L_1(\Omega,\mu)\to L_\infty(\Omega,\mu) \! \parallel_{}\le C_0.\end{align} $$
Then
This result can be compared to the bound of Diaconis and Saloff-Coste [Reference Diaconis and Saloff-Coste23, Theoem 3.10], which statesFootnote 1
In particular, $\alpha _1\ge \alpha _{2}\ge \frac {2}{t_b(e^{-2})}$ for the alternative $L_\infty $ -mixing time
By the comparison $e^{-3}<0.1<e^{-2}$ , we have $t_b(e^{-2})\le t_{b}(0.1)\le \frac {3}{2}t_b(e^{-2})$ . Hence, in terms of lower bound for $\alpha _1$ , (3.13) and (3.12) are equivalent up to absolute constants. The difference is that (3.13) lower bounds the LSI constant $\alpha _2$ and our estimate (3.12) bounds the CMLSI constant $\alpha _{c}$ .
For finite Markov chains with $|\Omega |<\infty $ , we have finite index
where $\mu $ is a strictly positive probability density function. It was proved in [Reference Diaconis and Saloff-Coste23] that
Combined with our Theorem 1.1, we obtain the following:
Corollary 3.10. For a finite Markov chain $T_t:l_\infty (\Omega ,\mu )\to l_\infty (\Omega ,\mu )$ symmetric to $\mu $ ,
Proof. Note that $t_b(e^{-2})\le t_{cb}(0.1)\le \frac {3}{2}t_b(e^{-2})$ . Then by Theorem 1.1 and (3.15),
The other lower bound $\alpha _c\ge \frac {\lambda }{2\log (10\parallel \! \mu ^{-1} \! \parallel _{\infty })}$ follows from Theorem 3.9 by choosing $t_0=0$ .
Example 3.11. When $\Omega $ is not finite, here is a simple example with $\alpha _c(L)>0$ , but the ultra-contractivity (3.11) is never satisfied for finite $t_0$ . Take $L=I-E_\mu $ . It generates the so-called depolarizing semigroup
where $\textbf {1}$ is the unit constant function. Then for any $t<\infty $ ,
which is infinite whenever $L_{\infty }(\Omega ,\mu )$ is infinite dimensional. However, it follows from direct calculation that $\alpha _{c}(I-E_\mu )\geq \frac {1}{2}$ .
3.4 Hörmander system
We now discuss the application to Markov semigroups on smooth manifolds generated by sub-Laplacians. Let $(M,g)$ be a d-dimensional compact connected Riemannian manifold without boundary and let $d\mu =\omega d\operatorname {vol}$ be a probability measure with smooth density $\omega $ w.r.t the volume form $d\operatorname {vol}$ . A family of vector fields $H=\{X_i\}_{i=1}^k\subset TM$ with $k\leqslant d$ is called a Hörmander system if at every point $x\in M$ , the tangent space at x can be spanned by the iterated Lie brackets of $X_i$ ’s
By compactness, we can assume there is a global constant $l_H$ such that for every point $x\in M$ , we need at most $l_H$ th iterated Lie bracket in above expression (also called strong Hörmander condition). Denote $\nabla =(X_1, \cdots , X_k)$ and by $X_i^*$ the adjoint of $X_i$ on $L^{2}(M,d\mu )$ . Under the Hörmander condition, the sub-Laplacian
is a symmetric operator on $L^{2}(M,d\mu )$ which generates an ergodic Markov semigroup $P_t=e^{-\Delta _H t}$ , often called the horizontal heat semigroup. Here, $\operatorname {div}_{\mu }(X)$ is the divergence of X w.r.t to $\mu $ . When $H=\{X_i\}_{i=1}^d$ forms an orthonormal frame to the Riemannian metric, $\Delta _H=\Delta $ recovers the (weighted) Laplace-Beltrami operator and $P_t=e^{-\Delta t}$ is the (weighted) heat semigroup on M.
The gradient form (Carré du Champ operator) of $\Delta _H$ is given by
It follows from the product rule of derivatives that $\Gamma $ is diffusive (i.e., $\Gamma (fg,h)=f\Gamma (g,h)+g\Gamma (f,h)$ ). For diffusion semigroups, it is known [Reference Bakry, Gentil and Ledoux4, Theorem 5.2.1] that the MLSI constant $\alpha _1$ and the LSI constant $\alpha _2$ coincide (i.e., $\alpha :=\alpha _1=\alpha _2$ ). The positivity
for any Hörmander’s system $H=\{X_i\}_{i=1}^k$ on a compact connected Riemannian manifold without boundary was proved in [Reference Lugiewicz and Zegarliński53, Theorem 3.1]. Our Theorem 1.2 improves this to $\alpha _c(\Delta _H)>0$ .
Proof of Theorem 1.2.
Recall the following Sobolev-type inequality (see, for example, [Reference Lugiewicz and Zegarliński53, Lemma 2.1]):
where $q=\frac {2dl_H}{dl_H-2}>2$ and $l_H$ is globoal Lie bracket length needed in the strong Hörmander condition. By Varopoulos’ Theorem (see [Reference Varopoulos, Saloff-Coste and Coulhon77, Chapter 2]) on the dimension of semigroups, this implies the following ultra-contractive estimate:
where $m=dl_{H}$ . Also, it was proved in [Reference Lugiewicz and Zegarliński53, Theorem 2.3] that $\Delta _H$ satisfies the Poincaré inequality: $\lambda (\Delta _H)>0$ . Combining these with our Theorem 3.9 yields the assertion.
The Sobolev-type inequality (3.16) is also used in [Reference Lugiewicz and Zegarliński53] by Lugiewicz and Zegarlínski to prove that $\alpha _2(\Delta _H)>0$ . Their proof relies on the Rothaus lemma, and so does the discrete case by Diaconis and Saloff-Coste [Reference Diaconis and Saloff-Coste23]. However, we will see in Section 3.6 that this approach is out of scope for showing the CMLSI constant $\alpha _c(\Delta _H)>0$ .
Example 3.12. The special unitary group $\operatorname {SU}\left ( 2 \right )$ is
where $X,Y,Z$ are the skew-Hermitian Pauli unitary
The Lie algebra is $\mathfrak {su}(2)=\operatorname {span}_{\mathbb {R}}\{X,Y,Z\}$ with Lie bracket rules as
The canonical sub-Riemannian structure is given by $H=\{X,Y\}$ , which is a generating set of $\mathfrak {g}$ because $[X,Y]=2Z$ . The associated sub-Laplacian is
The semigroup $P_t=e^{-\Delta _H t}$ on $\operatorname {SU}\left ( 2 \right )$ has been studied as a prototype of horizontal heat semigroups. In particular, Baudoin and Bonnefont in [Reference Baudoin and Bonnefont8] proved that
for some constant $C>0$ . In [Reference Gao and Rouzé31], Gao and Gordina based on (3.20) proved the CMLSI constant that
The gradient estimate (3.20), as a weaker variant of Bakry-Emery curvature dimension condition, has been found useful to derive CMLSI in [Reference Gao and Rouzé31]. Nevertheless, this weaker gradient estimate is only known for only a limited number of examples in the sub-Riemannian setting [Reference Driver and Melcher24, Reference Melcher54]. Our result avoids this condition and obtains CMLSI for general Hörmander systems.
Example 3.13. Let $n\ge 3$ . The special unitary group $\operatorname {SU}\left ( n \right )$ is
The Lie algebra $\mathfrak {su}(n)$ is the space of all the skew-Hermitian matrices, and a natural basis $\mathfrak {su}(n)$ is given by $\{X_{j,k},Y_{j,k},Z_{k} \hspace {.1cm} | \hspace {.1cm} 1\le j< k\le n\}$ where
which is $n^2-1$ dimensional. Let $V=\{1,\cdots , n\}$ be a vertex set and $E\subset V\times V$ as an edge set. The set
is a generating set if and only if $(V,E)$ is a connected graph. The associated sub-Laplacian
is a generalization of (3.19). Theorem (1.2) implies that $\alpha _c(\Delta _E)>0$ for all connected $(V,E)$ , despite the gradient estimate (3.20) is not known for this type of generator.
3.5 Transference semigroups
Let us discuss an immediate application of $\alpha _c(\Delta _H)>0$ to symmetric Quantum Markov semigroups. Let G be a compact Lie group and $H=\{X_1,\cdots ,X_k\}$ be a generating set of its Lie algebra $\mathfrak {g}$ . Then $\{X_1,\cdots ,X_k\}$ satisfies the Hörmander condition, and its sub-Laplacian $\Delta _H=-\sum _{k}X_i^2$ generates a Markov semigroup $P_t=e^{-\Delta _H t}$ symmetric to the Haar measure. Let $u:G\to {\mathbb M}_n$ be a finite dimensional unitary representation and $d_u:\mathfrak {g}\to i({\mathbb M}_n)_{s.a.}$ be the corresponding Lie algebra morphism. $P_t=e^{-\Delta _H t}$ induces a quantum Markov semigroup $T_t=e^{-L_Ht}:{\mathbb M}_m\to {\mathbb M}_m$ with generator in the Lindbladian form [Reference Lindblad50],
$T_t$ is called a transference semigroup of $P_t$ by the following commuting diagram:
where the transference map $\pi _u$ is a $*$ -endomorphism given by
which embeds ${\mathbb M}_m$ into $L^{\infty }(G, {\mathbb M}_m)$ . Then the quantum semigroup $T_t$ is the restriction of the matrix-valued extension of classical semigroup $P_t\otimes \operatorname {\mathrm {\operatorname {id}}}_{{\mathbb M}_m}$ on the image of $\pi ({\mathbb M}_m)$ . Such a transference relation holds fro any unitary representation. We obtain the following dimension-free estimates both spectral gap and CMLSI constant (see [Reference Gao, Junge and LaRacuente29, Section 4]):
which are independent of the choice of the unitary representation. Then Corollary 1.3 follows immediately from Theorem 1.2.
3.6 Failure of matrix valued log-Sobolev inequality
As mentioned above, a standard analysis approach to MLSI through hypercontractivity or LSI relies on the Rothaus Lemma (see, for example, [Reference Rothaus68, Reference Bakry3])
Here, we show that the Rothaus Lemma, LSI and hypercontractivity all fail for matrix-valued functions for any classical Markov semigroups. This is a strong indication that the approach by Diaconis and Saloff-Coste’s hypercontractive [Reference Diaconis and Saloff-Coste23] estimate (also used in [Reference Lugiewicz and Zegarliński53]) cannot be used in proving lower bounds for the CMLSI constants.
The following lemma calculates the derivatives of the entropy functional $H(\rho )=\tau (\rho \log \rho )$ . Recall the BKM metric of a operator $X\in {\mathcal M}$ at a base state $\rho $ is
Lemma 3.14. Let $t\mapsto \rho _t \in S_B({\mathcal M}), t\in (a,b)$ be a smooth family of bounded density operator. Define the function $F(t)=H(\rho _t)=\tau (\rho _t\log \rho _t)$ . Then
where $\rho _t'$ and $\rho _t"$ are the first and second order derivative of $\rho _t$ .
Proof. The formula for $F'$ follows from [Reference Wirth79, Lemma 5.8]. For the second derivative, recall the noncommutative chain rule
By calculating the second derivative, we obtain the second assertion
Proposition 3.15. Let $T_t=e^{-tL}:L_\infty (\Omega ,\mu )\to L_\infty (\Omega ,\mu )$ be an ergodic symmetric Markov semigroup. Let $\alpha _R,\alpha _2,\alpha _h$ be the optimal (largest) constant such that the following inequalities hold for any $f\in L_\infty (\Omega ,{\mathbb M}_2)\cap {\mathcal A}_{\mathcal E}$ ,
where $E_\mu (f)=(\int f d\mu )1_\Omega $ is the expectation map and $\hat {f}=f-E_\mu (f)$ is the mean zero part of f. Then $\alpha _R=\alpha _2=\alpha _h=0$ .
Proof. We write $\tau (f)=\frac {1}{2}\int {\text {tr}}(f)d\mu $ for the normalized trace on $L_\infty (\Omega ,{\mathbb M}_2)$ . We start with constant $\alpha _R$ in the Rothaus lemma. Without loss of generality, we may assume there is a measurable set $X\subset \Omega $ such that $\mu (X)=r$ for some $0<r<1$ . Let $\eta \in (0,1)$ . Then $h_0=(1-r)1_X-r 1_{X^c}$ is a real mean zero function. Consider the matrix valued function $f_\varepsilon =f+\varepsilon h$ where
where f is a constant matrix valued function. Then $E_\mu f_\varepsilon =f, \hat {f}_\varepsilon =\varepsilon h$ and
Then $E_\mu (|f_\varepsilon |^2)=f^2+\varepsilon ^2 h^2$ and $E_\mu (|\hat {f}_\varepsilon |^2)=E_\mu ( h^2)\varepsilon ^2$ . Using Lemma 3.14, the Taylor expansion of the left-hand side of (LSI) is
where we used the fact $\tau (h\log f)=0$ and $\tau (h^2\log f-E_\mu (h^2)\log f)=0$ . For the right-hand side of the Rothaus lemma, we find
While both $D(h^2||E_\mu ( h^2))$ and $\parallel \! h \! \parallel _{2}^2$ are finite, we have
Note that we can choose $\eta \to 1$ and $\frac {1}{2\eta }\ln (\frac {1+\eta }{1-\eta })\to +\infty $ , which implies $\alpha _R=0$ . The same example applies to LSI by choosing a mean zero function $h_0$ such that ${\mathcal E}(h_0,h_0)<\infty $ . For the hypercontractivity, for $p\ge 2$ we recall the norms
where the infimum takes over all positive invertible $x,y\in {\mathbb M}_2$ with unit $2r$ -norm for $\frac {1}{r}=\frac {1}{2}-\frac {1}{p}$ . Since $T_t$ is a bimodule map for $\mathbb {C}1\otimes {\mathbb M}_2\subset L_\infty (\Omega ,{\mathbb M}_2)$ , we can equivalently consider the norm
where the asymmetric amalgamated $L_2^a({\mathbb M}_2, L_p(\Omega ))$ space is equipped with norm
In particular,
and we have
Now define $p(t)=2q(t)=1+e^{2\alpha _h t}$
By assumption $G(t)\le 1$ , we have
which implies $\alpha _h\le \alpha _2=0$ . Note, however, that $\alpha _h\ge 0$ because $T_t$ is always contractive on $L_2({\mathbb M}_2, L_2(\Omega ))$ . Hence, $\alpha _h=0$ , and the proof is complete.
Remark 3.16. Similar to [Reference Bardet and Rouzé7, Corollary 5.2], the above proposition implies that for $p\neq 2$ , neither $L_2^a({\mathbb M}_2, L_p(\Omega ))$ or $L_2({\mathbb M}_2, L_p(\Omega ))$ are uniformly convex.
4 Entropy contraction for GNS symmetric quantum channels
4.1 State symmetric quantum channels
Let ${\mathcal M}$ be a von Neumann algebra and $\phi $ a normal faithful state. We have the GNS cyclic representation $\{\pi _\phi , H_\phi , \eta _\phi \}$ , which is a $*$ -isomorphism $\pi _\phi :{\mathcal M}\to H_\phi $ with a cyclic and separating vector $\eta _\phi $ such that
By identifying ${\mathcal M}\cong \pi _\phi ({\mathcal M})$ , the modular automorphism group $\alpha ^\phi _t$ for $t\in \mathbb {R}$ is defined as
where $\Delta $ is the modular operator of $\phi $ , defined as follows:
We consider the following two symmetric conditions with respect to a state $\phi $ .
Definition 4.1. We say a quantum Markov map $\Phi :{\mathcal M} \to {\mathcal M}$ is GNS-symmetric with respect to $\phi $ (in short, GNS- $\phi $ -symmetric) if
ii) We say $\Phi $ is KMS-symmetric with respect to $\phi $ (in short, KMS- $\phi $ -symmetric) if
Correspondingly, we call the pre-adjoint $\Phi _*:{\mathcal M}_*\to {\mathcal M}_*$ a GNS- or KMS- $\phi $ -symmetric quantum channel.
Both definitions are generalizations of the detailed balance condition for classical Markov chains and imply that $\phi =\phi \circ \Phi =\Phi _*(\phi )$ is an invariant state of $\Phi $ . It is proven in [Reference Giorgetti, Parzygnat, Ranallo and Russo32, Reference Wirth80] that the GNS- $\phi $ -symmetric quantum Markov map is equivalent to KMS- $\phi $ -symmetric plus that $\Phi $ commutes with the modular group
The commutation to modular group is also called Accardi-Cecchini condition in [Reference Giorgetti, Parzygnat, Ranallo and Russo32] for a study of quantum Bayes rule [Reference Parzygnat62, Reference Parzygnat and Russo65, Reference Parzygnat and Buscemi63, Reference Parzygnat and Fullwood64].
For simplicity, we will consider a semifinite von Neumann algebra ${\mathcal M}$ equipped with a normal faithful semi-finite trace $\tau $ , but our discussion applies to general von Neumann algebras with proper interpretation of notations. In the tracial setting, we can write $\phi (x)=\tau (d_\phi x)$ using the density operator $d_\phi $ of $\phi $ . Then the modular automorphism group is given by
Let $\Phi _*:L_1({\mathcal M})\to L_1({\mathcal M})$ be the pre-adjoint quantum channel via trace duality. The KMS- $\phi $ -symmetry is equivalent to
For $1\le p\le \infty $ , the weighted $L_p$ -space $L_p({\mathcal M},\phi )$ is the completion of ${\mathcal M}$ under the norm
where $\parallel \! y \! \parallel _{p}=\tau (|y|^p)^{1/p}$ is the tracial p-norm. For $p=2$ , $L_p({\mathcal M},\phi )$ is a Hilbert space with KMS-inner product $\parallel \! x \! \parallel _{2,\phi }^2=\langle \Delta ^{\frac {1}{4}}x\eta _\phi ,\Delta ^{\frac {1}{4}}x\eta _\phi \rangle $ . By equation (4.1), $\Phi $ is also a contraction on $L_1({\mathcal M},\phi )$ , and hence a contraction on $L_p({\mathcal M},\phi )$ for all $1\le p\le \infty $ by complex interpolation.
The lemma below is an analog of Proposition 2.2.
Proposition 4.2. Let $\Phi :{\mathcal M}\to {\mathcal M}$ be a GNS- $\phi $ -symmetric quantum Markov map for a normal faithful state $\phi $ . Denote ${\mathcal N}$ as the multiplicative domain of $\Phi $ . Then
-
i) ${\mathcal N}$ is invariant under $\alpha _t^\phi $ . Hence, there exists a $\phi $ -preserving normal conditional expectation ${E:{\mathcal M}\to {\mathcal N}}$ .
-
ii) $\Phi |_{\mathcal N}$ is an involutive $*$ -automorphism satisfying
(4.2) $$ \begin{align} \Phi^2\circ E=E\circ \Phi^2=E, \hspace{.1cm} E\circ \Phi=\Phi\circ E. \end{align} $$Moreover, $\Phi ^2$ is a ${\mathcal N}$ -bimodule map satisfying $\Phi ^2(axb)=a\Phi ^2(x)b$ for any $a,b\in {\mathcal N}$ and $x\in {\mathcal M}$ .
-
iii) $\Phi $ is an isometry on $L_2({\mathcal N},\phi )$ . If, in addition,
$$\begin{align*}{\parallel \! \Phi(\operatorname{\mathrm{\operatorname{id}}}-E):L_2({\mathcal M},\phi)\to L_2({\mathcal M},\phi) \! \parallel_{2}<1},\end{align*}$$then $E=\lim _{n}\Phi ^{2n}$ as a map from $L_2({\mathcal M},\phi )$ to $L_2({\mathcal M},\phi )$ .
Proof. It suffices to explain i). The rest follows similar as Proposition 2.2 (see also [Reference Gao and Rouzé31, Lemma 2.5] for the finite dimensional case). Indeed, since $\Phi $ commutes with $\alpha _t^\phi $ , for $x\in {\mathcal N}$ ,
Multiplicativity on the other side is similar, implying $ \alpha _t^\phi (x)\in {\mathcal N}$ . By Takesaki’s theorem [Reference Takesaki72], there exists $\phi $ -preserving conditional expectation satisfying the defining property
from which the GNS- $\phi $ -symmetry follows.
4.2 Haagerup’s reduction
A von Neumann algebra ${\mathcal M}$ is called type III if it does not admit a nontrivial semifinite trace. We briefly review the basics of Haagerup’s construction and refer to [Reference Haagerup, Junge and Xu36] for more details. The key idea is to consider the additive subgroup $G=\bigcup _{n\in {\mathbb N}} 2^{-n}\mathbb {Z} \subset \mathbb {R}$ of the automorphism group. Let ${\mathcal M}\subset B(H)$ be a von Neumann algebra and $\phi $ be a normal faithful state. One can define the crossed product by the action $\alpha ^\phi :G\curvearrowright {\mathcal M}$
$\hat {{\mathcal M}}$ can be considered as the von Neumann subalgebra $ \hat {{\mathcal M}}=\{\pi ({\mathcal M}),\lambda (G)\}" \subset {\mathcal M}\overline \otimes B(\ell _2(G))$ generated by the embeddings
Basically, $\pi $ is the transference homomorphism ${\mathcal M}\to \ell _\infty (G,{\mathcal M})$ , and $\lambda $ is the left regular representation on $\ell _2(G)$ . The set of finite sums $\{\sum _g a_g \lambda (g)\hspace {.1cm} | \hspace {.1cm} a_g \in {\mathcal M} \}\subset \hat {{\mathcal M}}$ forms a dense $w^*$ -subalgebra of $\hat {{\mathcal M}}$ . In the following, we identify ${\mathcal M}$ with $\pi ({\mathcal M})\subset \hat {M}$ (resp. a with $\pi (a)$ ) and view ${\mathcal M}\subset \hat {{\mathcal M}}$ as a subalgebra. The state $\phi $ admits a natural extension as a normal faithful state on $\hat {{\mathcal M}}$
Moreover,
is the canonical normal conditional expectation such that $\phi \circ E_{\mathcal M}\hspace {.1cm} = \hspace {.1cm} \hat {\phi }$ .
The main object in Haagerup’s construction is an increasing family of subalgebras
given by the centralizer algebra $\hat {{\mathcal M}}_{\psi _n}$ for a suitable family of states $\psi _n$ so that $\bigcup _n {\mathcal M}_n$ is $w^*$ -dense in $\hat {{\mathcal M}}$ . The state $\psi _n$ is defined via a Radon-Nikodym density w.r.t to $\hat {\phi }$
Here, $\text {Log}$ is the principal branch of the logarithmic function with $0\le \text {Log}(z)<2\pi $ . Each subalgebra ${\mathcal M}_n$ contains $\lambda (G)$ , and there exists normal conditional expectation $E_{{\mathcal M}_n}:\hat {{\mathcal M}}\to {\mathcal M}_n$ . Indeed, by the definition of $\psi _n$ , the modular group $\alpha _t^{\psi _n}$ is $2^{-n}$ periodic. The explicit form (see [Reference Haagerup, Junge and Xu36, Lemma 2.3]) is given by
The normalized state $\tau _n=\frac {\psi _n}{\psi _n(1)}$ is a normalized trace on ${\mathcal M}_n$ . The key properties of ${\mathcal M}_n$ are summarized in [Reference Haagerup, Junge and Xu36, Theorem 2.1 & Lemma 2.7], which we state below.
Theorem 4.3. With above notations, ${\mathcal M}_n$ is an increasing family of von Neumann subalgebras satisfying the following properties
-
(1) Each $({\mathcal M}_n,\tau _n)$ is a finite von Neumann algebra.
-
(2) $\bigcup _{n\ge 1} {\mathcal M}_n$ is weak $^*$ -dense in $\hat {{\mathcal M}}$ .
-
(3) There exists a $\hat {\phi }$ -preserving normal faithful conditional expectation $E_{{\mathcal M}_n}:\hat {{\mathcal M}}\to {\mathcal M}_n$ such that
$$\begin{align*}\hat{\phi}\circ E_{{\mathcal M}_n}=\hat{\phi}\hspace{.1cm}, \hspace{.1cm} \alpha_t^{\hat{\phi}}\circ E_{{\mathcal M}_n}= E_{{\mathcal M}_n}\circ \alpha_t^{\hat{\phi}}. \end{align*}$$Moreover, $E_{{\mathcal M}_n}(x)\to x$ in $\sigma $ -strong topology for any $x\in \hat {{\mathcal M}}$ .
We now look at the Haagerup reduction on the states. For a state $\rho \in S({\mathcal M})$ , $\hat {\rho }=\rho \circ E_{\mathcal M}$ is the canonical extension on $\hat {{\mathcal M}}$ . We denote $\rho _n:=\hat {\rho }|_{{\mathcal M}_n}\in {\mathcal M}_{n,*}$ as the restriction state of $\hat {\rho }$ on the subalgebra ${\mathcal M}_n \subset \hat {{\mathcal M}}$ . Note that the predual ${\mathcal M}_{n,*}$ can be viewed as a subspace of $\hat {{\mathcal M}}_*$ via the embedding
Via this identification, $\rho _n=\hat {\rho }|_{{\mathcal M}_n}\circ E_{{\mathcal M}_n}=\hat {\rho }\circ E_{{\mathcal M}_n}=E_{{\mathcal M}_n,*}(\hat {\rho })\in \hat {{\mathcal M}}_*$ . Moreover, by the weak $^*$ -density of the family ${\mathcal M}_n$ , $\rho _n\to \hat {\rho }$ converges in the weak topology. An immediate consequence is the following approximation of relative entropy.
Lemma 4.4. Let $\rho $ and $\sigma $ be two normal states of ${\mathcal M}$ . Then
Proof. Let $\iota :{\mathcal M}\subset \hat {{\mathcal M}}$ be the inclusion map. Because $\hat {\rho }=\rho \circ E_{\mathcal M}$ is an extension of $\rho $ , $\iota _*(\hat {\rho })=\hat {\rho }|_{\mathcal M}=\rho $ , and similarly for $\sigma $ . Both $\iota : {\mathcal M} \to \hat {{\mathcal M}}$ and $E_{\mathcal M}:\hat {{\mathcal M}}\to {\mathcal M}$ are quantum Markov maps. Then by the data processing inequality,
Thus, $D(\rho ||\sigma )=D(\hat {\rho }||\hat {\sigma })$ . As for the limit, we have
where the equality follows from the lower semi-continuity of relative entropy (see, for example, [Reference Hiai37, Theorem 2.7]). The second inequality is another use the data processing inequality.
We shall also apply the Haagerup’s reduction on GNS-symmetric maps. Let $\Phi :{\mathcal M}\to {\mathcal M}$ be a GNS- $\phi $ -symmetric quantum Markov map. Its canonical extension map
is also a GNS- $\hat {\phi }$ -symmetric quantum Markov map. Indeed, $\hat {\Phi }=\Phi \otimes \operatorname {\mathrm {\operatorname {id}}}_{B(\ell _2(G))}|_{\hat {{\mathcal M}}}$ is the restriction of $\Phi \otimes \operatorname {\mathrm {\operatorname {id}}}_{B(\ell _2(G))}$ on $\hat {{\mathcal M}}\subset {\mathcal M}\overline {\otimes } B(\ell _2(G)$ . It is clear that $\hat {\Phi }$ has the multiplicative domain
where ${\mathcal N}$ is the multiplicative domain of $\Phi $ . In particular, this crossed product is well defined because $\alpha ^\phi _t({\mathcal N})={\mathcal N}$ . Moreover, the $\hat {\phi }$ -preserving conditional expectation $\hat {E}:\hat {{\mathcal M}}\to \hat {{\mathcal N}}$ is nothing but the canonical extension of $E:{\mathcal M}\to {\mathcal N}$ .
Recall that we write $E_{\mathcal M}$ and $E_n$ as the normal conditional expectations from $\hat {{\mathcal M}}$ onto ${\mathcal M}$ and ${\mathcal M}_n$ , respectively. The next lemma shows that the extension $\hat {\Phi }$ is well compatible with the approximation family ${\mathcal M}_n$ .
Lemma 4.5. Let $\Phi :{\mathcal M}\to {\mathcal M}$ be a GNS- $\phi $ -symmetric quantum Markov map. With the notations above,
-
i) $\hat {\Phi }$ commutes with $E_{\mathcal M}, \hat {E}$ and $E_{{\mathcal M}_n}$ . In particular, $\hat {\Phi }({\mathcal M}_n)\subset {\mathcal M}_n$ .
-
ii) The restriction $\Phi _n=\hat {\Phi }|_{{\mathcal M}_n}$ is a normal unital completely positive map symmetric with respect to the tracial state $\tau _n$ .
-
iii) Let ${\mathcal N}_n\subset {\mathcal M}_n$ be the multiplicative domain for $\Phi _n$ . Then the restriction map $E_n:=\hat {E}|_{{\mathcal M}_n}:{\mathcal M}_n\to {\mathcal N}_n$ is the $\tau _n$ -preserving conditional expectation.
Proof. The relation $\hat {\Phi }\circ E_{\mathcal M}=E_{\mathcal M}\circ \hat {\Phi }$ is clear from the definition of $\hat {\Phi }$ , and $\hat {\Phi }\circ \hat {E}=\hat {E}\circ \hat {\Phi }$ follows from Lemma 4.2. Recall that $\psi _n(x)=\hat {\phi }(e^{-a_n}x)$ with density operator $e^{-a_n}\in \lambda (G)"$ and $\lambda (G)$ is in the centralizer of $\hat {\phi }$ [Reference Haagerup, Junge and Xu36, Lemma 2.3]. Then
for the unitary $u(t)=e^{-ita_n}$ . Note that $\hat {\Phi }$ commutes with $\alpha _t^{\hat {\phi }}$ by GNS- $\hat {\phi }$ -symmetry, and also commutes with $\text {ad}_{u(t)}$ because $u(t)\in \lambda (G)"$ is in $\hat {\Phi }$ ’s multiplicative domain. Thus, $\hat {\Phi }$ commutes with $\alpha _t^{\psi _n}$ and hence the conditional expectation $E_{{\mathcal M}_n}=2^{-n}\int _0^{2^{-n}}\alpha _t^{\psi _n}$ . This proves i).
For ii), we note that for $x,y\in {\mathcal M}_n$ ,
where we use the fact that $\hat {\Phi }$ is GNS- $\hat {\phi }$ -symmetric and $e^{-a_n}\in \lambda (G)"$ is in the fixed point subspace of $\hat {\Phi }$ . Finally, iii) follows from applying i) and ii) to $\hat {E}$ .
To summarize the lemma above, we have the following commuting diagrams:
Basically, $\Phi _n$ is a family of trace symmetric channels approximating $\hat {\Phi }$ , which is in turn a natural extension of $\Phi $ . The same picture holds for the conditional expectations $E_n,\hat {E}$ and E.
4.3 Entropy contraction
We shall now discuss the entropy contraction of GNS- $\phi $ -symmetric channels. The first step is to extend the entropy difference Lemma 2.1. Define the state space that is bounded with respect to $\phi $ ,
For all $\rho \in S_B({\mathcal M},\phi )$ , $D(\rho ||\phi ) <\infty $ is finite. Such $S_B({\mathcal M},\phi )$ is a dense subset of $S({\mathcal M})$ because for any $\rho $ and $0<\varepsilon <1$ , $\rho _\varepsilon =(1-\varepsilon )\rho +\varepsilon \phi \in S_B({\mathcal M})$ . For $\rho \in S_B({\mathcal M})$ , we define the entropy difference for a GNS- $\phi $ -symmetric quantum channel $\Phi _*$ as
By data processing inequality and $\Phi _*(\phi )=\phi $ , $D_{\Phi _*}(\rho )\ge 0$ . In the trace symmetric case, $D_{\Phi _*}(\rho )=D(\rho ||1)-D(\Phi _*(\rho )||1)=H(\rho )-H(\Phi _*(\rho ))$ as in Section 2. Let E be the conditional expectation onto the multiplicative domain of $\Phi $ . By the chain rule [Reference Petz66, Theorem 2] that for any E invariant state $\psi \circ E=\psi $ ,
we have the alternative expressions $D_{\Phi _*}(\rho )=D(\rho ||E_*(\rho ))-D(\Phi _*(\rho )||\Phi _*E_*(\rho )), $ where we used the property $\Phi _*E_*=E_*\Phi _*$ in Proposition 4.2.
Lemma 4.6. Let $\Phi _*$ be a GNS- $\phi $ -symmetric quantum channel. For any state $\rho ,\omega \in S_{b}({\mathcal M},\phi )$ ,
Proof. Recall that we use $\rho _n=\hat {\rho }|_{{\mathcal M}_n}=E_{{\mathcal M}_n,*}(\hat {\rho })$ and $\omega _n=\hat {\omega }|_{{\mathcal M}_n}=E_{{\mathcal M}_n,*}(\hat {\omega })$ as the restriction states on finite von Neumann algebra ${\mathcal M}_n\subset \hat {{\mathcal M}}$ obtained from the Haagerup reduction. By Lemma 4.5, we know that $\Phi _n=\hat {\Phi }|_{{\mathcal M}_n}$ is a quantum Markov map symmetric with respect to the tracial state $\tau _n$ . Thus, by Lemma 2.1 in the tracial case,
where we identify $\Phi _{n}=\Phi _{n,*}$ by trace symmetry. Here, since $\Phi _n=\Phi |_{{\mathcal M}_n}$ is GNS-symmetric to $\phi _n=\phi |_{{\mathcal M}_n}$ ,
By the definitions of $\Phi _n$ and $\rho _n$ , and the hat “ $\:\hat {}\:$ ” notation for states on $\hat {{\mathcal M}}$ ,
Then by Lemma 4.4, we can approximate every entropic term
The next lemma shows the CB-return time is also compatible with Haargerup reduction.
Lemma 4.7. Let $\Psi :{\mathcal M}\to {\mathcal M}$ be a GNS- $\phi $ -symmetric quantum Markov map and E be the conditional expectation on its multiplicative domain. Suppose
Then for all $n\in {\mathbb N}$ ,
Moreover, if $0.9 E \le _{cp} \Psi \le _{cp} 1.1E$ and $\Psi \circ E=E$ , then for any $\rho \in S_{B}({\mathcal M},\phi )$ ,
Proof. The CP order inequality follows from the fact that both maps $E_n$ and $\Psi _n$ are the restriction of $E\otimes \operatorname {\mathrm {\operatorname {id}}}$ and $\Psi \otimes \operatorname {\mathrm {\operatorname {id}}}$ on the subalgebra ${\mathcal M}_n\subset \hat {{\mathcal M}}\subset {\mathcal M}\overline {\otimes } B(\ell _2(G))$ . Then the entropy inequality can be obtained by the tracial case Lemma 2.3 and approximation as in Lemma 4.6.
We then extend the entropy contraction to the GNS-symmetric case.
Theorem 4.8. Let $\Phi :{\mathcal M}\to {\mathcal M}$ be a GNS- $\phi $ -symmetric quantum Markov map and E be the $\phi $ -preserving conditional expectation onto its multiplicative domain ${\mathcal N}$ . Define
Then, for any $\sigma $ -finite von Neumann algebra ${\mathcal Q}$ , state $\rho \in S({\mathcal M}\overline {\otimes } {\mathcal Q})$ ,
Proof. For $\rho \in S_{B}({\mathcal M},\phi )$ , the proof is same as the tracial case Theorem 2.5 by using Lemma 4.6 and Lemma 4.7 above. The general case $\rho \in S({\mathcal M})$ can be approximated by $\rho _\varepsilon = (1-\varepsilon )\rho + \varepsilon \phi $ .
Recall that in finite dimensions, the MLSI is defined as the supremum of $\alpha $ such that
The right-hand side $I_L(\rho )$ is the entropy production, and the equivalence to entropy decay relies on the de Bruijn identity
In infinite dimensions, the de Bruijn identity (4.6) is less justified even in $B(H)$ with $\dim (H)=+\infty $ (see discussions in [Reference Huber, K¨onig and Vershynina40, Reference König and Smith44]). To avoid this issue, we define the MLSI on Type III von Neumann algebra as follows.
Definition 4.9. For a GNS- $\phi $ -symmetric quantum Markov semigroup $T_t=e^{-tL}:{\mathcal M}\to {\mathcal M}$ , we define the modified log-Sobolev (MLSI) constant $\alpha _1(L)$ as the largest constant $\alpha $ such that
where E is the $\phi $ -preserving conditional expectation onto the fixed point subalgebra ${\mathcal N}$ . The complete MLSI constant is then defined as $\alpha _c(L):=\sup _{\mathcal {{\mathcal Q}}}\alpha (L\otimes \operatorname {\mathrm {\operatorname {id}}}_{{\mathcal Q}})$ , where the supremum is over all $\sigma $ -finite von Neumann algebra ${\mathcal Q}$ .
This definition of MLSI also does not depend on any choice of reference state $\phi $ (see Lemma 4.16). With this definition, we obtain the first half of Theorem 1.1, which is restated below.
Theorem 4.10. Let $T_t=e^{-tL}:{\mathcal M}\to {\mathcal M}$ be a GNS- $\phi $ -symmetric quantum Markov semigroup. Denote $t_{cb} \hspace {.1cm} = \hspace {.1cm} \inf \{t>0 \hspace {.1cm}|\hspace {.1cm} 0.9 E\le _{cp}T_t \le _{cp} 1.1 E\} $ . Then
Namely, for any $\sigma $ -finite von Neumann algebra ${\mathcal Q}$ and state $\rho \in S({\mathcal M}\overline {\otimes } {\mathcal Q})$ , we have the exponential decay of relative entropy
Remark 4.11. In the above Haagerup’s reduction, both $\hat {\Phi }$ and $\Phi _n$ are always non-ergodic even given $\Phi $ is ergodic. From this point of view, our consideration for non-ergodic cases is essential even for ergodic $\Phi $ . It also indicates that Haagerup’s reduction does not work for LSI/hypercontractivity.
As we have seen in Proposition 3.7 for the tracial case, a combination of heat kernel estimates and spectral gap allows us to bound CB return time. The same analysis remains valid in the GNS- $\phi $ -symmetric case. For $1\le p\le \infty $ , we define the $\phi $ -weighted conditional $L_\infty ^p({\mathcal N}\subset {\mathcal M}, \phi )$ space as the completion of ${\mathcal M}$ under the norm
For a GNS-symmetric ${\mathcal N}$ -bimodule map $\Psi :{\mathcal M}\to {\mathcal M}$ , the equivalence in Proposition 3.4 also holds,
Based on that, we have an analog of Proposition 3.7.
Proposition 4.12. Let $T_t:{\mathcal M}\to {\mathcal M}$ be a GNS- $\phi $ -symmetric quantum Markov semigroup and $E:{\mathcal M}\to {\mathcal N}$ be the $\phi $ -preserving conditional expectation onto the fixed point space. Suppose
-
i) the $\lambda $ -Poincaré inequality that $\parallel \! T_{t}-E: L_2({\mathcal M},\phi )\to L_2({\mathcal M},\phi ) \! \parallel _{}\le e^{-\lambda t}\hspace {.1cm} ,\hspace {.1cm} \forall t\ge 0$ ;
-
ii) there exists $t_0$ such that $\parallel \! T_{t_0}: L_\infty ^1({\mathcal N}\subset {\mathcal M},\phi )\to L_\infty ({\mathcal M}) \! \parallel _{cb}\le C_0$ .
Then $ t_{cb}\hspace {.1cm} \le \hspace {.1cm} \frac {1}{\lambda }\ln (10 C_0)+t_0$ . In particular, if $C_{cb}(E)<\infty $ , $ t_{cb}\hspace {.1cm} \le \hspace {.1cm} \frac {1}{\lambda }\ln (10 C_{cb}(E))$ .
Proof. The argument is similar to the tracial cases by using the property of $L_\infty ^p({\mathcal N}\subset {\mathcal M},\phi )$ for general von Neumann algebra established in [Reference Junge and Parcet42]. See also [Reference Bardet, Capel, Gao, Lucia, Pérez-García and Rouzé6, Section 5] for the argument in finite dimensional GNS-symmetric cases.
4.4 Applications to finite quantum Markov chains
Let $T_t=e^{-Lt}:{\mathbb M}_d\to {\mathbb M}_d$ be a quantum Markov semigroup on matrix algebra ${\mathbb M}_d$ . Its generator L admits the following Lindbladian form ([Reference Gorini, Kossakowski and Sudarshan33, Reference Lindblad50]):
where $h,V_j\in {\mathbb M}_d$ and $h=h^*$ is Hermitian. When $T_t$ is GNS-symmetric, one has the following simplified form [Reference Alicki1, Reference Kossakowski, Frigerio, Gorini and Verri45] that
where $\{V_j\}=\{V_j\}^*$ is an orthogonal set with respect to trace inner product and the eigenvector of modular group $\alpha _t^{\phi }(V_j)=e^{-iw_j t}V_j\hspace {.1cm}.$ In finite dimensions, the completely Pimsner-Popa index $C_{cb}(E)$ is always finite. Combining Theorem 4.10 and Proposition 4.12, we obtain the second half of Theorem 1.1 restated as below.
Corollary 4.13. For finite dimensional GNS-symmetric quantum Markov semigroups,
Corollary 4.13 improves the bound $\alpha _c \ge \frac {\lambda }{2C_{cb}(E)}$ in the previous work of Gao and Rouzé [Reference Gao and Rouzé31].
Remark 4.14. In the ergodic case ${\mathcal N}=\mathbb {C}1$ , the conditional expectation $E_\phi (x)=\phi (x)1$ has index
The above bound (4.9) gives
This can be compared to the bound
proved by Diaconis and Saloff-Coste [Reference Diaconis and Saloff-Coste23] for symmetric classical Markov semigroups. In the quantum case, it is only obtained for unital semigroups [Reference Kastoryano and Temme43] and $d=2$ [Reference Beigi, Datta and Rouzé9]. For both classical and quantum depolarizing semigroups $L(x)=x-\phi (x)1$ , this bound is known to be optimal for $\alpha _2$ , which lower bounds $\alpha _1$ . Our results gives a general $\mathcal {O}(\frac {\lambda }{\parallel \! \phi ^{-1} \! \parallel _{\infty }})$ lower bound for $\alpha _1$ for non-ergodic cases and also the complete constant $\alpha _c$ .
Remark 4.15. The Corollary 3.10 shows that the CMLSI constant $\alpha _{c}$ for a classical Markov semigroup is lower bounded by LSI constant $\alpha _2$ up to a $O(\log \log \parallel \! \mu ^{-1} \! \parallel _{\infty })$ term. This argument does not work for Quantum Markov semigroup $T_t:\mathbb {M}_d\to {\mathbb M}_d$ on matrix algebras, although (3.15) remains valid for ergodic quantum Markov semigroups. The difference is that for matrix algebra, the bounded return time
and the CB return time of completely bounded norm
are quite different. In the classical setting, we used the fact
So the $t_b(e^{-2})$ and $t_{cb}(0.1)$ are comparable by absolute constants. In the noncommutative setting, we only have
In the trace symmetric case, $\parallel \! \mu ^{-1} \! \parallel _{\infty }=d$ and $t_{cb}(0.1)\le \frac {3}{2}t_b(e^{-2}) +\ln d$ ,
which is worse than the lower bound in the previous remark as $\alpha _2\le \lambda $ .
4.5 Independence of invariant state
The next lemma shows that the GNS-symmetry is also independent of the choice of invariant state $\phi $ .
Lemma 4.16. Let $T:{\mathcal M}\to {\mathcal M}$ be a GNS- $\phi $ -symmetric quantum Markov map for a normal faithful state $\phi $ . Denote $E:{\mathcal M}\to {\mathcal N}$ as the $\phi $ -preserving conditional expectation onto the multiplicative domain. Suppose $\psi $ is an another normal faithful state invariant under E (i.e., $\psi \circ E=\psi $ ). Then $T:{\mathcal M}\to {\mathcal M}$ is also GNS- $\psi $ -symmetric.
Proof. Without loss of generality, we assume $\psi \le C\phi $ for some $C>0$ . We first view them as the states on the subalgebra ${\mathcal N}$ by restriction. By [Reference Takesaki73, Theorem 3.17], there exists $h\in {\mathcal N}$ such that
This identity actually also holds for $y\in {\mathcal M}$ . Indeed, because of $\phi \circ E=\phi $ and $\psi \circ E=\psi $ ,
Moreover, one can replace h by $T(h)$ , because
where we use the fact that $T^2(h)=h$ . Thus, the GNS-symmetry with respect to $\psi $ follows that for $x,y\in {\mathcal M}$ ,
where we used the multiplicative property of $T(axb)=T(a)T(x)T(b)$ for $a,b\in {\mathcal N}$ . The general case can be obtained via $\psi _{\varepsilon }=(1-\varepsilon )\psi +\varepsilon \phi $ .
We remark that if one has convergence $\lim _{n}\Phi ^{2n}=E$ in $L_2$ -norm, the above E-invariant condition $\phi \circ E=\phi $ can be replaced by $\phi =\Phi ^{2}\circ \phi $ .
Note that the left-hand side of (4.8) only relies on complete positivity. Indeed, the $L_\infty ^1({\mathcal N}\subset {\mathcal M}, \phi )$ norm at the right hand-side is also independent of the choice of the invariant state $\phi =\phi \circ E$ .
Lemma 4.17. Let $\phi $ be a normal faithful state and $E:{\mathcal M}\to {\mathcal N}$ be a $\phi $ -preserving conditional expectation. Suppose $\psi =\psi \circ E$ is another normal faithful state preserved by E. Then,
The identity extends to all $x\in L_\infty ^p({\mathcal N}\subset {\mathcal M}, \phi ) $ .
Proof. Note that if both $\phi $ and $\psi $ are E invariant, then $d_{\psi }^{-\frac {1}{2p}}d_\phi ^{\frac {1}{2p}}$ is affiliated to ${\mathcal N}$ . Indeed, as argued in Lemma 4.16, if $\psi \le C\phi $ , then $d_\psi = hd_\phi h^*$ for some $h\in {\mathcal N}$ , and the general case follows from approximation $\psi \le \frac {1}{\varepsilon }((1-\varepsilon )\phi +\varepsilon \psi )$ . Then we have
Denote $a_1=d_{\psi }^{-\frac {1}{2p}}d_\phi ^{\frac {1}{2p}}a$ and $b_1=bd_\phi ^{\frac {1}{2p}}d_{\psi }^{-\frac {1}{2p}}$ . For $x\in {\mathcal M}$ ,
where the supremum are for $a,b\in {\mathcal N}$ .
Remark 4.18. For finite ${\mathcal M}$ , one particular invariant state of E used in [Reference Bardet and Rouzé7, Reference Bardet, Capel, Gao, Lucia, Pérez-García and Rouzé6] is $\phi _{{\text {tr}}}=E_*(1)$ . This state is convenient because $\phi _{{\text {tr}}}|_{\mathcal N}$ is a trace. Then by Lemma 4.17, we have
where we used the fact $L_p({\mathcal N},\phi _{\text {tr}})$ is a tracial $L_p$ -space. We will use this point to simplify the discussion in Section 5.4.
5 Applications and examples
5.1 Entropy contraction coefficients
In this section, we discuss the implications of our results on contraction coefficients studied in [Reference Diaconis and Saloff-Coste23, Reference Del Moral, Ledoux and Miclo22, Reference Müller-Hermes, Stilck França and Wolf57, Reference Gao and Rouzé31]. These are analogs of functional inequalities for a single quantum channel.
Definition 5.1. Let $\Phi :{\mathcal M}\to {\mathcal M}$ be a quantum Markov map GNS- $\phi $ -symmetric to a normal faithful state $\phi $ and $E:{\mathcal M}\to {\mathcal N}$ be the $\phi $ -preserving conditional expectation onto the multiplicative domain of $\Phi $ . We define
-
i) the $L_2$ -contraction coefficient:
(5.1) $$ \begin{align}\lambda(\Phi):=\parallel \! \Phi(\operatorname{\mathrm{\operatorname{id}}}-E):L_2({\mathcal M},\phi)\to L_2({\mathcal M},\phi) \! \parallel_{}. \end{align} $$ -
ii) the entropy contraction coefficient:
$$\begin{align*}\alpha(\Phi):=\sup_{\rho} \frac{D(\Phi_*(\rho)||\Phi_*\circ E_*(\rho))}{D(\rho||E_*(\rho))}.\end{align*}$$ -
iii) the complete entropy contraction coefficient $\alpha _c(\Phi ):=\sup _{{\mathcal Q}}\alpha (\operatorname {\mathrm {\operatorname {id}}}_{\mathcal Q}\otimes \Phi )$ where the supremum is over all $\sigma $ -finite von Neumann algebras ${\mathcal Q}$ .
The condition $\lambda (\Phi )<1$ can be viewed as a Poincaré inequality for a quantum channel $\Phi $ , which implies the exponential convergence in $L_2$ ,
Similarly, the entropy contraction coefficient gives the convergence in relative entropy
The complete constant $\alpha _c(\Phi )$ controls not only the entropy contraction of $\Phi $ but also $\operatorname {\mathrm {\operatorname {id}}}_{\mathcal Q}\otimes \Phi $ with any environment system $\mathcal {Q}$ . This leads to the tensorization property of $\alpha _c$ that for two GNS-symmetric quantum channels [Reference Gao and Rouzé31],
For classical Markov maps, the tensorization property (5.2) is known to also hold for the non-complete constant $\alpha $ . Nevertheless, for the quantum Markov map (channel), this is not the case, and $\alpha (\Phi )$ in general can be strictly less than $\alpha _c(\Phi )$ (see [Reference Brannan, Gao and Junge14, Section 4.4]).
In finite dimensions, the existence of strictly contractive constant $\alpha _c(\Phi )<1$ was obtained in [Reference Gao and Rouzé31, Theorem 4.1]. Our results give an explicit estimate for $\alpha _c(\Phi )$ .
Corollary 5.2. Let $\Phi $ be a GNS-symmetric quantum Markov map,
Remark 5.3. In the ergodic trace symmetric case ${\mathcal N}=\mathbb {C}1$ and ${\mathcal M}={\mathbb M}_d$ , we have the trace map $E(x)={\text {tr}}(x)\frac {1}{d}$ and the CB-index $C_{cb}(E)= d^2$ . The above estimate implies
This can be compared to [Reference Müller-Hermes, Stilck França and Wolf57, Theorem 4.2] and [Reference Kastoryano and Temme43, Corollary 27],
where $\alpha _{2}(\operatorname {\mathrm {\operatorname {id}}}-\Phi ^2)$ is the LSI constant of $\operatorname {\mathrm {\operatorname {id}}}-\Phi ^2$ as a generator of quantum Markov semigroup. The two upper bounds in (5.3) and (5.4) are comparable, as both are asymptotically $\Theta (\frac {-\ln \lambda (\Phi )}{\ln d})$ . The strength of our results is that (5.3) also bounds the complete constant $\alpha _c(\Phi )$ which has the tensorization property.
Remark 5.4. Our Lemma 2.1 implies
where $\alpha _1$ is MLSI constant of the semigroup generator $(\operatorname {\mathrm {\operatorname {id}}}-\Phi ^*\Phi )$ . For a classical Markov map, it was proved by Del Moral, Ledoux and Miclo [Reference Del Moral, Ledoux and Miclo22] that there exists a universal constant $0<c<1$ such that
To the best of our knowledge, the above upper bound in (5.5) is open in the quantum case.
5.2 Graph random walks
Let $G=(V,E)$ be a finite undirected graph with $|V|=d$ and the edge set $E\subset V\times V$ . The discrete time random walk on G is a finite Markov chain given by the stochastic matrix
Here, $d(u)$ is the degree of vertex $u\in V$ . Then $K_G:l_\infty (V)\to l_\infty (V)$ is a Markov map. The $K_G$ admits a unique station distribution $\pi (u)=\frac {d(u)}{2m}$ , where $|E|=m$ . It is clear that $K_G$ is symmetric to the measure $\pi $ , also called reversible. Hence, $K_G$ is an ergodic unital channel on $L_\infty (V,\pi )$ as $\pi (K_G(f))=\pi (f)$ . The expectation map is $E_\pi (f)=\pi (f)1$ whose index is
$K_G$ is connected and $E_\pi $ are symmetric operators on $L_2(V,\pi )$ and
if $K_G$ not bipartite (in the bipartite case $K_G$ has eigenvalue $-1$ ). Then our results imply
Example 5.5 (Cyclic graphs).
Let us consider the cyclic graph $C_d=(V,E)$ with $d\ge 4$ where $V=\{1,\cdots ,d\}$ and $E=\{(j,j+1)| j=1,\cdots , d \}$ . Here, the addition is understood in the sense of ‘mod d’. Then
As $C_d$ is 2-regular, $K_{C_d}$ is symmetric to the uniform distribution $\pi (i)=1/d$ . It is known that $K_{C_d}$ has spectrum
The associated eigenvector is $e_j=\frac {1}{\sqrt {d}}(1,\omega ^j,\omega ^{2j},\cdots , \omega ^{(d-1)j})$ where $\omega =\exp (\frac {2\pi i}{d})$ . When $d=2m+1$ is odd, $\pi $ is the unique stationary measure, and $E_\pi $ is the projection onto the vector $e_0$ . We have
By triangle inequality, we have
where $C>0$ is some absolute constant by fact that the Wallis integrals $W_k=\int _{0}^{\pi /2}\cos ^k(x)dx\sim \sqrt {\frac {\pi }{2k}}$ . Thus,
and (5.6) implies
By Miclo’s result (5.5), this is asymptotically tight because the MLSI constant $\alpha _1(I-K_G^2)\sim \mathcal {O}(d^{-2})$ (see Example 5.6 below for detials). The similar asymptotic estimate also holds for even circle $d=2m$ .
For the continuous time random walk, we consider $w:E\to (0,\infty )$ to be a positive weighted function on the edge set E. The (weighted) graph Laplacian is given by the matrix
$L_G$ generates the continuous time random walk $T_t=e^{-L_G t}$ as a Markov semigroup and is symmetric with to the uniform distribution $\pi $ on V. $T_t$ is ergodic if and only if $G=(V,E)$ is connected. The expectation map $E_\pi (f)=\pi (f)\textbf {1}$ has index $C_{cb}(E_\pi )=d$ . Then Corollary 4.13,
This lower bound of $\alpha _c(L_G)$ has better dependence on the dimension d than [Reference Li, Junge and LaRacuente49, Lemma 5.2].
Example 5.6 (Cyclic graphs).
Let us again consider the cyclic graph $C_d$ with d vertices. For the uniformly weighted case $w_e\equiv 1$ , $L_{C_d}$ is a circulant matrix
Thus, $L_{C_d}=2(I-K_{C_d})$ where $K_{C_d}$ is the random walk kernel in Example 5.5, and $ L_{C_d}$ has spectrum $\lambda _j=2(1-\cos \frac {2\pi j}{d})$ . As discussed in [Reference Diaconis and Saloff-Coste23, Example 3.6],
Choosing $t_0=d^2$ , we have
Thus, by Theorem 1.1,
This shows that for this example, our inverse of $t_{cb}$ bound for $\alpha _c$ is tight up to absolute constant. Note that the LSI constant $\alpha _2(L_{C_d})$ is also of $\Theta (\frac {1}{d^2})$ .
We refer to [Reference Diaconis and Saloff-Coste23, Reference Bobkov and Tetali11] more examples on spectral gap $\lambda $ , Log-Sobolev constants $\alpha _2$ , $\alpha _1$ , and $L_\infty $ mixing time $t_b$ of finite Markov chains.
5.3 A noncommutative Birth-Death process
Let us illustrate our estimate with a noncommutative birth-death process. This example is a generalization of graph Laplacians on matrix algebras (see [Reference Li, Junge and LaRacuente49, Reference Junge, LaRacuente and Rouzé41] for similar constructions). To fix the notation, let $G=(V, E)$ be an undirected graph with $n=|V|$ vertices and edge set E. For each edge $(r,s)\in E$ , we introduce the edge Lindbladian on ${\mathbb M}_n$ ,
where $e_{rs}\in {\mathbb M}_n$ is the matrix unit with $1$ at the $(r,s)$ position. The total Lindbladian is a weighted sum over the edge set E,
where we assume $\beta _{rs}=-\beta _{sr}$ and $w(r,s)=w(s,r)>0$ for the GNS-symmetry condition. Note that for $j\neq k$ ,
Let us collect some relevant facts of such a Lindbladian L as noncommutative extension of graph Laplacian.
-
i) Denote $\ell _{\infty }(V)\subset {\mathbb M}_n$ as the diagonal subalgebra. $L(\ell _{\infty }(V))\subset \ell _{\infty }(V)$ , and $L|_{\ell _{\infty }(V)}$ is a weighted graph Laplacian;
-
ii) For $r\neq s$ , the matrix unit $e_{rs}$ is an eigenvector of L
$$\begin{align*}L(e_{rs}) \hspace{.1cm} = \hspace{.1cm} \gamma_{rs} e_{rs} ,\end{align*}$$where $\gamma _{rs}=2(\sum _{(r,j)\in E}w(r,j)e^{\beta _{rj}/2}+\sum _{(k,s)\in E}w(k,s)e^{-\beta _{ks}/2})$ . -
iii) $\ker (L)\subset \ell _{\infty }(V)$ , and $\ker (L)=\mathbb {C}1$ if $\mathcal {G}=(V, E)$ is connected.
-
iv) Let $\mu =(\mu _k)\in \ell _{\infty }(V)$ be a density operator in the diagonal subalgebra. Then L is GNS- $\mu $ -symmetric if $e^{\beta _{rs}}=\mu _s/\mu _r$ for any $s\neq r$ .
Assume $L=\sum _{(s,r)\in E}L_{sr}$ is an ergodic graph Lindbladian satisfying GNS- $\mu $ -symmetric condition for a diagonal density operator $\mu $ . Denote $E_d$ as the projection onto diagonal subalgebra. We can decompose the semigroup $T_t=e^{-tL}$ on the diagonal part and off diagonal part.
It is clear from i) and ii) that $T_tE_d$ is a classical graph random walk and $T_t(\operatorname {\mathrm {\operatorname {id}}}-E_d)$ is a Schur multiplier on ${\mathbb M}_n$ . Using this decomposition, we consider the CB-return time of the semigroup
satisfying
where $t^{diag}_{cb}$ and $t^{off}_{cb}$ are the CB-return time for the diagonal part $T_{t}E_{d}$ and off diagonal part $T_t(\operatorname {\mathrm {\operatorname {id}}}-E_d)$ , respectively, where
For the diagonal part, $t_{cb}^{diag}(\epsilon )$ is a classical $L_\infty $ mixing time, i.e. the smallest t such that
For the off-diagonal term, we deduce from the Effros-Ruan isomorphism that a Schur multiplier map
Note that for each t,
is a symmetric matrix with positive entry. A standard application of Schur’s lemma for matrices with positive entries implies
which gives us an estimate for the off diagonal term $t_{cb}^{off}(\epsilon )$ .
Now we consider the birth-death process on a finite state space $V=\{1,\cdots ,n\}$ , which we denote as $L^{BD}_n$ . The corresponding edge E set consists of only successive vertices $E=\{(j,j+1)| 1\le j\le n-1\}$ . The simplest case chooses the uniform weight $w(r,s)=1$ for $(r,s)\in E$ and allows only one Bohr frequency $e^{-\beta }=\frac {\mu _{j}}{\mu _{j+1}}$ , and the resulting stationary measure is the well-studied thermal state
where $Z_{\beta }=\sum _{j=1}^{n} e^{-\beta j}$ is the normalization constant. In this case, $\gamma _{rs}=8(\cosh \beta ) t$ , and the off diagonal CB norm can be estimated by
Thus, $t_{cb}^{off}(\varepsilon )\le C_1(\beta )n$ for some constant $C_1(\beta )$ depending on $\beta $ . For the classic part, we refer to [Reference Miclo55] and [Reference Chen17] for the fact that the spectral gap is of order $O(1)$ ; that is,
for all $n\in {\mathbb N}$ . For the commutative system on the diagonal part, this implies (see also [Reference Diaconis and Saloff-Coste23])
(for $\varepsilon =e^{-2}$ , but here, the actual value of $\varepsilon $ does not change the asymptotic estimate). However, we have based on [Reference Miclo55] that
Combining the diagonal and off diagonal part, we know $t_{cb}(L^{BD}_n)\sim n$ . It turns out CMLSI constant has asymptotic $\alpha _c(L^{BD}_n)\sim \frac {1}{n}$ , which indicates our estimate $\alpha _{c}\ge \frac {1}{2t_{cb}}$ is asymptotically tight for this example.
Theorem 5.7. For $\beta> 0$ , there exist constants $c(\beta ), C(\beta )>0$ such that the CMLSI constant of noncommutative birth-death process $L_n^{BD}$ satisfies
The same $\Theta (\frac {1}{n})$ asymptotic holds for $t_{cb}(L^{BD}_n)^{-1}$ .
Proof. It suffices to show that
For this, we consider the function in the commutative system on the diagonal
so that $\rho :=f\mu $ represents a probability density. The relative entropy term satisfies
Our density is $\rho \equiv (\frac {1}{n})$ , and the reference density is $\mu (k)=\frac {e^{-\beta k}}{Z(\beta )}$ .
Denote $a_k=|k\rangle \langle k+1|$ . On the diagonal, we have
We have
Note that
Then we have the entropy production
for some constant $c(\beta )$ only depending on $\beta $ . This holds for $n\hspace {.1cm} \ge \hspace {.1cm} n_0$ large enough.
Remark 5.8. When $\beta> 0$ , $\sum _{k=1}^n e^{-\beta k} = O(1)$ is a geometric series. In the case that $\beta = 0$ , the above birth-death process reduces to a ‘broken’ version of the cyclic graph (linear graph) as in Example 5.6 with $\alpha _c(L_n) \sim 1/n^2$ .
5.4 Noncommutative concentration inequality
In this section, we show that $\text {CMLSI}$ of a GNS- $\phi $ -symmetric semigroup implies concentration inequalities for the state $\phi $ . The key quantity in the discussion is the Lipschitz semi-norm
where the gradient form (or Carré du Champ operator) is
Note that $\|\cdot \|_{\text {Lip}}$ is a semi-norm (satisfying triangle inequality) because $\Gamma _L$ is completely positive bilinear form. Our first lemma is to show that $\|x\|_{\text {Lip}}$ can be approximated by Haagerup reduction.
Lemma 5.9. Let $x\in {\mathcal M}$ . Then for all $n\in {\mathbb N}$ ,
Proof. Recall the conditional expectation $E_{{\mathcal M}_n}:\hat {{\mathcal M}}\to {\mathcal M}_n$ is given by
Note that $\alpha _t^{\psi _n}$ is an inner automorphism on ${\mathcal M}\rtimes _\alpha 2^{-n}\mathbb {Z}\cong L_\infty (\mathbb {T}, {\mathcal M}) $ . We note that for a modular automorphism $\alpha _t$ such that $L\alpha _t=\alpha _tL$ ,
which implies $\parallel \! x \! \parallel _{\text {Lip}}=\parallel \! \alpha _t(x) \! \parallel _{\text {Lip}}$ . Here, both $\alpha _t^{\hat {\phi }}$ and $\alpha _t^{\psi _n}$ commute with $\hat {L}=\operatorname {\mathrm {\operatorname {id}}}_{\mathbb {T}}\otimes L$ by the GNS-symmetricness of $\hat {L}$ . Then by triangle inequality,
Lemma 5.10. Let ${\mathcal M}_0,{\mathcal N}\subset {\mathcal M}$ be two subalgebras and $\phi $ be a normal faithful state. Suppose $E_{0}:{\mathcal M}\to {\mathcal M}_0$ and $E:{\mathcal M}\to {\mathcal N}$ are $\phi $ -preserving conditional expectations onto ${\mathcal M}_0$ and ${\mathcal N}$ , respectively. Suppose $E\circ E_0=E_0\circ E$ satisfy the commuting square condition
where ${\mathcal N}_{0}\subset {\mathcal N}$ is a subalgebra. Then for any $p\in [1,\infty ]$ and any $x\in {\mathcal M}$ ,
In other words, $L_\infty ^p({\mathcal N}_0\subset {\mathcal M}_0,\phi )\subset L_\infty ^p({\mathcal N}\subset {\mathcal M},\phi )$ as a $1$ -complemented subspace with projection $E_0$ .
Proof. We can assume $\phi =\phi _{\text {tr}}$ in the Remark 4.18. Using commuting square assumption, we know $E_0(a)\in {\mathcal N}_0$ for $a\in {\mathcal N}$ . By definition,
where the supremum is for all $a,b$ in the corresponding subalgebra with $\parallel \! a \! \parallel _{\phi ,p}=\parallel \! b \! \parallel _{\phi ,p}=1$ . Now it suffices to show the other direction
for $x\in {\mathcal M}_0$ . For that, we revoke that for $\frac {1}{p}+\frac {1}{q}=1$ , $L_1^{p'}({\mathcal N}\subset {\mathcal M})\subset L_\infty ^p({\mathcal N}\subset {\mathcal M}, \phi )^*$ is as a weak $^*$ -dense subspace [Reference Junge and Parcet42, Proposition 4.5]. Here, for $x\in {\mathcal M}$ ,
where the infimum is over all factorization $y=azb$ with $a,b\in {\mathcal N}, z\in {\mathcal M}$ . The duality pairing is given by the KMS inner product,
Indeed, it was proved in [Reference Junge and Parcet42, Corollary 3.13] that
is a contraction by the commuting square condition. Therefore, for $x\in {\mathcal N}_0$ , by the KMS- $\phi $ -symmetry of $E_0$ ,
Lemma 5.11. For $x\in {\mathcal M}$ , $\lim _{n}\|E_{{\mathcal M}_n}(x)\|_{L_\infty ^p({\mathcal N}_n\subset {\mathcal M}_n, \psi _n)}= \|x\|_{L_\infty ^p({\mathcal N}\subset {\mathcal M}, \phi )}$ .
Proof. Recall the commuting square condition $E_{{\mathcal M}_n}\circ \hat {E}=\hat {E}\circ E_{{\mathcal M}_n}$ . By Lemma 4.17 & 5.10,
The other direction follows from the weak $^*$ -convergence $E_{{\mathcal M}_n}(x) \to x$ . Fix $\frac {1}{q}+\frac {1}{p}=1$ . For any $\varepsilon>0$ , there exists $a_0,b_0\in \hat {{\mathcal N}}$ and $y_0\in \hat {{\mathcal M}}$ such that
By the weak $^*$ -density, we can choose $n_1,n_2,n_3$ and $n_4\ge \max \{n_1,n_2,n_3\}$ inductively such that
Since $E_{{\mathcal M}_{n}}(\hat {{\mathcal N}})={\mathcal N}_n$ (see the commuting diagram after Lemma 4.5), we have
by the KMS- $\hat {\phi }$ -symmetry of $E_{{\mathcal M}_n}$ . Then, for $n\ge n_4=\max \{n_1,n_2,n_3,n_4\}$ ,
This proves
Finally, the assertion follows from
as a consequence of $E_0\circ \hat {E}=\hat {E}\circ E_0$ by Lemma 5.10.
Now we restate and prove Theorem 1.4.
Theorem 5.12. Let ${\mathcal M}$ be a $\sigma $ -finite von Neumann algebra and $T_t=e^{-tL}$ be a GNS- $\phi $ -symmetric quantum Markov semigroup. Suppose $T_t$ satisfies $\text {MLSI}$ with parameter $\alpha>0$ . There exists an universal constant c such that for $2\le p <\infty $ ,
Proof. We first show that if $T_t$ satisfies $\alpha $ -MLSI, so does the approximation semigroup.
Indeed, as we see in the discussion above, ${\mathcal M}_n\subset {\mathcal M}\rtimes _{\alpha ^{\phi }_t} 2^{-n}\mathbb {Z}\cong L_\infty (\mathbb {T}, {\mathcal M})$ , and the extension $\hat {T}_t=T_t\otimes \operatorname {\mathrm {\operatorname {id}}}_{\mathbb {T}}$ has $\alpha $ -MLSI (because $L_\infty (\mathbb {T})$ is a commutative space). Note that since ${\mathcal M}_n\subset {\mathcal M}\rtimes _{\alpha ^{\phi }_t} 2^{-n}\mathbb {Z}\subset {\mathcal M}\rtimes _{\alpha ^{\phi }_t} G$ , the restriction $E_{{\mathcal M}_n}:{\mathcal M}\rtimes _{\alpha ^{\phi }_t} 2^{-n}\mathbb {Z}\to {\mathcal M}_n$ is also a conditional expectation. Then for any $\rho ,\sigma \in S({\mathcal M}_n)$ , we have
Using the commutation relation $T_{n,t}\circ E_{{\mathcal M}_n}=E_{{\mathcal M}_n}\circ \hat {T}_t$ and $E_{{\mathcal M}_n}\circ \hat {E}=E_n\circ E_{{\mathcal M}_n} $ , we have for $\rho \in S({\mathcal M}_n)$
Thus, $T_{n,t} $ has $\alpha $ - $\text {MLSI}$ on ${\mathcal M}_n$ . Note that $T_{n,t}$ is both GNS- $\hat {\phi }$ -symmetric for the extension state $\hat {\phi }$ and also symmetric for the trace $\psi _n$ . Now, we may use the tracial version of the concentration inequality [Reference Gao, Junge and LaRacuente29, Theorem 6.10] that for $x\in {\mathcal M}_n$ ,
Now by the approximation of Lemma 5.11 and independence of $L_\infty ^p({\mathcal N}_n\subset {\mathcal M}_n)$ on the reference state, for $x\in {\mathcal M}$ ,
The other inequality
is clear from definition of $L_\infty ^p({\mathcal N}\subset {\mathcal M},\phi )$ .
For Gaussian type concentration property, we introduce the following definition.
Definition 5.13. For an operator O, we say that
if there exists a projection e such that
The next lemma is a Chebyshev inequality for $\phi $ -weighted $L_p$ norm.
Lemma 5.14. Let $x\in L_p({\mathcal M},\phi )$ and $1<p<\infty $ . Then
Proof. We start with a positive element $x=y^2$ and assume $\|x\|_{p,\phi }=M$ . Then we have
Recall the asymmetric Kosaki $L_p$ -space
and the complex interpolation relation [Reference Junge and Parcet42]
and the relation between real and complex interpolation
By the definition of real interpolation space, for every $s>0$ , we have a decomposition $y \hspace {.1cm} = \hspace {.1cm} y_1+ y_2$ such that
Then by Chebychev’s inequality for the spectral projection $e=e_{[0,a]}(y_2^*y_2)$ , we have
Choose $a=s^{2/p}M$ and deduce that
Then for $t=4a$ and
For an arbitrary x, we may write $x=x_1x_2$ such that
Then for each $s>0$ , we have decomposition
with
We then use the Chebychev inequality for $e=e_{[0,a]}(x_{12}^{*}x_{12}+x_{22}^*x_{22})$ ,
Take $a=s^{2/p}M$ ,
Thus, for $t=4s^{2/p}M$ , by Chebychev’s inequality for e,
Corollary 5.15. Let $T_t=e^{-tL}$ be a GNS- $\phi $ -symmetric quantum Markov semigroup. Suppose $T_t$ satisfies $\alpha $ -MLSI. Then for any $x\in {\mathcal M}$ and $t>0$ ,
where c is a universal constant as in Theorem 5.12.
Proof. By Lemma 5.14 and Theorem 5.12, we have
Minimizing over p gives $p=\frac {1}{e}(\frac {\alpha t}{4c\parallel \! x \! \parallel _{\text {Lip}}})^2$ , which implies
Remark 5.16. In the ergodic case, the above results can be compared to [Reference Rouzé and Datta69, Theorem 8], which states that for self-adjoint $x=x^*$ ,
with a different Lipschitz norm $\parallel \! \cdot \! \parallel _{\tilde {\text {Lip}}}^2$ . Our Corollary 5.15 here uses a more natural definition of the Lipschitz norm and applies to non-ergodic cases. Nevertheless, the projection we have for
is not necessarily a spectral projection $e_{\{|x-E(x)|>t\}}$ and will depend on the state $\phi $ .
Remark 5.17. In the operator valued setting, let ${\mathcal Q}$ be any finite von Neumann algebra and $T_t\otimes \operatorname {\mathrm {\operatorname {id}}}_{{\mathcal Q}}$ be the amplification semigroup on ${\mathcal Q}\overline {\otimes } {\mathcal M}$ . The conditional expectation for $T_t\otimes \operatorname {\mathrm {\operatorname {id}}}_{{\mathcal Q}}$ is $E\otimes \operatorname {\mathrm {\operatorname {id}}}_{{\mathcal Q}}$ . Note that by Lemma 4.17, $T_t\otimes \operatorname {\mathrm {\operatorname {id}}}_{{\mathcal Q}}$ is GNS-symmetric to the product state $ \phi \otimes \sigma $ , for any state $\sigma \in S({\mathcal Q})$ and any invariant state $\phi \in E_*(S({\mathcal M}))$ . This means we obtain
for any product state $\phi \otimes \sigma $ of this specific form. The projection of course depends on both $\phi $ and $\sigma $ .
We illustrate our result with a special case as matrix concentration inequalities.
Example 5.18 (Matrix concentration inequality).
Let $S_1,\cdots ,S_n$ be an independent sequence of random $d\times d$ -matrices $S_1,\cdots ,S_n$ such that
Tropp in [Reference Tropp75, Corollary 6.1.2] proved the following matrix Bernstein inequality that for the sum ${Z=\sum _{k=1}^nS_k}$ ,
and the matrix Chernoff bound
where
Now to apply our results, we recall that the depolarizing semigroup with generator $L(f):=(I-E_\mu )(f)=f-\mu (f)\textbf {1}_\Omega $ on any probability space $(\Omega ,\mu )$ has $\alpha _c\ge \frac {1}{2}$ (a simple fact by convexity of relative entropy). For a random matrix $f:\Omega \to {\mathbb M}_d$ , the Lipschitz norm is
where $\hat {f}=f-E_\mu (f)$ is the mean zero part.
Now we consider for each $k=1,\cdots , n$ , $S_k:\Omega _k\to M_d$ as a random matrix on $(\Omega _k,\mu _k)$ . Then on the product space $(\Omega ,\mu )=(\Omega _1,\mu _1)\times \cdots \times (\Omega _n,\mu _n)$ , we have by Theorem 5.12 for $Z=\sum _{k}S_k$
where $\parallel \! \cdot \! \parallel _{p}$ is the p-norm for the normalized trace ( ${\text {tr}}(1)=1$ ). Applying (5.9) and optimizing p gives
For the matrix Chernoff bound, we use Corollary 5.15
6 Final discussion
1. Positivity and complete positivity. The central quantity in this work is the CB return time $t_{cb}$ and $k_{cb}$ defined via complete positivity. Alternatively, one can consider positive maps and positivite mixing time. Indeed, the entropy difference Lemma 2.1
holds for a positive unital trace-preserving map $\Phi $ . This is because the operator concavity
of the logarithmic function holds for any unital positive map $\Phi $ [Reference Choi18], and the monotonicity of relative entropy
was proved for any positive trace-preserving map $\Phi $ in [Reference Müller-Hermes and Reeb56] (see also [Reference Frenkel27]). Thus, both inequalities used in the proof of Lemma 2.1 hold for positive maps. Also, the conditions in Lemma 2.3 also only require positivity order
where $\Phi \ge \Psi $ means $\Phi -\Psi $ is a positive map but not necessarily completely positive. Combining these two relaxed lemmas for positive maps, we have an analog of Theorem 1.1.
Theorem 6.1. i) For a positive unital trace-preserving map $\Phi :{\mathcal M}\to {\mathcal M}$ ,
ii) For a trace symmetric positive unital semigroup $T_t=e^{-tL}:{\mathcal M}\to {\mathcal M}$ ,3
Applying the above theorem to $\Phi \otimes \operatorname {\mathrm {\operatorname {id}}}_{\mathcal {Q}}$ and $T_t\otimes \operatorname {\mathrm {\operatorname {id}}}_{\mathcal {Q}}$ for any finite von Neumann algebra $\mathcal {Q}$ actually yields our main Theorem 1.1 for trace symmetric cases. It remains open whether this observation holds for GNS-symmetric cases.
Problem 6.2. Does Theorem 6.1 with positivity conditions hold for GNS-symmetric cases?
The obstruction is that in the Haagerup reduction, we need the complete positivity and CB return time $k_{cb}(\Phi )$ of $\Phi $ to imply positivity and positivity mixing time $k(\Phi )$ of the extension $\hat {\Phi }$ , similar for the semigroup $T_t$ . One possible approach is to avoid using Haagerup reduction, and prove Lemma 4.6 directly.
The comparison between positivity and complete positivity has a deep root in the entanglement theory of quantum physics (see [Reference Chruściński and Pascazio20]). From the mathematical point of view, although the positivity looks a more flexible condition, it lacks connection to CB norms as Proposition 3.4. Indeed, there is no non-complete analog of Choi’s theorem [Reference Choi19]
Therefore, despite that the estimate of $\alpha _1(L)$ only requires $t(L)$ , our kernel estimate Proposition 3.7 only applies to $t_{cb}(L)$ .
2. GNS and KMS symmetry. Both GNS-symmetry and KMS-symmetry are noncommutative generalizations for the detailed balance condition of classical Markov chains. As observed in [Reference Carlen and Maas15], GNS-symmetry is the strongest generalization of detail balance condition, and KMS is the weakest, which means the assumption of GNS-symmetry is the most restrictive. It is natural to ask whether our main results (c.f. Theorem 4.10 & 4.8) can be obtained for KMS-symmetric channels or semigroups.
Problem 6.3. Do entropy decay results Theorem 4.10 and 4.8 or the entropy difference Lemma 4.6 hold for KMS-symmetric maps?
The key property of a GNS-symmetric map $\Phi $ is the commutation with modular group $\Phi \circ \alpha _t^\phi =\alpha _t^\phi \circ \Phi $ . This has been used to ensure the compatibility of Haagerup reduction with channel and semigroups (see Lemma 4.5). One can ask whether the same commuting diagram Figure 1 can be obtained for KMS Markov maps. That will allow us to use Haagerup reduction to obtain the entropy difference Lemma (4.6) for KMS-symmetric channels. Another approach is, again, to avoid using Haagerup reduction and prove the KMS-case directly. At the moment of writing, this is not unclear to us even on finite dimensional matrix algebras.
From a mathematical physics perspective, it is also interesting to explore the relative entropy decay beyond GNS symmetry. For instance, one has a Lindbladian of the form $x \mapsto i[h, x] + L(x)$ such that L is GNS-symmetric and the adjoint action $\text {ad}(e^{-iht})$ commutes with L. Then the associated semigroup is $e^{-iht}e^{-tL}(\cdot )e^{iht}$ , which has the same entropy decay as $e^{-tL}$ . Such Lindbladians are considered in [Reference LaRacuente48]. Indeed, there is also numerical and theoretical evidence that adding an nonzero Hamiltonian part can destroy the exponential entropy decay. We refer to [Reference LaRacuente48] for more discussion on entropy decay beyond symmetry conditions.
3. MLSI and CMLSI constant. By the results of this work and also previous works [Reference Li, Junge and LaRacuente49, Reference Brannan, Gao and Junge14, Reference Gao and Gordina28], we now know the positivity of CMLSI constant $\alpha _c>0$ for many cases of classical Markov semigroups with the (non-complete) MLSI constant $\alpha>0$ . That is, $\alpha \ge \alpha _c>0$ for
-
i) finite Markov chains [Reference Li, Junge and LaRacuente49, Reference Gao and Gordina28];
-
ii) heat semigroups on manifold with curvature lower bound [Reference Brannan, Gao and Junge14];
-
iii) sub-Laplacians of Hörmander system on a compact Riemannian manifold.
It remains open whether MLSI constant $\alpha $ and CMLSI constant $\alpha _c$ coincide for classical semigroups. This would be in the similar spirit that the bounded norm (resp. positivity) and the complete bounded norm (resp. complete positivity) coincide for a classical map on $L_\infty (\Omega ,\mu )$ .
Problem 6.4. Does $\alpha =\alpha _c$ for a classical symmetric Markov semigroup $T_t:L_\infty (\Omega ,\mu )\to L_\infty (\Omega ,\mu )$ ?
For a quantum Markov semigroup, a counterexample is the qubit depolarizing semigroup
which has $\frac {1}{2}\le \alpha _c(T_t)<\alpha (T_t)=1$ because of entangled states [Reference Brannan, Gao and Junge14, Section 4.3]. It is natural to ask whether $\alpha _c<\alpha $ also holds for classical depolarizing channel.
Another interesting example is the heat semigroup on the unit torus $\mathbb {T}=\{z\in \mathbb {C}\hspace {.1cm} | \hspace {.1cm} |z|=1\}$ ,
It was proved by [Reference Weissler78] that $\alpha (P_t)=\lambda (P_t)=1$ . The best known bound for CMLSI is $\alpha _c(P_t)\ge \frac {1}{6}$ . It is open whether the gap can be closed.
Problem 6.5. Does the heat semigroup $P_t$ on the torus $\mathbb {T}$ have $\alpha _c(P_t)=\alpha (P_t)=1$ ?
Acknowledgements
LG is grateful to the support of NSF grant DMS-2154903 and AMS-Simons Travel Grants. Nicholas LaRacuente was supported as an IBM Postdoc at The University of Chicago. MJ was partially supported by NSF Grant DMS-2247114 and NSF RAISE-TAQS 1839177. HL acknowledges support by the DFG cluster of excellence 2111 (Munich Center for Quantum Science and Technology).