1 Early history
The term gyrokinetics was first used by Catto & Tsang (Reference Catto and Tsang1977) slightly before the gyrokinetic change of variables was introduced by Catto (Reference Catto1978). Catto & Tsang (Reference Catto and Tsang1977) derived a linearized collision operator for perpendicular wavelengths comparable to a gyroradius by using a radial WKB (Wentzel–Kramer–Brillouin) or eikonal approximation for the waves. Their procedure made use of the electrostatic efforts of Rutherford & Frieman (Reference Rutherford and Frieman1968) and Taylor & Hastie (Reference Taylor and Hastie1968) that treated small perpendicular wavelengths by using an eikonal technique rather than employing a gyrokinetic change of variables. However, the Catto (Reference Catto1978) gyrokinetic change of variables streamlined these earlier treatments of magnetized plasmas and was adopted and appreciated even in the era before codes (BC) since it avoided the need for complicated trajectory integral solutions of the kinetic equation. Indeed, the new gyrokinetic variables retained gyration and drift effects while removing fast time variation associated with gyrofrequency effects. Moreover, the gyrokinetic equation reduced to the Hazeltine (Reference Hazeltine1973) drift kinetic equation in the long wavelength limit.
The gyrokinetic change of variables removes the gyrophase as a velocity space variable by averaging out the rapid gyration motion associated with gyrofrequency time scales. It allows elongated structure along magnetic field lines for low frequency fluctuations as observed in magnetically confined and space plasmas, while allowing short, gyroradius wavelengths across the magnetic field lines. The remaining two velocity space variables enter as the magnetic moment adiabatic invariant and a near constant of the motion energy for typical analytic calculations, while turbulence simulations normally employ a parallel velocity variable along with the magnetic moment. The gyrokinetic orderings are consistent with equilibrium descriptions, low frequency electromagnetic stability, and the evaluation of collisional fluxes, as well as turbulent transport. Recent advances are allowing some simulations to be performed on the much longer turbulent and collisional transport time scales needed to evolve radial profiles with momentum transport retained to properly determine the axisymmetric radial electric field in a tokamak.
An early electromagnetic gyrokinetic treatment neglecting compressional effects appeared shortly after gyrokinetic variables were introduced (Hitchcock & Hazeltine Reference Hitchcock and Hazeltine1978), and soon after fully electromagnetic linear descriptions appeared, both without the use of gyrokinetic variables (Antonsen & Lane Reference Antonsen and Lane1980) and with the use of gyrokinetic variables (Catto, Tang & Baldwin Reference Catto, Tang and Baldwin1981). In addition, nonlinear electromagnetic treatments appeared that employed unperturbed gyrokinetic variables in tokamak geometry (Hitchcock & Hazeltine Reference Hitchcock and Hazeltine1978; Frieman & Chen Reference Frieman and Chen1982), and nonlinear electrostatic formulations in slab geometry (Lee Reference Lee1983) retaining perturbed gyrokinetic variables (Dubin et al.
Reference Dubin, Krommes, Oberman and Lee1983). Hahm, Lee & Brizard (Reference Hahm, Lee and Brizard1988) extended the perturbed gyrokinetic variables to nonlinear electromagnetics in a uniform magnetic field
$\boldsymbol{B}$
, and Hahm (Reference Hahm1988) and Brizard (Reference Brizard1989) made nonlinear generalizations to tokamak geometry.
Two early codes appeared in the 1980s as well. The first was the linear continuum eigenvalue gyrokinetic code FULL (Rewoldt, Tang & Chance Reference Rewoldt, Tang and Chance1982), and the second was the nonlinear gyrokinetic particle-in-cell (PIC) code of Lee (Reference Lee1983) in slab geometry. Years later, Kotschenreuther, Rewoldt & Tang (Reference Kotschenreuther, Rewoldt and Tang1995) built the first initial value, fully implicit linear delta f gyrokinetic code in tokamak geometry GKS (for GyroKinetic Stability), with delta f indicating that only the correction to a Maxwellian was being found gyrokinetically. GKS was extended by Dorland et al. (Reference Dorland, Jenko, Kotschenreuther and Rogers2000) to become the first electromagnetic nonlinear continuum delta f gyrokinetic tokamak turbulence code GS2. GS2 also retained trapped and passing particles and was fully parallelized.
The earliest nonlinear delta f flux tube PIC codes were electrostatic (Sydora Reference Sydora1995; Dimits et al.
Reference Dimits, Williams, Byers and Cohen1996). The later work developed the code that became PG3EQ and discovered an important discrepancy between gyrokinetic and gyrofluid descriptions of ion temperature gradient (ITG) turbulence. It revealed the key role of zonal flow, which is a robust, temporally varying
$\boldsymbol{E}\times \boldsymbol{B}$
or electric field
$\boldsymbol{E}$
drift due to an axisymmetric radial electric field with strong radial variation, but no poloidal variation. It became apparent that zonal flow acts to regulate and limit ITG turbulent transport, especially near marginal stability. This insight led to the understanding that short wavelength turbulence gives rise to nonlinear beating that deposits some energy into the axisymmetric radial electric field variation whose poloidal variation damps to a residual steady state level to become the zonal flow. Further insights into zonal flow behaviour (absent from gyrofluid codes of the time) came from code simulations and comparisons (Dimits et al.
Reference Dimits, Bateman, Beer, Cohen, Dorland, Hammett, Kim, Kinsey, Kotschenreuther and Kritz2000), leading to an understanding that there is a nonlinear Dimits shift away from the ITG linear stability threshold. The Dimits shift occurs because the generated zonal flow nonlinearly suppresses ITG turbulence near marginality. As a result, the onset of large turbulent transport is delayed and shifted to a nonlinear threshold value higher than the linear prediction.
An analytic linear initial value check was developed by Rosenbluth & Hinton (Reference Rosenbluth and Hinton1998) to verify whether the ion polarization effects associated with magnetic drift are being properly retained in gyrokinetic codes. It yields a non-zero residual level of radial electric field (and thereby zonal flow) when the electric field is linearly perturbed. Later, Sugama & Watanabe (Reference Sugama and Watanabe2006) successfully described the transient damping of poloidal variation by geodesic acoustic modes (GAMs) to a residual steady state zonal flow. Other local delta f gyrokinetic continuum codes soon became available, like GYRO (Candy & Waltz Reference Candy and Waltz2003), GENE (Dannert & Jenko Reference Dannert and Jenko2005), GKV (Sugama & Watanabe Reference Sugama and Watanabe2006) and GKW (Peeters et al. Reference Peeters, Strintzi, Camenen, Angioni, Casson, Hornsby and Snodin2009), and even global PIC codes such as ELMFIRE (Heikkinen et al. Reference Heikkinen, Kiviniemi, Kurki-Suonio, Peeters and Sipilä2001). Additional global delta f PIC codes like GTC (Lin et al. Reference Lin, Ethier, Hahm and Tang2002), GEM (Chen & Parker Reference Chen and Parker2003), GTS (Wang et al. Reference Wang, Lin, Tang, Lee, Ethier, Lewandowski, Rewoldt, Hahm and Manickam2006) and ORB5 (Jolliet et al. Reference Jolliet, Bottino, Angelino, Hatzky, Tran, McMillan, Sauter, Appert, Idomura and Villard2007), were developed soon after GS2, PG3EQ and ELMFIRE.
2 Recent theoretical insights
Intrinsic ambipolarity means that the radial electric field cannot be determined from
$\langle \unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{J}\rangle =0$
, or, upon integrating, from
$\langle \boldsymbol{J}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}\rangle =0$
, since this constraint is automatically or intrinsically satisfied within a flux function in the electrostatic potential that is determined by momentum transport. Here
$\boldsymbol{J}$
is the current density and
$\langle \cdots \rangle$
denotes a flux surface average with
$\unicode[STIX]{x1D713}$
the poloidal flux function. Sugama et al. (Reference Sugama, Okamoto, Horton and Wakatani1996) demonstrated that neoclassical and turbulent transport are separately intrinsically ambipolar in a tokamak. Although their treatment is fully electromagnetic, some details relied on using a quasilinear treatment and Krook collision operator. These limitations where removed electrostatically by Parra & Catto (Reference Parra and Catto2009). Appendix A of Sugama & Horton (Reference Sugama and Horton1998) also showed intrinsic ambipolarity electromagnetically in a tokamak with sonic toroidal flow within a quasilinear treatment of the Onsager symmetry. Intrinsic ambipolarity implies that the radial electric field is set by the need to satisfy toroidal angular momentum conservation in an turbulent tokamak as suggested by Catto et al. (Reference Catto, Simakov, Parra and Kagan2008) and verified by Parra & Catto (Reference Parra and Catto2008), as well as the need to satisfy important symmetry properties of the gyrokinetic equation (Parra, Barnes & Catto Reference Parra, Barnes and Catto2011; Sugama et al.
Reference Sugama, Watanabe, Nunami and Nishimura2011).
Until 2008 it was widely assumed that the gyrokinetic equation and quasineutrality were all that was required to obtain a complete electrostatic solution. However, by much more carefully verifying the suggestion of Catto et al. (Reference Catto, Simakov, Parra and Kagan2008), Parra & Catto (Reference Parra and Catto2008) demonstrated that much greater care was required to describe momentum transport in a tokamak, and that very small errors would lead to fake torques (Parra & Catto Reference Parra and Catto2010b ,Reference Parra and Catto c ) and result in unphysical momentum transport. The controversy that ensued seems to have subsided as the result of more general gyrokinetic descriptions of turbulent momentum transport (Parra & Catto Reference Parra and Catto2010a ,Reference Parra and Catto b ; Parra et al. Reference Parra, Barnes and Catto2011; Parra et al. Reference Parra, Barnes, Calvo and Catto2012; Parra & Barnes Reference Parra and Barnes2015a ,Reference Parra and Barnes b ). However, the numerical implementation of a completely general description of momentum transport to the requisite order remains incomplete. The remaining subtlety is that delta f descriptions rely on scale separation, and therefore, require certain global features in the vicinity of a flux surface (Parra & Barnes Reference Parra and Barnes2015a ). Full f descriptions avoid scale separation, but cannot yet be formulated to high enough order in the gyrokinetic variables (Parra et al. Reference Parra, Barnes, Calvo and Catto2012). For example, they are roughly equivalent to trying to solve the short mean free path, long wavelength limit of the gyrokinetic equation to very high order to avoid using a Chapman–Enskog approach.
3 Kinetic equation in a tokamak
To keep the presentation as simple as possible while retaining geometric effects, the electrostatic limit of an axisymmetric tokamak is considered first. The magnetic field is taken as

with
$\unicode[STIX]{x1D713}$
,
$\unicode[STIX]{x1D701}$
and
$\unicode[STIX]{x1D717}$
the poloidal flux function (with
$2\unicode[STIX]{x03C0}\unicode[STIX]{x1D713}$
the poloidal flux), the toroidal angle variable, and
$\unicode[STIX]{x1D717}$
the poloidal angle variable in straight field line coordinates (
$\boldsymbol{B}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D717}=I/qR^{2}$
), and where
$q=q(\unicode[STIX]{x1D713})$
is the safety factor and
$I=I(\unicode[STIX]{x1D713})=RB_{t}$
. Here
$R$
is the major radius and
$B_{t}$
is the toroidal magnetic field. The Clebsch representation uses
$\unicode[STIX]{x1D6FC}=\unicode[STIX]{x1D701}-q\unicode[STIX]{x1D717}$
.
The Fokker–Planck equation written in the spatial and velocity space variables
$\boldsymbol{r},\boldsymbol{v},t$
is

where
$f=f(\boldsymbol{r},\boldsymbol{v},t)$
is the distribution function,
$\boldsymbol{E}=-\unicode[STIX]{x1D735}\unicode[STIX]{x1D6F7}$
is the electric field in the electrostatic limit, and
$C$
denotes the Fokker–Planch collision operator. Also,
$Z$
is the charge number and
$M$
is the mass, with
$e$
the charge on a proton and
$c$
the speed of light. The mass ratio difference implies that it is usually convenient to think of the kinetic equation as being for the ions as electrons more closely follow field lines.
The axisymmetry of the magnetic field means that it will at times be convenient to employ the canonical angular momentum (
$-Ze\unicode[STIX]{x1D713}_{\ast }/c$
) in the form

where
$\unicode[STIX]{x1D6FA}=ZeB/Mc$
and
$v_{\Vert }=\boldsymbol{b}\boldsymbol{\cdot }\boldsymbol{v}$
. A departure of
$\unicode[STIX]{x1D6F7}$
from axisymmetry means that
$\unicode[STIX]{x1D713}_{\ast }$
is no longer a constant of the motion since

where
$\boldsymbol{a}\boldsymbol{\cdot }\unicode[STIX]{x1D735}(R^{2}\unicode[STIX]{x1D735}\unicode[STIX]{x1D701})\boldsymbol{\cdot }\boldsymbol{a}=0$
for an arbitrary vector
$\boldsymbol{a}$
is used. The unperturbed vector potential
$\text{}\underline{\boldsymbol{A}}$
is related to the flux function by
$\unicode[STIX]{x1D713}=\unicode[STIX]{x1D713}(\boldsymbol{r},t)=-R^{2}\unicode[STIX]{x1D735}\unicode[STIX]{x1D701}\boldsymbol{\cdot }\text{}\underline{\boldsymbol{A}}$
and
$\unicode[STIX]{x1D735}\unicode[STIX]{x1D701}\times \unicode[STIX]{x1D735}\unicode[STIX]{x1D713}=\unicode[STIX]{x1D735}\times \text{}\underline{\boldsymbol{A}}$
.
Similarly, the total energy,

is not a constant of the motion because of the time variation of
$\unicode[STIX]{x1D6F7}$
, which gives

The Fokker–Planck kinetic equation is the starting point of all gyrokinetic (and drift kinetic) treatments.
4 Gyrokinetic orderings and assumptions
The gyrokinetic orderings assume a strongly magnetized plasma so the ion gyroradius
$\unicode[STIX]{x1D70C}_{i}$
is small compared to the minor radius
$a$
of the device, that is,

with
$\unicode[STIX]{x1D70C}_{i}=v_{i}/\unicode[STIX]{x1D6FA}_{i}$
,
$v_{i}=(2T_{i}/M_{i})^{1/2}$
the ion thermal speed and
$\unicode[STIX]{x1D6FA}_{i}=ZeB/M_{i}c$
the ion cyclotron frequency. The minor radius
$a$
is assumed to be a typical radial scale length, and is normally assumed much larger than the poloidal gyroradius
$\unicode[STIX]{x1D70C}_{pi}=Bv_{i}/B_{p}\unicode[STIX]{x1D6FA}_{i}\simeq qR\unicode[STIX]{x1D70C}_{i}/a$
. The electrons are even more strongly magnetized.
In addition, for turbulent and collisional transport in tokamaks the frequencies,
$\unicode[STIX]{x1D714}$
, of interest are normally assumed to be small compared to the ion cyclotron frequency. The typical frequencies of interest are near the diamagnetic drift frequency

associated with drift waves, where
$k_{\bot }$
is a typical perpendicular wavenumber. The ion and electron drift frequencies are comparable as
$\unicode[STIX]{x1D70C}_{i}v_{i}\sim \unicode[STIX]{x1D70C}_{e}v_{e}$
, with
$\unicode[STIX]{x1D70C}_{e}$
and
$v_{e}$
the electron gyroradius and thermal speed.
In the core of a tokamak collisions are normally weak so it is convenient to assume that the poloidal gyroradius is small compared to the ion mean free path
$\unicode[STIX]{x1D706}=v_{i}/\unicode[STIX]{x1D708}_{ii}$
,

where
$\unicode[STIX]{x1D708}_{ii}$
is the ion–ion collision frequency, and ions and electrons have comparable mean free paths so no distinction is made.
Gyrokinetics allows

by expanding in the small parameter

Moreover,
$k_{\bot }\unicode[STIX]{x1D70C}_{e}\sim 1$
can be allowed so electron temperature gradient modes can be treated as well. In addition, the fluctuations are assumed to have parallel wavenumbers
$k_{\Vert }$
such that

where
$qR$
is referred to as the connection length. Notice that
$\unicode[STIX]{x1D70C}_{pi}/\unicode[STIX]{x1D706}=qR\unicode[STIX]{x1D70C}_{i}/\unicode[STIX]{x1D706}a$
implies the general collisional ordering
$\unicode[STIX]{x1D706}\sim qR$
is allowed for both ions and electrons. These orderings are compatible with the banana regime ordering
$\unicode[STIX]{x1D706}\gg qR$
in the core of a tokamak in which parallel streaming dominates over collisions. Moreover, they permit charges to stream faster along a field line than they magnetically drift poloidally in a flux surface, that is,
$qR/v_{i}\ll a^{2}/\unicode[STIX]{x1D70C}_{i}v_{i}$
, since
$\unicode[STIX]{x1D70C}_{pi}/a\ll 1$
. Only on the longer collisional or turbulent transport scale do charges transport across flux surfaces.
It is normally useful to write the electrostatic potential as

where
$\bar{\unicode[STIX]{x1D6F7}}$
and
$\tilde{\unicode[STIX]{x1D6F7}}$
, respectively, are slowly and rapidly varying functions of space and time. For a sonic (
${\sim}v_{i}$
) electric field drift
$Ze\bar{\unicode[STIX]{x1D6F7}}/T\sim 1/\unicode[STIX]{x1D6FF}\gg 1$
. Normally, however, the drift due to the electric field is at the level of the diamagnetic drift speed (
${\sim}\unicode[STIX]{x1D6FF}v_{i}$
) giving
$Ze\bar{\unicode[STIX]{x1D6F7}}/T\sim 1$
.
It is also often convenient to employ scale separation to split the distribution function up into slow and fast spatio-temporal portions by writing

Typically
$\unicode[STIX]{x2202}\tilde{f}/\unicode[STIX]{x2202}t\sim \unicode[STIX]{x1D714}_{\ast }\tilde{f}$
and
$\unicode[STIX]{x2202}\tilde{\unicode[STIX]{x1D6F7}}/\unicode[STIX]{x2202}t\sim \unicode[STIX]{x1D714}_{\ast }\tilde{\unicode[STIX]{x1D6F7}}$
, with
$\bar{f}$
and
$\bar{\unicode[STIX]{x1D6F7}}$
varying approximately on the transport scale of gyroDohm diffusion
$D_{gB}\sim \unicode[STIX]{x1D70C}_{i}^{2}v_{i}/a$
, that is,
$\bar{f}^{-1}\unicode[STIX]{x2202}\bar{f}/\unicode[STIX]{x2202}t\sim D_{gB}/a^{2}$
.
Moreover, it is useful to order

which is consistent with the so-called mixing length estimate
$\unicode[STIX]{x1D735}_{\bot }\,\bar{f}\sim \unicode[STIX]{x1D735}_{\bot }\,\tilde{f}$
. The preceding ordering for
$\tilde{f}/\bar{f}\sim Ze\tilde{\unicode[STIX]{x1D6F7}}/T$
was introduced by Dimits, Lodestro & Dubin (Reference Dimits, Lodestro and Dubin1992). Using it gives the fluctuating electric drift to be of order

Also, rapid temporal variation on the diamagnetic drift frequency scale, and slow temporal variation on the gyroBohm transport scale gives
$(\unicode[STIX]{x2202}\bar{f}/\unicode[STIX]{x2202}t)/(\unicode[STIX]{x2202}\tilde{f}/\unicode[STIX]{x2202}t)\sim (\unicode[STIX]{x2202}\bar{\unicode[STIX]{x1D6F7}}/\unicode[STIX]{x2202}t)/(\unicode[STIX]{x2202}\tilde{\unicode[STIX]{x1D6F7}}/\unicode[STIX]{x2202}t)\sim \unicode[STIX]{x1D6FF}$
for

5 Gyrokinetic variables
When transforming to gyrokinetic variables there are a few choices for the velocity space variables, but the usual choice for the spatial variation is the mixed variable that relates the guiding centre or gyrocentre location
$\boldsymbol{R}$
to the particle position
$\boldsymbol{r}$
, namely,

with
$\unicode[STIX]{x1D746}$
the gyroradius vector. The velocity variable is defined in cylindrical velocity space variables
$v_{\bot },\unicode[STIX]{x1D711},v_{\Vert }$
such that the charge spirals about the magnetic field line as it streams along it to lowest order. Then

with
$\unicode[STIX]{x1D711}$
the gyrophase, and
$\boldsymbol{e}_{\unicode[STIX]{x1D713}}=\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}/|\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}|$
and
$\boldsymbol{e}_{\times }=\boldsymbol{b}\times \boldsymbol{e}_{\unicode[STIX]{x1D713}}$
orthonormal unit vectors.
There are three main reasons why this gyrokinetic change of variables is useful. The first is that to lowest order the guiding centre tries to stream along the magnetic field, that is,

To obtain the preceding, the velocity space variable result
$\unicode[STIX]{x1D735}_{v}\boldsymbol{v}=\overset{\leftrightarrow }{I}$
is employed, just as in spatial variables
$\unicode[STIX]{x1D735}\boldsymbol{r}=\overset{\leftrightarrow }{I}$
. A convenient form of the unit dyad is
$\overset{\leftrightarrow }{I}=\boldsymbol{e}_{\unicode[STIX]{x1D713}}\boldsymbol{e}_{\unicode[STIX]{x1D713}}+\boldsymbol{e}_{\times }\boldsymbol{e}_{\times }+\boldsymbol{b}\boldsymbol{b}$
.
The preceding means that the gyrokinetic variable
$\boldsymbol{R}$
allows the ordering

Consequently, unlike drift kinetics (Hazeltine Reference Hazeltine1973), gyrokinetics is able to retain short perpendicular wavelengths by allowing
$\boldsymbol{k}_{\bot }\boldsymbol{\cdot }\boldsymbol{v}_{\bot }\sim \unicode[STIX]{x1D6FA}$
.
It is convenient to employ scale separation to assume
$\boldsymbol{B}$
,
$\bar{\unicode[STIX]{x1D6F7}}$
and
$\bar{f}$
are slow functions of space such that Taylor expansions may be employed to obtain


and

where the leading correction to the expansion of
$\bar{f}$
is needed to treat diamagnetic effects.
The fluctuating quantities
$\widetilde{\unicode[STIX]{x1D6F7}}$
and
$\widetilde{f}$
contain rapid spatial variation and need not be Taylor expanded. This feature is another important difference between gyrokinetics and drift kinetics. For example, if

then a gyroaverage at fixed gyrocenter location
$\boldsymbol{R}$
,

may be employed to obtain

As a result, the gyroaverage of the electrostatic potential holding the guiding centre fixed differs from the electrostatic potential at fixed particle location, that is,
$\langle \tilde{\unicode[STIX]{x1D6F7}}\rangle _{R}\neq \tilde{\unicode[STIX]{x1D6F7}}(\boldsymbol{r})$
.
The third key reason that the gyrokinetic change of variables is so useful is found by working a bit harder. Assuming a steady state or weakly time varying magnetic field,

Using
$\langle \boldsymbol{v}\rangle _{R}=v_{\Vert }\boldsymbol{b}$
and
$\langle \boldsymbol{v}\boldsymbol{v}\rangle _{R}=(v_{\bot }^{2}/2)(\overset{\leftrightarrow }{I}-\boldsymbol{b}\boldsymbol{b})+v_{\Vert }^{2}\boldsymbol{b}\boldsymbol{b}$
gives the important result

where the gyroaveraged electric drift is

and the magnetic drift is as in drift kinetics

while the parallel velocity correction is

To more easily obtain
$u_{\Vert }$
and the
$\unicode[STIX]{x1D735}B$
and curvature drifts, it is useful to notice that

and then use
$\unicode[STIX]{x1D735}\times (\unicode[STIX]{x1D6FA}^{-1}\boldsymbol{b})=\unicode[STIX]{x1D6FA}^{-1}(\unicode[STIX]{x1D735}\times \boldsymbol{b}+\boldsymbol{b}\times \unicode[STIX]{x1D735}\ell nB)$
and
$(\overset{\leftrightarrow }{I}-\boldsymbol{b}\boldsymbol{b})\boldsymbol{\cdot }\unicode[STIX]{x1D735}\times \boldsymbol{b}=\boldsymbol{b}\times (\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{b})$
to find

The combination

appearing in the canonical angular momentum is the radial gyrokinetic variable, with the poloidal and toroidal gyrokinetic variables similarly defined as

and

The drift kinetic canonical angular momentum is

a constant of the motion of the drift kinetic equation. Occasionally, it is convenient to use the canonical angular momentum
$\unicode[STIX]{x1D713}_{\ast }$
as the radial variable rather than the gyrokinetic variable
$\unicode[STIX]{x1D6F9}$
(Kagan & Catto Reference Kagan and Catto2008, Reference Kagan and Catto2010).
Analytically, it is usually most convenient to use the velocity space variables of total energy,

magnetic moment,

and gyrophase
$\unicode[STIX]{x1D711}$
.
In these variables a so-called full f form of the gyrokinetic equation can be obtained by gyroaveraging

For slow temporal evolution and global variation of the magnetic field


and, differentiating
$\boldsymbol{v}\boldsymbol{\cdot }\boldsymbol{e}_{\unicode[STIX]{x1D713}}=v_{\bot }\cos \unicode[STIX]{x1D711}$
and
$\boldsymbol{v}\boldsymbol{\cdot }\boldsymbol{e}_{\times }=v_{\bot }\sin \unicode[STIX]{x1D711}$
,

These last three expressions are the same as for drift kinetics. However, gyroaveraging holding
$\boldsymbol{R}$
fixed instead of
$\boldsymbol{r}$
, gives the lowest-order expressions


and

where the lowest-order result
$\boldsymbol{v}_{\bot }\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D6F7}=(\unicode[STIX]{x2202}\boldsymbol{v}/\unicode[STIX]{x2202}\unicode[STIX]{x1D711})\boldsymbol{\cdot }\boldsymbol{b}\times \unicode[STIX]{x1D735}\unicode[STIX]{x1D6F7}=-\unicode[STIX]{x1D6FA}\unicode[STIX]{x2202}\unicode[STIX]{x1D6F7}/\unicode[STIX]{x2202}\unicode[STIX]{x1D711}|_{R}$
and
$Ze\bar{\unicode[STIX]{x1D6F7}}/T\sim 1$
are used, and
$\langle \boldsymbol{v}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{e}_{\unicode[STIX]{x1D713}}\boldsymbol{\cdot }\boldsymbol{e}_{\times }+v_{\Vert }v_{\bot }^{-2}\boldsymbol{v}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{b}\times \boldsymbol{b}\boldsymbol{\cdot }\boldsymbol{v}\rangle _{R}\simeq v_{\Vert }(2\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{e}_{\unicode[STIX]{x1D713}}\boldsymbol{\cdot }\boldsymbol{e}_{\times }+\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\times \boldsymbol{b})/2\sim v_{\Vert }/R\ll \unicode[STIX]{x1D6FA}$
varies on the scale of the magnetic field so is small. Expanding f in
$\unicode[STIX]{x1D6FF}$
by writing
$f=f^{(0)}+f^{(1)}+\cdots \,,$
taking
$-\unicode[STIX]{x1D6FA}\unicode[STIX]{x2202}f^{(0)}/\unicode[STIX]{x2202}\unicode[STIX]{x1D711}=0$
to lowest order, and then gyroaveraging the next-order equation to annihilate the
$f^{(1)}$
term, gives the simplest full f gyrokinetic equation to be

Unfortunately, this direct approach is risky since the result contains terms of a few different orders as noted below the equation (recall
$k_{\bot }a\gg 1$
and
$\unicode[STIX]{x1D708}\ll \unicode[STIX]{x1D714}_{\ast }$
), and it ignores higher-order corrections from
$f^{(1)}$
and the gyrokinetic change of variables. In addition, most of
$f^{(0)}$
is normally a Maxwellian. Consequently, the derivation of the gyrokinetic equation will be refined in the next sections by being more clever and deriving what is referred to as a local or
$\unicode[STIX]{x1D6FF}f=\tilde{f}$
gyrokinetic equation. The derivation takes advantage of the scale separation between the fluctuations and the background.
6 Gyrokinetics for the slowly varying portion of the distribution function
When the unperturbed distribution function is slowly varying in time and space with
$\unicode[STIX]{x2202}\bar{f}/\unicode[STIX]{x2202}\unicode[STIX]{x1D711}=0$
, only the lowest-order equation

need be solved, as long as
$\unicode[STIX]{x1D708}\bar{f}\gg \unicode[STIX]{x2202}\bar{f}/\unicode[STIX]{x2202}t$
,
$\langle \boldsymbol{v}_{E}\rangle _{R}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\bar{f}/v_{\Vert }\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\bar{f}\sim \boldsymbol{v}_{M}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\bar{f}/v_{\Vert }\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\bar{f}\sim \unicode[STIX]{x1D70C}_{pi}/a\ll 1$
and

The preceding implies that
$v_{\Vert }\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}_{R}\,\bar{f}\simeq v_{\Vert }\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\bar{f}$
and
$\langle C\{\,\bar{f}\}\rangle _{R}\simeq \langle C\{\,\bar{f}\}\rangle _{r}=C\{\langle \,\bar{f}\rangle _{r}\}\simeq C\{\,\bar{f}\}$
. A subscript r on a gyroaverage indicates that it is to be performed at fixed
$\boldsymbol{r}$
rather than fixed
$\boldsymbol{R}$
. As
$\tilde{\unicode[STIX]{x1D6F7}}$
is small compared to
$\bar{\unicode[STIX]{x1D6F7}}$
it will be convenient at times to use the alternate energy variable

Using the lowest-order form of the kinetic equation
$v_{\Vert }\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\bar{f}=C\{\,\bar{f}\}$
, either a flux surface average,

for the passing, or a transit average,

for the trapped, can be employed to annihilate the parallel streaming term. It is convenient to retain the
$\unicode[STIX]{x1D701}$
integrals in the transit average, and also notice
$\overline{v_{\Vert }/B}=1/\langle B/v_{\Vert }\rangle$
for the passing, with
$\overline{v_{\Vert }/B}=0$
for the trapped. The constraints obtained for the passing by flux surface averaging,

and trapped by transit averaging,

must be satisfied by the lowest-order distribution function as is shown next.
To verify that these can only be satisfied by a Maxwellian, multiply the lowest-order kinetic equation by
$\ell n\bar{f}$
and integrate over all velocity space and flux surface average to obtain the Boltzmann H theorem on a flux surface,

The preceding form for the H theorem follows from
$\text{d}^{3}v\rightarrow 2\unicode[STIX]{x03C0}\text{d}\bar{E}\text{d}\unicode[STIX]{x1D707}B/v_{\Vert }$
and

because the flux surface average of the divergence of any vector
$\boldsymbol{Q}$
,

gives

with
$V^{\prime }=\oint \text{d}\unicode[STIX]{x1D717}\text{d}\unicode[STIX]{x1D701}/\boldsymbol{B}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D717}$
and where
$\boldsymbol{B}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D717}$
is independent of
$\unicode[STIX]{x1D701}$
. The entropy production only vanishes for a local Maxwellian
$f_{M}$
, for which
$C\{\,\bar{f}\}=0$
and, therefore,
$v_{\Vert }\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\bar{f}=0$
. Consequently, the steady state solution for a stationary confined plasma with well-defined flux or total pressure surfaces is

with the density
$n$
and pseudo-density
$\unicode[STIX]{x1D702}$
related by

In the core of a tokamak
$n$
,
$T$
and
$\bar{\unicode[STIX]{x1D6F7}}$
are lowest-order flux functions. The collision operator must be Galilean invariant so a drifting Maxwellian is also allowed. However, without large momentum input, the ion drift velocity is small and at the diamagnetic level
${\sim}\unicode[STIX]{x1D6FF}v_{i}$
so a stationary Maxwellian can be employed.
The preceding demonstration is valid for ions without any modification since
$C$
is then the ion–ion collision operator. For electrons both like and unlike electron–ion collisions must be retained. However, momentum relaxation occurs on the electron–ion time scale so friction rapidly relaxes the electrons to the same drift velocity as the ions. A speed expansion of the electron–ion collision operator can then be performed and the Galilean invariance of the like collision operator used to determine that the electrons are also Maxwellian to lowest order.
The next-order correction to the Maxwellian in a tokamak is found by using axisymmetry. To find this correction it is convenient to define

so that Taylor expansion gives

For an axisymmetric steady state

since
$\dot{\unicode[STIX]{x1D713}}_{\ast }=0$
and
$\dot{\bar{E}}=0$
. Consequently, letting

and using
$\langle C_{1}\{(\unicode[STIX]{x1D713}_{\ast }-\unicode[STIX]{x1D713})\unicode[STIX]{x2202}f_{M}/\unicode[STIX]{x2202}\unicode[STIX]{x1D713}\}\rangle _{R}\simeq -C_{1}\{(Iv_{\Vert }/\unicode[STIX]{x1D6FA})\unicode[STIX]{x2202}f_{M}/\unicode[STIX]{x2202}\unicode[STIX]{x1D713}\}$
, the gyrokinetic equation for
$\overline{h}$
is

with
$C_{1}$
denoting the linearized collision operator.
To lowest-order collisions are normally weak in the core, giving
$v_{\Vert }\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\overline{h}=0$
. Transit averaging the next-order equation for the trapped (subscript
$t$
) over a full bounce annihilates the streaming term leaving

As there is no drive,

A flux surface average is used to annihilate the streaming term in the next-order equation for the passing (subscript
$p$
) leaving

The solution of this constraint is the neoclassical response
$\bar{h}_{p}=\bar{h}_{p}(\unicode[STIX]{x1D713},\bar{E},\unicode[STIX]{x1D707},\unicode[STIX]{x1D70E})$
responsible for banana regime transport. It must be odd in
$v_{\Vert }=\unicode[STIX]{x1D70E}|v_{\Vert }|$
like the drive. As it depends only on
$\unicode[STIX]{x1D713}$
,
$\bar{E}$
, and
$\unicode[STIX]{x1D707}$
it will be shown later that it does not matter in the gyrokinetic equation for the fluctuations even though
$\bar{h}_{p}\sim (Iv_{\Vert }/\unicode[STIX]{x1D6FA})\unicode[STIX]{x2202}f_{M}/\unicode[STIX]{x2202}\unicode[STIX]{x1D713}\sim (Mc/Ze)R^{2}\unicode[STIX]{x1D735}\unicode[STIX]{x1D701}\boldsymbol{\cdot }\boldsymbol{v}\unicode[STIX]{x2202}f_{M}/\unicode[STIX]{x2202}\unicode[STIX]{x1D713}$
.
Before going on to derive the gyrokinetic equation for the fluctuations some particle transport and ambipolarity considerations are worth stressing.
7 Ambipolarity and particle transport considerations
The convenient way to evaluate the collisional and turbulent particle fluxes is to use the
$R^{2}\unicode[STIX]{x1D735}\unicode[STIX]{x1D701}$
component of the momentum conservation equation

Forming the
$R^{2}\unicode[STIX]{x1D735}\unicode[STIX]{x1D701}$
component and including coarse grain averages in time and radius with the flux surface average gives

where the density and flow velocity are defined as
$n=\int \text{d}^{3}vf$
and
$n\boldsymbol{V}=\int \text{d}^{3}v\boldsymbol{v}f$
, and

with
$\unicode[STIX]{x1D714}_{\ast }\unicode[STIX]{x1D70F}\gg 1$
and
$\boldsymbol{k}_{\bot }\boldsymbol{\cdot }\boldsymbol{e}_{\unicode[STIX]{x1D713}}\unicode[STIX]{x1D6FE}\gg |\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}|$
. The coarse grain average removes very fast time and very short radial scale variation. The species pressure
$p=(M/3)\int \text{d}^{3}vv^{2}f=nT$
does not contribute to
$\langle MR^{2}\unicode[STIX]{x1D735}\unicode[STIX]{x1D701}\boldsymbol{\cdot }\int \text{d}^{3}vf\boldsymbol{v}\boldsymbol{v}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}\rangle$
, making the term small. Indeed, the off-diagonal stress tensor and time derivative only enter at higher order when the radial flux of toroidal angular momentum is evaluated. Therefore, only the turbulent and collisional fluxes survive in

In a quasineutral plasma (
$\unicode[STIX]{x1D6F4}Zen=0$
) this form makes it clear that turbulent and collisional particle transport are separately intrinsically ambipolar, and when added yield

where
$\boldsymbol{J}$
is the current density and
$\unicode[STIX]{x1D6F4}$
is the species sum. This ambipolarity condition is required in a quasineutral plasma for which
$\unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{J}=0$
must satisfy the constraint
$\langle \unicode[STIX]{x1D735}\boldsymbol{\cdot }\boldsymbol{J}\rangle =0$
with the radial current vanishing at the magnetic axis. Overall momentum conservation for Coulomb collisions requires

as first noted by Kovrizhnykh (Reference Kovrizhnykh1969) and Rutherford (Reference Rutherford1970). The form of
$\langle n\boldsymbol{V}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}\rangle$
implies that the turbulent particle flux is

and the collisional particle flux is

with the
$\boldsymbol{v}_{\bot }$
the contribution from classical collisions due to gyromotion and the
$v_{\Vert }\boldsymbol{b}$
term the neoclassical portion caused by finite orbit departures from flux surfaces. The separate intrinsic ambipolarity of turbulent (
$\unicode[STIX]{x1D6F4}\unicode[STIX]{x1D6E4}_{\text{turb}}=0$
, using quasineutrality) and collisional (
$\unicode[STIX]{x1D6F4}\unicode[STIX]{x1D6E4}_{\text{coll}}=0$
, using collisional momentum conservation) particle transport in a tokamak was first demonstrated by Sugama et al. (Reference Sugama, Okamoto, Horton and Wakatani1996). Their treatment is fully electromagnetic, but some details relied on using a quasilinear treatment and Krook collision operator. These limitations where removed electrostatically by Parra & Catto (Reference Parra and Catto2009). Appendix A of Sugama & Horton (Reference Sugama and Horton1998) also showed intrinsic ambipolarity electromagnetically in a tokamak with sonic toroidal flow within a quasilinear treatment of the Onsager symmetry.
In a completely axisymmetric tokamak it is necessary to satisfy conservation of total toroidal angular momentum to higher order. Quasineutrality, ambipolarity and total collisional momentum conservation require that the lowest-order flux function behaviour of
$\bar{\unicode[STIX]{x1D6F7}}$
adjusts to satisfy conservation of toroidal angular momentum

where the time derivative term is inserted for completeness. In the absence of a momentum source (as assumed here) the steady state equation reduces to the condition that the radial flux of toroidal angular momentum vanish on each flux surface,

as there is no flux of radial momentum at the magnetic axis. The species sum is dropped as the electron contribution is usually negligible.
8 Gyrokinetic equation for the fluctuations
The conventional
$\unicode[STIX]{x1D6FF}f=\tilde{f}$
gyrokinetic equation is really not an equation for
$\unicode[STIX]{x1D6FF}f=\tilde{f}$
at all. Instead it is the gyrokinetic equation for the non-adiabatic portion of
$\unicode[STIX]{x1D6FF}f=\tilde{f}$
obtained by letting

with

and
$\tilde{h}$
the non-adiabatic response. Expanding the
$E$
dependence about

for small
$\tilde{\unicode[STIX]{x1D6F7}}$
, and expanding
$\unicode[STIX]{x1D713}_{\ast }$
about
$\unicode[STIX]{x1D713}$
yields

Then

with
$-(Ze\tilde{\unicode[STIX]{x1D6F7}}/T)f_{M}$
the adiabatic part or Maxwell–Boltzmann response of
$\tilde{f}$
, and

Inserting
$f=f_{\ast }+h$
into the Fokker–Planck equation and using

gives

where the lowest-order linearized collision operator must have the property that
$C_{1}\{f_{M}\}=0$
. The drive terms for turbulence on the right side are time variation and departure from axisymmetry.
Subtracting out the slowly varying background terms gives the Fokker–Planck equation for the fluctuating response to be

with
$\unicode[STIX]{x1D6F7}=\bar{\unicode[STIX]{x1D6F7}}+\tilde{\unicode[STIX]{x1D6F7}}$
on the left side and
$\tilde{h}=\tilde{h}(\boldsymbol{r},\bar{E},\unicode[STIX]{x1D707},\unicode[STIX]{x1D711},t)$
. Changing to gyrokinetics variables, using
$\tilde{h}=\tilde{h}(\boldsymbol{R},\bar{E},\unicode[STIX]{x1D707},t)$
as
$\unicode[STIX]{x2202}\tilde{h}/\unicode[STIX]{x2202}\unicode[STIX]{x1D711}|_{R}=0$
to lowest order, and then annihilating the next-order equation, yields the desired form of the
$\unicode[STIX]{x1D6FF}f=\tilde{f}$
gyrokinetic equation for the non-adiabatic response to be

where

and

In the linear limit, Laplace transforming in time and Fourier decomposing in toroidal angle gives the replacements
$\unicode[STIX]{x2202}\langle \tilde{\unicode[STIX]{x1D6F7}}\rangle _{R}/\unicode[STIX]{x2202}t\rightarrow -\text{i}\unicode[STIX]{x1D714}\langle \tilde{\unicode[STIX]{x1D6F7}}\rangle _{R}$
and
$\unicode[STIX]{x2202}\langle \tilde{\unicode[STIX]{x1D6F7}}\rangle _{R}/\unicode[STIX]{x2202}\unicode[STIX]{x1D701}\rightarrow -\text{i}\ell \langle \tilde{\unicode[STIX]{x1D6F7}}\rangle _{R}$
. Defining

recovers the usual instability drive
$\unicode[STIX]{x1D714}-\unicode[STIX]{x1D714}_{\ast }^{T}$
of linear gyrokinetics (Catto Reference Catto1978). Notice that
$\unicode[STIX]{x1D714}_{\ast }^{T}$
only depends on the toroidal mode number
$\ell$
.
Any neoclassical response
$\bar{h}_{p}=\bar{h}_{p}(\unicode[STIX]{x1D713},\bar{E},\unicode[STIX]{x1D707},\unicode[STIX]{x1D70E})$
in
$\bar{h}$
is unimportant as the gyroaverage of

gives

since
$\boldsymbol{v}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\tilde{\unicode[STIX]{x1D6F7}}\simeq \boldsymbol{v}_{\bot }\boldsymbol{\cdot }\unicode[STIX]{x1D735}_{R}\tilde{\unicode[STIX]{x1D6F7}}=-\unicode[STIX]{x1D6FA}\unicode[STIX]{x2202}\tilde{\unicode[STIX]{x1D6F7}}/\unicode[STIX]{x2202}\unicode[STIX]{x1D711}|_{R}$
means
$\langle \boldsymbol{v}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\tilde{\unicode[STIX]{x1D6F7}}\rangle _{R}\simeq 0$
, and along with
$\langle \boldsymbol{v}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}\rangle _{R}=0$
gives
$\langle \dot{\unicode[STIX]{x1D707}}\rangle _{R}=0$
to lowest order.
When a simple local Fourier decomposition of
$\tilde{\unicode[STIX]{x1D6F7}}$
is used then

giving
$\tilde{h}\propto \text{e}^{\text{i}\boldsymbol{k}\boldsymbol{\cdot }\boldsymbol{R}}$
. Therefore, when perturbed quasineutrality,

is evaluated a second set of Bessel functions appears from

where
$\langle \cdots \rangle _{r}$
is the gyroaverage at fixed
$\boldsymbol{r}$
.
In a stellarator
$\unicode[STIX]{x1D6F9}=\unicode[STIX]{x1D713}+\unicode[STIX]{x1D6FA}^{-1}\boldsymbol{v}\times \boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}$
is used in place of
$\unicode[STIX]{x1D713}_{\ast }$
so that

and

As a result, the gyrokinetic equation for a stellarator is

In the
$\unicode[STIX]{x1D6FF}f$
tokamak and stellarator equations for
$\widetilde{h}$
all terms are normally of the same order in a turbulent plasma, and the
$cB^{-1}\boldsymbol{b}\times \unicode[STIX]{x1D735}_{R}\langle \tilde{\unicode[STIX]{x1D6F7}}\rangle _{R}$
term in
$\langle \boldsymbol{v}_{E}\rangle _{R}$
is the only nonlinearity (Hitchcock & Hazeltine Reference Hitchcock and Hazeltine1978; Frieman & Chen Reference Frieman and Chen1982; Lee Reference Lee1983). This nonlinearity leads to gyroBohm transport as shown in the next section.
9 GyroBohm transport from the fluctuation gyrokinetic equation and zonal flow
In this section the critical balance procedure of Barnes, Parra & Schekochihin (Reference Barnes, Parra and Schekochihin2011) is used to demonstrate that the turbulent transport is gyroBohm within a geometrical factor.
To begin, the eddy size
$\unicode[STIX]{x1D6E5}$
is taken as the measure of the typical value of
$k_{\bot }^{-1}$
, that is

and thereby the drives are ordered as

The adiabatic and non-adiabatic responses are comparable when
$\tilde{h}/f_{M}\sim Ze\tilde{\unicode[STIX]{x1D6F7}}/T$
.
According to the mixing length estimate the nonlinear terms become important when
$\unicode[STIX]{x1D735}_{\bot }\tilde{h}\sim \unicode[STIX]{x1D735}_{\bot }\,f_{M}$
or

Balancing the nonlinear drift term with parallel streaming

then yields the eddy size estimate

The eddy turnover time
$\unicode[STIX]{x1D70F}$
is estimated by balancing the time derivative with the parallel streaming term,

giving

The turbulent diffusivity
$D_{\text{turb}}\sim \unicode[STIX]{x1D6E5}^{2}/\unicode[STIX]{x1D70F}$
is then estimated to be

The preceding crude estimate is an aspect ratio larger than gyroBohm. More details with comparisons to ion temperature gradient simulation results are in Barnes et al. (Reference Barnes, Parra and Schekochihin2011). The turbulent levels found by Parra & Barnes (Reference Parra and Barnes2015a ) for ITG momentum diffusivity are also consistent with (9.8).
Nonlinear beating deposits some of the turbulent energy by an inverse cascade into a temporally varying axisymmetric radial mode with a sheared toroidal
$\boldsymbol{E}\times \boldsymbol{B}$
drift velocity referred to as a zonal flow. To reach the residual zonal flow level evaluated by Rosenbluth & Hinton (Reference Rosenbluth and Hinton1998), the poloidal dependence of the initial condition gives rise to a geodesic acoustic mode (GAM) that damps away. Sugama & Watanabe (Reference Sugama and Watanabe2006) were able to describe this transient behaviour as it damped to the steady state zonal flow level. The residual zonal flow potential has no poloidal variation so no further Landau damping occurs once the transient geodesic acoustic mode (GAM) response to this mode coupling has Landau damped away. The zonal flow shear suppresses turbulence and is responsible for the nonlinear Dimits upshift from the linear threshold for the onset of the ion temperature gradient (ITG) mode. The Rosenbluth & Hinton (Reference Rosenbluth and Hinton1998) zonal flow test demonstrated the need for simulations to properly retain ion polarization effects associated with the magnetic drifts. They did this by solving the linear axisymmetric initial value problem associated with the transit average of the gyrokinetic equation

By specifying poloidally varying gyroaveraged initial conditions they proved that the time asymptotic solution did not damp to zero, but rather relaxed to a residual zonal flow level of

when
$k_{\bot }\unicode[STIX]{x1D70C}_{pi}\ll 1$
and
$\unicode[STIX]{x1D700}\ll 1$
. Subsequent work considered the role of collisions (Hinton & Rosenbluth Reference Hinton and Rosenbluth1999; Xiao, Catto & Molvig Reference Xiao, Catto and Molvig2007), shaping (Xiao & Catto Reference Xiao and Catto2006), arbitrary
$k_{\bot }\unicode[STIX]{x1D70C}$
(Xiao et al.
Reference Xiao, Catto and Molvig2007; Monreal et al.
Reference Monreal, Calvo, Sánchez, Parra, Bustos, Könies, Kleiber and Görler2016), electromagnetic effects (Catto, Parra & Pusztai Reference Catto, Parra and Pusztai2017) and extensions to stellarators (Monreal et al.
Reference Monreal, Calvo, Sánchez, Parra, Bustos, Könies, Kleiber and Görler2016 and references therein). Xiao’s work is briefly summarized in Xiao, Catto & Dorland (Reference Xiao, Catto and Dorland2007), where comparisons with numerical simulations are presented.
The next section generalizes gyrokinetics to retain electromagnetic effects.
10 Electromagnetic effects in the local gyrokinetic equation
Electromagnetic effects in a local
$\unicode[STIX]{x1D6FF}f$
gyrokinetic description are most conveniently handled by writing the fluctuating magnetic field
${\displaystyle\mathop{\boldsymbol{B}}\limits_{{\sim}}}$
as

then relating its parallel component to the fluctuating vector potential
${\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}$
by

In this way,
$\tilde{A}_{\Vert }$
and
$\tilde{B}_{\Vert }$
turn out to be the two useful dependent variables needed to describe electromagnetic effects. The fluctuating Ampere’s law

then gives the parallel component to be

and the perpendicular component as

The gyrokinetic equation now contains

with

The
$\unicode[STIX]{x1D735}_{v}\bar{E}=M\boldsymbol{v}$
term is easier to treat so is considered first. Removing the adiabatic piece as before leaves the gyroaverage of the fluctuating terms

namely,

The gyroaverage
$\langle \tilde{\unicode[STIX]{x1D712}}\rangle _{R}$
can be evaluated when
$\tilde{\unicode[STIX]{x1D6F7}}$
,
${\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}$
and
${\displaystyle\mathop{\boldsymbol{B}}\limits_{{\sim}}}$
are proportional to
$\text{e}^{\text{i}\boldsymbol{k}\boldsymbol{\cdot }\boldsymbol{r}}$
. Recalling
$\langle \text{e}^{-\text{i}\boldsymbol{k}\boldsymbol{\cdot }\unicode[STIX]{x1D746}}\rangle _{R}=J_{0}(k_{\bot }v_{\bot }/\unicode[STIX]{x1D6FA})$
and using
$\langle \boldsymbol{v}_{\bot }\text{e}^{-\text{i}\boldsymbol{k}\boldsymbol{\cdot }\unicode[STIX]{x1D746}}\rangle _{R}=\text{i}k_{\bot }^{-1}v_{\bot }\boldsymbol{k}\times \boldsymbol{b}J_{1}(k_{\bot }v_{\bot }/\unicode[STIX]{x1D6FA})$
gives

where
$\tilde{B}_{\Vert }\simeq \text{i}\boldsymbol{b}\boldsymbol{\cdot }\boldsymbol{k}\times {\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}_{\bot }$
. Then using
$\tilde{\unicode[STIX]{x1D712}}=\unicode[STIX]{x1D6F4}_{\boldsymbol{k}}\tilde{\unicode[STIX]{x1D712}}_{\boldsymbol{k}}\text{e}^{\text{i}\boldsymbol{k}_{\bot }\boldsymbol{\cdot }\boldsymbol{r}}$
results in the Fourier transform expression

where
$\tilde{\unicode[STIX]{x1D6F7}}(\boldsymbol{R},t)$
,
$\tilde{A}_{\Vert }(\boldsymbol{R},t)$
and
$\tilde{B}_{\Vert }(\boldsymbol{R},t)$
are proportional to
$\text{e}^{\text{i}\boldsymbol{k}\boldsymbol{\cdot }\boldsymbol{R}}$
and their
$\boldsymbol{k}_{\bot }$
subscripts are suppressed for notational simplicity. To verify that the
$k_{\bot }v_{\bot }/\unicode[STIX]{x1D6FA}\ll 1$
limit is recovered correctly for the
$\tilde{B}_{\Vert }$
term as well as for
$\tilde{\unicode[STIX]{x1D6F7}}-c^{-1}v_{\Vert }\tilde{A}_{\Vert }$
, use
$\boldsymbol{v}_{\bot }=(\unicode[STIX]{x2202}\boldsymbol{v}/\unicode[STIX]{x2202}\unicode[STIX]{x1D711})\times \boldsymbol{b}$
to integrate by parts, then use
$\boldsymbol{v}_{\bot }\boldsymbol{\cdot }\unicode[STIX]{x1D735}_{R}{\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}_{\bot }=-\unicode[STIX]{x1D6FA}\unicode[STIX]{x2202}{\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}_{\bot }/\unicode[STIX]{x2202}\unicode[STIX]{x1D711}|_{R}$
to find

The preceding results imply the orderings

giving

with
$v_{e}$
and
$v_{i}$
the electron and ion thermal speeds. Recalling
$Ze\tilde{\unicode[STIX]{x1D6F7}}/T\sim 1/k_{\bot }a$
,
$c\tilde{\unicode[STIX]{x1D6F7}}\sim v_{e}\tilde{A}_{\Vert }$
gives

and

Consequently, electron dynamics appears to allow electromagnetic effects to enter at very small levels. However, typically only the transit average response of the electrons matters, bringing the electron parallel current down to the ion level. As a result, the parallel Ampere’s law gives

which for
$k_{\bot }\unicode[STIX]{x1D70C}_{i}$
yields
$v_{i}\tilde{A}_{\Vert }/c\sim \unicode[STIX]{x1D6FD}\tilde{\unicode[STIX]{x1D6F7}}$
, where
$\unicode[STIX]{x1D6FD}$
is the plasma over magnetic pressure,
$\unicode[STIX]{x1D6FD}=8\unicode[STIX]{x03C0}nT/B^{2}$
. The transit averaged electron response from
$\unicode[STIX]{x2202}\tilde{A}_{\Vert }/\unicode[STIX]{x2202}t$
competes with the electrostatic potential terms in quasineutrality when
$\unicode[STIX]{x1D714}\tilde{A}_{\Vert }/k_{\Vert }c\sim \tilde{\unicode[STIX]{x1D6F7}}$
. Combining these two estimates for
$\unicode[STIX]{x1D714}\sim \unicode[STIX]{x1D714}_{\ast }\sim k_{\bot }\unicode[STIX]{x1D70C}_{i}v_{i}/a$
and
$k_{\Vert }\sim qR$
, suggests that finite
$\unicode[STIX]{x1D6FD}$
effects due to shear Alfvén effects enter when

Within other geometrical factors it is very roughly the
$\unicode[STIX]{x1D6FD}$
value at which trapped electron modes make the transition to kinetic ballooning modes (Pueschel, Kammerer & Jenko Reference Pueschel, Kammerer and Jenko2008).
To estimate when
$\tilde{B}_{\Vert }$
enters, the perpendicular Ampere’s law is used by noting the ion response is what matters in the perpendicular current because of the gyroaverage contained in the integral over velocity space. As a result, it gives

Then estimating
$\tilde{B}_{\Vert }/B\sim 1/k_{\bot }a\sim \tilde{f}/f_{M}$
yields the much higher plasma over magnetic pressure of

as the value of
$\unicode[STIX]{x1D6FD}$
when compressional Alfvén effects should be kept.
The
$\unicode[STIX]{x1D735}_{v}\unicode[STIX]{x1D713}_{\ast }=-(Mc/Ze)R^{2}\unicode[STIX]{x1D735}\unicode[STIX]{x1D701}$
term must also be evaluated. However,
$\unicode[STIX]{x1D713}\rightarrow \unicode[STIX]{x1D713}+\tilde{\unicode[STIX]{x1D713}}$
with
$\tilde{\unicode[STIX]{x1D713}}=-R^{2}\unicode[STIX]{x1D735}\unicode[STIX]{x1D701}\boldsymbol{\cdot }{\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}$
must be used in
$\unicode[STIX]{x1D713}_{\ast }$
when forming
$\dot{\unicode[STIX]{x1D713}}_{\ast }$
so that
$\unicode[STIX]{x2202}\tilde{\unicode[STIX]{x1D713}}/\unicode[STIX]{x2202}t$
cancels the
$-c^{-1}\unicode[STIX]{x2202}{\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}/\unicode[STIX]{x2202}t$
in the fluctuating electric field term, leading to the consideration of just

Employing

with the
$\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\tilde{A}_{\Vert }$
term ignored as small, gives

In addition,

where the gyroaverage of
$\boldsymbol{v}_{\bot }\boldsymbol{\cdot }\unicode[STIX]{x1D735}{\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}_{\bot }=\boldsymbol{v}_{\bot }\boldsymbol{\cdot }\unicode[STIX]{x1D735}_{R}{\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}_{\bot }=-\unicode[STIX]{x1D6FA}\unicode[STIX]{x2202}{\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}_{\bot }/\unicode[STIX]{x2202}\unicode[STIX]{x1D711}|_{R}$
leaves only

As a result,
$\langle \tilde{\unicode[STIX]{x1D712}}\rangle _{R}$
is again obtained, with

Therefore, the right side of the electromagnetic gyrokinetic equation is found to be

On the left side of the gyrokinetic equation the electromagnetic nonlinear term must be evaluated, namely

where the
$\unicode[STIX]{x2202}{\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}/\unicode[STIX]{x2202}t$
is small and
$\langle \boldsymbol{v}\boldsymbol{\cdot }\unicode[STIX]{x1D735}{\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}\rangle _{R}=0$
. As a result,
$\langle \boldsymbol{v}_{\bot }\text{e}^{-\text{i}\boldsymbol{k}\boldsymbol{\cdot }\unicode[STIX]{x1D746}}\rangle _{R}=\text{i}k_{\bot }^{-1}v_{\bot }\boldsymbol{k}\times \boldsymbol{b}J_{1}$
gives

Using the preceding results, the
$\unicode[STIX]{x1D6FF}f=\tilde{f}$
form for the nonlinear electromagnetic gyrokinetic equation for the non-adiabatic response is found to be

with

and

Consequently, to recover the electromagnetic form from the electrostatic form,
$\langle \tilde{\unicode[STIX]{x1D6F7}}\rangle _{R}$
is simply replaced by
$\langle \tilde{\unicode[STIX]{x1D712}}\rangle _{R}$
.
Employing
$\tilde{h}=\unicode[STIX]{x1D6F4}_{\boldsymbol{k}-\boldsymbol{k}^{\prime }}\tilde{h}_{\boldsymbol{k}-\boldsymbol{k}^{\prime }}\text{e}^{\text{i}(\boldsymbol{k}_{\bot }-\boldsymbol{k}_{\bot }^{^{\prime }})\boldsymbol{\cdot }\boldsymbol{R}}$
and
$\tilde{\unicode[STIX]{x1D712}}=\unicode[STIX]{x1D6F4}_{\boldsymbol{k}^{\prime }}\tilde{\unicode[STIX]{x1D712}}_{\boldsymbol{k}^{\prime }}\text{e}^{\text{i}\boldsymbol{k}_{\bot }^{^{\prime }}\boldsymbol{\cdot }\boldsymbol{r}}$
the nonlinear term becomes

Then the Fourier transformed version of the electromagnetic gyrokinetic equation is

with
$\tilde{h}=\unicode[STIX]{x1D6F4}_{\boldsymbol{k}}\tilde{h}_{\boldsymbol{k}}\text{e}^{\text{i}\boldsymbol{k}_{\bot }\boldsymbol{\cdot }\boldsymbol{R}}$
,
$\text{e}^{\text{i}\boldsymbol{k}_{\bot }\boldsymbol{\cdot }\boldsymbol{r}}=\text{e}^{\text{i}k_{\unicode[STIX]{x1D713}}\unicode[STIX]{x1D713}+\text{i}k_{\unicode[STIX]{x1D6FC}}(\unicode[STIX]{x1D701}-q\unicode[STIX]{x1D717})}$
,
$\boldsymbol{k}_{\bot }=k_{\unicode[STIX]{x1D713}}\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}+k_{\unicode[STIX]{x1D6FC}}\unicode[STIX]{x1D735}(\unicode[STIX]{x1D701}-q\unicode[STIX]{x1D717})$
, and
$\boldsymbol{b}\boldsymbol{\cdot }\boldsymbol{k}_{\bot }\times \boldsymbol{k}_{\bot }^{^{\prime }}=B(k_{\unicode[STIX]{x1D713}}^{^{\prime }}k_{\unicode[STIX]{x1D6FC}}-k_{\unicode[STIX]{x1D713}}k_{\unicode[STIX]{x1D6FC}}^{^{\prime }})$
.
Only the
$\boldsymbol{k}\times \boldsymbol{b}$
component of the perpendicular Ampere’s law,

matters, since the adiabatic response does not contribute and both sides of the
$\boldsymbol{k}_{\bot }$
component vanish because
$\langle \boldsymbol{k}_{\bot }\boldsymbol{\cdot }\boldsymbol{v}_{\bot }\text{e}^{-\text{i}\boldsymbol{k}\boldsymbol{\cdot }\unicode[STIX]{x1D746}}\rangle _{r}=0$
and
$\unicode[STIX]{x1D735}\tilde{B}_{\Vert }\times \boldsymbol{b}\approx \text{i}\tilde{B}_{\Vert }\boldsymbol{k}\times \boldsymbol{b}$
.
Once again the electromagnetic correction from the neoclassical response
$\bar{h}_{p}=\bar{h}_{p}(\unicode[STIX]{x1D713},\bar{E},\unicode[STIX]{x1D707},\unicode[STIX]{x1D70E})$
does not matter. The fluctuating
${\displaystyle\mathop{\boldsymbol{B}}\limits_{{\sim}}}$
term is proportional to

and is negligible since
$\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}{\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}\boldsymbol{\cdot }\boldsymbol{v}_{\bot }$
is small and
$\langle \boldsymbol{v}_{\bot }\boldsymbol{\cdot }\unicode[STIX]{x1D735}\tilde{A}_{\Vert }\rangle _{R}=0$
.
Simulations often use
$v_{\Vert }$
, rather than
$E$
or
$\bar{E}$
, as one of the velocity space variables. In such cases, using
$Mv_{\Vert }^{2}/2=\bar{E}-Ze\bar{\unicode[STIX]{x1D6F7}}-\unicode[STIX]{x1D707}B$
, the replacement

displays the mirror force. In addition, flux tube simulations use the spatial variables
$\unicode[STIX]{x1D713}$
,

and
$\unicode[STIX]{x1D717}$
or their gyrokinetic counterparts, where now

The poloidal angle
$\unicode[STIX]{x1D717}$
gives the position along the magnetic field. Notice that for this choice

as desired.
11 Gyrokinetics with flow
The treatment so far retains some flow features by keeping
$cB^{-1}\boldsymbol{b}\times \unicode[STIX]{x1D735}_{R}\bar{\unicode[STIX]{x1D6F7}}$
in
$\langle \boldsymbol{v}_{E}\rangle _{R}$
. In particular, for sonic flow
$\bar{\unicode[STIX]{x1D6F7}}$
is known to be a flux function to lowest order (Hinton & Wong Reference Hinton and Wong1985; Catto, Bernstein & Tessarotto Reference Catto, Bernstein and Tessarotto1987). The order used here is subsonic, but even for diamagnetic flows
$\bar{\unicode[STIX]{x1D6F7}}=\text{}\underline{\bar{\unicode[STIX]{x1D6F7}}}(\unicode[STIX]{x1D713})$
to lowest order (Parra et al.
Reference Parra, Barnes, Calvo and Catto2012). Consequently,
$\boldsymbol{B}\times \unicode[STIX]{x1D735}\unicode[STIX]{x1D713}=I\boldsymbol{B}-R^{2}B^{2}\unicode[STIX]{x1D735}\unicode[STIX]{x1D701}$
can be employed to write the small electric drift

as the difference between two
$B/B_{p}$
larger flows, one parallel and the other toroidal. As a result, the left side of the gyrokinetic equation can be written as

with the toroidal rotation frequency
$\unicode[STIX]{x1D6FA}_{\unicode[STIX]{x1D701}}=\unicode[STIX]{x1D6FA}_{\unicode[STIX]{x1D701}}(\unicode[STIX]{x1D713})$
defined as

and the fluctuating electric drift

The combination

is the rotating frame time derivative, and redefining
$\unicode[STIX]{x1D6FC}=\unicode[STIX]{x1D701}-q(\unicode[STIX]{x1D713})\unicode[STIX]{x1D703}(\unicode[STIX]{x1D713},\unicode[STIX]{x1D717})-\unicode[STIX]{x1D6FA}_{\unicode[STIX]{x1D701}}(\unicode[STIX]{x1D713})t$
gives
$D\widetilde{h}/Dt=\unicode[STIX]{x2202}\widetilde{h}/\unicode[STIX]{x2202}t|_{\unicode[STIX]{x1D6FC}}$
.
The preceding suggests adding a parallel flow to the Maxwellian to make it a drifting Maxwellian by introducing

then in the toroidally rotating frame the curvature drift will include a Coriolis drift as well, since

with
$\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\boldsymbol{b}\simeq -\unicode[STIX]{x1D735}\ell nR$
, as in Parra et al. (Reference Parra, Barnes and Catto2011). Continuing to follow their treatment, but using energy rather than parallel velocity as a variable, the energy in the rotating frame is defined as

where
$M\unicode[STIX]{x1D6FA}_{\unicode[STIX]{x1D701}}^{2}(R^{2}-I^{2}B^{-2})/2T=M\unicode[STIX]{x1D6FA}_{\unicode[STIX]{x1D701}}^{2}(R^{2}B_{p}^{2})/2TB^{2}\ll 1$
.
To retain flow with a parallel drifting Maxwellian the replacement
$\bar{E}\rightarrow \bar{U}$
is made in
$\bar{f}_{\ast }$
to obtain

and

with

Defining the parallel drifting Maxwellian

where
$M\unicode[STIX]{x1D6FA}_{\unicode[STIX]{x1D701}}^{2}R^{2}B_{p}^{2}/2TB^{2}\ll 1$
is assumed, then

Observing

with
$\unicode[STIX]{x1D6FA}_{\ast }(\unicode[STIX]{x1D713}_{\ast }\rightarrow \unicode[STIX]{x1D713})=\unicode[STIX]{x1D6FA}_{\unicode[STIX]{x1D701}}$
and
$Iv_{\Vert }/B$
a slow function of space, then a new drive term due to flow shear arises when
${\dot{f}}_{\ast }^{\odot }$
is formed since to lowest order

The preceding implies that when toroidal flow is retained and the gyrokinetic equation is written in the toroidally rotating frame for
$M\unicode[STIX]{x1D6FA}_{\unicode[STIX]{x1D701}}^{2}R^{2}B_{p}^{2}/2TB^{2}\ll 1$
, the result is

with

and

The preceding form for the gyrokinetic equation requires
$1\gg \unicode[STIX]{x1D6FA}_{\unicode[STIX]{x1D701}}R/v_{i}\gg B\unicode[STIX]{x1D6FF}/B_{p}\gg \unicode[STIX]{x1D6FF}$
to be valid.
Local gyrokinetic simulations typically solve the gyrokinetic and Maxwell system of equations on a flux tube by specifying
$p,T$
and
$\unicode[STIX]{x2202}\bar{\unicode[STIX]{x1D6F7}}/\unicode[STIX]{x2202}\unicode[STIX]{x1D713}$
and their radial derivatives. Global simulations specify more involved profiles by linking local flux tubes together. As a result, boundary conditions are more complicated for global runs.
Many more details on transforming the gyrokinetic equation to the toroidally rotating frame are given in Parra et al. (Reference Parra, Barnes and Catto2011) where the derivation is performed to higher order to treat intrinsic rotation. Here the purpose is to give some insight into how subsonic toroidal rotation due to a momentum source can be treated when it spins the plasma above the diamagnetic level. In addition, the preceding is a gyrokinetic equation consistent with the important symmetry properties proven and illustrated in Parra, Barnes & Peeters (Reference Parra, Barnes and Peeters2011) and Sugama et al. (Reference Sugama, Watanabe, Nunami and Nishimura2011). These symmetry properties are discussed in appendix A in a more heuristic manner. The consideration of sonic flows results in some simplifications (Able et al. Reference Able, Plunk, Wang, Barnes, Cowley, Dorland and Schekochihin2013), but requires a strong source of momentum input. Even when a sonic impurity species is present, a full treatment of momentum transport and profile evolution is required for the non-sonic species to obtain the correct rotation frequency.
12 When are higher-order gyrokinetic variables required?
Local and global simulations are unable to spatially and temporally evolve radial profiles self-consistently. They must be run for specified background profiles (value and derivative) of the plasma density, ion and electron temperatures and radial electric field. If it were not for the lowest flux function that needs to be determined in the electrostatic potential, evolving profiles could be done by linking multiple flux tube simulations using just the conservation of number and energy equations as only particle and heat transport would need to be determined. However, to evaluate the full electrostatic potential requires evaluating the radial flux of toroidal angular momentum in the vicinity of each flux surface and then linking and evolving these surfaces using conservation of toroidal angular momentum. To evaluate the radial flux of toroidal angular momentum and toroidal angular momentum conservation to the requisite order requires a higher-order gyrokinetic description as realized by Catto et al. (Reference Catto, Simakov, Parra and Kagan2008), demonstrated in detail by Parra & Catto (Reference Parra and Catto2008, Reference Parra and Catto2009, Reference Parra and Catto2010a ) and refined further by Parra et al. (Reference Parra, Barnes and Catto2011) and Parra et al. (Reference Parra, Barnes, Calvo and Catto2012). The goal here is the far less ambitious one of demonstrating why higher-order effects are required.
To begin the task, recall that as profiles evolve due to momentum transport, the ambipolarity condition
$\langle \boldsymbol{J}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}\rangle =0$
must be satisfied on each flux surface, as the displacement current can normally be ignored. Any error resulting in
$\langle \boldsymbol{J}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}\rangle _{\text{error}}\neq 0$
acts as a torque on the plasma that results in an unphysical momentum transport satisfying

in the steady state (Parra & Catto Reference Parra and Catto2010b
). Letting
$\unicode[STIX]{x1D6F4}M\langle R^{2}\unicode[STIX]{x1D735}\unicode[STIX]{x1D701}\boldsymbol{\cdot }\int \text{d}^{3}vf\boldsymbol{v}\boldsymbol{v}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}\rangle \simeq R^{2}B_{p}\unicode[STIX]{x1D6F1}_{r\unicode[STIX]{x1D701}}$
and
$\langle \boldsymbol{J}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}\rangle _{\text{error}}\simeq RB_{p}J_{r}^{\text{error}}$
, implies that the ambipolarity error must be small enough that it and the off-diagonal stress tensor
$\unicode[STIX]{x1D6F1}_{r\unicode[STIX]{x1D701}}$
satisfy
$\unicode[STIX]{x2202}\unicode[STIX]{x1D6F1}_{r\unicode[STIX]{x1D701}}/\unicode[STIX]{x2202}r\gg c^{-1}B_{p}J_{r}^{\text{error}}$
or

Turbulent momentum transport implies

where the toroidal flow is diamagnetic with
$V_{\unicode[STIX]{x1D701}}\sim v_{i}\unicode[STIX]{x1D70C}_{pi}/a$
. As a result,

and an unphysical torque will be applied by a very small error in the stress tensor or radial current density of

In addition, a direct evaluation of the off-diagonal stress requires a very high-order evaluation of the fluctuating distribution function

Any error in its evaluation will also act as an unphysical torque (Parra & Catto Reference Parra and Catto2010c
). Such an accurate evaluation of
$\tilde{h}$
seems unrealistic since the gyrokinetic equation derived herein only gives
$\tilde{h}/f_{M}\sim Ze\widetilde{\unicode[STIX]{x1D6F7}}/T\sim 1/k_{\bot }a\sim \unicode[STIX]{x1D70C}_{i}/a$
, while to next-order gyrokinetics gives
$\tilde{h}/f_{M}\sim \unicode[STIX]{x1D70C}_{i}\unicode[STIX]{x1D70C}_{pi}/a^{2}$
. Fortunately,
$\tilde{h}/f_{M}\sim \unicode[STIX]{x1D70C}_{i}\unicode[STIX]{x1D70C}_{pi}/a^{2}$
is sufficient when moments of the exact Fokker–Planck equation are employed to pick up another order in
$\unicode[STIX]{x1D70C}_{pi}/a$
, as anticipated by Catto et al. (Reference Catto, Simakov, Parra and Kagan2008) and confirmed more carefully and rigorously by Parra & Catto (Reference Parra and Catto2008, Reference Parra and Catto2010a
). Moments of the full Fokker–Planck equation are routinely employed in neoclassical transport to avoid having to calculate distribution functions to higher order in the gyroradius expansion. And, of course, are the basis of all evaluations of short and long mean free path transport.
The simplest example is radial particle transport as mentioned in the ambipolarity discussion. It is most conveniently evaluated using the coarse grain average of the
$R^{2}\unicode[STIX]{x1D735}\unicode[STIX]{x1D701}$
component of the electron momentum conservation to find the lowest-order result

with
$C_{ei}$
the electron–ion collision operator and
$m$
the electron mass. For illustrative purposes only electrostatics need be considered. Only the fluctuating electron density
${\tilde{n}}_{e}$
and potential are required to evaluate the turbulent flux. A direct evaluation of
$\langle \int \text{d}^{3}vf_{e}\boldsymbol{v}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}\rangle _{cg}$
would require a more accurate
$f_{e}$
as the lowest-order neoclassical contribution to
$f_{e}$
is odd in
$v_{\Vert }$
, the leading classical part of
$f_{e}$
also depends on
$\boldsymbol{v}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}$
and thereby results in a vanishing collisional contribution, while
$\langle \tilde{f}_{e}\rangle _{cg}=0$
for the turbulent contribution implying that the turbulent correction to
$\langle f_{e}\rangle _{cg}\simeq \bar{f}_{e}$
is required. As
$\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\tilde{\unicode[STIX]{x1D6F7}}$
is higher order, the turbulent particle flux is simply

Similarly, to evaluate radial heat fluxes the
$R^{2}\unicode[STIX]{x1D735}\unicode[STIX]{x1D701}\boldsymbol{\cdot }\boldsymbol{v}v^{2}/2$
moment of the full Fokker–Planck equation is also needed. In this case, the lowest-order expression is

where
$\tilde{p}_{j}=M_{j}\int \text{d}^{3}vv^{2}\tilde{f}_{j}/3=\bar{n}_{j}\tilde{T}_{j}+{\tilde{n}}_{j}\bar{T}_{j}$
and
$\langle \tilde{p}_{j}\unicode[STIX]{x2202}\tilde{T}_{j}/\unicode[STIX]{x2202}\unicode[STIX]{x1D701}\rangle _{cg}=(\bar{T}_{j}/\bar{n}_{j})\langle {\tilde{n}}_{j}\unicode[STIX]{x2202}\tilde{p}_{j}/\unicode[STIX]{x2202}\unicode[STIX]{x1D701}\rangle _{cg}$
, with
$C_{i}=C_{ii}$
for the ions and
$C_{e}=C_{ee}+C_{ei}$
for the electrons. Again, this moment of the full kinetic equation obviates the need to evaluate
$f_{j}$
to very high order.
The electron particle flux, and electron and ion heat fluxes, require only the lowest-order solutions of the
$\unicode[STIX]{x1D6FF}f=\tilde{f}$
gyrokinetic equation for the fluctuations and the lowest-order drift kinetic equation for the collisional modification to the Maxwellian background.
The lowest-order flux function
$\bar{\unicode[STIX]{x1D6F7}}\simeq \bar{\unicode[STIX]{x1D6F7}}(\unicode[STIX]{x1D713})$
is required to complete the transport description and determine the complete ion flow and flow shear at a flux surface. It enters particle and heat fluxes but must be evaluated by solving the conservation of toroidal angular momentum equation. The moment procedure becomes more involved for the evaluation of the radial flux of toroidal angular momentum both because a stress tensor is involved and higher-order gyrokinetic variables are required. Even with these complications the moment procedure is essential as it only requires the next-order modifications of gyrokinetics. At present, the next order gyrokinetic treatment combined with a moment approach seems to be the only practical and realistic way to treat momentum transport and profile evolution. Details are beyond the scope of a gyrokinetic tutorial as the mathematics is rather involved since the
$R^{2}\unicode[STIX]{x1D735}\unicode[STIX]{x1D701}\boldsymbol{\cdot }\boldsymbol{v}\boldsymbol{v}$
and
$R^{2}\unicode[STIX]{x1D735}\unicode[STIX]{x1D701}\boldsymbol{\cdot }\boldsymbol{v}\boldsymbol{v}\boldsymbol{v}$
moments of the full Fokker–Planck equation are required (Parra & Catto Reference Parra and Catto2010b
,Reference Parra and Catto
c
), as well as the associated complexity of having to employ higher-order gyrokinetic variables. Nearly all of the required expressions were evaluated by Parra & Catto (Reference Parra and Catto2008, Reference Parra and Catto2009, Reference Parra and Catto2010b
,Reference Parra and Catto
c
); Parra et al. (Reference Parra, Barnes and Catto2011); Parra et al. (Reference Parra, Barnes, Calvo and Catto2012); Parra & Barnes (Reference Parra and Barnes2015a
,Reference Parra and Barnes
b
); and Calvo & Parra (Reference Calvo and Parra2015); however, the contribution from global features of the radial turbulence profile due to eddy length scale over unperturbed radial scale length corrections remains an active area of research (Parra & Barnes Reference Parra and Barnes2015a
,Reference Parra and Barnes
b
) because of the more complicated boundary conditions needed for such a flux tube simulation.
13 Discussion and prospects
This tutorial attempts to summarize gyrokinetics in a manner accessible to interested magnetic fusion practitioners who are not gyrokinetic experts. The orderings appropriate for gyrokinetics and benefits of taking advantage of separation of scale between the slow temporal and spatial background behaviour and the much shorter scales and more rapid time variation of the turbulent fluctuations are highlighted. Then the gyrokinetic equation is derived in its simplest electrostatic forms. The scale separation assumption allows the adiabatic response of each species to be removed from the local gyrokinetic equation for the non-adiabatic response once a lowest-order Maxwellian is shown to be valid because of the existence of pressure or flux surfaces in a tokamak. The electromagnetic and flow modifications to the gyrokinetic equation for the non-adiabatic response are also obtained, along with a brief mention of the stellarator form for the gyrokinetic equation. The discussions stress the separate intrinsic ambipolarity of turbulent and collisional particle transport in an axisymmetric tokamak. Moreover, the development demonstrates that both turbulent and collisional heat fluxes follow from lowest-order gyrokinetic treatments. The inadequacy of lowest-order gyrokinetics only becomes evident when momentum transport and profile evolution need to be treated in a tokamak to determine the unknown flux function in the electrostatic potential. The need to use great care in such cases to avoid introducing errors that torque the plasma is stressed. The prospects of solving a full hybrid fluid (using conservation of number, momentum, and energy equations) and next-order gyrokinetic equation to provide closure is stressed without delving into the challenging and complex formulation that is required. At present this seems to be the most promising path for solving this important problem and using the results to further optimize tokamak performance.
The lowest-order flux function
$\bar{\unicode[STIX]{x1D6F7}}\simeq \bar{\unicode[STIX]{x1D6F7}}(\unicode[STIX]{x1D713})$
is required to complete the transport description and determine the complete ion flow and flow shear at a flux surface. It is evaluated by solving the conservation of toroidal angular momentum equation. The moment procedure becomes more involved for the evaluation of the radial flux of toroidal angular momentum both because a stress tensor is involved and higher-order gyrokinetic variables are required. Even with these complications the moment procedure is essential as it only requires the next-order modifications of gyrokinetics. In spite of these subtleties the procedure is more tractable than a full f Hamiltonian method that requires a third-order Hamiltonian – that is, the lowest Hamiltonian plus all corrections up to and including
$(\unicode[STIX]{x1D70C}/a)^{3}$
(Parra & Calvo Reference Parra and Calvo2011; Parra et al.
Reference Parra, Barnes, Calvo and Catto2012; Calvo & Parra Reference Calvo and Parra2012). At present only the second-order Hamiltonian has been evaluated (Parra & Calvo Reference Parra and Calvo2011; Calvo & Parra Reference Calvo and Parra2015). Therefore, the next-order gyrokinetic treatment combined with a moment approach seems the only practical and realistic way to treat momentum transport and profile evolution.
Acknowledgements
P.J.C. is indebted to Professor N. Loureiro and Dr P. Bonoli (both of MIT) and Professor B. Dorland (U. Maryland) for nominating him for a tutorial talk at the 2018 APS Division of Plasma Physics meeting in Portland that served as the impetus for this manuscript; and to Professors F. Parra and M. Barnes (both of Oxford U.) for their support and help with the talk and manuscript. He also appreciates the careful reviewers of JPP whose suggestions and comments further improved the manuscript. The work was supported by the US Department of Energy grant DE-FG02-91ER-54109.
Appendix A. Symmetry properties of the gyrokinetic equation
Parra et al. (Reference Parra, Barnes and Catto2011) and Sugama et al. (Reference Sugama, Watanabe, Nunami and Nishimura2011) have proven that the gyrokinetic equation has important symmetry properties. To simplify the demonstration only an up–down mirror reflected symmetric tokamak is considered for the more heuristic treatment sketched here. For these up–down reflected symmetry considerations the
$Z$
axis is the axis of rotation and upon reflection
$Z\rightarrow -Z$
,
$\unicode[STIX]{x1D735}Z\rightarrow -\unicode[STIX]{x1D735}Z$
and
$\unicode[STIX]{x1D735}\unicode[STIX]{x1D717}\rightarrow -\unicode[STIX]{x1D735}\unicode[STIX]{x1D717}$
, while
$\boldsymbol{B}_{p}=\unicode[STIX]{x1D735}\unicode[STIX]{x1D701}\times \unicode[STIX]{x1D735}\unicode[STIX]{x1D713}$
is unchanged. The low field side of the equatorial plane is taken to be
$\unicode[STIX]{x1D717}=0$
. Up-down symmetry means that in the absence of momentum sources or sinks, there can be no radial transport of toroidal angular momentum associated with the lowest-order gyrokinetic equation.
Using
$\boldsymbol{B}=B\boldsymbol{b}=I\unicode[STIX]{x1D735}\unicode[STIX]{x1D701}+\unicode[STIX]{x1D735}\unicode[STIX]{x1D701}\times \unicode[STIX]{x1D735}\unicode[STIX]{x1D713}=\unicode[STIX]{x1D735}\unicode[STIX]{x1D6FC}\times \unicode[STIX]{x1D735}\unicode[STIX]{x1D713}$
with
$\unicode[STIX]{x1D713}$
,
$\unicode[STIX]{x1D6FC}=\unicode[STIX]{x1D701}-q(\unicode[STIX]{x1D713})\unicode[STIX]{x1D703}(\unicode[STIX]{x1D713},\unicode[STIX]{x1D717})$
and
$\unicode[STIX]{x1D717}$
as the three spatial variables, and considering the linearized electrostatic gyrokinetic equation at first, then the transformations
$v_{\Vert }\rightarrow -v_{\Vert }$
and
$\unicode[STIX]{x1D735}\unicode[STIX]{x1D717}\rightarrow -\unicode[STIX]{x1D735}\unicode[STIX]{x1D717}$
leave the parallel streaming term, as well as the time derivative and collision terms on the left side unchanged. The remaining linear terms are the magnetic drifts

As the magnetic drift is predominately down (in the
$-\unicode[STIX]{x1D735}Z$
direction), up–down symmetry (
$Z\rightarrow -Z$
,
$\unicode[STIX]{x1D717}\rightarrow -\unicode[STIX]{x1D717}$
) means that upon coordinate reflection the magnetic drift becomes predominately in the
$\unicode[STIX]{x1D735}Z$
direction with
$\unicode[STIX]{x1D735}\unicode[STIX]{x1D717}\rightarrow -\unicode[STIX]{x1D735}\unicode[STIX]{x1D717}$
. As a result,
$\boldsymbol{v}_{M}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D717}\rightarrow \boldsymbol{v}_{M}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D717}$
, but
$\boldsymbol{v}_{M}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}\rightarrow -\boldsymbol{v}_{M}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}$
. Moreover, the toroidal drift cannot change direction so
$\boldsymbol{v}_{M}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D6FC}\rightarrow \boldsymbol{v}_{M}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D6FC}$
. Therefore, for perturbations the transformations
$\unicode[STIX]{x2202}/\unicode[STIX]{x2202}\unicode[STIX]{x1D6F9}\rightarrow -\unicode[STIX]{x2202}/\unicode[STIX]{x2202}\unicode[STIX]{x1D6F9}$
or
$k_{\unicode[STIX]{x1D713}}\rightarrow -k_{\unicode[STIX]{x1D713}}$
,
$\unicode[STIX]{x2202}/\unicode[STIX]{x2202}\unicode[STIX]{x1D6E9}\rightarrow \unicode[STIX]{x2202}/\unicode[STIX]{x2202}\unicode[STIX]{x1D6E9}$
, and
$\unicode[STIX]{x2202}/\unicode[STIX]{x2202}\unicode[STIX]{x1D6FC}\rightarrow \unicode[STIX]{x2202}/\unicode[STIX]{x2202}\unicode[STIX]{x1D6FC}$
or
$k_{\unicode[STIX]{x1D6FC}}\rightarrow k_{\unicode[STIX]{x1D6FC}}$
do not change the magnetic drift terms and thereby the linearized electrostatic gyrokinetic equation.
When the nonlinear term

is retained, additional complications arise. As
$B^{2}$
is an even function of
$\unicode[STIX]{x1D717}$
, a sign change will occur in the first nonlinear term. This observation suggests using the additional transformations
$\tilde{h}\rightarrow -\tilde{h}$
and
$\langle \tilde{\unicode[STIX]{x1D6F7}}\rangle _{R}\rightarrow -\langle \tilde{\unicode[STIX]{x1D6F7}}\rangle _{R}$
in all terms of the gyrokinetic equation. Then using
$\boldsymbol{B}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D6FC}\times \unicode[STIX]{x1D735}\unicode[STIX]{x1D717}=R^{-2}\unicode[STIX]{x1D735}\unicode[STIX]{x1D713}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D717}-(\unicode[STIX]{x2202}q/\unicode[STIX]{x2202}\unicode[STIX]{x1D713})\boldsymbol{B}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\unicode[STIX]{x1D717}$
, the second nonlinear term transforms properly as well. The third and final nonlinear term appears to be a problem; however, it is an order smaller since it can be rewritten as

where
$cIB^{-1}\unicode[STIX]{x2202}\langle \tilde{\unicode[STIX]{x1D6F7}}\rangle _{R}/\unicode[STIX]{x2202}\unicode[STIX]{x1D6F9}\ll |v_{\Vert }|$
implies that the first term is an order smaller than the parallel streaming term, while in the second term the parallel electric field is small compared to the perpendicular electric field.
The preceding symmetry properties mean that if
$\tilde{h}(k_{\unicode[STIX]{x1D713}},k_{\unicode[STIX]{x1D6FC}},\unicode[STIX]{x1D717},\bar{E},\unicode[STIX]{x1D707},\unicode[STIX]{x1D70E},t)$
and
$\langle \tilde{\unicode[STIX]{x1D6F7}}\rangle _{R}(k_{\unicode[STIX]{x1D713}},k_{\unicode[STIX]{x1D6FC}},\unicode[STIX]{x1D717},t)$
with
$\unicode[STIX]{x1D70E}=v_{\Vert }/|v_{\Vert }|$
are solutions to the nonlinear electrostatic gyrokinetic equation then so are
$-\tilde{h}(-k_{\unicode[STIX]{x1D713}},k_{\unicode[STIX]{x1D6FC}},\unicode[STIX]{x1D717},\bar{E},\unicode[STIX]{x1D707},-\unicode[STIX]{x1D70E},t)$
and
$-\langle \tilde{\unicode[STIX]{x1D6F7}}\rangle _{R}(-k_{\unicode[STIX]{x1D713}},k_{\unicode[STIX]{x1D6FC}},\unicode[STIX]{x1D717},t)$
. These symmetry properties are then easily extended to the nonlinear electromagnetic gyrokinetic equation by noting that when the solutions
$\langle \boldsymbol{A}_{\Vert }\rangle _{R}(k_{\unicode[STIX]{x1D713}},k_{\unicode[STIX]{x1D6FC}},\unicode[STIX]{x1D717},t)$
and
$\langle \tilde{B}_{\Vert }\rangle _{R}(k_{\unicode[STIX]{x1D713}},k_{\unicode[STIX]{x1D6FC}},\unicode[STIX]{x1D717},t)$
are retained,
$\langle \boldsymbol{A}_{\Vert }\rangle _{R}(-k_{\unicode[STIX]{x1D713}},k_{\unicode[STIX]{x1D6FC}},\unicode[STIX]{x1D717},t)$
and
$-\langle \tilde{B}_{\Vert }\rangle _{R}(-k_{\unicode[STIX]{x1D713}},k_{\unicode[STIX]{x1D6FC}},\unicode[STIX]{x1D717},t)$
are the corresponding solutions. Finally, when subsonic rotation is much stronger than the diamagnetic level, the additional symmetry properties in the rotating frame of
$\unicode[STIX]{x1D6FA}_{\unicode[STIX]{x1D701}}\rightarrow -\unicode[STIX]{x1D6FA}_{\unicode[STIX]{x1D701}}$
and
$\unicode[STIX]{x2202}\unicode[STIX]{x1D6FA}_{\unicode[STIX]{x1D701}}/\unicode[STIX]{x2202}\unicode[STIX]{x1D713}\rightarrow -\unicode[STIX]{x2202}\unicode[STIX]{x1D6FA}_{\unicode[STIX]{x1D701}}/\unicode[STIX]{x2202}\unicode[STIX]{x1D713}$
enter, where
$v_{\Vert }$
is replaced by
$w_{\Vert }$
.
The direction of the toroidal magnetic field is unimportant for the preceding symmetry argument as the full gyrokinetic equation posses another simpler symmetry property of just
$v_{\Vert }\rightarrow -v_{\Vert }$
(or
$w_{\Vert }\rightarrow -w_{\Vert }$
) and
$I\rightarrow -I$
.