Grid-based calculation of the Lagrangian mean

Hossein A. Kafiabad

doi:10.1017/jfm.2022.233

Grid-based calculation of the Lagrangian mean

Published online by Cambridge University Press: 08 April 2022

Hossein A. Kafiabad

Show author details

Hossein A. Kafiabad*: Affiliation:
School of Mathematics and Maxwell Institute for Mathematical Sciences, University of Edinburgh, EdinburghEH9 3FD, UK
*: †Email address for correspondence: h.kafiabad@ed.ac.uk

Article contents

Abstract
Introduction
GBLA
Validation
Application: wave-averaged geostrophic balance
Discussion
References

Rights & Permissions

Abstract

Lagrangian averaging has been shown to be more effective than the Eulerian mean in separating waves from slow dynamics in two time scale flows. It also appears in many reduced models that capture the wave feedback on the slow flow. Its calculation, however, requires tracking particles in time, which imposes several difficulties in grid-based numerical simulations or estimation from fixed-point measurements. To circumvent these difficulties, we propose a grid-based iterative method to calculate the Lagrangian mean without tracking particles in time, which also reduces computation, memory footprint and communication between processors in parallelised numerical models. To assess the accuracy of this method several examples are examined and discussed. We also explore an application of this method in the context of shallow-water equations by quantifying the validity of wave-averaged geostrophic balance – a modified form of geostrophic balance accounting for the effect of strong waves on slow dynamics.

JFM classification

Geophysical and Geological Flows: Waves in rotating fluids Mathematical Foundations: Computational methods Geophysical and Geological Flows: Ocean circulation

Type: JFM Papers
Information: Journal of Fluid Mechanics , Volume 940 , 10 June 2022 , A21

DOI: https://doi.org/10.1017/jfm.2022.233 [Opens in a new window]
Copyright: © The Author(s), 2022. Published by Cambridge University Press

1. Introduction

Large-scale geophysical flows are characterised by fast and slow time scales; the fast dynamics consists of inertia-gravity waves, and the slow dynamics is usually described by geostrophic and hydrostatic balance (Vanneste Reference Vanneste2013; Vallis Reference Vallis2017). There is a particular interest in decomposing the flow into these components and analysing them separately. Focusing on the slow dynamics reduces the governing equations to a simpler set that is easier to interpret and solve (Charney Reference Charney1971; Warn Reference Warn1997; McIntyre & Norton Reference McIntyre and Norton2000; Mohebalhojeh & Dritschel Reference Mohebalhojeh and Dritschel2001). Likewise, mathematical tools specific to wave dynamics may be employed to analyse the fast component (Bühler Reference Bühler2009). In addition to bringing physical insights, such decompositions are used to parameterise the small-scale dynamics that are not resolved in ocean and weather models (Sutherland et al. Reference Sutherland, Achatz, Colm-cille and Klymak2019).

The most straightforward decomposition is averaging the variables over the fast time scale at fixed spatial points to derive the Eulerian mean. Although easy to implement, this is not the most effective way of extracting wave signal from the mean flow. For instance, the Doppler shifting of the wave frequency by strong mean flows can eclipse the inherent time scale separation between them. To circumvent this issue, one can derive the Lagrangian mean (LM) instead by averaging the flow variables along particle trajectories and assigning the mean to a fixed particle label. Recent studies have shown a range of oceanic phenomena are better understood with Lagrangian averaging, including spontaneous wave generation (Nagai et al. Reference Nagai, Tandon, Kunze and Mahadevan2015; Shakespeare & Hogg Reference Shakespeare and Hogg2017, Reference Shakespeare and Hogg2018), internal tides (Shakespeare & Hogg Reference Shakespeare and Hogg2019) and their effect on the ecology of the coastal and reef zones (Bachman et al. Reference Bachman, Shakespeare, Kleypas, Castruccio and Curchitser2020).

Another motivation for calculating the LM comes from its natural appearance in reduced models that are derived by wave averaging (Bretherton Reference Bretherton1971; Grimshaw Reference Grimshaw1975; Andrews & McIntyre Reference Andrews and McIntyre1978; Bühler Reference Bühler2009; Gilbert & Vanneste Reference Gilbert and Vanneste2018). A striking prediction, highlighting the fundamental role of the LM, is that the hydrostatic and geostrophic balance continues to hold in the presence of strong waves for LM quantities (Kafiabad, Vanneste & Young (Reference Kafiabad, Vanneste and Young2021), and references therein). Similarly, models of near-inertial waves interacting with the mean flow naturally involve LM variables (Xie & Vanneste Reference Xie and Vanneste2015; Wagner & Young Reference Wagner and Young2015, Reference Wagner and Young2016; Asselin & Young Reference Asselin and Young2020). In general, these studies show that the slow dynamics with the inclusion of wave feedback on the flow is more accurately represented by LM quantities. To examine the validity of these models and utilise them in operational and research models there is a growing need to compute the LM.

Computing the LM, however, requires more effort than computing the Eulerian mean. Most numerical models are discretised using spatial grid points and the majority of measurements are taken at fixed locations in space. To derive the LM, in these cases passive particles are seeded in the flow and advected (forward or backward in time) based on the interpolated velocities at particle positions (e.g. Nagai et al. Reference Nagai, Tandon, Kunze and Mahadevan2015; Shakespeare & Hogg Reference Shakespeare and Hogg2017, Reference Shakespeare and Hogg2018, Reference Shakespeare and Hogg2019). This requires saving the time series of several fields, which are not normally saved in numerical models, at high spatial resolution. The other issue with particle tracking comes up in parallelised solvers, where the computational domain is divided into many subdomains assigned to independent processors. Even uniformly seeded particles may leave their initial subdomain, causing additional communication between the processors and disturbing the computational load balance among them.

We propose a grid-based Lagrangian averaging (GBLA) method that does not require particle tracking. Instead, the LM is updated on grid points using the mean associated with the trajectories that pass through the grid points in the previous time step. As a result, there is no need for saving any additional time series. This method can be integrated with numerical models to efficiently calculate the LM ‘on the fly’. It also maintains a load balance between the processors in parallel implementations and minimises the communication between them. We first describe this method and its mathematical justification in § 2. We then investigate its validity in five examples in § 3. An application of GBLA is considered in § 4, where we study the wave-averaged geostrophy for the rotating shallow-water equations. The paper concludes with a discussion in § 5.

2. GBLA

2.1. Algorithm

Let us consider the LM of the field $\phi (\boldsymbol {x},t)$ from $t_0$ to $t_0+T$. Before explaining the steps of GBLA, we clarify some notations and definitions. We denote the Eulerian mean of $\phi (\boldsymbol {x},t)$ over the period of $t_0$ to $t$ with

(2.1)

\begin{equation} \bar{\phi}(\boldsymbol{x},t_0,t) = \frac{1}{t-t_0} \int_{t_0}^{t} \phi(\boldsymbol{x},\tau) \,{\rm d} \tau. \end{equation}

The LM of $\phi$ is the time average of this quantity along the trajectory, that is, at fixed particle label instead of fixed spatial position. There are different ways of labelling particles; for instance, the particle position at the beginning or the end of the averaging interval may be selected as labels. Denoting this label by $\boldsymbol {x}$, we can map it to the particle position at time $t$ (denoted by $\boldsymbol {X}$) using the particle displacement field $\boldsymbol {{\xi }}(\boldsymbol {x},t)$,

(2.2)

\begin{equation} \boldsymbol{X} = \boldsymbol{{\Xi}}(\boldsymbol{x},t) = \boldsymbol{x}+\boldsymbol{{\xi}}(\boldsymbol{x},t). \end{equation}

Using (2.1) and (2.2), the LM operator is mathematically described as

(2.3)

\begin{equation} \overline{\phi}^{{\hspace{0.5mm}\small L}} (\boldsymbol{x},t_0,t) = \frac{1}{t-t_0} \int_{t_0}^{t} \phi(\boldsymbol{x}+\boldsymbol{{\xi}}(\boldsymbol{x},\tau),\tau) \,{\rm d} \tau. \end{equation}

Different choices of $\boldsymbol {x}$ lead to different displacement fields $\boldsymbol {{\xi }}$. Andrews & McIntyre (Reference Andrews and McIntyre1978) defined the ‘generalised Lagrangian mean (GLM)’ by requiring

(2.4)

\begin{equation} \bar{\boldsymbol{{\xi}}}(\boldsymbol{x},t) = 0. \end{equation}

In other words, $\boldsymbol {x}$ is now the mean position of the particle from $t_0$ to $t$. If the displacement field is small (compared with the changes in mean position), (2.4) gives the LM some Eulerian-like character as $\boldsymbol {x}$ is more than just a label and represents the spatial location of the trajectory for which the mean is calculated. As it will be shown in the examples of § 3.1, this choice of label is more effective at removing the fast time scale in the LM estimate provided that the mapping (2.2) does not have a fast time dependence. If other labels such as instantaneous position at the end or middle of the averaging interval are chosen, the filtered signal may still contain some fast oscillations or get distorted (see the numerical example of § 3.1). Moreover, the GLM definition holds properties that lead to significant analytical simplifications in modelling two time scale geophysical flows (Andrews & McIntyre Reference Andrews and McIntyre1978; Bühler & McIntyre Reference Bühler and McIntyre1998; Xie & Vanneste Reference Xie and Vanneste2015; Gilbert & Vanneste Reference Gilbert and Vanneste2018). Throughout this paper, we use (2.3) in conjunction with (2.4) to express the LM in terms of the mean position. For $t<(t_0+T)$, (2.1) and (2.3) represent partial means, and for $t = t_0+T$ the complete mean for the intended averaging interval. Considering that in this section we focus only on one averaging interval with a fixed starting point, we drop $t_0$ in the arguments of the mean operators to lighten the notation.

The core idea behind GBLA is to calculate partial LMs recursively in time. For instance, we assume that the partial LM from $t_0$ to $t_k$ ($t_0< t_k< t_0+T$) for the particles that reach the grid points at time $t_k$ is already computed. Through the steps laid out below, the LM is extended to $t_{k+1}$ and calculated for a new set of particles that reach the grid points at time $t_{k+1}$ (using the partial averages of the previous step). In this approach, at each time step a different set of trajectories is considered (the set that ends up at grid points at that time step), which is in contrast with particle tracking methods where one trajectory is followed all along over the entire averaging interval. The incremental extension of partial LMs continues to the end of the averaging interval $t_0+T$. The completed LM field is then assigned to time $t_0+T/2$. Note that in our demonstration we focus only on one averaging interval, but this process can be repeated for new intervals, and averaging procedure can be carried out simultaneously. For example, the next interval may start from $t_m$ and end at $t_{m+N}$ and so on. If the beginning of a later interval exceeds $t_0+T$, the memory allocated to the first interval (from $t_0$ to $t_0+T$) can be freed in a systematic way and used for the new intervals.

Equipped with the above definitions, we present the steps to compute $\overline {\phi }^{{\hspace {0.5mm}\small L}} (\boldsymbol {{\Xi }}^{-1}(\boldsymbol {X}_i,t_{k+1})$, $t_{k+1})$ from $\overline {\phi }^{{\hspace {0.5mm}\small L}} (\boldsymbol {{\Xi }}^{-1}(\{\boldsymbol {X}_{\alpha }\},t_k), t_k)$, where $\boldsymbol {X}_i$ is an arbitrary grid point in space and $\{\boldsymbol {X}_{\alpha }\}$ is a set consisting of $\boldsymbol {X}_i$ and its neighbours. In other words, our steps show how to calculate the partial LM (from $t_0$ to $t_{k+1}$) for the particle that passes through $\boldsymbol {X}_i$ at $t_{k+1}$ by knowing the LM (from $t_0$ to $t_k$) for the particles that pass through $\boldsymbol {X}_i$ and its neighbouring grid points at $t_{k}$. To simplify the notation we use one spatial index $i$, which can be viewed as a multielement vector or replaced by several indices in higher than one dimension. Considering that $\overline {\phi }^{{\hspace {0.5mm}\small L}} (\boldsymbol {{\Xi }}^{-1}(\boldsymbol {X}_i,t_k),t_k)$ is the partial LM for the particle that reaches $X_i$ at $t_{k}$ and $\overline {\phi }^{{\hspace {0.5mm}\small L}} (\boldsymbol {{\Xi }}^{-1}(\boldsymbol {X}_i,t_{k+1}),t_{k+1})$ is the partial LM for the particle that reaches $X_i$ at $t_{k+1}$, these two LMs belong to two separate trajectories (marked in red and blue in figure 1).

Step 1 Calculate $\phi (\boldsymbol {X}_i, t_{k+1})$ and the velocity $\boldsymbol {u}(\boldsymbol {X}_i, t_{k+1})$ by integrating the governing equations from $t_k$ to $t_{k+1}$. Note that a finer time step may be used for the time integration of the governing equations than that used for averaging; there might be several simulation time steps between $t_k$ and $t_{k+1}$.
Step 2 Consider the particle that goes through $\boldsymbol {X}_i$ at time $t_{k+1}$ (red trajectory in figure 1). Approximate the position of this particle at time $t_k$ by advecting it backward in time,
(2.5)\begin{equation} \boldsymbol{Y}_i = \boldsymbol{X}_i - (t_{k+1}-t_k) \boldsymbol{u}(\boldsymbol{X}_i, t_{k+1}). \end{equation}
This point is marked in green in figure 1 and splits the red trajectory into two parts.
Step 3 Denote the partial LM of $\phi$ between $t_0$ and $t_k$ for the red trajectory by $\overline {\phi }^{{\hspace {0.5mm}\small L}} (\boldsymbol {{\Xi }}^{-1}(\boldsymbol {Y}_i,t_k),t_k)$. Then, find its value by interpolation using the LM for the trajectories that go through the neighbouring grid points (marked by blue curves). For example, $\boldsymbol {Y}_i$ lies between $\{\boldsymbol {X}_{\alpha }\}$ for ${\alpha } = i, j, l$ and $p$ in figure 1. Therefore, the four values of $\overline {\phi }^{{\hspace {0.5mm}\small L}} (\boldsymbol {{\Xi }}^{-1}(\{\boldsymbol {X}_{\alpha }\},t_k),t_k)$ may be used to interpolate $\overline {\phi }^{{\hspace {0.5mm}\small L}} (\boldsymbol {{\Xi }}^{-1}(\boldsymbol {Y}_i,t_k),t_k)$. Note that $\overline {\phi }^{{\hspace {0.5mm}\small L}} (\boldsymbol {{\Xi }}^{-1}(\boldsymbol {Y}_i,t_k),t_k)$ and $\overline {\phi }^{{\hspace {0.5mm}\small L}} (\boldsymbol {{\Xi }}^{-1}(\boldsymbol {X}_i,t_k),t_k)$ are both averages over $[t_0,\ t_k]$ but for different trajectories (highlighted in orange and light blue, respectively).
Step 4 Selecting the mean position of the red trajectory from $t_0$ to $t_{k+1}$ as a label for the particle that passes through $\boldsymbol {X}_i$ at $t_{k+1}$ and denoting it by $\boldsymbol {x}_i$, use the definition (2.3) to compute
(2.6)\begin{align} &\overline{\phi}^{{\hspace{0.5mm}\small L}} (\boldsymbol{{\Xi}}^{{-}1}(\boldsymbol{X}_i,t_{k+1}),t_{k+1}) = \frac{1}{t_{k+1}-t_0} \int_{t_0}^{t_{k+1}} \phi(\boldsymbol{x}_i + \boldsymbol{{\xi}}(\boldsymbol{x}_i,\eta),\eta) \,{\rm d} \eta\nonumber\\ & = \frac{t_k-t_0}{t_{k+1} - t_0} \ \frac{1}{t_{k} - t_0} \int_{t_0}^{t_k} \phi(\boldsymbol{x}_i + \boldsymbol{{\xi}}(\boldsymbol{x}_i,\eta),\eta) \,{\rm d} \eta+ \frac{1}{t_{k+1} - t_0} \int_{t_k}^{t_{k+1}} \phi(\boldsymbol{x}_i + \boldsymbol{{\xi}}(\boldsymbol{x}_i,\eta),\eta) \,{\rm d} \eta\nonumber\\ & = \frac{t_k-t_0}{t_{k+1} - t_0}\ \overline{\phi}^{{\hspace{0.5mm}\small L}} (\boldsymbol{{\Xi}}^{{-}1}(\boldsymbol{Y}_i,t_k),t_k) +\frac{1}{t_{k+1} - t_0} \int_{t_k}^{t_{k+1}} \phi(\boldsymbol{x}_i + \boldsymbol{{\xi}}(\boldsymbol{x}_i,\eta),\eta) \,{\rm d} \eta. \end{align}
The first term on the right-hand side is already computed in the previous step. The second term, which is the weighted mean of the red trajectory from $t_k$ to $t_{k+1}$, can be approximated by
(2.7)\begin{equation} \int_{t_k}^{t_{k+1}} \phi(\boldsymbol{x}_i + \boldsymbol{{\xi}}(\boldsymbol{x}_i,\eta),\eta) \,{\rm d} \eta \approx \phi(\boldsymbol{X}_i,t_{k+1})\ (t_{k+1}-t_k). \end{equation}
A better approximation may be derived by the trapezoidal rule
(2.8)\begin{equation} \int_{t_k}^{t_{k+1}} \phi(\boldsymbol{x}_i + \boldsymbol{{\xi}}(\boldsymbol{x}_i,\eta),\eta) \,{\rm d} \eta \approx \tfrac{1}{2}\left[ \phi(\boldsymbol{Y}_i,t_{k}) +\phi(\boldsymbol{X}_i,t_{k+1}) \right] (t_{k+1}-t_k). \end{equation}
To calculate $\phi (\boldsymbol {Y}_i,t_{k})$, another interpolation is required. Hence, depending on the desired accuracy and computational cost, (2.8) may or may not be preferred to (2.7).

Figure 1. Schematic view of particle trajectories that pass through the grid points $\boldsymbol {X}_i$, $\boldsymbol {X}_j$, $\boldsymbol {X}_p$ and $\boldsymbol {X}_l$ that are marked by black. The trajectories that reach these grid points at $t_k$ are marked in blue and the trajectory that reaches the grid point at $t_{k+1}$ in red. For $\boldsymbol {X}_i$, the times associated with the particle position of red and blue trajectories are shown as well.

So far the calculation of the LM field in (2.6) is complete, but it is given for the end position of each particle $\boldsymbol {X}_i$. If the definition of GLM with the condition (2.4) is desired, the LM should be assigned to the mean position of particles. In other words, we need to calculate $\boldsymbol {x}_i = \boldsymbol {{\Xi }}^{-1}(\boldsymbol {X}_i,t_{k+1})$, which maps the particle's end position $\boldsymbol {X}_i$ to its mean from $t_0$ to $t_{k+1}$. To calculate the mean position, we update it in a similar manner to (2.6) at each time step via step 5.
Step 5 Assuming $\boldsymbol {{\Xi }}^{-1}(\boldsymbol {X},t_{k})$ is known for all grid points from the previous time step, compute the mean position of the red trajectory from $t_0$ to $t_{k+1}$,
(2.9)\begin{align} \boldsymbol{{\Xi}}^{{-}1}(\boldsymbol{X}_i,t_{k+1}) &= \frac{1}{t_{k+1}-t_0} \int_{t_0}^{t_{k+1}} \boldsymbol{x}_i + \boldsymbol{{\xi}}(\boldsymbol{x}_i,\eta) \,{\rm d} \eta\nonumber\\ & = \frac{t_k-t_0}{t_{k+1} - t_0}\ \boldsymbol{{\Xi}}^{{-}1}(\boldsymbol{Y}_i,t_{k}) +\frac{1}{t_{k+1} - t_0} \frac{1}{2}\left( \boldsymbol{Y}_i + \boldsymbol{X}_i \right) (t_{k+1}-t_k), \end{align}
where $\boldsymbol {{\Xi }}^{-1}(\boldsymbol {Y}_i,t_{k})$ is interpolated using the neighbouring grid points. Note that (2.9) is simply a particular case of (2.6) in conjunction with (2.8) for $\phi (\boldsymbol {x},t) = \boldsymbol {x}$.

The above steps are carried out for all the grid points and iteratively in time until $t_{k+1} = t_0+T$. The completed LM is now given on scattered mean positions $\boldsymbol {{\Xi }}^{-1}(\boldsymbol {X}_i,t_{N})$ and can be interpolated back on the computational grid points.

2.2. Parameter set-up, convergence and numerical error

The GBLA method may be implemented on temporally and spatially decimated fields. To distinguish between the two, we denote the (iterative) averaging time step with $\Delta t = t_{k+1} - t_k$ and the simulation time step (used to solve the governing equations) with $\delta t$ throughout this paper. Similarly, $\Delta \boldsymbol {x}$ denotes the spatial resolution of GBLA, which could be coarser than the one used in simulations (i.e. weather or climate models). It is important to keep $\Delta t$ and $\Delta \boldsymbol {x}$ smaller than the characteristic time and length scales of the fast phenomenon that is going to be filtered. To guarantee the stability of backward advection in (2.5) the Courant–Friedrichs–Lewy condition $\boldsymbol {u} \Delta t < \Delta \boldsymbol {x}$ should be satisfied.

Just like any numerical algorithm, GBLA is susceptible to numerical errors. Denoting the local error at each iteration (from $t_k$ to $t_{k+1}$) by $e_k$, there are three main contributors to $e_k$.

(i) Finite differentiation used to advect particles backward in (2.5). Considering the Taylor expansion of particle position on a trajectory, the difference between the exact and approximated particle position at time $t_k$ is $O(\Delta t^2)$.
(ii) Interpolation of LM at time $t_k$. In the step 3 of the algorithm, we interpolate $\overline {\phi }^{{\hspace {0.5mm}\small L}} (\boldsymbol {{\Xi }}^{-1}(\boldsymbol {Y}_i,t_k),t_k)$, leading to an error that depends on the selected interpolation scheme. Considering polynomial interpolation of degree $n$ on a grid with spacing of $\Delta x$, one can show that the interpolation error is $O((\Delta x)^{n+1})$. For example, the linear interpolation error is $O((\Delta x)^2)$ and the cubic interpolation error $O((\Delta x)^3)$. Note that the order of error is independent of spatial dimension.
(iii) Approximation of the last integral in (2.6) either by (2.7) or (2.8). The error of the former is $O(\Delta t)$ and the latter $O(\Delta t^3 (\Delta x)^{n+1})$ considering the interpolation required to calculate $\phi (\boldsymbol {Y}_i,t_{k})$.

Considering that there are linear leading terms in polynomial interpolation, the error in (i) directly translates into the total local error $e_k$. As a result, we can write

(2.10)

\begin{equation} e_k = O(\Delta t^2) + O((\Delta x)^{n+1}) + O(\Delta t^s), \end{equation}

where $s =1$ or $3$ if (2.7) or (2.8) are used, respectively. Hence, depending on the value of $s$, the order of local error in $\Delta t$ may be determined by (i) or (ii). The calculation of global error, which is the accumulation of local errors, is more complicated. It cannot be derived simply by multiplying $e_k$ with the number of time steps $N = T/\Delta t$, because in the interpolation of the step 3 the neighbouring points contain the accumulated interpolation errors from the previous iterations. As a result, different sources of error become intertwined. Comparatively, the particle tracking approach does not have this type of error accumulation, because it uses the instantaneous values at neighbouring points for each time step. Hence, GBLA might be more affected by interpolation errors than the particle tracking methods. We leave the analytical expression of global error for a future study, but in § 3.1 and § 3.2.1 we calculate it numerically to investigate the convergence of GBLA.

3. Validation

3.1. Illustrative one-dimensional flows

We consider the following example with a velocity that consists of a fast and slow component:

(3.1a–c)

\begin{equation} u = u_{f} + u_{s}, \quad u_{f} = \epsilon \sin(20x - 20t), \quad u_{s} = \mathrm{e}^{{-}a(x-{\rm \pi})^2}, \end{equation}

where $a = 15$ for $x<{\rm \pi}$ and $a = 1$ for $x\geq {\rm \pi}$. Figure 2(a) displays the LM velocity of this flow regarded as a function of mean position using GBLA with cubic interpolation (shown in magenta) and particle tracking (shown in black). For both methods, $\Delta x = 4.2\times 10^{-3}$, $\Delta t = 2.6\times 10^{-3}$ and the averaging period $T = 2{\rm \pi} /3$, which is longer than the period of the fast signal $2{\rm \pi} /20$. The GBLA and particle tracking results (magenta and black curves) perfectly lie on top of each other, which confirms the validity of GBLA. To better quantify the accuracy and convergence of GBLA, in figure 2(b) we coarsen $\Delta x$ and calculate the following error norm (representing the global error for the averaging period of $T$):

(3.2)

\begin{equation} \Vert e \Vert=\sqrt{\frac{1}{N} \sum (\overline{u}^{{\hspace{0.5mm}\small L}} _{{GBLA}} - \overline{u}^{{\hspace{0.5mm}\small L}} _{{exact}})^2}, \end{equation}

where the summation is carried out over all spatial grid points, and $N$ is the number of spatial grid points. Here $\overline {u}^{{\hspace {0.5mm}\small L}} _{{GBLA}}$ is the LM calculated using GBLA and $\overline {u}^{{\hspace {0.5mm}\small L}} _{{exact}}$ is the exact value of LM which is approximated by particle tracking at much finer spatial and temporal resolutions of $\Delta x = 2 \times 10^{-3}$ and $\Delta t = 2.6 \times 10^{-3}$. Two different interpolation schemes are used in computing GBLA for figure 2(b): linear (black dots) and cubic (red dots). The overall message of this figure is that GBLA with different interpolation schemes converges to the exact solution if $\Delta x$ is lowered. At smaller $\Delta x$, GBLA's error with cubic interpolation has a slope close to 2, and with linear interpolation a slope slightly shallower than 1. At larger $\Delta x$ other sources of error contribute to $\Vert e \Vert$ leading to shallower slopes.

Figure 2. (a) The LM velocity of (3.1a–c) calculated for $\epsilon = 0.06$ from $t=0$ to $t=2{\rm \pi} /3$ for $\Delta x = 4.2\times 10^{-3}$ and $\Delta t = 2.6\times 10^{-3}$ using: GBLA (with cubic interpolation) as a function of mean position (magenta); particle tracking as a function of mean position (black); particle tracking as a function of position at the middle of the averaging interval (blue); and particle tracking as a function of position at the end of the averaging interval (green). (b) The error norm $\Vert e \Vert$ of the GBLA scheme using linear interpolation (black dots) and cubic interpolation (red dots). The slope of blue guide line is 1 and that of red line is 2.

As mentioned in the previous section, we choose the definition of GLM theory by requiring (2.4) to hold and using mean position as a label for LM. We use the above example to clarify the importance of this choice. In figure 2(a), in addition to expressing the LM as a function of mean position, we show the LM as a function of particle position at the middle of the averaging interval (blue curve) and as a function of particle position at the end of the averaging interval (green curve). We have used particle tracking methods to calculate these results. The black and blue curves preserve the overall shape of the slow signal, but the green curve fails to do so, which shows that the LM in terms of end position may distort the mean-flow signal. The black and blue curves have differences as well, which are more visible at the steep velocity gradient before $x={\rm \pi}$. To understand the reason behind this discrepancy, consider a particle that travels through the relatively flat part of velocity profile in the first half of the averaging interval and through the steep velocity jump in the second half. As a result, the average speed of this particle is small in the first half and large in the second half, leading to a relatively small change of position in the first half compared with the second half. Therefore, the mean position of this particle differs from its position at the middle of the averaging interval (which is closer to the position at the beginning of the interval).

In figure 3 we increased the wave amplitude to $\epsilon = 0.15$ to illustrate how the GLM definition of LM can be more effective in removing the fast time scale (all other parameters are the same as those used for figure 2a). The LM at different times is shown in terms of the mean position calculated by GBLA (magenta curve) and in terms of the position at the middle of the averaging interval calculated by particle tracking (blue curve). There is still a noticeable trace of fast signal in the blue curves, which propagates rightward in time. This is because the particle position at the middle of the averaging period oscillates with the fast time scale, whereas the mean position is less affected by the these oscillations. This example is deliberately chosen to amplify the difference for various choices of spatial labels. For smaller-amplitude waves and milder background velocity gradients the magenta and blue curves converge to each other. Applying Lagrangian filtering to oceanic flows, many authors have used the final position as a label for LM (Shakespeare & Hogg Reference Shakespeare and Hogg2017, Reference Shakespeare and Hogg2019) or the position at the middle of the averaging interval (Shakespeare et al. Reference Shakespeare, Gibson, Hogg, Keating, Bachman and Velzeboer2021). If a more accurate removal of fast time scale is desired, expressing LM in terms of mean position should be considered instead.

Figure 3. The LM velocity of (3.1a–c) over the period of $2{\rm \pi} /3$, calculated for $\epsilon = 0.15$ at different times using: GBLA in terms of mean position (magenta) and particle tracking in terms of position at the middle of the averaging interval (blue).

3.2. Using a materially conserved quantity to investigate the validity of GBLA

In this section, we consider the materially conserved quantities – which are constant on each trajectory – to study the accuracy of GBLA. For these quantities, the LM expressed in terms of final position should be equal to their instantaneous values at the end of the averaging interval. We employ this fact to see how reliable GBLA is. We remind the reader that this is merely for studying the validity of GBLA; as discussed before using the GLM definition and expressing the LM in terms of mean position instead of final position is preferred when it comes to the applications of Lagrangian averaging. As a result, step 5 of § 2 is skipped in the calculations of this part.

3.2.1. Conservation of vorticity in two-dimensional incompressible flow

For two-dimensional incompressible inviscid flows, vorticity is conserved on particle trajectories,

(3.3a,b)

\begin{equation} \frac{\partial \zeta}{\partial t} + u \frac{\partial \zeta}{\partial x} + v \frac{\partial v}{\partial y} = 0, \quad \zeta = \frac{\partial v}{\partial x} - \frac{\partial u}{\partial y}. \end{equation}

As a result, $\zeta (\boldsymbol {X},t) = \overline {\zeta }^{{\hspace {0.5mm}\small L}} (\boldsymbol {{\Xi }}^{-1}(\boldsymbol {X},t),t)$, where $\boldsymbol {X} = (x,y)$. Therefore, to examine GBLA we derive the LM vorticity (expressed in terms of the final particle position) and compare it with the instantaneous vorticity field at the end of the averaging interval. We initialise the numerical simulation with two like-signed vortices,

(3.4)

$$\begin{gather} \zeta(x,y,t=0) = \exp({-(x-{\rm \pi}+0.1)^2 -(y-{\rm \pi}+{\rm \pi}/3)^2 })\nonumber\\ + \exp({-(x-{\rm \pi}-0.1)^2 - (y-{\rm \pi}-{\rm \pi}/3)^2}). \end{gather}$$

We employ a standard pseudo-spectral code to solve (3.3a,b) on a $2{\rm \pi} \times 2{\rm \pi}$ domain with periodic boundary conditions using $256 \times 256$ grids points. The time step for implementing the averaging process is 10 times coarser than the simulation time step, $\delta t = 0.002$. Figure 4 portrays the LM vorticity calculated by different methods next to the instantaneous vorticity at the end of the averaging interval for two different averaging periods. For the shorter averaging interval, all methods lead to very similar results. However, for the longer interval, GBLA with linear interpolation is less accurate than the others. The comparison between panels (c) and (h), and (d) and (i), shows that the use (2.8) instead of (2.7) has not made any discernible difference, because the interpolation error exceeds other types of error due to the relatively small time step.

Figure 4. Instantaneous and LM vorticity field for two like-signed vortices. Panels (a–e) correspond to fields at $t=10$ and ( f–j) to $t=30$. (a, f) Instantaneous vorticity; (b,g) GBLA using linear interpolation and (2.7); (c,h) GBLA using cubic interpolation and (2.7); (d,i) GBLA using cubic interpolation and (2.8); (c,h) LM by particle tracking and using cubic interpolation.

To have a more quantitative comparison between different methods, we calculate the following error norm (a two-dimensional version of the norm used in (3.2)):

(3.5)

\begin{equation} \Vert e \Vert=\sqrt{\frac{1}{N_x N_y} \sum_i (\overline{\zeta}^{{\hspace{0.5mm}\small L}} (\boldsymbol{{\Xi}}^{{-}1}(\boldsymbol{X}_i,T),T)-\zeta(\boldsymbol{X}_i,T))^2 }, \end{equation}

where the summation is carried out over all spatial grid points, and $N_x$ and $N_y$ are the number of grid points in each dimension. Here $\zeta (\boldsymbol {X}_i,T)$ is an approximation for the exact LM, which is obtained by numerically solving (3.3a,b) at a high resolution of $384 \times 384$ and small time step of $\delta t = 0.002$. Here $\overline {\zeta }^{{\hspace {0.5mm}\small L}}$ is calculated using different Lagrangian averaging methods with temporal and spatial distancing that are lowered to investigate their convergence and accuracy.

In figure 5(a) we set the averaging time step to $\Delta t = 0.02 = 10\, \delta t$, which is small enough to highlight the interpolation error (see § 2.2 for discussion on different sources of error). The GBLA method with linear interpolation and using (2.7) (red dots) underperforms GBLA and particle tracking with cubic interpolation (black squares and blue circles, respectively). However, after using (2.8) even GBLA with linear interpolation (magenta triangles) yields accurate results within the same ‘ballpark’ as cubic interpolation and particle tracking approaches. With cubic interpolation, the convergence of GBLA is $O((\Delta x)^2)$, and that of particle tracking is $O(\Delta x)$. This is similar to the spatial convergence of error observed in figure 2(b) for the one-dimensional synthetic flow. For large $\Delta x$, GBLA with cubic interpolation has higher errors than its particle tracking counterpart, but by lowering $\Delta x$ both methods converge to the same result.

Figure 5. (a) Global error using different spatial resolution. (b) Global error using different averaging time steps. Error norm is calculated for different schemes namely: GBLA with linear interpolation and using (2.7) (red dots); GBLA with cubic interpolation and using (2.7) (black squares); particle tracking with cubic interpolation (blue circles); and GBLA with linear interpolation and using (2.8) (magenta triangles). The blue dashed lines have the slope of 1, and the black line has the slope of 2.

In figure 5(b) the convergence of different methods with respect to $\Delta t$ is investigated. For all the calculations of this plot $\Delta x = \Delta y = 2{\rm \pi} /384$. Similar to the figure 5(a), the error of GBLA with linear interpolation and using (2.7) (red dots) is almost two orders of magnitude higher than the other methods. Use of (2.8) reduces the error of linear methods by more than one order of magnitude (compare the red dots with magenta triangles). At higher $\Delta t$, all methods seem to have a slope close to one. However, at small $\Delta t$, the methods with linear interpolation plateau as they get dominated by interpolation errors.

Considering that GBLA is an iterative algorithm, different sources of error are intertwined, and tracking their effects separately is complicated. However, from the numerical investigation of global errors in figure 5, we can draw several insightful conclusions. Different variations of GBLA converge robustly by lowering $\Delta x$ and $\Delta t$. The GBLA method is more affected by the interpolation error than the particle tracking, which can be mitigated by using cubic interpolation (or other higher-order schemes). If in some applications linear interpolations are selected to lighten the computations, the choice of (2.8) over (2.7) substantially improves the result.

3.2.2. Conservation of potential vorticity in shallow water

To put our method to a more challenging test, we consider a fully developed turbulent flow evolving under the rotating shallow-water equations on flat topography,

(3.6a)

$$\begin{gather} \frac{\partial \boldsymbol{u}}{\partial t} + \boldsymbol{u} \boldsymbol{\cdot} \boldsymbol{\nabla} \boldsymbol{u} + \boldsymbol{f} \times \boldsymbol{u} ={-}g \boldsymbol{\nabla} h, \end{gather}$$

(3.6b)

$$\begin{gather}\frac{\partial \boldsymbol{u}}{\partial t} + \boldsymbol{\nabla} \boldsymbol{\cdot} (h \boldsymbol{u}) = 0, \end{gather}$$

where $h$ the height field, $g$ the gravitational acceleration and $\boldsymbol {f} = f \hat {\boldsymbol {z}}$ the Coriolis parameter. The existence of vortices of different sizes and sharp fronts in turbulent shallow water systems allow us to better examine the strengths and weaknesses of our method. For this flow, the PV,

(3.7)

\begin{equation} q = \frac{\boldsymbol{\nabla} \times \boldsymbol{u} + f}{h}, \end{equation}

is conserved on each particle trajectory. This material conservation can be used in a similar way to the previous section to assess the accuracy of Lagrangian averaging; the LM PV expressed in terms of final position should be equal to the instantaneous PV at the end of the averaging interval. Selecting $f = 0.1$, $g = 0.1$ and the mean height $H = 1$, we solve the nonlinear shallow water equation using a similar pseudo-spectral method with the same computational resolution, domain size and boundary condition as in § 3.2.1. A viscous damping of the form $\nu \nabla ^2$ with $\nu = 8 \times 10^{-7}$ is included for numerical stability. We initialise the simulations with a fully developed turbulent flow which is initially in a geostrophic balance (see the initial PV in figure 6b). The time step of the simulation is $\delta t = 5 \times 10^3$, and that of GBLA $\Delta t = 2 \times 10^3$. The instantaneous PV at $t=120$ in figure 6(a) is remarkably close to the LM PV over the period of $[0,\ 120]$ in figure 6(c), where small-scale features are also captured by GBLA (see figure 6d for the difference between the two). The small disagreement is partly caused by the viscous damping that makes the conservation of PV less accurate. This is particularly the case for the sharp fronts in figure 6(c) which are dampened in figure 6(a).

Figure 6. (a) The potential vorticity (PV) at the end of the averaging interval $t=120$. (b) The PV at $t=0$. (c) The LM PV from $t=0$ to $t=120$ using GBLA with a cubic interpolation scheme. (d) Absolute value of the difference between (a,c). Panels (a–c) share the same colour bar, which is different from that of panel (d).

3.3. Stokes terms for standing wave in shallow water

The difference between the Lagrangian and Eulerian mean is termed Stokes correction (see e.g. Andrews & McIntyre Reference Andrews and McIntyre1978; Bühler Reference Bühler2009) and can be written as

(3.8)

\begin{equation} \overline{\phi}^{ {\hspace{0.5mm}\small S}} = \overline{\phi}^{{\hspace{0.5mm}\small L}} - \bar{\phi} = \overline{(\boldsymbol{{\xi}} \boldsymbol{\cdot} \boldsymbol{\nabla}) \phi'} + O(\epsilon^2), \quad \phi' = \phi(\boldsymbol{x},t) - \bar{\phi}, \end{equation}

where the first term in the right-hand side is the leading-order approximation for this quantity. Here $\epsilon$ is the amplitude of the wave term $\phi '$ after proper scaling, where we assumed that $\epsilon \ll 1$. We consider a monochromatic standing wave of the form

(3.9)

\begin{align} h'(x,y,t) = A \cos(\omega t) \hat{h}(x,y), \quad \hat{h}(x,y) = \cos(kx) \cos(ly), \end{align}

(3.10)

\begin{align} \boldsymbol{u}'(x,y,t) = \frac{g}{\omega^2 - f^2}( - \omega \sin(\omega t) \boldsymbol{\nabla} \hat{h} + \cos(\omega t) \boldsymbol{f} \times \boldsymbol{\nabla} \hat{h}), \end{align}

where $A$ is a small amplitude, is a solution of linear rotating shallow water equations provided that $\omega$ satisfies the dispersion relation

(3.11)

\begin{equation} \omega = \sqrt{f^2+g H (l^2 + k^2)}. \end{equation}

Thomas, Bühler & Smith (Reference Thomas, Bühler and Smith2018) showed this standing wave induces a mean flow characterised by the Stokes terms

(3.12a,b)

\begin{align} \overline{h}^{ {\hspace{0.5mm}\small S}} = \frac{1}{2} \frac{g}{\omega^2 - f^2} \left| \boldsymbol{\nabla} \hat{h} \right|^2, \quad \overline{\boldsymbol{u}}^{ {\hspace{0.5mm}\small S}} ={-}\left( \frac{g}{\omega^2 - f^2} \right)^2 \boldsymbol{f} \times \boldsymbol{\nabla} \left( \frac{1}{2} \left| \boldsymbol{\nabla} \hat{h} \right|^2 + \frac{\omega^2 - f^2}{4 g H} \hat{h}^2 \right). \end{align}

The above analytical expressions provide a good test case for GBLA. We perform a numerical simulation for

(3.13a–c)

\begin{equation} A = 0.001, \quad f=g=H=1, \quad k = l =1, \end{equation}

and compare $\overline {h}^{{\hspace {0.5mm}\small L}} -\bar {h}$ and $\overline {\boldsymbol {u}}^{{\hspace {0.5mm}\small L}} -\bar {\boldsymbol {u}}$ calculated by the grid-based averaging method with the analytical expressions in (3.12a,b). There is an excellent agreement between these two sets of quantities in figure 7.

Figure 7. Stokes terms computed by GBLA (a–c) and from analytical expression in (3.12a,b) (d–f): (a,d) $\overline {u}^{ {\hspace {0.5mm}\small S}}$; (b,e) $\overline {v}^{ {\hspace {0.5mm}\small S}}$; (c, f) $\overline {h}^{ {\hspace {0.5mm}\small S}}$. The averaging interval is set to $2{\rm \pi} /\omega$.

4. Application: wave-averaged geostrophic balance

Geostrophic balance reliably describes the slow dynamics of large-scale geophysical flows when the waves are absent or relatively small. However, in the presence of waves that are at the same order or stronger than the mean flow, this balance breaks down even after averaging over the fast time scale at fixed points. Generalised Lagrangian-mean theories have predicted that a modified version of geostrophic balance can still hold with coexisting strong waves if it is applied to LM quantities (Moore Reference Moore1970; Bühler & McIntyre Reference Bühler and McIntyre1998; Wagner & Young Reference Wagner and Young2015; Xie & Vanneste Reference Xie and Vanneste2015; Thomas et al. Reference Thomas, Bühler and Smith2018). Referring to this as wave-averaged balance, Kafiabad et al. (Reference Kafiabad, Vanneste and Young2021) numerically illustrated its validity for a single barotropic vortex interacting with strong inertial waves in the context of Boussinesq equations. Instead of directly calculating the LM, they used explicit expressions of Stokes terms derived by Rocha, Wagner & Young (Reference Rocha, Wagner and Young2018) in the limit of near-inertial waves. This approach circumvented the complications of particle tracking in parallelised Boussinesq simulation. However, for baroclinic flows and waves with $\omega \gg f$, explicit expressions for the Stokes terms are not available. Therefore, the LM has to be computed directly, and GBLA makes this computation straightforward. In this study, we touch upon this potential application of GBLA by investigating wave-averaged geostrophy in the context of rotating shallow water.

We first present a brief derivation of wave-averaged balance for the momentum equations in a form that is slightly different than the existing expressions in the literature (a more detailed derivation for shallow water can be found in Thomas et al. (Reference Thomas, Bühler and Smith2018) and for the Boussinesq equations in Wagner & Young (Reference Wagner and Young2015)). We then quantify to what extent this balance holds for waves with different parameters and compare it with Eulerian-averaged geostrophic balance. Similar to preceding studies, the underlying assumption in our derivation is that the flow consists of fast waves with small amplitude $\epsilon \ll 1$, interacting with a slow flow of the order $\epsilon ^2$. This relative scaling of waves and slow flow is more a matter of convenience than necessity as the effect of waves on the mean flow appears at lower orders. For waves at the same order as the slow flow, similar results can be obtained with more tedious algebra and expansions to higher orders. Considering these assumptions, we expand flow variables as

(4.1)

\begin{equation} \boldsymbol{u} = \epsilon \boldsymbol{u} _1 + \epsilon^2 \boldsymbol{u} _2 + O(\epsilon^{3}), \end{equation}

where $\boldsymbol {u} _1$ denotes the wave terms with $\overline {\boldsymbol {u} _1} = 0$, and the mean flow at the leading order is $\bar {\boldsymbol {u} } = \epsilon ^2 \overline {\boldsymbol {u} _2} + O(\epsilon ^{3})$ (a similar expansion is carried out for $h$). Additionally, the mean flow is assumed to vary on the slow time scale of quasigeostrophic dynamics, leading to $\partial \overline {\boldsymbol {u} _2}/ \partial t = O(\epsilon ^2)$. At order $\epsilon$, the expansion of (3.6a) leads to

(4.2)

\begin{equation} \frac{\partial \boldsymbol{u} _1}{\partial t} + \boldsymbol{f} \times \boldsymbol{u} _1 ={-}g \boldsymbol{\nabla} h_1. \end{equation}

The next-order equation after averaging is

(4.3)

\begin{equation} \overline{\boldsymbol{u} _1 \boldsymbol{\cdot} \boldsymbol{\nabla} \boldsymbol{u} _1} + \boldsymbol{f} \times \overline{\boldsymbol{u} _2} ={-}g \overline{\boldsymbol{\nabla} h_2}. \end{equation}

After taking the time derivative of (2.2) and some algebraic manipulation, one can show that $\boldsymbol {u} _1 = \partial \boldsymbol {{\xi }} / \partial t + O(\epsilon )$ (see Andrews & McIntyre (Reference Andrews and McIntyre1978) for the detailed derivation). As a result,

(4.4)

\begin{equation} \overline{\boldsymbol{u} _1 \boldsymbol{\cdot} \boldsymbol{\nabla} \boldsymbol{u} _1} ={-} \overline{\boldsymbol{{\xi}} \boldsymbol{\cdot} \boldsymbol{\nabla} \frac{\partial \boldsymbol{u} _1}{\partial t}}. \end{equation}

Substituting $\partial \boldsymbol {u} _1 / \partial t$ from (4.2) in the above equation and using (3.8), we derive

(4.5)

\begin{equation} \overline{\boldsymbol{u} _1 \boldsymbol{\cdot} \boldsymbol{\nabla} \boldsymbol{u} _1} = \boldsymbol{f} \times \overline{\boldsymbol{{\xi}} \boldsymbol{\cdot} \boldsymbol{\nabla} \boldsymbol{u} _1} + g \overline{\boldsymbol{{\xi}} \boldsymbol{\cdot} \boldsymbol{\nabla} (\boldsymbol{\nabla} h_1)} = \boldsymbol{f} \times \overline{\boldsymbol{u}}^{ {\hspace{0.5mm}\small S}} + g \overline{\boldsymbol{\nabla} h}^{ {\hspace{0.5mm}\small S}} , \end{equation}

(4.5) and (4.3) lead to

(4.6)

\begin{equation} \boldsymbol{f} \times \left( \bar{\boldsymbol{u}} + \overline{\boldsymbol{u}}^{ {\hspace{0.5mm}\small S}} \right)={-}g \left( \overline{\boldsymbol{\nabla} h} + \overline{\boldsymbol{\nabla} h}^{ {\hspace{0.5mm}\small S}} \right), \end{equation}

where we replaced $\overline {\boldsymbol {u} _2}$ with $\bar {\boldsymbol {u}}$ (and removed the book-keeping parameter $\epsilon$). Rewriting (4.6) in terms of LM variables gives to the final form of wave-averaged geostrophy

(4.7)

\begin{equation} \boldsymbol{f} \times \overline{\boldsymbol{u}}^{{\hspace{0.5mm}\small L}} ={-}g \overline{\boldsymbol{\nabla} h}^{{\hspace{0.5mm}\small L}} . \end{equation}

We emphasise that $\overline {\boldsymbol {\nabla } h}^{{\hspace {0.5mm}\small L}}$ is the LM of the height gradient which is different from the gradient of the LM height $\boldsymbol {\nabla } \overline {h}^{{\hspace {0.5mm}\small L}}$. The right-hand side of (4.7) can also be written as $-g \boldsymbol {\nabla } ( \bar {h} + \overline {h}^{ {\hspace {0.5mm}\small S}} /2)$ (see e.g. Thomas et al. (Reference Thomas, Bühler and Smith2018) or Wagner & Young (Reference Wagner and Young2015) for the counterpart pressure gradient in the Boussinesq equations). We prefer to keep the current form of (4.7) as expressions in terms of $\overline {h}^{ {\hspace {0.5mm}\small S}} /2$ unnecessarily complicate the computation and derivation.

To verify (4.7), we study two examples of slow flow interacting with strong waves. In the first example, we consider an initial condition that consists of a Gaussian anticyclone similar to what was studied by Kafiabad et al. (Reference Kafiabad, Vanneste and Young2021). The vertical vorticity of this vortex is defined as

(4.8)

\begin{equation} \zeta(r, t = 0) ={-} \zeta_{{max}} \, \mathrm{e}^{{-}r^2/a^2}, \end{equation}

where $r$ is the radial coordinate, $a$ the vortex radius and $\zeta _{{max}}$ the initial maximum vorticity. The velocity fields associated with this vorticity are set to zero far from the vortex, and the associated height field is defined such that the geostrophic balance is initially maintained. This vortex is superimposed with a Poincaré wave of the form

(4.9a–c)

\begin{equation} u' = A \cos x, \quad v' = \frac{f}{\omega} A \sin x, \quad h' = H + \frac{H}{\omega} A \cos x, \end{equation}

where $\omega$ satisfies (3.11) (for $k=1$ and $l = 0$). Selecting $a = 0.4$, $f= 20$, $g = H = 1$, $\zeta _{{max}} = 0.5$ and $A = 3$, we numerically solve the nonlinear shallow-water equations for three periods of the initial wave and time average the variables over this interval ($3 \times 2{\rm \pi} /\omega$). The time step – which is the same for both the simulation and averaging – is set to $10^{-4}$, and all other parameters are kept the same as those in § 3.2.2.

Figure 8 portrays the vectors of Lagrangian- and Eulerian-mean Coriolis forces and height gradients (scaled by $g$) using GBLA. It is clear that the geostrophic balance holds very well for the LM values as described by (4.7), whereas the Eulerian-mean version has broken down. In the background $\boldsymbol {\nabla } \times \overline {\boldsymbol {u}}^{{\hspace {0.5mm}\small L}}$ and $\boldsymbol {\nabla } \times \bar {\boldsymbol {u}}$ are shown, which are slightly different fields. Unlike the initial vortex, $\boldsymbol {\nabla } \times \bar {\boldsymbol {u}}$ (and to a lesser extent $\boldsymbol {\nabla } \times \overline {\boldsymbol {u}}^{{\hspace {0.5mm}\small L}}$) are not completely radially symmetric due to the wave feedback on the mean flow.

Figure 8. The vectors of Lagrangian- and Eulerian-mean Coriolis forces and height gradients (scaled by $g$) using GBLA: black arrows, $\boldsymbol {f} \times \overline {\boldsymbol {u}}^{{\hspace {0.5mm}\small L}}$ (a) and $\boldsymbol {f} \times \bar {\boldsymbol {u}}$ (b); Magenta arrows, $g \overline {\boldsymbol {\nabla } h}^{{\hspace {0.5mm}\small L}}$ (a) and $g {\boldsymbol {\nabla } \bar {h}}$ (b). The background colour shows $\boldsymbol {\nabla } \times \overline {\boldsymbol {u}}^{{\hspace {0.5mm}\small L}}$ (a) and $\boldsymbol {\nabla } \times \bar {\boldsymbol {u}}$ (b). Only a part of resolved domain is shown.

To better investigate the accuracy of (4.7), we take a cross-section of this flow at $y=0$ in figure 9 and show the profiles of Coriolis force and (scaled) height gradient. The most outstanding difference in this figure is between $-f \bar {u}$ and $g \overline {h_y}$ (the dashed lines in figure 9b), which is expected from the form of the initial wave. Let us assume a hypothetical scenario that the linear wave in (4.9a–c) does not interact with the vortex. The following Stokes expressions can be derived for this linear wave:

(4.10a)

$$\begin{gather} f\, \overline{ \boldsymbol{{\xi}} \boldsymbol{\cdot} \boldsymbol{\nabla} u'} = f\ \overline{\frac{-A}{\omega} \sin (x-\omega t) \frac{\partial}{\partial x} A\cos(x-\omega t)} = f \frac{A^2}{2 \omega} \approx 4.5, \end{gather}$$

(4.10b)

$$\begin{gather}f\, \overline{ \boldsymbol{{\xi}} \boldsymbol{\cdot} \boldsymbol{\nabla} v'} = g\ \overline{ \boldsymbol{{\xi}} \boldsymbol{\cdot} \boldsymbol{\nabla} h_x} = g\ \overline{ \boldsymbol{{\xi}} \boldsymbol{\cdot} \boldsymbol{\nabla} h_y} = 0, \end{gather}$$

where the averaging is carried over one or several wave periods. In the presence of a vortex, the fast component of the flow changes in time leading to more complicated Stokes expressions than those in (4.10). Nevertheless, these values can provide a ‘ballpark’ estimate for the difference between the Eulerian and Lagrangian velocities. This explains the significant difference between $-f \bar {u}$ and $g \overline {h_y}$ as we expect $\overline {u}^{ {\hspace {0.5mm}\small S}}$ to be the largest Stokes term from (4.10). In fact, several other numerical simulations with different wave amplitudes have been performed to verify that this difference proportionally changes with $f A^2 / (2 \omega )$. Leaving this major difference aside, $f \overline {v}^{{\hspace {0.5mm}\small L}}$ and $g \overline {h_x}^{{\hspace {0.5mm}\small L}}$ are also closer to each other than their Eulerian counterparts. Even after removing the constant shift between $-f \bar {u}$ and $g \overline {h_y}$, $- f \overline {u}^{{\hspace {0.5mm}\small L}}$ more closely follows $g \overline {h_y}^{{\hspace {0.5mm}\small L}}$. These observations illustrate the merits of wave-averaged balance over classical geostrophic balance when strong waves are present.

Figure 9. The profiles of different components of Coriolis force and (scaled) height gradient at $y=0$. All the quantities in (a) are Lagrangian averaged, and in (b) Eulerian averaged.

Figure 10 displays the scatter plots of Lagrangian and Eulerian means for $g h_x$ and $g h_y$ as functions of $f v$ and $-f u$, respectively (using all the computational grid points). For perfect geostrophic balance, one expects the data points to lie on $y=x$ (shown in black in figure 10). We consider three simulations with $g=H=1$, $g=H=6$ and $g=H=10$ leading to wave frequencies of $\omega = 20.03$, $20.88$ and $22.36$, respectively. It is clear that the geostrophic balance holds much better for the LM quantities as the red markers are closer to $y=x$ than the green ones for all three cases. Considering these different wave frequencies implies that (4.7) is a good approximation in the presence of non-inertial waves as well as inertial waves. Unfortunately, we could not investigate much higher wave frequencies due to the formation of shocks at the high wave-amplitude, which is considered in our simulations.

Figure 10. The scaled height gradient as a function of the Coriolis force after Lagrangian averaging (in red) and Eulerian averaging (in green). The black line is $y=x$, which represents the geostrophic balance. Every marker represents the mean values for a computation grid point in space: (a,b) $g=H=1$; (c,d) $g=H=6$; (e, f) $g=H=10$. All other parameters are kept the same as those in figures 8 and 9.

In the next example, we initialise the slow flow with two opposite-sign vortices,

(4.11a–c)

\begin{equation} u ={-}\cos y, \quad v = \cos x, \quad h ={-} \frac{f}{g} (\sin x+\sin y). \end{equation}

A similar form of wave to (4.9a–c) is superimposed with these vortices. We set the flow and wave parameters as $f= 100$, $g = 100$, $H =4$ and $A = 20$. All other parameters and the numerical set-up are kept the same as the previous example. In figure 11, we observe that the vectors of $\boldsymbol {f} \times \overline {\boldsymbol {u}}^{{\hspace {0.5mm}\small L}}$ and $g \overline {\boldsymbol {\nabla } h}^{{\hspace {0.5mm}\small L}}$ balance each other very well (figure 11a), whereas the balance for their Eulerian counterparts breaks down (figure 11b). The background fields indicate that $\boldsymbol {\nabla } \times \bar {\boldsymbol {u}}$ is more affected by the wave signal than $\boldsymbol {\nabla } \times \overline {\boldsymbol {u}}^{{\hspace {0.5mm}\small L}}$ is. Further assurance comes from figure 12 where the LM Coriolis force components much more closely follow the LM scaled height-gradient components. Figure 13 also verifies that the Eulerian-mean quantities are farther from $y=x$ and wider spread.

Figure 11. Similar figure to figure 8 for two opposite-sign vortices described in (4.11a–c). The entire computational domain is shown in this figure.

Figure 12. Similar figure to figure 9 for two opposite-sign vortices described in (4.11a–c).

Figure 13. Similar figure to figure 10 for two opposite-sign vortices described in (4.11a–c).

5. Discussion

Through several examples, we find that GBLA reliably calculates the LM of flow variables and has several advantages over its particle tracking counterparts. Particle-tracking methods are performed either forward or backward in time to calculate LM fields. The forward-time tracking cannot guarantee that the particles at a later time are uniformly distributed. In fact, in regions of confluence the particles get cluttered in a small part of domain. The backward particle tracking does not suffer from this issue but requires storing times series of several fields: the instantaneous value of the desired field, its interpolated values at particle positions, particle positions along the trajectories and velocity components. The GBLA method does not need to store any time series; it only stores a few fields such as partial means at two time steps. As a result, it can significantly reduce the memory consumption if a large number of grid points or time steps are employed. For the LM of any field other than velocity, GBLA does not require interpolation of velocities, whereas the particle tracking methods do. For instance, assume that the LM temperature needs to be calculated. For particle tracking, in addition to the temperature, the velocity field needs to be interpolated at the particles’ instantaneous positions to advect them backward or forward. However, for GBLA there is no need for interpolating velocities, and only one set of interpolation for temperature partial means is needed; as a result, the amount of interpolation in GBLA is at least half of those in particle tracking. Depending on the spatial dimension, it could be even less than half. For example in three-dimensional flows, it is only a quarter of the interpolations required by particle tracking.

All the advantages said, we caution that the GBLA results could be affected by interpolation errors for long averaging periods or coarse spatial resolutions; better interpolation schemes and finer time steps can significantly reduce these errors. Our numerical investigations establish a robust spatial and temporal convergence of GBLA even for linear interpolation. Hence, by using finer grids one obtains reliable LM fields.

In this study, we explained the averaging process only for a single interval, but in some applications the evolution of mean flow or wave field in time is desired. This can be achieved by a moving time-window that frees up the memory from completed intervals and allocates it to the new intervals. Note that GBLA does not require the discretisation in time or space to be uniform. Hence, it can be improved by adaptive time stepping or a more advanced interpolation scheme. Another direction of improvement is by using more advanced filtering schemes. Many studies in geophysical fluids employ low-pass filtering in Fourier space (see e.g. Shakespeare & Hogg Reference Shakespeare and Hogg2017; Shakespeare et al. Reference Shakespeare, Gibson, Hogg, Keating, Bachman and Velzeboer2021), which can be more effective than simple averaging used in (2.3). This filtering (as well as other advanced filtering schemes) can be achieved by using GBLA, if an appropriate kernel is included in the definition of mean in (2.3). Considering other definitions of mean/perturbation than the one used in (2.3) and (2.4) (Soward & Roberts Reference Soward and Roberts2010; Gilbert & Vanneste Reference Gilbert and Vanneste2018), it is an open question to the author whether a similar approach to GBLA can be used to derive these forms of mean.

We mainly focused on the description and validation of our method and touched upon one application of GBLA. We leave other applications of GBLA for future work. Recent studies such as Shakespeare & Hogg (Reference Shakespeare and Hogg2017, Reference Shakespeare and Hogg2018, Reference Shakespeare and Hogg2019) have demonstrated that Lagrangian filtering is more effective in extracting wave signals from the flow. The GBLA method can make Lagrangian filtering more accessible by making the parallel implementation easier and reducing the memory footprint. Another application of GBLA is the study of the reduced models described for LM quantities (e.g. Xie & Vanneste Reference Xie and Vanneste2015; Wagner & Young Reference Wagner and Young2015), which was briefly considered in this paper. Studying these models in the context of Boussinesq equations is often challenging due to complications of particle tracking in high-resolution parallelised solvers. The GBLA method can also be applied to observations or more realistic global circulation models to gain more insight about the ocean and climate dynamics.

Ultimately, GBLA can improve the parameterisation schemes used in ocean and weather models that need to dissipate energy at much larger scales than viscous scales. Such dissipation – which is required for numerical stability and closing the energy budget – reduces the effective resolution of these models (see e.g. Skamarock Reference Skamarock2004). Several studies have shown that the unbalanced and fast processes such as waves dynamics initiate the downscale flux of energy toward dissipation range in the ocean and atmosphere (Waite & Snyder Reference Waite and Snyder2009; Barkan, Winters & McWilliams Reference Barkan, Winters and McWilliams2017; Kafiabad & Bartello Reference Kafiabad and Bartello2018; Taylor & Straub Reference Taylor and Straub2020; Xie Reference Xie2020). Hence, one can argue that the dissipation of the fast component after removing the LM preserves the slow dynamics better than the less selective dissipation of all fields or their Eulerian high-pass part.

Acknowledgements

Professor J.Vanneste is thanked for insightful discussions and suggestions that greatly improved the manuscript.

Declaration of interests

The author reports no conflict of interest.

References

REFERENCES

Andrews, D.G. & McIntyre, M.E. 1978 An exact theory of nonlinear waves on a Lagrangian-mean flow. J. Fluid Mech. 89, 609–646.CrossRef Google Scholar

Asselin, O. & Young, W.R. 2020 Penetration of wind-generated near-inertial waves into a turbulent ocean. J. Phys. Oceanogr. 50 (6), 1699–1716.CrossRef Google Scholar

Bachman, S.D., Shakespeare, C.J., Kleypas, J., Castruccio, F.S. & Curchitser, E. 2020 Particle-based lagrangian filtering for locating wave-generated thermal refugia for coral reefs. J. Geophys. Res.: Oceans 125 (7), e2020JC016106.CrossRef Google Scholar

Barkan, R., Winters, K.B. & McWilliams, J.C. 2017 Stimulated imbalance and the enhancement of eddy kinetic energy dissipation by internal waves. J. Phys. Oceanogr. 47 (1), 181–198.CrossRef Google Scholar

Bretherton, F.P. 1971 The general linearized theory of wave propagation. In Mathematical Problems in the Geophysical Sciences (ed. W.H. Reid), Lectures in Applied Mathematics, vol. 13, pp. 61–102. American Mathematical Society.Google Scholar

Bühler, O. 2009 Waves and Mean Flows. Cambridge University Press.CrossRef Google Scholar

Bühler, O. & McIntyre, M.E. 1998 On non-dissipative wave–mean interactions in the atmosphere or oceans. J. Fluid Mech. 354, 301–343.CrossRef Google Scholar

Charney, J.G. 1971 Geostrophic turbulence. J. Atmos. Sci. 28 (6), 1087–1095.2.0.CO;2>CrossRef Google Scholar

Gilbert, A.D. & Vanneste, J. 2018 Geometric generalised Lagrangian-mean theories. J. Fluid Mech. 839, 95–134.CrossRef Google Scholar

Grimshaw, R. 1975 Nonlinear internal gravity waves in a rotating fluid. J. Fluid Mech. 71 (3), 497–512.CrossRef Google Scholar

Kafiabad, H.A. & Bartello, P. 2018 Spontaneous imbalance in the non-hydrostatic Boussinesq equations. J. Fluid Mech. 847, 614–643.CrossRef Google Scholar

Kafiabad, H.A., Vanneste, J. & Young, W.R. 2021 Wave-averaged balance: a simple example. J. Fluid Mech. 911, R1.CrossRef Google Scholar

McIntyre, M.E. & Norton, W.A. 2000 Potential vorticity inversion on a hemisphere. J. Atmos. Sci. 57, 1214–1235, corrigendum: 58, 2001, 949–949.2.0.CO;2>CrossRef Google Scholar

Mohebalhojeh, A.R. & Dritschel, D.G. 2001 Hierarchies of balance conditions for the

$f$-plane shallow water equations. J. Atmos. Sci. 58, 2411–2426.2.0.CO;2>CrossRef Google Scholar

Moore, D. 1970 The mass transport velocity induced by free oscillations at a single frequency. Geophys. Fluid Dyn. 1, 237–247.CrossRef Google Scholar

Nagai, T., Tandon, A., Kunze, E. & Mahadevan, A. 2015 Spontaneous generation of near-inertial waves by the Kuroshio front. J. Phys. Oceanogr. 45 (9), 2381–2406.CrossRef Google Scholar

Rocha, C.B., Wagner, G.L. & Young, W.R. 2018 Stimulated generation: extraction of energy from balanced flow by near-inertial waves. J. Fluid Mech. 847, 417–451.CrossRef Google Scholar

Shakespeare, C.J., Gibson, A.H., Hogg, A.M., Keating, S.R., Bachman, S.D. & Velzeboer, N. 2021 Lagrangian filtering: a novel method for separating internal waves from non-wave flows in high-resolution simulations. In Earth and Space Science Open Archive, https://doi.org/10.1002/essoar.10507016.1.CrossRef Google Scholar

Shakespeare, C.J. & Hogg, A.M. 2017 Spontaneous surface generation and interior amplification of internal waves in a regional-scale ocean model. J. Phys. Oceanogr. 47 (4), 811–826.CrossRef Google Scholar

Shakespeare, C.J. & Hogg, A.M. 2018 The life cycle of spontaneously generated internal waves. J. Phys. Oceanogr. 48 (2), 343–359.CrossRef Google Scholar

Shakespeare, C.J. & Hogg, A.M. 2019 On the momentum flux of internal tides. J. Phys. Oceanogr. 49 (4), 993–1013.CrossRef Google Scholar

Skamarock, W.C. 2004 Evaluating mesoscale NWP models using kinetic energy spectra. Mon. Weath. Rev. 132 (12), 3019–3032.CrossRef Google Scholar

Soward, A.M. & Roberts, P.H. 2010 The hybrid Euler–Lagrange procedure using an extension of Moffatt's method. J. Fluid Mech. 661, 45–72.CrossRef Google Scholar

Sutherland, B.R., Achatz, U., Colm-cille, P.C. & Klymak, J.M. 2019 Recent progress in modeling imbalance in the atmosphere and ocean. Phys. Rev. Fluids 4 (1), 010501.CrossRef Google Scholar

Taylor, S. & Straub, D. 2020 Effects of adding forced near-inertial motion to a wind-driven channel flow. J. Phys. Oceanogr. 50 (10), 2983–2996.CrossRef Google Scholar

Thomas, J., Bühler, O. & Smith, K.S. 2018 Wave-induced mean flows in rotating shallow water with uniform potential vorticity. J. Fluid Mech. 839, 408–429.CrossRef Google Scholar

Vallis, G.K. 2017 Atmospheric and Oceanic Fluid Dynamics. Cambridge University Press.CrossRef Google Scholar

Vanneste, J. 2013 Balance and spontaneous wave generation in geophysical flows. Annu. Rev. Fluid Mech. 45, 147–172.CrossRef Google Scholar

Wagner, G.L. & Young, W.R. 2015 Available potential vorticity and wave-averaged quasi-geostrophic flow. J. Fluid Mech. 785, 401–424.CrossRef Google Scholar

Wagner, G.L. & Young, W.R. 2016 A three-component model for the coupled evolution of near-inertial waves, quasi-geostrophic flow and the near-inertial second harmonic. J. Fluid Mech. 802, 806–837.CrossRef Google Scholar

Waite, M.L. & Snyder, C. 2009 The mesoscale kinetic energy spectrum of a baroclinic life cycle. J. Atmos. Sci. 66 (4), 883–901.CrossRef Google Scholar

Warn, T. 1997 Nonlinear balance and quasi-geostrophic sets. Atmos. Ocean 35 (2), 135–145.CrossRef Google Scholar

Xie, J.-H. 2020 Downscale transfer of quasigeostrophic energy catalyzed by near-inertial waves. J. Fluid Mech. 904, A40.CrossRef Google Scholar

Xie, J.-H. & Vanneste, J. 2015 A generalised-Lagrangian-mean model of the interactions between near-inertial waves and mean flow. J. Fluid Mech. 774, 143–169.CrossRef Google Scholar

Figure 1. Schematic view of particle trajectories that pass through the grid points $\boldsymbol {X}_i$, $\boldsymbol {X}_j$, $\boldsymbol {X}_p$ and $\boldsymbol {X}_l$ that are marked by black. The trajectories that reach these grid points at $t_k$ are marked in blue and the trajectory that reaches the grid point at $t_{k+1}$ in red. For $\boldsymbol {X}_i$, the times associated with the particle position of red and blue trajectories are shown as well.

Figure 9. The profiles of different components of Coriolis force and (scaled) height gradient at $y=0$. All the quantities in (a) are Lagrangian averaged, and in (b) Eulerian averaged.