Hostname: page-component-745bb68f8f-b6zl4 Total loading time: 0 Render date: 2025-02-12T04:30:36.399Z Has data issue: false hasContentIssue false

Particle dispersion by nonlinearly damped random waves

Published online by Cambridge University Press:  02 December 2015

Oliver Bühler*
Affiliation:
Center for Atmosphere Ocean Science at the Courant Institute of Mathematical Sciences, New York University, New York, NY 10012, USA
Yuan Guo
Affiliation:
Center for Atmosphere Ocean Science at the Courant Institute of Mathematical Sciences, New York University, New York, NY 10012, USA
*
Email address for correspondence: obuhler@cims.nyu.edu

Abstract

We present a theoretical study of the dispersion of particles along quasi-horizontal stratification surfaces induced by small-amplitude internal gravity waves that are forced by white noise and dissipated by linear or nonlinear damping. This extends previous studies in which only linear damping was considered. The damping itself is a toy model for the nonlinear processes that would attenuate a wave mode in a broad spectrum of internal waves such as the Garrett–Munk spectrum for ocean internal waves. We compute the velocity covariance using an eigenfunction expansion of the Kolmogorov backward equation and investigate how the degree of nonlinearity affects the scaling of diffusivity with wave amplitude. We find a simple new expression for the diffusivity that is valid in both the linear and nonlinear cases, and we consider the likely quantitative importance of these process in the context of data from field experiments on small-scale ocean tracer dispersion.

Type
Papers
Copyright
© 2015 Cambridge University Press 

1. Introduction

We are interested in the horizontal spreading of particles along stratification surfaces due to the advection by random small-amplitude gravity waves. On the one hand this process poses a fundamental problem in geophysical fluid dynamics in its own right (e.g. Herterich & Hasselmann Reference Herterich and Hasselmann1982; Sanderson & Okubo Reference Sanderson and Okubo1988; Weichman & Glazman Reference Weichman and Glazman2000; Balk, Falkovich & Stepanov Reference Balk, Falkovich and Stepanov2004; Balk Reference Balk2006), and on the other hand it might also be relevant to explain part of the actual tracer diffusion observed at very small horizontal scales in the ocean (e.g. Ledwell, Watson & Law Reference Ledwell, Watson and Law1998; Garrett Reference Garrett2006). The simplest quantity that can quantify this process is the one-particle diffusivity $D$ of Taylor (Reference Taylor1921), which in a horizontally isotropic situation can be computed based on a single Cartesian velocity component $U(t)$ , say, which the Lagrangian velocity following a fluid particle. If $U(t)$ is a stationary zero-mean random function with covariance function

(1.1) $$\begin{eqnarray}C(s)=C(-s)=\mathbb{E}[U(t)U(t+s)]\quad \text{then }D=\int _{0}^{\infty }C(s)\,\text{d}s.\end{eqnarray}$$

Here $\mathbb{E}$ denotes probabilistic expectation and (1.1) makes obvious how the problem hinges on the statistics of the Lagrangian velocity field $U(t)$ .

In a recent paper, Bühler, Grisouard & Holmes-Cerfon (Reference Bühler, Grisouard and Holmes-Cerfon2013) (hereafter BGHC13), it was pointed out that the scaling of $D$ with wave amplitude $a\ll 1$ depends crucially on the presence of explicit wave damping. Specifically, in the unforced and undamped case studied in rotating shallow water in Bühler & Holmes-Cerfon (Reference Bühler and Holmes-Cerfon2009) and in a three-dimensional rotating Boussinesq system in Holmes-Cerfon, Bühler & Ferrari (Reference Holmes-Cerfon, Bühler and Ferrari2011), the diffusivity scaling was $D=O(a^{4})$ , whilst in the forced–dissipative nonlinear numerical simulations presented in BGHC13 it was clearly found that $D=O(a^{2})$ . This significant difference could be satisfactorily explained by considering toy models for the forced–dissipative wave dynamics based on stochastic differential equations (SDEs). In these toy models a single wave mode with frequency ${\it\omega}$ was forced by white noise and dissipated by linear damping. Such linear models can be solved exactly and this allowed the computation of $D$ directly from the theory, and good agreement with the numerical simulations was found.

However, as a model for the real ocean internal wave field, say, white-noise forcing and linear damping are of course poor models for the actual physical processes that underpin the spectral energy cascade through a broad spectrum of internal waves (e.g McComas & Bretherton Reference McComas and Bretherton1977; Munk Reference Munk, Warren and Wunsch1981). Hence the present paper investigates the robustness of the simple linear models by considering nonlinear variants of the damping function associated with a single wave mode. Crudely, such a damping function seeks to capture the nonlinear energy transfer from the given mode to all other modes of the wave spectrum, and there is no particularly good reason to assume that such a transfer should be linear in wave amplitude.

Our aim here is to investigate this issue by considering a family of nonlinear stochastic wave models (see (3.1) in § 3 below) that have an adjustable degree of nonlinearity indicated by a parameter $p\geqslant 0$ . The technically difficult part is that nonlinear SDEs cannot be solved directly anymore, so the covariance function $C(t)$ must be found by other means in order to compute $D$ . In this paper we investigate both an eigenfunction expansion based on the Kolmogorov backward equation (KBE) in § 2.1, which allows $C(t)$ to be computed exactly, and also a simple (but surprisingly accurate) approximation for $C(t)$ in § 2.2.

Overall, we find that the role of the nonlinearity strongly affects the scaling of $D$ with wave amplitude $a\gg 1$ , but that its explicit impact on $D$ can be removed if one chooses the expected energy flux and the expected kinetic wave energy as variables to describe the wave mode statistics. In fact, rather unexpectedly, in this way the linear theory can be generalized to nonlinear damping functions. This leads to a simple new expression for $D$ (see (3.7) and (3.10) in § 3 below) that generalizes the earlier results in BGHC13 and indicates their robustness. This simple new formula also allows the oceanic situation to be looked at in quantitative detail, as will be shown in § 4. Concluding remarks are offered in § 5.

2. Analysis of a nonlinear Ornstein–Uhlenbeck process

To introduce the analysis tools we first consider a nonlinear variant of the classical Ornstein–Uhlenbeck (OU) process in one and two dimensions and thereafter add the important physical feature of wave oscillations to the two-dimensional equations in § 3. The one-dimensional OU-process for a Lagrangian velocity component $U(t)$ is defined by the SDE

(2.1) $$\begin{eqnarray}\text{d}U=-{\it\gamma}U|U|^{p}\,\text{d}t+{\it\beta}\,\text{d}W.\end{eqnarray}$$

Here $W(t)$ is a standard Wiener process (Brownian motion) and the parameters ${\it\gamma}>0$ and ${\it\beta}\neq 0$ measure the strength of the damping and of the random forcing. The exponent $p\geqslant 0$ describes the form of the nonlinear damping term and for $p=0$ the process (2.1) reduces to the linear OU-process studied in BGHC13. For $p>0$ the instantaneous damping rate varies as ${\it\gamma}|U|^{p}$ and consequently the dimensions of ${\it\gamma}$ depend on $p$ (see (2.8ad ) below). We will be most interested in the cases $p=1$ (quadratic nonlinearity) and $p=2$ (quadratic instantaneous damping rate). The former is suggested by the elementary quadratic nonlinearity of the fluid equations, whilst the latter is often invoked in models for a spectrum of inertia–gravity waves in the ocean (e.g. McComas & Bretherton Reference McComas and Bretherton1977; Munk Reference Munk, Warren and Wunsch1981).

In the linear case $p=0$ a closed-form solution for $U(t)$ is available, but this does not extend to the nonlinear case $p>0$ . Hence indirect methods are required in order to compute the covariance $C(t)$ and the diffusivity $D$ . First, for all values of $p\geqslant 0$ a stationary state of (2.1) can be readily computed (see § A.1), which is described by the probability density function (p.d.f.)

(2.2) $$\begin{eqnarray}{\it\rho}(u)=\frac{1}{Z}\exp \left(-\frac{2}{2+p}\frac{{\it\gamma}}{{\it\beta}^{2}}|u|^{2+p}\right)\!\quad \text{with }Z=\left(\!\frac{{\it\beta}^{2}}{{\it\gamma}}\right)^{1/(2+p)}{\it\Gamma}\!\left(\frac{1}{2+p}\!\right)\!\left(\frac{2}{2+p}\!\right)^{(1+p)/(2+p)}.\end{eqnarray}$$

Here ${\it\Gamma}(\cdot )$ is the gamma function (see (A 3)). Compared to the linear case $p=0$ , the p.d.f. for $p>0$ is flattened near the origin and then decreases more sharply for larger values of $u$ , which indicates that the intermittency of $U$ depends on $p$ . The stationary moments of $U$ are

(2.3) $$\begin{eqnarray}\mathbb{E}[|U|^{m}]=\frac{{\it\Gamma}\left({\displaystyle \frac{1+m}{2+p}}\right)}{{\it\Gamma}\left({\displaystyle \frac{1}{2+p}}\right)}\left(\frac{2+p}{2}\frac{{\it\beta}^{2}}{{\it\gamma}}\right)^{m/(2+p)}\quad \text{for }m\geqslant 0.\end{eqnarray}$$

In particular, the variance

(2.4) $$\begin{eqnarray}C(0)=\mathbb{E}[U^{2}]=\frac{{\it\Gamma}\left({\displaystyle \frac{3}{2+p}}\right)}{{\it\Gamma}\left({\displaystyle \frac{1}{2+p}}\right)}\left(\frac{2+p}{2}\frac{{\it\beta}^{2}}{{\it\gamma}}\right)^{2/(2+p)}.\end{eqnarray}$$

Notably, if $m=2+p$ then the identity ${\it\Gamma}(x+1)=x{\it\Gamma}(x)$ yields the simple result

(2.5) $$\begin{eqnarray}\mathbb{E}[|U|^{2+p}]=\frac{{\it\beta}^{2}}{2{\it\gamma}}.\end{eqnarray}$$

This is the fluctuation–dissipation relation belonging to (2.1) and it can be derived directly from the SDE by multiplying it by $U$ to form an equation for the energy budget. It is peculiar to stochastic calculus that in this derivation one must retain quadratic terms such as $\text{d}U\,\text{d}U$ in order to get the correct answer (e.g. Gardiner Reference Gardiner1997). This yields

(2.6) $$\begin{eqnarray}U\,\text{d}U=\text{d}\left(\frac{U^{2}}{2}\right)-\frac{\text{d}U\,\text{d}U}{2}=\text{d}\left(\frac{U^{2}}{2}\right)-\frac{{\it\beta}^{2}}{2}\,\text{d}t=-{\it\gamma}|U|^{2+p}\,\text{d}t+{\it\beta}U\,\text{d}W\end{eqnarray}$$

after using (2.1) and Itô’s formula in the form $\text{d}W\,\text{d}W=\text{d}t$ . The stationary expectation of (2.6) then yields (2.5) because $\mathbb{E}[U\,\text{d}W]=0$ . This makes it obvious that ${\it\beta}^{2}/2$ is the expected generation rate of energy per unit time, whilst ${\it\gamma}\mathbb{E}[|U|^{2+p}]$ is the expected energy dissipation rate per unit time, which balances the generation rate in a stationary state. The conventional notation for the energy dissipation rate is ${\it\epsilon}$ , so

(2.7) $$\begin{eqnarray}{\it\epsilon}=\frac{{\it\beta}^{2}}{2}={\it\gamma}\mathbb{E}[|U|^{2+p}]={\it\gamma}O(a^{2+p}).\end{eqnarray}$$

This provides a simple and accurate formula that relates the amplitude of $U$ monotonically to the parameter ${\it\beta}^{2}/{\it\gamma}$ .

Before moving on we note how dimensional analysis can be used to define length and time scales $(L,T)$ from $({\it\beta},{\it\gamma})$ . Because $U$ is a velocity, this yields

(2.8ad ) $$\begin{eqnarray}L={\it\beta}^{(2-2p)/(2+p)}{\it\gamma}^{-3/(2+p)},\quad T=({\it\beta}^{p}{\it\gamma})^{-2/(2+p)}\quad \text{and}\quad {\it\beta}=\frac{L}{T^{3/2}},\quad {\it\gamma}=\frac{T^{p-1}}{L^{\,p}}.\end{eqnarray}$$

The intrinsic velocity and diffusivity scales then follow as

(2.9a,b ) $$\begin{eqnarray}\frac{L}{T}=\left(\frac{{\it\beta}^{2}}{{\it\gamma}}\right)^{1/(2+p)}\quad \text{and}\quad D\propto \frac{L^{2}}{T}=\left(\frac{{\it\beta}^{2-p}}{{\it\gamma}^{2}}\right)^{2/(2+p)}=\left(\frac{L}{T}\right)^{2-p}\frac{1}{{\it\gamma}}=O(a^{2-p})\frac{1}{{\it\gamma}}.\end{eqnarray}$$

For fixed variance and hence fixed $L/T=O(a)$ the diffusivity $D$ is simply proportional to $1/{\it\gamma}$ , which extends to all $p\geqslant 0$ , a result quoted for the linear OU-process in BGHC13. Interestingly, in the case of quadratic damping rate $p=2$ , the diffusivity becomes independent of the velocity amplitude. However, the nonlinear OU-process of course lacks any wave-like oscillations and hence not much relevance to oceanic waves can be read into these scaling relations.

2.1. Eigenfunction expansion for the covariance function

The exact covariance function can be determined via an eigenfunction expansion of the generator of the SDE (2.1), which is the differential operator

(2.10) $$\begin{eqnarray}L=-{\it\gamma}u|u|^{p}\frac{\partial }{\partial u}+\frac{{\it\beta}^{2}}{2}\frac{\partial ^{2}}{\partial u^{2}}.\end{eqnarray}$$

The generator $L$ is less frequently used in physical applications than its adjoint $L^{\dagger }$ , which appears in the Fokker–Planck equation (a.k.a. the Kolmogorov forward equation)

(2.11) $$\begin{eqnarray}\frac{\partial {\it\rho}}{\partial t}=L^{\dagger }{\it\rho}=\frac{\partial }{\partial u}\left({\it\gamma}u|u|^{p}{\it\rho}+\frac{\partial }{\partial u}\left(\frac{{\it\beta}^{2}}{2}{\it\rho}\right)\right)\end{eqnarray}$$

for the p.d.f. ${\it\rho}(u,t)$ of the process. The invariant p.d.f. ${\it\rho}(u)$ in (2.2) is in the null-space of the adjoint: $L^{\dagger }{\it\rho}=0$ . From a mathematical perspective the generator $L$ is fundamental and has a number of attractive properties (e.g. Oksendal Reference Oksendal2013), the most important of which for the present purpose is its role in the KBE. For an autonomous SDE such as (2.1) the KBE takes the following form: consider a function $f(u,t)$ defined for $t\geqslant 0$ by the initial-value problem

(2.12a,b ) $$\begin{eqnarray}\frac{\partial f}{\partial t}=L\,f\quad \text{and}\quad f(u,0)=g(u).\end{eqnarray}$$

It then follows from probability theory that

(2.13) $$\begin{eqnarray}f(u,t)=\mathbb{E}[g(U(t))\mid U(0)=u].\end{eqnarray}$$

In other words, if $f(u,t)$ solves the KBE problem in (2.12) then it equals the expected value of $g(U(t))$ at time $t\geqslant 0$ conditioned on $U(0)=u$ at the initial time $t=0$ . Here $g(\cdot )$ is an arbitrary function of $u$ . Hence the KBE allows time-dependent expected values to be computed directly, i.e. without finding the time-dependent p.d.f. of the process first, which is a significant simplification.

This can be used to compute $C(t)$ as follows. We first use the KBE to compute the expected value of $U(t)$ conditioned on starting at a specific location $U(0)=u$ . Thereafter we aggregate this result (times $u$ ) over all values of $u$ according to the invariant measure. In equations, for $t\geqslant 0$ this means

(2.14) $$\begin{eqnarray}C(t)=\mathbb{E}[U(0)U(t)]=\int _{-\infty }^{+\infty }u\mathbb{E}[U(t)\mid U(0)=u]{\it\rho}\,\text{d}u=\int _{-\infty }^{+\infty }u\,f(u,t){\it\rho}\,\text{d}u\end{eqnarray}$$

where $f(u,t)$ solves (2.12) with $g(u)=u$ . The actual computation of $f$ utilizes the eigenfunctions $y_{k}(u)$ and eigenvalues ${\it\lambda}_{k}$ of $L$ as defined by

(2.15) $$\begin{eqnarray}Ly_{k}=-{\it\lambda}_{k}y_{k}.\end{eqnarray}$$

Multiplication by ${\it\rho}(u)$ and using $L^{\dagger }{\it\rho}=0$ puts (2.15) into self-adjoint form:

(2.16) $$\begin{eqnarray}\frac{\text{d}}{\text{d}u}\left(\frac{{\it\beta}^{2}}{2}{\it\rho}\,\frac{\text{d}y_{k}}{\text{d}u}\right)+{\it\lambda}_{k}{\it\rho}\,y_{k}=0.\end{eqnarray}$$

This is a standard Sturm–Liouville problem with weight ${\it\rho}(u)$ , so the eigenfunctions are orthogonal under the inner product with that weight and can be normalized such that

(2.17) $$\begin{eqnarray}\int _{-\infty }^{+\infty }y_{k}y_{l}\,{\it\rho}\,\text{d}u={\it\delta}_{kl}.\end{eqnarray}$$

From here it is straightforward to find $(y_{k},{\it\lambda}_{k})$ numerically by using a suitable software package such as MATSLISE for matlab (Ledoux, Daele & Berghe Reference Ledoux, Daele and Berghe2005). Because ${\it\rho}(u)$ decays rapidly at large $|u|$ this can be done using a finite computational domain.

Each eigenfunction generates a separable solution $y_{k}(u)\exp (-{\it\lambda}_{k}t)$ of the KBE and therefore $f(u,t)$ can be expanded as

(2.18) $$\begin{eqnarray}f(u,t)=\mathop{\sum }_{k}a_{k}y_{k}(u)\text{e}^{-{\it\lambda}_{k}t}\quad \text{where }a_{k}=\int _{-\infty }^{+\infty }uy_{k}{\it\rho}\,\text{d}u.\end{eqnarray}$$

From the second part of (2.14) the covariance and the diffusion are then given by

(2.19a,b ) $$\begin{eqnarray}t\geqslant 0:\quad C(t)=\mathop{\sum }_{k}a_{k}^{2}\text{e}^{-{\it\lambda}_{k}t}\quad \text{and}\quad D=\mathop{\sum }_{k}\frac{a_{k}^{2}}{{\it\lambda}_{k}}.\end{eqnarray}$$

The main result (2.19) shows that the covariance is always a sum of exponential functions, albeit with different decay rates given by the eigenvalues ${\it\lambda}_{k}$ . Numerical results of this eigenfunction expansion for $p=1$ and $p=2$ are given in table 1 and illustrated in figure 1. The sum for $C(t)$ in (2.19) converges rapidly and this suggests a simple approximation based on a single exponential function $C_{\ast }(t)$ , say. This is pursued in the next section.

Figure 1. (a) Non-dimensional exact $C(t)$ computed in § 2.1 (solid line) and approximate $C_{\ast }(t)$ computed in § 2.2 (dot-dashed line) for the process (2.1) with $p=2$ . The approximation is very accurate except for a small underestimate at large lag times. (b) Illustration of the first four non-dimensional eigenfunctions $y_{k}$ for $k\in \{0,1,2,3\}$ for the same case. The weighted functions $y_{k}\sqrt{{\it\rho}}$ are plotted, which are orthogonal under the standard inner product.

Table 1. The eigenfunction expansion for $p=1$ and $p=2$ , and ${\it\beta}={\it\gamma}=1$ , which produces non-dimensional eigenvalues and other parameters. Dimensional values are obtained by multiplying the eigenvalues ${\it\lambda}_{k}$ by $1/T$ and so on. Eigenvalues with even $k$ are neglected because their corresponding eigenfunctions are orthogonal to the initial condition $f(u,0)=u$ and hence make no contribution to $C(t)$ . For $p=1$ the approximate $D_{\ast }=0.479$ and for $p=2$ it is $D_{\ast }=0.457$ , indicating the remarkable accuracy of the exponential approximation (2.24a,b ).

2.2. Exponential approximation for the covariance function

A natural idea for an approximate covariance function would be to truncate the sum in (2.19) after just one term. However, this would have little practical advantage because one would still need the eigenfunction expansion to compute the parameters $a_{1}$ and ${\it\lambda}_{1}$ . Also, such an approximation would not capture the exact variance $C(0)$ of the process, which is the sum over all $a_{k}^{2}$ . It is therefore a better idea to pose an exponential approximation for $C(t)$ at the outset and fit its parameters directly from the behaviour of the exact $C(t)$ near $t=0$ . This side-steps the need for an eigenfunction expansion altogether and turns out to be surprisingly accurate as well. We therefore consider

(2.20a,b ) $$\begin{eqnarray}C_{\ast }(t)=A\text{e}^{-B|t|}\quad \text{and}\quad D_{\ast }=\frac{A}{B}\end{eqnarray}$$

with suitable constants $(A,B)$ determined from the behaviour of $C(t)$ near $t=0$ . Specifically, we require

(2.21a,b ) $$\begin{eqnarray}C_{\ast }(0)=C(0)\quad \text{and}\quad C_{\ast }^{\prime }(0)=C^{\prime }(0),\end{eqnarray}$$

where the slope in the second equation is evaluated as $t\rightarrow 0+$ , say. This is necessary because $C^{\prime }$ jumps at $t=0$ . The exact slope at the origin can be computed by combining a Taylor expansion for small $\text{d}t>0$ with the fluctuation–dissipation relation (2.7) as follows:

(2.22) $$\begin{eqnarray}\displaystyle & C(\text{d}t)=C(0)+\text{d}tC^{\prime }(0)=\mathbb{E}[U(t)U(t+\text{d}t)]=\mathbb{E}[U^{2}]+\mathbb{E}[U\,\text{d}U] & \displaystyle\end{eqnarray}$$
(2.23) $$\begin{eqnarray}\displaystyle & \displaystyle \quad \Rightarrow \quad C^{\prime }(0)=\frac{\mathbb{E}[U\,\text{d}U]}{\text{d}t}=-{\it\gamma}\mathbb{E}[|U|^{2+p}]=-\frac{{\it\beta}^{2}}{2}=-{\it\epsilon}. & \displaystyle\end{eqnarray}$$
Hence $A=\mathbb{E}[U^{2}]$ and $B={\it\epsilon}/\mathbb{E}[U^{2}]$ , which finally yields
(2.24a,b ) $$\begin{eqnarray}C_{\ast }(t)=\mathbb{E}[U^{2}]\exp \left(-\frac{{\it\epsilon}|t|}{\mathbb{E}[U^{2}]}\right)\quad \text{and}\quad D_{\ast }=\frac{(\mathbb{E}[U^{2}])^{2}}{{\it\epsilon}}.\end{eqnarray}$$

The corresponding approximation for the auto-correlation time is

(2.25) $$\begin{eqnarray}{\it\tau}_{\ast }=\frac{D_{\ast }}{\mathbb{E}[U^{2}]}=\frac{\mathbb{E}[U^{2}]}{{\it\epsilon}}.\end{eqnarray}$$

Of course, the functional expressions for $D_{\ast }$ and ${\it\tau}_{\ast }$ are merely as one would expect from dimensional analysis based on the kinetic energy $\mathbb{E}[U^{2}]/2$ and on the energy dissipation ${\it\epsilon}$ , but the interesting point is that these formulae yield the correct diffusivity values to within a few percent, as was seen in § 2.1. Notably, there are no pre-factors involving the nonlinearity parameter $p\geqslant 0$ , so in this approximation two processes with different values of $p$ but equal values of $\mathbb{E}[U^{2}]$ and ${\it\epsilon}$ will yield the same diffusivity. This is in fact a robust finding even for wave-like processes, as we shall see.

The comparison of $C(t)$ and $C_{\ast }(t)$ in figure 1 shows that the latter matches the former very well, with a small underestimate for large values of $t$ . Overall we find that the exponential approximation (2.24a,b ) is surprisingly accurate, with errors less than 5 % or so. This unexpected accuracy of the exponential approximation $C_{\ast }(t)$ allows us to simplify a great number of relations below, but we did check that using the exact $C(t)$ would yield essentially the same results in all cases we considered.

2.3. Two-dimensional nonlinear OU-process

A natural two-dimensional isotropic generalization of the process (2.1) is

(2.26a ) $$\begin{eqnarray}\text{d}U=-{\it\gamma}U(U^{2}+V^{2})^{p/2}\,\text{d}t+{\it\beta}\text{d}W_{1},\end{eqnarray}$$
(2.26b ) $$\begin{eqnarray}\text{d}V=-{\it\gamma}V(U^{2}+V^{2})^{p/2}\,\text{d}t+{\it\beta}\text{d}W_{2}.\end{eqnarray}$$

Here $\boldsymbol{U}=(U,V)$ is the two-dimensional horizontal velocity vector and $(W_{1},W_{2})$ are two independent Wiener processes. No new dimensional parameters have been introduced so the dimensional analysis leading to (2.8ad ) and below remains unchanged. The nonlinear drift is a gradient flow with isotropic potential

(2.27) $$\begin{eqnarray}{\it\Phi}(U,V)=-\frac{{\it\gamma}}{2+p}(U^{2}+V^{2})^{(2+p)/2},\end{eqnarray}$$

and hence the two-dimensional invariant p.d.f. has the form (e.g Gardiner Reference Gardiner1997)

(2.28) $$\begin{eqnarray}{\it\rho}(u,v)=\frac{1}{Z}\exp \left(\frac{2}{{\it\beta}^{2}}{\it\Phi}(u,v)\right)\quad \text{with }Z={\rm\pi}\left(\frac{{\it\beta}^{2}}{{\it\gamma}}\right)^{2/(2+p)}{\it\Gamma}\left(\frac{2}{2+p}\right)\left(\frac{2}{2+p}\right)^{p/(2+p)}.\end{eqnarray}$$

Clearly, ${\it\rho}(u,v)$ depends only on the velocity magnitude $r=\sqrt{u^{2}+v^{2}}$ , but it does not factorize and hence $U$ and $V$ are not independent, except in the linear case $p=0$ . For example, the conditional expectations

(2.29a,b ) $$\begin{eqnarray}\mathbb{E}[V\mid U=u]=0\quad \text{but }\mathbb{E}[V^{2}\mid U=u]\approx \frac{{\it\beta}^{2}}{2{\it\gamma}|u|^{p}}\end{eqnarray}$$

for large $|u|$ . The remaining analysis works much as it did for the one-dimensional model. First, the stationary moments of $U$ are (see § A.1)

(2.30) $$\begin{eqnarray}\mathbb{E}[|U|^{m}]=\frac{1}{\sqrt{{\rm\pi}}}\frac{{\it\Gamma}\left({\displaystyle \frac{1+m}{2}}\right){\it\Gamma}\left({\displaystyle \frac{2+m}{2+p}}\right)}{{\it\Gamma}\left({\displaystyle \frac{2+m}{2}}\right){\it\Gamma}\left({\displaystyle \frac{2}{2+p}}\right)}\left(\frac{2+p}{2}\frac{{\it\beta}^{2}}{{\it\gamma}}\right)^{m/(2+p)}\quad \text{for }m\geqslant 0.\end{eqnarray}$$

The variance is

(2.31) $$\begin{eqnarray}C(0)=\mathbb{E}[U^{2}]=\mathbb{E}[V^{2}]=\frac{1}{2}\frac{{\it\Gamma}\left({\displaystyle \frac{4}{2+p}}\right)}{{\it\Gamma}\left({\displaystyle \frac{2}{2+p}}\right)}\left(\frac{2+p}{2}\frac{{\it\beta}^{2}}{{\it\gamma}}\right)^{2/(2+p)}.\end{eqnarray}$$

Second, the fluctuation–dissipation relation is

(2.32) $$\begin{eqnarray}\mathbb{E}[{\it\gamma}U^{2}(U^{2}+V^{2})^{p/2}]=\mathbb{E}[{\it\gamma}V^{2}(U^{2}+V^{2})^{p/2}]=\frac{{\it\beta}^{2}}{2}={\it\epsilon}={\it\gamma}O(a^{2+p}).\end{eqnarray}$$

Here ${\it\epsilon}$ is the energy dissipation per kinetic energy component $U^{2}/2$ or $V^{2}/2$ , so the total energy dissipation associated with $(U^{2}+V^{2})/2$ is twice that, or ${\it\beta}^{2}=2{\it\epsilon}$ . Third, the approximate covariance $C_{\ast }$ and diffusion $D_{\ast }$ take the same functional forms as in (2.24a,b ). This is because (2.22) adjusts to

(2.33) $$\begin{eqnarray}C^{\prime }(0)=\frac{\mathbb{E}[U\,\text{d}U]}{\text{d}t}=-{\it\gamma}\mathbb{E}[U^{2}(U^{2}+V^{2})^{\,p/2}]=-\frac{{\it\beta}^{2}}{2}=-{\it\epsilon},\end{eqnarray}$$

using the fluctuation–dissipation relation once more. Finally, the KBE (2.12) now applies to functions $f(\boldsymbol{u},t)$ in an obvious fashion:

(2.34a,b ) $$\begin{eqnarray}f(\boldsymbol{u},0)=g(\boldsymbol{u})\quad \text{and}\quad \frac{\partial f}{\partial t}=Lf\quad \Rightarrow \quad f(\boldsymbol{u},t)=\mathbb{E}[g(\boldsymbol{U}(t))\mid \boldsymbol{U}(0)=\boldsymbol{u}].\end{eqnarray}$$

The eigenfunction expansion is now based on the two-dimensional generator

(2.35) $$\begin{eqnarray}L=-{\it\gamma}(u^{2}+v^{2})^{p/2}\left(u\frac{\partial }{\partial u}+v\frac{\partial }{\partial v}\right)+\frac{{\it\beta}^{2}}{2}\left(\frac{\partial ^{2}}{\partial u^{2}}+\frac{\partial ^{2}}{\partial v^{2}}\right),\end{eqnarray}$$

which in polar coordinates takes the form

(2.36) $$\begin{eqnarray}L=-{\it\gamma}\,r^{\,p+1}\frac{\partial }{\partial r}+\frac{{\it\beta}^{2}}{2}\left(\frac{1}{r}\frac{\partial }{\partial r}r\frac{\partial }{\partial r}+\frac{1}{r^{2}}\frac{\partial ^{2}}{\partial {\it\theta}^{2}}\right).\end{eqnarray}$$

The eigenfunctions of $L$ are separable in $(r,{\it\theta})$ so we can consider functions of the form $y_{k}(r)\exp (\text{i}n{\it\theta})$ with integer $n$ , which yields

(2.37) $$\begin{eqnarray}-{\it\gamma}r^{\,p+1}\frac{\text{d}}{\text{d}r}+\frac{{\it\beta}^{2}}{2}\left(\frac{1}{r}\frac{\text{d}}{\text{d}r}r\frac{\text{d}}{\text{d}r}-\frac{n^{2}}{r^{2}}\right)y_{k}=-{\it\lambda}_{k}y_{k}.\end{eqnarray}$$

Again, multiplication with the weight $r{\it\rho}(r)$ puts this equation into self-adjoint form:

(2.38) $$\begin{eqnarray}\frac{\text{d}}{\text{d}r}\left(\frac{{\it\beta}^{2}}{2}r{\it\rho}\frac{\text{d}y_{k}}{\text{d}r}\right)+\left({\it\lambda}_{k}-\frac{n^{2}{\it\beta}^{2}}{2r^{2}}\right)r{\it\rho}y_{k}=0.\end{eqnarray}$$

The boundary condition at $r=0$ is

(2.39) $$\begin{eqnarray}\frac{\text{d}y_{k}}{\text{d}r}=0\quad \text{if }n=0\text{ and }y_{k}=0\text{ otherwise}.\end{eqnarray}$$

For any given $n$ the radial eigenfunctions are orthogonal and normalized such that

(2.40) $$\begin{eqnarray}\int _{0}^{\infty }r{\it\rho}y_{k}y_{l}\,\text{d}r={\it\delta}_{kl}.\end{eqnarray}$$

In order to compute the expected value of $U$ we consider

(2.41) $$\begin{eqnarray}f(r,{\it\theta},0)=u=r\cos {\it\theta},\end{eqnarray}$$

which is just the $n=1$ mode. Hence

(2.42) $$\begin{eqnarray}f(r,{\it\theta},t)=\cos {\it\theta}\mathop{\sum }_{k}a_{k}y_{k}(r)\text{e}^{-{\it\lambda}_{k}t}\quad \text{where }a_{k}=\int _{0}^{\infty }r^{2}{\it\rho}y_{k}\,\text{d}r\end{eqnarray}$$

and the $y_{k}$ are the eigenfunctions of (2.38) with $n=1$ . These are again easily found numerically. The covariance function sought is

(2.43) $$\begin{eqnarray}\displaystyle & C(t)=\mathbb{E}[U(0)U(t)]=\displaystyle \int _{0}^{\infty }\int _{0}^{2{\rm\pi}}r\cos {\it\theta}\,f\,{\it\rho}r\,\text{d}r\,\text{d}{\it\theta} & \displaystyle\end{eqnarray}$$
(2.44) $$\begin{eqnarray}\displaystyle & \displaystyle =\int _{0}^{2{\rm\pi}}\cos ^{2}{\it\theta}\,\text{d}{\it\theta}\int _{0}^{\infty }\mathop{\sum }_{k}a_{k}y_{k}(r)\text{e}^{-{\it\lambda}_{k}t}{\it\rho}r^{2}\,\text{d}r={\rm\pi}\mathop{\sum }_{k}a_{k}^{2}\text{e}^{-{\it\lambda}_{k}t} & \displaystyle\end{eqnarray}$$
and the two-dimensional diffusion coefficient is
(2.45) $$\begin{eqnarray}D={\rm\pi}\mathop{\sum }_{k}\frac{a_{k}^{2}}{{\it\lambda}_{k}}.\end{eqnarray}$$

We have computed the eigenfunction expansion for $p=1$ and $p=2$ but found the same result as in the one-dimensional case: the series for the diffusivity in (2.45) converges rapidly, the exponential approximation to it is accurate to within 5 %, and the exponential covariance $C_{\ast }(t)$ slightly underestimates the true $C(t)$ for large values of $t$ .

The summary of this investigation is that the exponential approximation gives an excellent prediction of the true covariance in both the one-dimensional and two-dimensional cases and that the value of the nonlinearity parameter $p\geqslant 0$ does not enter the expressions for $C_{\ast }(t)$ and $D_{\ast }$ when these quantities are expressed in terms of $\mathbb{E}[U^{2}]$ and ${\it\epsilon}$ . Notably, ${\it\epsilon}$ is the energy dissipation rate per energy component $U^{2}/2$ , so in the two-dimensional case the total dissipation range is $2{\it\epsilon}$ .

3. Two-dimensional nonlinear process with wave oscillation

We now augment the two-dimensional nonlinear process (2.26a ) with a simple model for a wave-like oscillation with frequency ${\it\omega}\geqslant 0$ :

(3.1) $$\begin{eqnarray}\left.\begin{array}{@{}c@{}}\text{d}U=-{\it\gamma}U(U^{2}+V^{2})^{p/2}\,\text{d}t+{\it\omega}V\,\text{d}t+{\it\beta}\text{d}W_{1},\\ \text{d}V=-{\it\gamma}V(U^{2}+V^{2})^{p/2}\,\text{d}t-{\it\omega}U\,\text{d}t+{\it\beta}\text{d}W_{2}.\end{array}\right\}\end{eqnarray}$$

In the context of inertia–gravity waves this model can be viewed either as a model for a clockwise inertial oscillation based on a Coriolis parameter $f={\it\omega}$ , or as a generic evolution equation for a wave mode in spectral space with frequency ${\it\omega}$ , say. The former is directly relevant to storm-forced oscillations in the upper ocean whilst the latter is relevant to nonlinear forced–dissipative inertia–gravity wave models as in § 4 of BGHC13. Either way, in terms of the intrinsic scales $(L,T)$ based on $({\it\beta},{\it\gamma})$ in (2.8ad ), the new frequency ${\it\omega}$ introduces a non-dimensional parameter ${\it\omega}T$ , which significantly weakens the conclusions that can be drawn from dimensional analysis alone.

3.1. Covariance reduction to the non-oscillatory case

The linear case $(p=0)$ was considered in BGHC13 and there the outcome was that the covariance of $U$ for ${\it\omega}\neq 0$ was simply $\cos ({\it\omega}t)$ times the covariance of $U$ for ${\it\omega}=0$ . However, this result was derived using the exact solution for the linear system, which does not extend to the nonlinear case. Hence we need to check carefully how the nonlinearity affects the situation. First of all, for all values of ${\it\omega}$ the system (3.1) has the same invariant measure (2.28) because the oscillatory drift $({\it\omega}V,-{\it\omega}U)$ is along lines of constant potential ${\it\Phi}$ in (2.27). Hence although the Fokker–Planck equation for (3.1) contains new terms, these terms are zero if evaluated on the invariant measure (2.28). For further analysis it is convenient to replace $\boldsymbol{U}$ by the back-rotated complex variable

(3.2) $$\begin{eqnarray}A=A_{r}+\text{i}A_{i}=(U+\text{i}V)\,\text{e}^{+\text{i}{\it\omega}t}\quad \text{such that }(U,V)=(\text{Re}\,,\text{Im})A\,\text{e}^{-\text{i}{\it\omega}t}.\end{eqnarray}$$

The system (3.1) then takes the form

(3.3) $$\begin{eqnarray}\text{d}A+{\it\gamma}A(A_{r}^{2}+A_{i}^{2})^{p/2}\,\text{d}t=\text{e}^{+\text{i}{\it\omega}t}{\it\beta}(\text{d}W_{1}+\text{i}\text{d}W_{2})={\it\beta}(\text{d}W_{1}+\text{i}\text{d}W_{2}).\end{eqnarray}$$

Here the second equality holds in the sense that the unitary factor $\exp (\text{i}{\it\omega}t)$ can be absorbed without loss of generality in the definition of the complex-valued white noise. This is because the generator of the respective diffusions is identical (see § A.2). But this means that $(A_{r},A_{i})$ satisfy exactly the ${\it\omega}=0$ nonlinear equations (2.26a ), for which we know how to solve for the covariance structure. Hence we can write

(3.4) $$\begin{eqnarray}\displaystyle C_{{\it\omega}}(t)=\mathbb{E}_{{\it\omega}}[U(0)U(t)] & = & \displaystyle \mathbb{E}_{{\it\omega}}[(\text{Re}\,A(0))(\text{Re}\,A(t)\,\text{e}^{-\text{i}{\it\omega}t})]\nonumber\\ \displaystyle & = & \displaystyle \cos {\it\omega}t\mathbb{E}_{0}[A_{r}(0)A_{r}(t)]+\sin {\it\omega}t\mathbb{E}_{0}[A_{r}(0)A_{i}(t)]\nonumber\\ \displaystyle & = & \displaystyle \cos {\it\omega}t\mathbb{E}_{0}[U(0)U(t)]+\sin {\it\omega}t\mathbb{E}_{0}[U(0)V(t)],\end{eqnarray}$$

where the subscripts indicate whether the expectation is taken with respect to the dynamics with ${\it\omega}\neq 0$ or with ${\it\omega}=0$ . The key question is whether $\mathbb{E}_{0}[U(0)V(t)]=0$ . In the linear case studied in BGHC13 this term was zero because if ${\it\omega}=0$ then $U$ and $V$ were independent zero-mean functions. In the nonlinear case $p>0$ they are not independent, but it is still true that their cross-covariance is zero at all time lags. This can either be seen by inspection of the symmetries of the ${\it\omega}=0$ equations, or it can be derived formally from the KBE (2.34). In the latter case we would integrate a solution starting from $f(r,{\it\theta},0)=v=r\sin {\it\theta}$ against a term $u=r\cos {\it\theta}$ in (2.43), which vanishes after integration over ${\it\theta}$ . The upshot is that

(3.5) $$\begin{eqnarray}C_{{\it\omega}}(t)=\cos {\it\omega}t\,C_{0}(t)\end{eqnarray}$$

holds exactly for all values of $p$ . From (2.43) we find that

(3.6) $$\begin{eqnarray}t\geqslant 0:\quad C_{{\it\omega}}(t)=\cos {\it\omega}t\mathop{\sum }_{k}{\rm\pi}a_{k}^{2}\text{e}^{-{\it\lambda}_{k}t}\quad \Rightarrow \quad D=\mathop{\sum }_{k}{\rm\pi}a_{k}^{2}\frac{{\it\lambda}_{k}}{{\it\lambda}_{k}^{2}+{\it\omega}^{2}}.\end{eqnarray}$$

This infinite sum for $D$ generalizes the single-term expression (3.10) derived in BGHC13 for the linear case. The corresponding approximate versions are

(3.7a,b ) $$\begin{eqnarray}C_{\ast }(t)=\mathbb{E}[U^{2}]\,\cos {\it\omega}t\,\text{e}^{-|t|/{\it\tau}^{E}}\quad \text{and}\quad D_{\ast }=\frac{\mathbb{E}[U^{2}]{\it\tau}^{E}}{1+({\it\omega}{\it\tau}^{E})^{2}}\quad \text{where }{\it\tau}^{E}=\frac{\mathbb{E}[U^{2}]}{{\it\epsilon}}\end{eqnarray}$$

is the energy damping time scale and ${\it\epsilon}={\it\beta}^{2}/2$ . As before, the dependence on the nonlinearity parameter $p$ has disappeared when the model parameters $({\it\beta},{\it\gamma})$ are eliminated in favour of $\mathbb{E}[U^{2}]$ and ${\it\epsilon}$ . Notably, if ${\it\omega}\neq 0$ then the auto-correlation time

(3.8) $$\begin{eqnarray}{\it\tau}_{\ast }=\frac{D_{\ast }}{\mathbb{E}[U^{2}]}=\frac{{\it\tau}^{E}}{1+({\it\omega}{\it\tau}^{E})^{2}}\end{eqnarray}$$

is always shorter than the energy damping time scale ${\it\tau}^{E}$ .

3.2. High-frequency waves

The practically relevant regime in the ocean is that of high-frequency waves in the sense that the wave period is short compared with the energy decay time scale of the wave field. This corresponds to

(3.9) $$\begin{eqnarray}({\it\omega}{\it\tau}^{E})^{2}\gg 1\end{eqnarray}$$

and then we obtain the limiting forms

(3.10a,b ) $$\begin{eqnarray}{\it\tau}_{\ast }\approx \frac{1}{{\it\tau}^{E}}\frac{1}{{\it\omega}^{2}}=\frac{{\it\epsilon}}{{\it\omega}^{2}\mathbb{E}[U^{2}]}\quad \text{and}\quad D_{\ast }\approx \frac{{\it\epsilon}}{{\it\omega}^{2}}.\end{eqnarray}$$

Finally, the fluctuation–dissipation relation (2.32) applied to (3.10) yields the simple scalings

(3.11ac ) $$\begin{eqnarray}\displaystyle {\it\tau}^{E}=O(a^{-p}),\quad {\it\tau}_{\ast }=O(a^{p})\quad \text{and}\quad D_{\ast }=O(a^{2+p}), & & \displaystyle\end{eqnarray}$$

with respect to the $O(a)$ wave amplitude. This also recovers the linear results of BGHC13 if $p=0$ .

4. Ocean internal waves

We now investigate how these simple stochastic models for wave-induced diffusion might apply to the kind of small-scale horizontal diffusion in the ocean that was first observed in the NATRE experiment (Ledwell et al. Reference Ledwell, Watson and Law1998) and subsequently confirmed in a number of additional observational campaigns (e.g. Sundermeyer & Ledwell Reference Sundermeyer and Ledwell2001; Shcherbina et al. Reference Shcherbina, Sundermeyer, Kunze, D’Asaro, Badin, Birch, Anne-Marie, Callies, Kuebel, Brandy and Claret2014). A suitable reference point is the Garrett–Munk (GM) spectrum for the observed background ocean spectrum of internal waves (e.g. Munk Reference Munk, Warren and Wunsch1981).

Three salient elements are important here: first, there is a systematic nonlinear energy flux ${\it\epsilon}_{GM}$ , say, that flows through the modal components of the GM-spectrum from small to large vertical wavenumbers. There are several different physical mechanisms for this flux, such as resonant triad interactions or the differential advection of small-scale waves by large-scale waves, but the upshot is that the energy flux ${\it\epsilon}_{GM}$ eventually leads to instability of the highest vertical wavenumber waves and therefore to wave breaking and wave dissipation. The small vertical wavenumber waves do not break, but their oscillatory dynamics is of course affected by the energy flux ${\it\epsilon}_{GM}$ flowing through them. This is the kind of nonlinear energy flux through the spectrum that is modelled by the forced–dissipative balance in our model.

Second, all evidence suggests that ${\it\epsilon}_{GM}$ is quadratic in the wave energy density $E_{GM}$ , say, of the GM-spectrum. In other words, the instantaneous damping rate of the wave field is roughly proportional to its energy. In the context of our model this selects the value $p=2$ for the nonlinearity parameter in (3.1) and consequently $D=O(a^{4})$ according to (3.11c ). This is the same scaling of $D$ with wave amplitude as in the unforced inviscid weakly nonlinear study in Holmes-Cerfon et al. (Reference Holmes-Cerfon, Bühler and Ferrari2011), but the pre-factor is very different, as we shall see.

Third, typical values for $E_{GM}$ and ${\it\epsilon}_{GM}$ at equilibrium in the main ocean thermocline are (e.g. Munk Reference Munk, Warren and Wunsch1981; Watanabe & Hibiya Reference Watanabe and Hibiya2005)

(4.1a,b ) $$\begin{eqnarray}E_{GM}=0.002~\text{m}^{2}~\text{s}^{-2}\quad \text{and}\quad {\it\epsilon}_{GM}=5\times 10^{-9}~\text{m}^{2}~\text{s}^{-3}\quad \Rightarrow \quad {\it\tau}_{GM}^{E}=\frac{E_{GM}}{{\it\epsilon}_{GM}}\approx 5~\text{days}.\end{eqnarray}$$

This, together with ${\it\omega}\geqslant f$ , shows that the high-frequency regime (3.9) is clearly relevant here, because the energy decay time is of the order of a week, which is long compared to the wave periods.

Combing these three elements we assume that there is a common ${\it\epsilon}\propto {\it\epsilon}_{GM}$ for the modal components of the wave spectrum, that $p=2$ , and that (3.10) together with (4.1) suggests that the wave-induced particle diffusion is dominated by the near-inertial waves ${\it\omega}\approx f$ . Notably, in the GM-spectrum the near-inertial waves are also the most energetic, but the present argument is based on the constant spectral energy flux ${\it\epsilon}$ rather than the variable spectral energy density. Let us consider only the diffusion induced by waves with ${\it\omega}\approx f$ . To use (3.10) for the diffusion we need to relate ${\it\epsilon}$ to ${\it\epsilon}_{GM}$ . For near-inertial waves the wave energy is dominated by the horizontal kinetic energy and therefore ${\it\epsilon}_{GM}$ is the energy flux associated with $U^{2}/2+V^{2}/2$ . Our ${\it\epsilon}$ is the energy flux associated with $U^{2}/2$ alone, so by isotropy we can argue that

(4.2) $$\begin{eqnarray}{\it\epsilon}=\frac{{\it\epsilon}_{GM}}{2}\quad \Rightarrow \quad D=\frac{{\it\epsilon}}{{\it\omega}^{2}}\approx \frac{{\it\epsilon}_{GM}}{2f^{2}}=0.25~\text{m}^{2}~\text{s}^{-1}\end{eqnarray}$$

using $f=0.0001~\text{s}^{-1}$ and (4.1). On the one hand, this is larger by a factor of ten or so than the $O(a^{4})$ inviscid estimate in Holmes-Cerfon et al. (Reference Holmes-Cerfon, Bühler and Ferrari2011) (or to the similarly small estimate based on shear dispersion, see Young, Rhines & Garrett Reference Young, Rhines and Garrett1982). On the other, it is still somewhat modest when compared to the observed small-scale diffusivity value of $1{-}2~\text{m}^{2}~\text{s}^{-1}$ in the NATRE experiment (Ledwell et al. Reference Ledwell, Watson and Law1998).

An alternative formula for $D$ that links it directly to the diapycnal diffusivity ${\it\kappa}$ can also be derived, although this runs the risk of making quantities look related that physically have little to do with each other. This is based on the often-used empirical formula

(4.3) $$\begin{eqnarray}{\it\kappa}={\it\Gamma}_{m}\frac{{\it\epsilon}_{GM}}{N^{2}}\quad \Rightarrow \quad D=\frac{{\it\kappa}}{2{\it\Gamma}_{m}}\frac{N^{2}}{f^{2}}\end{eqnarray}$$

where ${\it\Gamma}_{m}$ is the mixing efficiency of stratified turbulence and $N$ is the buoyancy frequency. Using the typical values ${\it\kappa}=10^{-5}~\text{m}^{2}~\text{s}^{-1}$ , ${\it\Gamma}_{m}=0.2$ , and $N=0.01~\text{s}^{-1}$ then recovers the numerical estimate for $D$ in (4.2).

As noted above, this makes it look as if there was a close link between shear dispersion and the dispersion process considered here, because modulo the pre-factor (4.3) turns out to be identical to the deep-ocean version for shear dispersion quoted in § 4d of Young et al. (Reference Young, Rhines and Garrett1982). But we believe this is not the case. For example, one could imagine a situation where wave energy sloshes back and forth between different wave modes in a conservative fashion, say between the members of a resonant internal wave triad. In this situation there would be zero shear dispersion because there is no vertical diapycnal diffusion to begin with. However, the process described in this paper would lead to a non-zero horizontal dispersion because the wave–wave interactions upset the exquisite reversibility of the linear wave motion, with consequent particle dispersion in the horizontal.

Finally, it is tempting to speculate on what would happen to these estimates if one were to take the diffusivity due to waves with other frequencies than ${\it\omega}=f$ into account. For example, if this were a purely additive process then one could add up contributions of the type ${\it\epsilon}/{\it\omega}^{2}$ for each octave in frequency between ${\it\omega}=f$ and ${\it\omega}=N$ . However, the rapid decay of $1/{\it\omega}^{2}$ with increasing ${\it\omega}$ suggests that this would not make a material difference to the estimate in (4.2); numerically we found that it might increase the estimate by some 30 % or so, which is not a big difference.

Overall it seems that internal-wave-induced dispersion can make a noticeable but modest contribution to the observed small-scale horizontal diffusivity in the ocean, leaving the bulk of the phenomenon to be attributed to other processes such as the small-scale tail of the potential-vorticity-controlled balanced flow (e.g. Polzin & Ferrari Reference Polzin and Ferrari2004). Interestingly, as suggested by a referee, the stochastic models studied here with frequency ${\it\omega}=0$ might be a useful modelling tool for the diffusion achieved by such balanced flows.

5. Concluding remarks

In hindsight it is easy to see the connection between the $D=O(a^{2})$ scaling of BGHC13 and the $D=O(a^{4})$ scaling obtained here: in the high-frequency regime the former estimate for $D$ was multiplied by the linear damping rate and divided by the wave frequency squared. The present estimate allows for a nonlinear amplitude-dependent damping rate of size $1/{\it\tau}^{E}=O(a^{p})$ (cf. (3.11a )), and hence for $p=2$ the scaling $D=O(a^{4})$ is consistent with the BGHC13 result. As noted before, this is the same scaling as in the inviscid theory in Holmes-Cerfon et al. (Reference Holmes-Cerfon, Bühler and Ferrari2011), but with a much larger pre-factor.

The main restriction of the present toy model is of course that it considers only a single wave mode in isolation, with the presence of all other modes crudely represented by white-noise forcing and nonlinear damping. It would be attractive to have a firm theoretical approach at hand to improve on this, but applications of many-mode theories such as weak wave turbulence theory have been difficult to carry out successfully for internal waves, especially in the presence of the strong Coriolis forces that are inevitable in oceanic applications. This remains an open theoretical question.

We have experimented with direct nonlinear numerical simulations of the rotating shallow-water equations under the constraint of uniform potential vorticity in order to discern the relevant statistics for a single wave mode in the spectrum, but this has not led to clear results so far.

An important theoretical point in this connection is that the wave–wave interactions that are crudely modelled by the nonlinear damping term in (3.1) do not impact on the potential vorticity of the fluid at all. This is in stark contrast to viscous wave dissipation or nonlinear wave breaking, both of which of course do change the potential vorticity locally (e.g. Bühler Reference Bühler2000). A consistent theoretical many-mode model that encompasses both wave and vortex modes would therefore only allow for actual wave dissipation on the smallest scales to impact on the potential-vorticity-controlled vortex flow. In other words, the bulk of the wave energy cascade through a spectrum of internal waves is invisible to the potential vorticity even though there is effective forcing and damping of each wave mode in the cascade. Clearly, it would be highly desirable to have a simple ocean spectral wave and vortex model that is functioning under these constraints. This would be useful not just for studies of diffusivity, but also for the exchange of energy between waves and vortices in general.

Acknowledgements

We thank M. Holmes-Cerfon for useful discussions and the anonymous referees for several helpful suggestions. Financial support under the United States National Science Foundation grant DMS-1312159 is gratefully acknowledged.

Appendix A

A.1. Computations involving invariant measures

The Fokker–Planck equation for the p.d.f. ${\it\rho}(u,t)$ belonging to (2.1) is (e.g. Gardiner Reference Gardiner1997)

(A 1) $$\begin{eqnarray}\frac{\partial {\it\rho}}{\partial t}=L^{\dagger }{\it\rho}=\frac{\partial }{\partial u}\left({\it\gamma}u(u^{2})^{\,p/2}{\it\rho}+\frac{\partial }{\partial u}\left(\frac{{\it\beta}^{2}}{2}{\it\rho}\right)\right)\end{eqnarray}$$

and it follows that (2.2) is its time-independent equilibrium solution. The value of $Z$ follows from the integral identity

(A 2) $$\begin{eqnarray}\int _{0}^{\infty }x^{c}\,\text{e}^{-ax^{b}}\,\text{d}x=\frac{1}{b}{\it\Gamma}\left(\frac{1+c}{b}\right)\left(\frac{1}{a}\right)^{(1+c)/b},\end{eqnarray}$$

which is valid for $a,b>0$ and $c>-1$ . Here ${\it\Gamma}(\cdot )$ is the gamma function defined by

(A 3) $$\begin{eqnarray}{\it\Gamma}(1+t)=\int _{0}^{\infty }x^{t}\,\text{e}^{-x}\,\text{d}x.\end{eqnarray}$$

To derive (2.30) for the two-dimensional system (2.26a ) we start with polar coordinates such that $u=r\cos {\it\theta}$ and hence

(A 4) $$\begin{eqnarray}\mathbb{E}[|U|^{m}]=\frac{1}{Z}\int _{0}^{2{\rm\pi}}\int _{0}^{\infty }|r\cos {\it\theta}|^{m}\text{e}^{-(2{\it\gamma}/(2+p){\it\beta}^{2})r^{2+p}}r\,\text{d}r\,\text{d}{\it\theta}.\end{eqnarray}$$

The azimuthal integral is

(A 5) $$\begin{eqnarray}\int _{0}^{2{\rm\pi}}|\cos {\it\theta}|^{m}\,\text{d}{\it\theta}=4\int _{0}^{{\rm\pi}/2}(\cos {\it\theta})^{m}\,\text{d}{\it\theta}=2\sqrt{{\rm\pi}}\frac{{\it\Gamma}\left({\displaystyle \frac{1+m}{2}}\right)}{{\it\Gamma}\left({\displaystyle \frac{2+m}{2}}\right)}\end{eqnarray}$$

and the radial part is an instance of (A 2). Combining with the second equation in (2.28) then yields (2.30).

A.2. Equivalent diffusions in (3.3)

We ignore the identical drift terms. If $n$ random variables are forced by $m$ noise terms via

(A 6) $$\begin{eqnarray}\text{d}X_{i}=\mathop{\sum }_{k=1}^{m}b_{ik}\,\text{d}W_{k}\quad \text{where }X\in R^{n},b\in R^{n\times m},W\in R^{m}\end{eqnarray}$$

then the generator is given by (e.g. Oksendal Reference Oksendal2013)

(A 7) $$\begin{eqnarray}L=\mathop{\sum }_{i=1}^{n}\mathop{\sum }_{j=1}^{n}\frac{c_{ij}}{2}\frac{\partial ^{2}}{\partial x_{i}\partial x_{j}}\quad \text{where }c=bb^{\text{T}}\in R^{n\times n}.\end{eqnarray}$$

In the present case $m=n=2$ , $(X_{1},X_{2})=(A_{r},A_{i})$ , and hence (3.3) holds because

(A 8) $$\begin{eqnarray}b={\it\beta}\left(\begin{array}{@{}cc@{}}\cos f\,t & -\text{sin}f\,t\\ \sin f\,t & \cos f\,t\end{array}\right)\quad \Rightarrow \quad c=bb^{\text{T}}={\it\beta}^{2}\left(\begin{array}{@{}cc@{}}1 & 0\\ 0 & 1\end{array}\right).\end{eqnarray}$$

References

Balk, A. M. 2006 Wave turbulent diffusion due to the Doppler shift. J. Stat. Mech. P08018.Google Scholar
Balk, A. M., Falkovich, G. & Stepanov, M. G. 2004 Growth of density inhomogeneities in a flow of wave turbulence. Phys. Rev. Lett. 92, 244504.CrossRefGoogle Scholar
Bühler, O. 2000 On the vorticity transport due to dissipating or breaking waves in shallow-water flow. J. Fluid Mech. 407, 235263.Google Scholar
Bühler, O., Grisouard, N. & Holmes-Cerfon, M. 2013 Strong particle dispersion by weakly dissipative random internal waves. J. Fluid Mech. 719, R4.Google Scholar
Bühler, O. & Holmes-Cerfon, M. 2009 Particle dispersion by random waves in rotating shallow water. J. Fluid Mech. 638, 526.Google Scholar
Gardiner, C. W. 1997 Handbook of Stochastic Methods. Springer.Google Scholar
Garrett, C. 2006 Turbulent dispersion in the ocean. Prog. Oceanogr. 70 (2), 113125.Google Scholar
Herterich, K. & Hasselmann, K. 1982 The horizontal diffusion of tracers by surface waves. J. Phys. Oceanogr. 12, 704712.2.0.CO;2>CrossRefGoogle Scholar
Holmes-Cerfon, M., Bühler, O. & Ferrari, R. 2011 Particle dispersion by random waves in the rotating Boussinesq system. J. Fluid Mech. 670, 150175.Google Scholar
Ledoux, V., Daele, M. V. & Berghe, G. V. 2005 Matslise: a matlab package for the numerical solution of Sturm–Liouville and Schrödinger equations. ACM Trans. Math. Softw. (TOMS) 31 (4), 532554.CrossRefGoogle Scholar
Ledwell, J., Watson, A. & Law, C. 1998 Mixing of a tracer in the pycnocline. J. Geophys. Res. 103 (C10), 2149921529.CrossRefGoogle Scholar
McComas, C. H. & Bretherton, F. P. 1977 Resonant interaction of oceanic internal waves. J. Geophys. Res. 82 (9), 13971412.Google Scholar
Munk, W. 1981 Internal waves and small-scale processes. In Evolution of Physical Oceanography (ed. Warren, B. A. & Wunsch, C.), pp. 264291. MIT Press.Google Scholar
Oksendal, B. 2013 Stochastic Differential Equations: An Introduction with Applications. Springer Science & Business Media.Google Scholar
Polzin, K. & Ferrari, R. 2004 Isopycnal dispersion in NATRE. J. Phys. Oceanogr. 34, 247257.2.0.CO;2>CrossRefGoogle Scholar
Sanderson, B. G. & Okubo, A. 1988 Diffusion by internal waves. J. Geophys. Res. 93, 35703582.CrossRefGoogle Scholar
Shcherbina, A. Y., Sundermeyer, M. A., Kunze, E., D’Asaro, E., Badin, G., Birch, D., Brunner-Suzuki, Anne-Marie, E. G., Callies, J., Kuebel, C., Brandy, T., Claret, M. et al. 2014 The latmix summer campaign: Submesoscale stirring in the upper ocean. A part of ams special collection latmix: studies of submesoscale stirring and mixing. Bull. Am. Meteorol. Soc. 96, 12571279.Google Scholar
Sundermeyer, M. A. & Ledwell, J. R. 2001 Lateral dispersion over the continental shelf: analysis of dye release experiments. J. Geophys. Res.: Oceans 106 (C5), 96039621.CrossRefGoogle Scholar
Taylor, G. I. 1921 Diffusion by continuous movements. Proc. Lond. Math. Soc. 20, 196212.Google Scholar
Watanabe, M. & Hibiya, T. 2005 Estimates of energy dissipation rates in the three-dimensional deep ocean internal wave field. J. Oceanogr. 61 (1), 123127.Google Scholar
Weichman, P. & Glazman, R. 2000 Passive scalar transport by travelling wave fields. J. Fluid Mech. 420, 147200.Google Scholar
Young, W. R., Rhines, P. B. & Garrett, C. J. 1982 Shear-flow dispersion, internal waves and horizontal mixing in the ocean. J. Phys. Oceanogr. 12, 515527.Google Scholar
Figure 0

Figure 1. (a) Non-dimensional exact $C(t)$ computed in § 2.1 (solid line) and approximate $C_{\ast }(t)$ computed in § 2.2 (dot-dashed line) for the process (2.1) with $p=2$. The approximation is very accurate except for a small underestimate at large lag times. (b) Illustration of the first four non-dimensional eigenfunctions $y_{k}$ for $k\in \{0,1,2,3\}$ for the same case. The weighted functions $y_{k}\sqrt{{\it\rho}}$ are plotted, which are orthogonal under the standard inner product.

Figure 1

Table 1. The eigenfunction expansion for $p=1$ and $p=2$, and ${\it\beta}={\it\gamma}=1$, which produces non-dimensional eigenvalues and other parameters. Dimensional values are obtained by multiplying the eigenvalues ${\it\lambda}_{k}$ by $1/T$ and so on. Eigenvalues with even $k$ are neglected because their corresponding eigenfunctions are orthogonal to the initial condition $f(u,0)=u$ and hence make no contribution to $C(t)$. For $p=1$ the approximate $D_{\ast }=0.479$ and for $p=2$ it is $D_{\ast }=0.457$, indicating the remarkable accuracy of the exponential approximation (2.24a,b).