1. INTRODUCTION
A Remotely Operated Vehicle (ROV) is an important tool for marine resource exploitation and marine scientific research with the advantages of good economy, high flexibility and strong adaptability to the environment, among others (Chu et al., Reference Chu, Zhu and Yang2016a; Avila et al., Reference Avila, Donha and Adamowski2013). The ROV control of horizontal position, orientation and depth is realised by operators relying on the real-time images transmitted by camera when an ROV works underwater (Wang et al., Reference Wang, Ge and Zhu2006). The precise synchronisation operation for various degrees of freedom is difficult to achieve, so the ROV controller is essentially an open-loop control system. The operations of ROV are complex and difficult (Gao et al., Reference Gao, Yang, Liu, Shi and Xu2015c), but if automatic depth control can be achieved during the operation, then manual operation is only for the motion in the horizontal plane and underwater operations will become relatively easy (Bessa et al., Reference Bessa, Dutra and Kreuzer2008; Gao et al., Reference Gao., Proctor and Bradley2015a). Therefore the study of depth tracking control for ROV has a certain practical significance.
In the study of underwater vehicle location tracking control, many methods have been proposed (Pan and Xin, Reference Pan and Xin2012; Feng and Allen, Reference Feng and Allen2004; Sun et al., Reference Sun, Zhu and Yang2014; Gao et al., Reference Gao, Proctor, Shi and Bradley2015b). Due to the complex uncertainties of underwater vehicle systems such as dynamics modelling error and external disturbances (Avila et al., Reference Avila, Adamowski, Maruyama and Takase2012), the conventional controllers such as a Proportional-Integral-Derivative (PID) controller are unable to cope with these uncertainties and will cause poor performance. Hence the adaptive control methods have been researched extensively, such as adaptive sliding mode control (Zhang and Chu, Reference Zhang and Chu2012; Zhang et al., Reference Zhang, Liu, Yin and Liu2015), adaptive neural network control (Zhu et al., Reference Zhu, Hua and Sun2014), adaptive Proportional and Derivative (PD) control (Hoang and Kreuzer, Reference Hoang and Kreuzer2007), etc. In most of these methods, the output of the controller is usually the force or moment acting on each degree of freedom. Since the actual thrust is unmeasurable, the desired thrust of each thruster needs to be determined by thruster distribution matrix first, and then the control signals which should be loaded on the thrusters can be calculated by the desired thrust according to the thrust model (Gan et al., Reference Gan, Wang, Liu and Xu2004). But a complex mapping relationship between the thrust and the control signal is required in this method (Kim and Chung, Reference Kim and Chung2006). It is difficult to obtain accurate control signals according to desired thrust, which may affect the quality of the tracking control system. Moreover, most of these control methods did not take the dead-zone nonlinearity and saturation nonlinearity of the thruster into consideration in the design process. For most thrusters, these two problems are encountered. The stability analysis of the control system shows that the stability of the closed-loop system may not be guaranteed by adaptive learning law when the thruster output is in a saturation zone or dead-zone (Chu et al., Reference Chu, Zhu and Yang2016b; Wu et al., Reference Wu, Gao, Mei and Zhou2012; Liu and Zhou, Reference Liu and Zhou2010; Tong et al., Reference Tong, Sui and Li2013). At present, the solutions to dead-zone nonlinearity of ROVs' tracking control are still rare. For the problem of saturation nonlinearity, the method based on the adjustment of command input signal is usually used (Gan et al., Reference Gan, Wang, Liu and Xu2004). However, this method relies on an accurate thrust model and it is also very complex. Therefore, for the nonlinear problems of thrusters, this research focuses on how to design an adaptive controller to obtain thruster control signals directly and ensure the stability of the closed-loop control system.
Motivated by the aforementioned observations, an adaptive sliding mode control method based on three-layer feed-forward neural network is proposed for ROV depth trajectory tracking control. The Lyapunov stability theory is used for stability analysis to prove that the proposed controller can guarantee trajectory tracking error is uniformly ultimately bounded. The main contributions of this paper can be summarised as three points. First, the problem of dead-zone nonlinearity and saturation nonlinearity of thruster is considered in controller design. Second, the control signal loaded on the thruster can be obtained directly by the designed controller without a thrust model. The third contribution is that a three-layer feed-forward neural network is introduced for online learning for the unknown motion model of an ROV depth tracking system.
The paper is organised as follows. In Section 2, the dead-zone nonlinearity and saturation nonlinearity of thruster is analysed, and a depth trajectory tracking system model which uses thruster control signals as system input is established. In Section 3, an adaptive sliding mode controller for depth trajectory tracking based on neural network is given. In Section 4, Lyapunov stability theory is used to analyse the stability of the closed-loop system. Then in Section 5, the effectiveness of the proposed approach is verified by simulation experiments. Finally, a brief conclusion is included in Section 6 of this paper.
2. PROBLEM FORMULATION
In underwater environments, an ROV is affected by hydrodynamic forces, cable drag forces and other factors, and its motion has strong nonlinearity and cross-coupling (Cao and Zhu, Reference Cao and Zhu2015). However, the distances between the centres of buoyancy and the centres of gravity of most ROVs are relatively far, which make the change of pitch and roll small, so that the influence of other degrees of freedom for vertical motion can be ignored (Bessa et al., Reference Bessa, Dutra and Kreuzer2010). Thus, the ROV vertical motion model can be described as:
where, z, $\dot{z}$ , $\ddot{z}$ denote vertical depth, vertical velocity and vertical acceleration, respectively, M contains ROV mass and added mass, D( $\dot{z}$ ) is hydrodynamic drag, g(z) is the difference between the gravity and buoyancy of the ROV, τ d is external disturbance force, such as cable drag and current interference force, etc., τ(u) is the force acting on the vertical degree of freedom, u is control signal loaded on the thruster and it is usually expressed as a voltage value.
Remark 1: The assumption is that only one thruster is arranged in the vertical degree of freedom, and τ(u) is the thrust. But in the case that multiple identical thrusters are arranged uniformly in the vertical degree of freedom, the control signal of each thruster can be made the same and then τ(u) = nτ i (u i ). Where n is the number of thrusters, τ i (u i ) is the thrust of each thruster. Therefore, nτ i (u i ) can be used instead of τ(u) in Equation (1) in the controller design.
The desired trajectory of ROV depth motion is set as $[z_d, \dot{z}_d ]$ T. Then the trajectory tracking error vector ξ = [ξ 1, ξ 2] T can be defined as:
where z d , $\dot{z}_d $ are the desired vertical depth and the desired vertical velocity, respectively.
Remark 2: In order to realise the desired vertical depth trajectory, the desired vertical velocity $\dot{z}_d $ should be in the range of ( $\dot{z}_{\min} $ , $\dot{z}_{\max} $ ), where $\dot{z}_{\min} $ , $\dot{z}_{\max} $ are the maximum diving velocity and the maximum rising velocity of the ROV, respectively.
Substituting Equation (1) into Equation (2), leads to:
where $f\left( {z,\dot{z}} \right) = - M^{ - 1} \left[ {D\left( {\dot{z}} \right)\dot{z} + g\left( z \right) + \tau _d} \right] - \ddot{z}_d $ are the un-modelled dynamics.
For most thrusters, the thrust model can be described approximately as τ(u) = a 1 u|u| − a 2 |u|v a , where a 1, a 2 are unknown positive parameters and v a is the advance speed of the propeller (Alessandri et al., Reference Alessandri, Caccia and Veruggio1999). However, the actual tested thrust curve of a Model 520 thruster is shown in Figure 1(a), which shows that the dead-zone nonlinearity problem exists. According to the test results, the theoretical thrust curve of the thruster can be inferred as shown in Figure 1(b). In Figure 1(b), when the thruster control signal is within the range of [u l , ur ], there is no thrust output. The function shown in Equation (4) can be used to describe the theoretical thrust curve of thruster in Figure 1(b).
From Figure 1(b) and Equation (4), the dead-zone of the thruster has the following characteristics:
-
Characteristic 1: τ(u) is unmeasurable.
-
Characteristic 2: The dead-zone parameters both u r and u l are unknown bounded constants.
-
Characteristic 3: τ +(u) and τ −(u) are smooth, and there exist positive constants k r , kl , $\dot k_r $ and $\dot k_l $ , which make:
(5) $$0 \lt k_r \les \tau _ + \left( u \right),\,0 \lt k_l \les \tau _ - \left( u \right),\,\dot k_r \ges \dot \tau _ + \left( u \right) \gt 0,\,\dot k_l \les \dot \tau _ - \left( u \right) \lt 0$$ -
Characteristic 4: There is known positive constant β, so that β ⩽ min(k r , kl ).
-
Characteristic 5: When u < u r , then τ +(u) = τ +(u r ) and when u > u l , τ −(u) = τ −(u l ). There exists an unknown positive constant ρ which makes |d(t)| ⩽ ρ.
According to the above characteristics, τ(u) can be re-expressed as:
where T (t) = [τ +(u), τ −(u)], ϕ (t) = [ϕ r , ϕl ] T , and:
Substituting Equation (6) into Equation (3), the depth trajectory tracking control system equation of the ROV can be expressed as:
As can be seen from Equation (8), the control system is described in the form of affine nonlinear system which uses thruster control signal u as system input. This benefits the subsequent controller design to obtain thruster control signal u directly without a thrust model, which is quite different from most existing ROV control methods.
Because of the finite thrust and the large ROV inertia in a practical thruster, the saturation nonlinearity of the thruster easily appears when the trajectory tracking error is large. So it must be considered during the process of controller design. When the output signal of the controller is in the saturation area, the control law as shown in Equation (9) is usually adopted (Chen et al., Reference Chen, Ge and Ren2011).
where v is the output of control law, v max and v min are the upper limit and lower limit of thruster control signal, respectively.
Remark 3: For an actual ROV system, the following must be satisfied: (1) if u = v max, then $\ddot{z}$ > 0, and $\dot{z}$ will increase until $\dot{z}{\rm =} \dot{z}_{{\rm max}} $ ; (2) if u = v min, then $\ddot{z}$ < 0, and $\dot{z}$ will decrease until $\dot{z}{\rm =} \dot{z}_{{\rm min}} $ .
In summary, the main objective of this paper is to design the depth trajectory tracking control law v with thruster nonlinearity and un-modelled dynamics, which will finally make the depth trajectory tracking error uniformly ultimately bounded.
3. CONTROLLER DESIGN
Since there are un-modelled dynamics, an adaptive sliding mode controller based on neural network will be given for ROV depth trajectory tracking in this Section. Without considering the thruster nonlinearity, many adaptive sliding mode control methods have been proposed (Zhang and Chu, Reference Zhang and Chu2012; Zhu et al., Reference Zhu, Hua and Sun2014; Hoang and Kreuzer, Reference Hoang and Kreuzer2007). However, thruster nonlinearity is inevitable in an actual ROV system. That is, in these methods, when the control law reaches the saturated zone, the adaptive parameters are still adjusted although the actual control law does not change. If the control law stays in the saturated zone for a long time and then returns to a feasible region, control system instability might occur due to the adjusted parameters. In this paper, the proposed adaptive sliding mode controller will only adjust the adaptive parameters when the control law is in the feasible region, and the Lyapunov stability theory is used to prove that the depth trajectory tracking error is uniformly ultimately bounded. In this section, the sliding mode dynamics will be analysed first. Then, the three-layer feed-forward neural network will be given and the estimation error of the neural network will be analysed. Finally, the theorem will be given.
According to the principle of sliding mode control, a sliding surface s is defined firstly as (Hussain and Po, Reference Hussain and Ho2004):
where l is a positive constant.
Differentiating s and introducing Equation (8), we obtain:
where $F(t) = M\left( {{\bi T}\left( t \right){\bi \varphi} \left( t \right)} \right)^{ - 1} \left( {f\left( {z,\dot{z}} \right) + l\dot \xi _1} \right)$ .
Considering the saturation nonlinearity of the thruster, the control signal is loaded on the thruster according to Equation (9). When the control law output v is beyond the range of [v min, v max], v is different from thruster control signal u. Therefore, the variable δ shown in Equation (12) is defined to indicate the difference between v and u.
From Equation (9), when v min ⩽ v ⩽ v max, then δ = 0. When v > v max, then δ > 0. When v < v min, then δ < 0.
Substituting Equation (12) into Equation (11) leads to:
In Equation (13), F(t) contains information such as hydrodynamic force, external disturbance force and dead-zone nonlinearity, and it is usually difficult to obtain accurately by dynamic modelling. Therefore, the method which uses neural network to learn the unknown items in the model online is widely adopted, such as radial basis function neural network (Gao et al., Reference Gao, Wang and Zhang2014) and fuzzy neural network (Zhang et al., Reference Zhang, Qi and Pang2009), etc. A radial basis function neural network has the advantage of fast learning speed, but on the other hand it usually needs to select the radial basis function centre and the bandwidth offline according to the experimental data. Fuzzy neural network has good nonlinear identification ability, but the fuzzy field division depends on the designer's experience and the network structure is relatively complex. Based on the above consideration, three-layer feed-forward neural network is used for online identification in this paper. The weights from hidden layer to output layer and the weights from input layer to hidden layer in the network are obtained through online learning. The structure of the three-layer feed-forward neural network is shown in Figure 2.
According to the nonlinear mapping ability of the neural network, there are the optimal network weights W and V (Peng and Duba, Reference Peng and Duba2012), which make:
where W is the weight vector from hidden layer to output layer, V is the weight matrix from input layer to hidden layer, x = [u, z, $\dot{z}$ ] T is the input vector of neural network, ε is the approximation error of neural network, σ (•) is a sigmoid function.
According to Equation (14), the estimated output of the neural network can be expressed as:
where ${\hat{{\bi W}}}$ and ${\hat{{\bi V}}}$ are the estimates of W and V , respectively.
The estimation error of neural network identification should be analysed first. From Equations (14) and (15), the estimation error of neural network can be obtained as:
where ${\tilde{{\bi W}}} = {\bi W} - {\hat{{\bi W}}}$ and ${\tilde{{\bi V}}} = {\bi V} - {\hat{{\bi V}}}$ are the estimation errors of network weights.
For the adaptive law design of weights, we need to obtain the explicit expressions of ${ \tilde{{\bi W}}}$ and ${\tilde{{\bi V}}}$ . Therefore, the Taylor expansion of σ ( Vx ) about ${\hat{\bi Vx}}$ gives:
where ${\bi o}({\hat{{\bi V}}}{\bi x})^2 $ is the higher order item of Taylor expansion.
Introducing Equation (17) into Equation (16), the estimation error of the neural network can be rewritten as:
where $\omega = { {\tilde{{\bi W}}}{\bi \sigma^{\prime}}}({\hat{{\bi V}}}{\bi x}){\tilde{{\bi V}}}{\bi x} + {\bi Wo}({\hat{{\bi V}}}{\bi x})^2 + \varepsilon $ .
It is assumed that the approximation error of the neural network is unknown and bounded, and the upper bound is known (Gao et al., Reference Gao, Yang, Liu, Shi and Xu2015c; Yang and Wang, Reference Yang and Wang2007), thus the stability of the closed-loop system can be ensured by using the sliding mode strategy to compensate. Although this hypothesis is tenable, the suitable upper bound value is difficult to select. A larger upper bound estimation can easily cause system chattering. To relax the qualifications, only $\tilde{F}\left( t \right)$ is to be assumed bounded and its upper bound is obtained by adaptive learning of an estimation item in this paper. Through introducing an adaptive estimation item into control law, the system chattering caused by the sliding mode strategy item can be avoided.
Assumption 1 (Zhang et al., Reference Zhang, Qi and Pang2009):
where C is an unknown positive constant, ψ( x ) is a known function about x .
After the analysis of the sliding mode dynamics and estimation error of the neural network, the main conclusion of this paper will be given as follows:
Theorem 1: Consider the vertical dynamics of an ROV expressed as Equation (1) with thruster nonlinearity and un-modelled dynamics. If the control law in Equation (20) is introduced with the thruster control signal in Equation (9), and the control parameters in Equation (25), and the parameters' adaptive laws in Equations (21)–(24), the depth trajectory tracking error ξ 1 will be uniformly ultimately bounded.
where $\hat{C}$ is the estimated value of C, $\hat{\rho} $ is the estimated value of ρ. Γ W , Γ V , Γ C , Γ ρ , λW, λV, λC, λρ are known positive constants, W o , V o , C o , ρo are offline estimates of W, V , C, ρ, respectively.
Remark 4: For the control law Equation (20), there are several parameters which should be adjusted in the process of control, so a large amount of calculation will be needed. But this is not a problem for the control system of an ROV, because the control cycle is usually set as 0·1 s to 1 s (Ma and Zeng, Reference Ma and Zeng2015) which makes enough time to calculate these parameters.
4. STABILITY ANALYSIS
In the stability analysis of most sliding mode controllers, the Lyapunov function usually contains s 2, and $s\dot s \lt 0$ will be acquired ultimately to prove that the control system is stable. However, since the saturation nonlinearity is considered, δ may take different values as different controller output signal v. When δs ⩾ 0, –δs can be ignored by the stability theory. When δs < 0, its impact on the stability of the system must be analysed. Therefore, the situation aiming at different δ and s will be discussed separately.
Situation 1: δ = 0 or (δ ⩾ 0 and s ⩾ 0) or (δ ⩽ 0 and s ⩽ 0)
Consider the Lyapunov function:
where $\tilde{C} = C - \hat{C}$ , $\tilde \rho = \rho - \hat{\rho} $ .
Differentiating V 1 and substituting Equation (13) into it, we have:
According to the definition and characteristics Equations (4) ~ (7), we can obtain:
According to Equation (28) and substituting control law Equations (20) and (26) into Equation (27), we obtain:
Since δs ⩾ 0, Equation (29) can be expressed as:
Introducing adaptive law Equations (21) and (22) into Equation (30), we obtain:
Introducing adaptive law Equations (23) into (31), and K 2 = 0, we obtain:
Introducing adaptive law Equations (24) into (32), we obtain:
Since:
Then:
Because of:
Introducing Equations (39) and (40) and K 1 into Equation (32), we obtain:
where χ 1 = min{2γk r M −1, 2γk l M −1, Γ ρ λρ, ΓCλC, ΓWλW /2, Γ V λV /2} which is a positive constant, λ 1 is shown in Equation (42).
From Equation (41), we can obtain:
where V 1(0) is the initial value of V 1 at t = 0.
From Equation (43), when t → ∞, V 1 → λ 1/χ 1, namely that s, ${ \tilde{{\bi V}}}$ , $\tilde{C}$ , $\tilde \rho $ are uniformly ultimately bounded, which means that trajectory tracking error ξ 1 is uniformly ultimately bounded.
Situation 2: δ < 0 and s > 0
When δ < 0, then v < v min and u = v min. For this situation, the adaptive laws Equations (21)-(24) are set as zero, so only the convergence of trajectory tracking error ξ 1 will be analysed.
According to the definition of s, when s > 0, then ξ 1 > 0 and ξ 2 < 0, or ξ 1 < 0 and ξ 2 > 0, or ξ 1 > 0 and ξ 2 > 0. Obviously, ξ 1 will converge to zero for the previous two cases, so that the control system is stable. Therefore, only the situations of ξ 1 > 0 and ξ 2 > 0 need to be considered.
When u = v min, $\ddot{z}$ will be less than zero and $\dot{z}$ will decrease until $\dot{z}{\rm =} \dot{z}_{\min} $ , which makes ξ 2 = $\dot{z}_{\min} - \dot{z}_d $ < 0. Therefore, ξ 2 > 0 is just for a short time and its value will be less than zero eventually, so that ξ 1 will also converge to zero.
If u = v min continues being used, both ξ 1 and ξ 2 will be less than zero, that is s < 0. That satisfies Situation 1.
Situation 3: δ > 0 and s < 0
When δ > 0, then v > v max and u = v max. For this situation, the adaptive laws Equation (21)-(24) are also set as zero, so only the convergence of trajectory tracking error ξ 1 like Situation 2 is to be analysed.
When s < 0, then ξ 1 > 0 and ξ 2 < 0, or ξ 1 < 0 and ξ 2 > 0, or ξ 1 < 0 and ξ 2 < 0. Obviously, ξ 1 will converge to zero in the previous two cases, so that the control system is stable. Therefore, only the situations of ξ 1 < 0 and ξ 2 < 0 need to be considered.
When u = v max, $\ddot{z}$ will be larger than zero and then $\dot{z}$ will increase until $\dot{z}_{\max} $ . That makes ξ 2 = $\dot{z}_{\max} - \dot{z}_d $ > 0. Therefore, ξ 2 < 0 is just for a short time and its value will be larger than zero eventually, so that ξ 1 will also converge to zero.
If u = v max continues being used, both ξ 1 and ξ 2 will be larger than zero, that is s > 0. That satisfies Situation 1.
Based on the above analysis, it can be concluded that the designed control law Equation (20) can ensure the stability of ROV depth trajectory tracking control and that the depth trajectory tracking error is uniformly ultimately bounded.
5. SIMULATION RESULTS
To verify the effectiveness of the proposed adaptive sliding mode controller in this paper, the ODIN vertical motion model is used for simulation experiment. The ODIN vertical motion model is shown in Equation (44) (Podder and Sarkar, Reference Podder and Sarkar2001).
where rand(−10,10) is the random number within [−10,10]. It is assumed that only one thruster works in the ODIN vertical direction. Thrust τ(u) is calculated by:
According to the thrust model Equation (45), it can be seen that u l = −0·5 V, u r = 0·5 V, v min = −5 V, v max = 5 V. The maximum forward and reverse thrust are both 60N.
According to the given motion model Equation (44) and thrust model Equation (45), the relevant parameters in adaptive sliding mode controller Equation (20) are selected as: β = 1, l = 5, γ = 3, ψ( x ) = 1, m 0 = 188, Γ W = Γ V = Γ C = Γ ρ = 1, λ W = λ V = λ C = λ ρ = 0·5. The initial values of ${\hat{{\bi W}}}$ , ${\hat{{\bi V}}}$ , $\hat{C}$ and $\hat{\rho} $ are randomly selected within the range [−0·1,0·1], ${\bi W}_{\bi o} = {\hat{{\bi W}}}$ , ${\bi V}_{o} = {\hat{{\bi V}}}$ , $C_o = \hat{C}$ , $\rho _o = \hat{\rho} $ . The number of hidden layer units of three-layer feed-forward neural network is 10. The initial depth of the ROV is 1·0 m and the control cycle is 0·1 s. Two simulations of sine trajectory tracking control and depth setting tracking control are performed, respectively. In them, the desired depth of the ROV is z d = 2(1 − cos(0·05πt)) in the simulation of sine trajectory tracking control, and the desired depth of ROV in the simulation of depth setting tracking control is shown in Equation (46). Two simulation curves are shown in Figures 3 and 4, respectively.
As can be observed from Figures 3 and 4, the depth output follows the desired trajectory well and the depth tracking error ξ 1 and the sliding mode s are close to zero. In Figure 3(b), the depth tracking error converges to zero at 9·6 s. From 9·7 s to 80 s, the average value of depth tracking error is 0·001 m. In Figure 4(b), the depth tracking error converges to zero at 4·7 s. From 4·8 s to 20 s, the average value of depth tracking error is −0·016 m. This indicates that the trajectory tracking error is uniformly ultimately bounded. From Figure 3(d) and Figure 4(d), it can be observed that the thruster works in the dead zone and saturation zone for a certain time, and the closed-loop system is stable in this period. Simulation results show that the proposed controller is well capable of the depth trajectory tracking and the proposed method is effective for the dead-zone nonlinearity and saturation nonlinearity of the thruster.
6. CONCLUSION
The ROV adaptive sliding mode depth trajectory tracking control method is researched, and the problem of dead-zone nonlinearity and saturation nonlinearity of thruster is considered during the process of controller design. For an unknown thrust model, the proposed controller is able to obtain the control signal directly. Under the condition of un-modelled dynamics, three-layer feed-forward neural network is used for online adaptive identification. The proposed controller can ensure that the depth trajectory tracking error is uniformly ultimately bounded. Simulation results indicate that the proposed method is effective for ROV depth trajectory tracking control with the dead-zone nonlinearity and saturation nonlinearity of thruster. However, we need to realise that only depth tracking has been considered in this paper. In future work, the problem of trajectory tracking control of multiple degrees of ROV with dead-zone nonlinearity and saturation nonlinearity should be researched, so that automatic control of an ROV can be achieved. This will be more challenging because the transformation matrix which expresses the transformation from the body-fixed frame to earth-fixed frame will be introduced and the transformation matrix is very difficult to be processed in controller design based on the proposed method.
ACKNOWLEDGEMENT
This work is supported by the National Natural Science Foundation of China under Grant 51509150, 51575336; Shanghai Municipal Natural Science Foundation under Grant 15ZR1419700.