1. Introduction
Marine fishing is a primary means of providing aquatic products and holds significant importance in ensuring the economic development of fisheries (Pham et al., Reference Pham, Huang and Chuang2014; Ding et al., Reference Ding, Lu and Xue2021). Various issues, such as unauthorised fishing activities, mismatched vessel certificates and irregular operations by fishing vessels, have led to the decline of marine fishery resources, causing severe damage to aquatic ecosystems (Trebilco et al., Reference Trebilco, Halpern, Flemming, Field, Blanchard and Worm2011; Christensen, Reference Christensen2016; Ma, Reference Ma2020; Mackay et al., Reference Mackay, Hardesty and Wilcox2020). There is an urgent need to strengthen the supervision of fishing vessels and effectively safeguard fishery resources. According to the requirements of Chinese fishery management authorities, fishing vessels must be equipped with a vessel monitoring system (VMS), automatic identification system (AIS) and BeiDou equipment that enables the acquisition of trajectory information, such as heading, speed and location. These data make it feasible to identify fishing vessel behaviours through data-mining approaches (Deng et al., Reference Deng, Dichmont, Milton, Haywood, Vance, Hall and Die2005; de Souza et al., Reference de Souza, Boerder, Matwin and Worm2016; Shepperson et al., Reference Shepperson, Hintzen, Szostek, Bell, Murray and Kaiser2018).
Currently, methods employing trajectory data for identifying operational behaviours of fishing vessels include machine learning and deep learning techniques. The application of machine learning in identifying fishing vessel operational behaviours has achieved applicable results (Mills et al., Reference Mills, Townsend, Jennings, Eastwood and Houghton2007; Gerritsen and Lordan, Reference Gerritsen and Lordan2011; Natale et al., Reference Natale, Gibin, Alessandrini, Vespe and Paulrud2015). However, machine learning methods demand manual feature extraction. The manual extraction of features is a highly laborious process and requires a certain level of prior knowledge. Carlos et al. (Reference Carlos, Aranda, Velasco, Rodríguez-González and Méndez-López2022) proposed a gear identification method involving supervised autoencoder dimensionality reduction. Haiguang et al. (Reference Haiguang, Feng, Jing, Chao, Yuan and Zhongwen2019) presented a fishing vessel type identification method that first extracts data features using feature engineering and then employs XGBoost as a fishing vessel type identification classifier. Bowen Xing et al. (Reference Xing, Zhang, Liu, Sheng, Bi and Xu2023) introduced a method for classifying fishing vessel operational types based on an improved LightGBM algorithm. This method first captures the correlation between fishing vessel trajectory points using the continuous bag of words (CBOW) model and then trains trajectory series by the Bayesian-optimised LightGBM algorithms. Zhu et al. (Reference Zhu, Xi, Hu and Chen2023) presented a method for identifying fishing vessel trajectory characteristics involving the Fourier series transform. The model utilises the Fourier series and Gaussian mixture clustering to address the complexity and uncertainty issues in fishing vessel trajectories. Gu et al. (Reference Gu, Hu, Zhao and Liao2024) proposed a transformer network with multi-source information fusion processing (MFGTN) which accurately classifies fishing vessels as single trawl or non-single trawl vessels. Yang et al. (Reference Yang, Wang, Chen, Xiao, Li and Huang2023) proposed a noise-rate adaptive learning robust training paradigm called A-JoCoR, significantly improving ship trajectory classification performance compared with the original method.
Recently, deep learning has achieved significant success in the field of pattern recognition due to its autonomous learning capabilities. It has been studied and applied in fishing vessel behaviour identification (Zheng et al., Reference Zheng, Fan and Zhang2016; Pipanmekaporn and Kamonsantiroj, Reference Pipanmekaporn, Kamonsantiroj, Ahram, Taiar, Gremeaux-Bader and Aminian2020; Tang et al., Reference Tang, Zhang and Fan2020). Kim and Lee (Reference Kim and Lee2020) presented a gear-type identification method based on convolutional neural networks (CNNs) and demonstrated through experiments that the established model has good identification capabilities for gear types. Xu and Dong (Reference Xu and Dong2022) investigated a fishing vessel operational type identification algorithm based on sliding windows and long short-term memory (LSTM) neural network auto-encoders and experimentally proved that the algorithm had an accuracy of 95 ⋅ 82%. Chuaysi and Kiattisin (Reference Chuaysi and Kiattisin2020) combined global and local features to develop a deep learning-based fishing vessel behaviour identification method using time series data. Experimental results on fishing, non-fishing and trans-shipment vessels showed an average accuracy of 97 ⋅ 5%. Despite the achievements of deep learning in learning fishing vessel behaviour identification, insufficient detection accuracy still poses a challenge.
To summarise, identifying fishing vessel behaviours is significant for fishery research, management and the protection of fishery resources. This study proposes a hybrid fishing vessel behaviour identification model based on a one-dimensional convolutional (1D CNN) neural network and LSTM. This model integrates the feature extraction capability of convolutional layers and the capability of LSTM to capture time series features. Comparative experiments were conducted against traditional CNN and LSTM algorithms to validate the effectiveness of the proposed hybrid model. The results show that the hybrid model outperforms other models regarding accuracy and stability.
2. A fishing vessel operational behaviour identification model based on 1D CNN-LSTM
2.1 Overall architecture
The flowchart in Figure 1 shows the fishing vessel operation identification process, divided into two parts: basic feature extraction with 1D CNN and time-domain feature extraction with LSTM.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_fig1.png?pub-status=live)
Figure 1. Flowchart of fishing vessel operations identification
2.2 1D CNN model
The CNN model is a deep learning model (Krizhevsky et al., Reference Krizhevsky, Sutskever and Hinton2017), including 1D-CNN, 2D-CNN and 3D-CNN. 1D-CNN is generally used for processing sequential data, and 2D-CNN for image processing. 3D-CNN incorporates input time dimension and is primarily applied in video processing (Lecun and Bottou, Reference Lecun and Bottou1998; Teng et al., Reference Teng, Chen, Liu, Cheng and Sun2021; Mohtavipour et al., Reference Mohtavipour S, Saeidi and Arabsorkhi2022). 1D-CNN can analyse the local features of 1D data. In this study, a 1D-CNN is employed to analyse the acquired fishing vessel trajectory data. Figure 2 illustrates its specific structure.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_fig2.png?pub-status=live)
Figure 2. Convolution process of 1D-CNN
The left side of the figure represents the input time series data, which is 3D. The convolution is performed from top to bottom, and the red and blue areas denote the convolution kernels of size 2 × 3. The resulting feature map from the convolution is denoted as N*x, where N is related to the data dimensions, kernel size and convolution stride, and x represents the dimensions of the input data. The input matrix for fishing vessel trajectory data in this study is four-dimensional, using 64 convolutional kernels.
2.3 LSTM model
LSTM is a type of recurrent neural network (RNN) applied to the time dimension and incorporates memory units within its hidden layers to manage the memory information of time series (Hochreiter and Schmidhuber, Reference Hochreiter and Schmidhuber1997). LSTM possesses long-term memory capabilities. It can transmit and express information in long time series while using useful information, providing a solution to the issue of vanishing and exploding gradients encountered by simple RNNs. This study uses the LSTM model to identify fishing vessel operational behaviours based on vessel trajectory data.
The LSTM model includes three gate structures: the forget gate, input gate and output gate. Figure 3 shows the unit structure of the network. The circular elements represent point-wise operations on vectors, while rectangles signify neural network layers annotated with the activation function used in that layer. Mathematically, LSTM can be defined as in Equations (1)–(6).
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_eqn1.png?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_eqn2.png?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_eqn3.png?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_eqn4.png?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_eqn5.png?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_eqn6.png?pub-status=live)
where x is the input series, h is the state of the hidden layer, c is the state of the unit structure, it is the coefficient of the input gate, ft is the coefficient of the forget gate, ot is the coefficient of the output gate, W is the learning weight and b is the learning bias.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_fig3.png?pub-status=live)
Figure 3. Unit structure of LSTM
2.4 Fishing vessel operational behaviour identification model based on 1D CNN-LSTM
We combined the characteristics of CNN and LSTM to enhance the accuracy of fishing vessel behaviour identification, proposing a hybrid 1D CNN-LSTM model. This proposed hybrid model takes multidimensional time series data as input and predicts the operational behaviour types of fishing vessels as output. Figure 4 illustrates its structure. This model first extracts features with CNN, which consists of two layers of 1D convolutional layers and one maxpooling layer. A flattened layer follows to transform the data into a format suitable for LSTM, integrating spatial feature extraction from CNN with time series representation from LSTM. In addition, to address the issue of overfitting in deep neural networks, a dropout layer is introduced between the flattened and LSTM layers. Dropout (Hinton et al., Reference Hinton, Srivastava, Krizhevsky, Sutskever and Salakhutdinov2012) temporarily drops neural units from the network during training based on a defined probability, simplifying the network structure. In this model, the dropout coefficient is set to 0 ⋅ 5.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_fig4.png?pub-status=live)
Figure 4. Structure of the 1D CNN-LSTM model
Activation functions introduce nonlinear factors into neural networks. They can solve complex problems through nonlinear combinations, enhancing the expressive and learning capabilities of the target network. Commonly used activation functions include the sigmoid function (Kyurkchiev and Markov, Reference Kyurkchiev and Markov2015), the tanh function (Zhang et al., Reference Zhang, Liu, Wei and Zhang2021) and the rectified linear unit (ReLU) function (Li et al., Reference Li, Tang and Yu2020). ReLU somewhat improves the vanishing gradients problem compared to the other two functions, offering faster computation and convergence rates. Its mathematical formula is given by:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_eqn7.png?pub-status=live)
3. Experiment and analysis
3.1 Experimental data
The experimental data originates from location data of fishing vessels equipped with BeiDou devices in the Tianchi Lab. It includes dynamic information, such as the ID, speed and direction of each fishing vessel. Table 1 lists partial trajectory information for vessel ID 20042. Complete trajectory point information is obtainable based on the dynamic information from BeiDou and the static information registered by the fishing vessels. Considering this, it is possible to generate a complete trajectory series of a vessel during operational activities. Figure 5 shows the trajectory series, where (a), (b) and (c) represent gill net, trawl and purse seine, respectively. Preliminary observations from the figure indicate that fishing vessel trajectories vary with behaviour.
Table 1. Information of fishing vessels
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_tab1.png?pub-status=live)
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_fig5.png?pub-status=live)
Figure 5. Vessel trajectories: (a) gill net, (b) trawl and (c) purse seine
The dataset contained certain anomalies. To address this, the study randomly divided the database multiple times into a training set (75%) and a test set (25%) and recorded the misjudgement rates of each trajectory. Trajectories with high misjudgement rates were manually assessed for potential anomalies. A data point was removed from the database if identified as an anomaly. This approach significantly mitigated the impact of anomalous data on algorithm training, enhancing the detection accuracy of the model. Some instances of anomalous data, such as those associated with vessel IDs 20355, 20544 and 26536, appeared due to sensor malfunctions. Vessel ID 28004 was labelled as a purse seine vessel in the database, but its actual trajectory did not bear the characteristics of such a vessel, suggesting the likelihood of anomaly. Vessel ID 20300, marked as a purse seine vessel, maintained speeds consistently between eight and 10 knots, indicating that it belongs to the transport state rather than the purse seine operation state. Vessel ID 28098, identified as a gill net vessel, lacked the characteristics of gill net operations in its trajectory and maintained an average speed exceeding eight knots. Thus, it was deemed as anomalous data, as shown in Figure 6.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_fig6.png?pub-status=live)
Figure 6. Abnormal data of vessel trajectories: (a) ID: 20355, (b) ID: 20544, (c) ID: 26536, (d) ID: 28004, (e) ID: 20300 and (f) ID: 28098
3.2 Evaluation methods
Experiments were conducted using the aforementioned dataset and evaluation standards to validate the effectiveness of the proposed hybrid model. Accuracy (Accuracy) was employed to measure classification precision, which is expressed as:
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_eqn8.png?pub-status=live)
where TPi is the number of correctly predicted positive samples in a class, TNi is the number of correctly predicted negative samples in a class, FPi is the number of negative samples in a class that were incorrectly predicted as positive, FNi is the number of positive samples in a class that were incorrectly predicted as negative, i is the class label and n is the number of classes.
3.3 Experimental results and analysis
The width and size of convolutional kernels are essential parameters for 1D CNN. By adjusting these parameters, the detection performance of the model can be enhanced. This study employed convolutional kernels with a width range of 1–12. The number of convolutional kernels was initialised as 6 and gradually increased to 16. Figure 7 shows the detection results of the model. The precision of the model consistently improves with a kernel width below 5, while the loss function decreases. Regarding the number of kernels, precision gradually improves before k reaches 10, coinciding with a decline in the loss function. Thus, the optimal model parameters were determined as w = 5 for the kernel width and k = 10 for the kernel quantity.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_fig7.png?pub-status=live)
Figure 7. Detection accuracy and loss function curves of convolutional kernels with different widths and quantities: (a) identification accuracy, (b) loss function curve
We utilised the confusion matrix obtained from the test set, as illustrated in Figure 8, to verify the identification performance of the proposed model. Based on the matrix, the identification accuracy for purse seine operations was the lowest, at 91 ⋅ 6%. Out of the 1,296 purse seine vessels in the test set, five were misclassified as gill net vessels, and 103 were misclassified as trawl vessels. The identification accuracy for gill net operations was at 93 ⋅ 5%. Fifty-two instances were misclassified as trawl vessels out of the 8,810 gill net vessels in the test set. For trawl operations, the identification accuracy was 92%. Thirty-one were misclassified as purse seine vessels and 229 were misclassified as gill net vessels out of 3,440 trawl vessels in the test set. Overall, classification errors with each class indicated certain similarities among the three operational methods, such as similar fishing speeds.
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_fig8.png?pub-status=live)
Figure 8. Normalised confusion matrix of classification for purse seine, gill net and trawl
This study compared the classification results of SVM, traditional DCNN and LSTM algorithms with the proposed model to further validate the identification accuracy of the proposed model. Table 2 lists the identification results. The classification results show that LSTM outperformed DCNN regarding identification accuracy, achieving 87% and F1 score of 87%. The 1D CNN-LSTM obtained a higher accuracy of 92% and F1 score of 93%. In addition, the experiment added Recall, Precision and F1 Score as the evaluation criteria of the model based on the overall accuracy, to verify the effect of various algorithms on identifying each type of fishing operation behaviour. The results show that the evaluation indexes of trawling and gillnet were better than those of purse Seine, which may be caused by the more flexible way fishing vessels conduct purse seine operations. This signifies that the proposed model, combining spatial feature extraction from DCNN and the time-series representation capability of LSTM, achieves superior effectiveness in identifying fishing vessel operational classes.
Table 2. Identification results of different models
![](https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20250111151126626-0869:S037346332400033X:S037346332400033X_tab2.png?pub-status=live)
4. Conclusions
The identification of fishing vessel operational behaviours contributes to the regulation of illegal fishing and the strengthening of the preservation of fishery resources. This study proposed a 1D CNN-LSTM hybrid deep learning model based on CNN and LSTM for fishing vessel operational behaviour identification. The data on fishing vessels equipped with BeiDou devices were collected from the Tianchi Lab and used for our experimental study. The experimental results demonstrated that the hybrid model outperformed traditional CNN and LSTM, achieving higher identification accuracy. The accuracy of the proposed method could be higher, but the running time can meet the requirements of real-time detection. In addition to trawl, gillnet and purse seine fishing methods, there are other types of fishing methods and non-fishing status, which can be added to the identification of the various operating states of the fishing vessel. This study helps supervise overfishing by fishing vessels, enhancing fishery regulation and marine resource preservation.
Funding
This research was funded by the National Natural Science Foundation of China (NSFC) Youth Fund (52201410), JMU(ZQ2023012), Doctoral Research Fund Program of Chengyi College, Jimei University (CK21018), and the young and middle-aged teacher education research program of Fujian Province (JAT201037).
Data availability statement
The data used to support the findings of this study are available from the corresponding author upon request.
Conflicts of interest
The authors declare no conflicts of interest.