(C) PLOS One [1]. This unaltered content originally appeared in journals.plosone.org.
Licensed under Creative Commons Attribution (CC BY) license.
url:https://journals.plos.org/plosone/s/licenses-and-copyright

------------



Deep inference of seabird dives from GPS-only records: Performance and generalization properties

['Amédée Roy', 'Institut De Recherche Pour Le Développement', 'Ird', 'Marbec', 'Univ. Montpellier', 'Ifremer', 'Cnrs', 'Sète', 'Imt Atlantique', 'Umr Cnrs Lab-Sticc']

Date: 2022-05

Abstract At-sea behaviour of seabirds have received significant attention in ecology over the last decades as it is a key process in the ecology and fate of these populations. It is also, through the position of top predator that these species often occupy, a relevant and integrative indicator of the dynamics of the marine ecosystems they rely on. Seabird trajectories are recorded through the deployment of GPS, and a variety of statistical approaches have been tested to infer probable behaviours from these location data. Recently, deep learning tools have shown promising results for the segmentation and classification of animal behaviour from trajectory data. Yet, these approaches have not been widely used and investigation is still needed to identify optimal network architecture and to demonstrate their generalization properties. From a database of about 300 foraging trajectories derived from GPS data deployed simultaneously with pressure sensors for the identification of dives, this work has benchmarked deep neural network architectures trained in a supervised manner for the prediction of dives from trajectory data. It first confirms that deep learning allows better dive prediction than usual methods such as Hidden Markov Models. It also demonstrates the generalization properties of the trained networks for inferring dives distribution for seabirds from other colonies and ecosystems. In particular, convolutional networks trained on Peruvian boobies from a specific colony show great ability to predict dives of boobies from other colonies and from distinct ecosystems. We further investigate accross-species generalization using a transfer learning strategy known as ‘fine-tuning’. Starting from a convolutional network pre-trained on Guanay cormorant data reduced by two the size of the dataset needed to accurately predict dives in a tropical booby from Brazil. We believe that the networks trained in this study will provide relevant starting point for future fine-tuning works for seabird trajectory segmentation.

Author summary Over the last decades, the use of miniaturized electronic devices enabled the tracking of many wide-ranging animal species. The deployment of GPS has notably informed on migratory, habitat and foraging strategies of numerous seabird species. A key challenge in movement ecology is to identify specific behavioural patterns (e.g. travelling, resting, foraging) through the observed movement data. In this work, we address the inference of seabird diving behaviour from GPS data using deep learning methods. We demonstrate the performance of deep networks to accurately identify movement patterns from GPS data over state-of-the-art tools, and we illustrate their great accross-species generalization properties (i.e. the ability to generalize prediction from one seabird species to aother). Our results further supports the relevance of deep learning schemes as ‘ready-to-use’ tools which could be used by ecologists to segmentate animal trajectories on new (small) datasets, including when these datasets do not include groundtruthed labelled data for a supervised training.

Citation: Roy A, Lanco Bertrand S, Fablet R (2022) Deep inference of seabird dives from GPS-only records: Performance and generalization properties. PLoS Comput Biol 18(3): e1009890. https://doi.org/10.1371/journal.pcbi.1009890 Editor: Ephraim Hanks, Pennsylvania State University, UNITED STATES Received: March 29, 2021; Accepted: February 2, 2022; Published: March 11, 2022 Copyright: © 2022 Roy et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Data Availability: Data, code and fitted models are available on github repository https://github.com/AmedeeRoy/BirdDL. Funding: This work is a contribution to the TRIATLAS project (European Union’s Horizon 2020 research and innovation program – grant agreement No. 817578), and to the Young Team IRD Programm (JEAI) for TABASCO project. RF was supported by LEFE program (LEFE MANU project IA-OAC), CNES (grant SWOT-DIEGO) and ANR Projects Melody and OceaniX. Fieldworks have been conducted thanks to the cooperative agreement between IRD, the Agence Nationale de la Recherche (ANR) project TOPINEME, and of the International Joint Laboratory LMI-DISCOH (to SLB). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Competing interests: The authors have declared that no competing interests exist.

Introduction Marine top predators have received significant attention in marine ecology over the last decades [1]. They are known to use vast areas for feeding, thus requiring specific adaptive foraging strategies in order to localize their preys, especially in the pelagic environments which are highly variable [2]. They offer a unique perspective into ocean processes and dynamics, given that they can amplify information on the structure of the seascape across multiple spatio-temporal scales due to their relatively high mobility and longevity. Often considered as sentinels of the environmental variability and bio-indicators for ecosystem structure and dynamics, their study has been particularly contextualized into ecosystem-based management and conservation issues [3, 4]. Numerous studies have focused on the variability of seabirds’ foraging strategies and in particular of dive distributions. Assessing consistency or shifts in foraging locations [5–7], and in the resource spatial partitioning [8, 9] provide indeed crucial information for understanding marine ecosystems. This has been particularly enabled by great technical advances in the miniaturization and autonomy of biologging devices [10, 11]. GPS loggers have been at the forefront of this breakthrough, and can now provide precise and accurate data on the foraging trajectories of many free-ranging species, such as seabirds [12, 13]. Detailed information on the diving behaviour has also been gained through the additional use of pressure sensors, such as Time Depth Recorders (TDR) devices [14–16]. Yet, for historical, financial and ethical reasons, the deployment of several sensors has not always been possible and a substantial amount of tracking datasets consist in GPS tracks only. The development of tools dedicated to animal trajectories segmentation (i.e. for dive identification) is therefore needed to extract more out of historical seabird foraging trajectories [17]. Among existing approaches to dive identification from GPS tracks, many individual-based studies aim to infer behavioral state directly by applying thresholds to various ecological metrics of movement data, such as speed, direction and tortuosity [18, 19]. A common example is the so-called First-Passage Time method (hereafter, FPT), which is defined as the time taken for an individual to cross a virtual circle of given radius [20–22]. Here foraging behaviour is assumed to occur when birds fly at very low speeds [23]. Statistical methods have also been used to predict diving behaviour with clustering schemes such as the Expectation Maximization binary clustering technique [24, 25] or using hidden Markov models (hereafter, HMM) typically with 2 or 3 distinct behavioural modes to explicit account for time-related priors [26–29]. More occasionally, supervised machine learning approaches such as artificial neural networks, support vector machines and random forests have also been used [30, 31]. We may refer the reader to [32] for a more detailed review of these methods. Recently, deep learning methods have been suggested to be a potentially useful tool for behavioural pattern segmentation [33]. Deep learning refers to a neural network with multiple layers of processing units [34]. By decomposing the data through these multiple layers, deep neural networks may learn complex features for representing the data with a high level of abstraction at multiple scales. The trajectory of an animal being the result of complex processes at multiple spatio-temporal scales [35], deep learning might be able to extract relevant representations of trajectories for performing tasks such as classification, segmentation or simulation. Deep learning has become the state-of-the-art framework for a wide range of problems in text, speech, audio and image processing and applications in ecology have mainly addressed image analysis and computer vision case-studies [36, 37]. Fewer studies have explored deep learning for animal trajectory data. Recurrent neural networks have been used for movement prediction [38, 39], and for the identification of representative movement patterns [40]. Very recently, an attention network has also been proposed for comparative analysis of animal trajectories [41]. Related to our study, a fully-connected network (hereafter, FCNet) has been used to predict seabirds’ diving in European shags, common guillemots and razorbills [17]. With a very simple FCNet with 4 layers comprising hundreds of hidden nodes, this study demonstrated the improved accuracy of this approach over commonly-used behavioural classification methods. These promising results support new investigations to further explore the potential of deep learning schemes for movement ecology studies. In particular, a central challenge in deep learning is to make algorithms that will not only perform well on the training data, but also on new datasets [42]. Generalization properties are indeed crucial for deep networks to tackle a wide range of problems. For example, it would be relevant to develop a neural network for the segmentation of behavioral patterns of certain species and whose characteristics are transferable to the analysis of the behavior of another species. Transfer learning refers to the fact of using knowledge that was gained from solving one problem and applying it to a new but related problem. For this purpose, a solution known as ‘fine-tuning’ consists in using a pre-trained model as the initialization of the training scheme rather than training a new model from scratch [43]. As in [17], this work addresses the inference of seabird diving behaviour from GPS data using Deep Learning methods. Besides, their FCNet architecture, we investigated Convolutional Neural Networks and U-Networks [44], which are state-of-the-art architectures for time series and image data processing and shall better account for the time structure of trajectory data. As case-studies, we considered two tropical seabird genus with distinct diving behaviour (Boobies vs Cormorants). The associated datasets comprised 297 foraging trips derived from GPS data deployed simultaneously with pressure sensors for the identification of dives. Our specific objectives were therefore (a) to confirm the performance of deep networks over state-of-the-art tools for dives identification, (b) to demonstrate generalization properties of trained network to predict dives of seabirds from other colonies and (c) to evaluate the benefits of a transfer learning strategy known as ‘fine-tuning’ for accross-species generalization.

Discussion This study aimed at predicting seabirds dives from GPS data only using deep neural networks trained in a supervised manner based on TDR data to define the groundtruthed dives. In line with [17], this study further supports the relevance of deep learning approach over classical methods for dive predictions. Using convolutional architectures rather fully-connected ones, we reported even better results with higher stability to the different data inputs, as well as better generalization abilities. Peruvian boobies and Guanay cormorants tracked in Peru breed in a highly productive upwelling system, the Humboldt Current System (HCS) and feed on the same preys, i.e. Peruvian anchovies [50]. However, they are known to have distinct foraging strategies: boobies are plunge divers reaching in average about 2 m depth and spending most of the time in fly, while cormorants dive deeper and longer on average, reach up to 30 m depth, and spend up to 40% of the time resting on the water surface [51] (Table 1). By contrast masked boobies breeding at Fernando de Noronha are plunge divers similarly to Peruvian boobies, yet they forage mainly in oligotrophic waters [52] and feed mainly on flying fish and flying squids [53, 54]. Their foraging strategies then differ from Peruvian boobies as they perform longest trips and spend more time resting at sea surface (Table 1). We demonstrated that for these three species, our best deep network models were able to accurately predict around 95% of dives and outperformed HMM that predicted around 85% of dives. In particular, the proposed U-shape deep network (UNet) demonstrated a greater robustness to different data inputs, as it obtained the best results whatever the sampling resolution (Table 3). Additionally, UNet also resulted in better seabird dive distribution maps (Fig 6). Recently numerous studies used seabirds dive as a proxy for prey distribution, and such distribution are usually computed by applying KDE on dive predictions derived from HMMs [55–57]. Here, we show that the error in the estimation of dive distributions maps can be divided by two when using deep learning tools rather than HMM tools. In our specific study, HMMs over-estimated the frequency of dives at specific locations (including the vicinity of the colony). Sulids and cormorants spend time bathing near their breeding territories involving vigorous splashing and beating the water with the wings [53]. Such behaviours associated to low speed might be erroneously classified as diving behaviour by state-of-the-art tools which could explain the observed bias. This might also explain why HMM are better at predicting boobies’ than cormorants’ dives because these birds spend more time resting at the surface, which corresponds to low speed patterns without being dives (see Table 1). We may also stress that Cormorants trajectories are characterized by relatively long gaps in the regularly sampled sequence of locations, since these devices do not receive a satellite signal while submerged [26, 58]. This may in turn make more complex the analysis of Cormorants trajectories. In this respect, UNet showed a greater ability to discriminate the resting/bathing behaviours from dives, and a greater robustness to the presence of linearly-interpolated segments. Whereas HMM are mostly driven by fine-scale features (w.r.t. the considered time resolution), UNets exploit a multi-scale analysis of trajectory data and can extract relevant multi-scale information to retrieve dive. Future work could investigate further the key features extracted by UNets. As shown in Fig 3, the performance of the deep networks was closely related to the temporal resolution of the sampled dataset. Whereas HMM did not succeed in exploiting higher-resolution data, UNets led to better performance when the resolution increased. This supports a greater ability of UNets both to deal with potential aliasing effects as well as to exploit fine-scale features. With technological advances in sensor technology, ecologists are able to collect larger amount of data than ever before. We might expect GPS with lower consumption and higher resolution in the future. Such an expected trend would make more critical the exploitation of the proposed deep learning approaches to make the most of the collected high-resolution animal trajectories [13, 59, 60]. When considering neural network approaches, training models which may apply beyond the considered training framework is a key feature, generally referred to as the generalization performance of the trained neural networks. Beyond the evaluation of dive prediction performance on a trajectory dataset, which is independent from the training dataset, the question whether a model trained on a given dataset, e.g. for a given species, colony and time period, may apply to other species, colonies and/or time periods, naturally arises as a key question. Numerous studies in the deep learning literature [61, 62] have highlighted that some neural architectures show relevant generalization properties whereas others may not. Here, we evaluated the generalization performance of the three benchmarked deep networks. Thus we demonstrate the ability of deep networks trained at a colony for one species to also apply to an another colony (of the same ecosystem) for the same species. In our example, Peruvian boobies from Guañape Island did have different foraging strategies from their counterparts from Pescadores island, with trips two times longer and dives slightly longer (see Table 1). However, the UNet reached similar dive prediction performance when applied to Guañape data. This suggests that dive patterns are highly similar between Peruvian boobies from both colonies. We also show the great ability of the CNNet to generalize dive prediction to a seabird of same genus but from a totally distinct ecosystem. When applied to masked boobies trajectories from a Brazilian colony the CNNet trained from Peruvian boobies data obtained an AUC of 0.87 despite the important difference in foraging strategies (Table 3). The same model trained on masked boobies data reached an AUC of 0.93 (Fig 7), suggesting that diving characteristics are slightly different. Masked boobies from the Brazilian colony feed indeed on different preys, and spend way more time resting at the surface (Table 1). As deep networks trained on cormorants unsurprisingly led to less accurate prediction when used to predict boobies dives, we suggest that the CNNet may capture genus-specific features. These results then support the relevance of deep learning schemes as ‘ready-to-use’ tools which could be used by ecologists to predict seabirds dives on new (small) datasets, including when these datasets do not include groundtruthed dive data for a supervised training. To make easier such applications, we share online the different models we trained on the considered datasets (https://github.com/AmedeeRoy/BirdDL/models). Beyond such a direct application, trained models are also of key interest to explore transfer learning strategies, which refer to the ability of exploiting some previously trained models to address a new task or dataset rather than training a new model from scratch. We illustrated how fine-tuned CNNet and UNet models could outperform HMM with smaller training datasets. For instance, the fine-tuned CNNet for the prediction of masked boobies’ dive was able to converge and outperform HMM with a dataset twice as small as the dataset required to reach same performance without fine-tuning (Fig 7). Such a result was even possible by initializing neural networks with the model trained with cormorant data. This further supports the ability of deep networks to generalize their prediction from deep diving seabirds (e.g. cormorants) to plunge divers (e.g. boobies). Fine-tuning is thus particularly relevant when the training dataset may not be sufficiently large to train a model from scratch. While the need of large dataset is often presented as a drawback for supervized techniques, we demonstrated that relatively small datasets (5-10 foraging trips, i.e. 10k to 20k GPS data) may be enough to fine-tune deep networks and outperform state-of-the-art approach to data segmentation. Thus, we expect that our models will be of interest for future work on seabird trajectory segmentation, as they could be used as initializations for fine-tuning procedures.

Acknowledgments We thank all people involved in fieldworks: H. Weimerskirch, K. Delord, C. Barbraud, Y. Tremblay, J. Silva, G. Passuni, C. Boyd, C. Saraux, A. Brunel, J.Jacoby, L. Figuereido and G. T. Nunes. We thank Proabonos for permission to work on Guañape and Pescadores Islands. We thank the Brazilian Ministry of Environment (ICMBio) and Fernando de Noronha’s firemen for the authorization and logistical support to fieldworks in Brazil.

[END]

[1] Url: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1009890

(C) Plos One. "Accelerating the publication of peer-reviewed science."
Licensed under Creative Commons Attribution (CC BY 4.0)
URL: https://creativecommons.org/licenses/by/4.0/


via Magical.Fish Gopher News Feeds:
gopher://magical.fish/1/feeds/news/plosone/