65 research outputs found

    Accurate deep-learning estimation of chlorophyll-a concentration from the spectral particulate beam-attenuation coefficient

    Get PDF
    Different techniques exist for determining chlorophyll-a concentration as a proxy of phytoplankton abundance. In this study, a novel method based on the spectral particulate beam-attenuation coefficient (cp) was developed to estimate chlorophyll-a concentrations in oceanic waters. A multi-layer perceptron deep neural network was trained to exploit the spectral features present in cp around the chlorophyll a absorption peak in the red spectral region. Results show that the model was successful at accurately retrieving chlorophyll-a concentrations using cp in three red spectral bands,irrespective of time or location and over a wide range of chlorophyll-a concentrations

    Estimates of Water-Column Nutrient Concentrations and Carbonate System Parameters in the Global Ocean: A Novel Approach Based on Neural Networks

    Get PDF
    A neural network-based method (CANYON: CArbonate system and Nutrients concentration from hYdrological properties and Oxygen using a Neural-network) was developed to estimate water-column (i.e., from surface to 8,000 m depth) biogeochemically relevant variables in the Global Ocean. These are the concentrations of three nutrients [nitrate (NO3−), phosphate (PO43−), and silicate (Si(OH)4)] and four carbonate system parameters [total alkalinity (AT), dissolved inorganic carbon (CT), pH (pHT), and partial pressure of CO2 (pCO2)], which are estimated from concurrent in situ measurements of temperature, salinity, hydrostatic pressure, and oxygen (O2) together with sampling latitude, longitude, and date. Seven neural-networks were developed using the GLODAPv2 database, which is largely representative of the diversity of open-ocean conditions, hence making CANYON potentially applicable to most oceanic environments. For each variable, CANYON was trained using 80 % randomly chosen data from the whole database (after eight 10° × 10° zones removed providing an “independent data-set” for additional validation), the remaining 20 % data were used for the neural-network test of validation. Overall, CANYON retrieved the variables with high accuracies (RMSE): 1.04 μmol kg−1 (NO3−), 0.074 μmol kg−1 (PO43−), 3.2 μmol kg−1 (Si(OH)4), 0.020 (pHT), 9 μmol kg−1 (AT), 11 μmol kg−1 (CT) and 7.6 % (pCO2) (30 μatm at 400 μatm). This was confirmed for the eight independent zones not included in the training process. CANYON was also applied to the Hawaiian Time Series site to produce a 22 years long simulated time series for the above seven variables. Comparison of modeled and measured data was also very satisfactory (RMSE in the order of magnitude of RMSE from validation test). CANYON is thus a promising method to derive distributions of key biogeochemical variables. It could be used for a variety of global and regional applications ranging from data quality control to the production of datasets of variables required for initialization and validation of biogeochemical models that are difficult to obtain. In particular, combining the increased coverage of the global Biogeochemical-Argo program, where O2 is one of the core variables now very accurately measured, with the CANYON approach offers the fascinating perspective of obtaining large-scale estimates of key biogeochemical variables with unprecedented spatial and temporal resolutions. The Matlab and R codes of the proposed algorithms are provided as Supplementary Material

    An Alternative to Static Climatologies: Robust Estimation of Open Ocean CO2 Variables and Nutrient Concentrations From T, S, and O2 Data Using Bayesian Neural Networks

    Get PDF
    This work presents two new methods to estimate oceanic alkalinity (AT), dissolved inorganic carbon (CT), pH, and pCO2 from temperature, salinity, oxygen, and geolocation data. “CANYON-B” is a Bayesian neural network mapping that accurately reproduces GLODAPv2 bottle data and the biogeochemical relations contained therein. “CONTENT” combines and refines the four carbonate system variables to be consistent with carbonate chemistry. Both methods come with a robust uncertainty estimate that incorporates information from the local conditions. They are validated against independent GO-SHIP bottle and sensor data, and compare favorably to other state-of-the-art mapping methods. As “dynamic climatologies” they show comparable performance to classical climatologies on large scales but a much better representation on smaller scales (40–120 d, 500–1,500 km) compared to in situ data. The limits of these mappings are explored with pCO2 estimation in surface waters, i.e., at the edge of the domain with high intrinsic variability. In highly productive areas, there is a tendency for pCO2 overestimation due to decoupling of the O2 and C cycles by air-sea gas exchange, but global surface pCO2 estimates are unbiased compared to a monthly climatology. CANYON-B and CONTENT are highly useful as transfer functions between components of the ocean observing system (GO-SHIP repeat hydrography, BGC-Argo, underway observations) and permit the synergistic use of these highly complementary systems, both in spatial/temporal coverage and number of observations. Through easily and robotically-accessible observations they allow densification of more difficult-to-observe variables (e.g., 15 times denser AT and CT compared to direct measurements). At the same time, they give access to the complete carbonate system. This potential is demonstrated by an observation-based global analysis of the Revelle buffer factor, which shows a significant, high latitude-intensified increase between +0.1 and +0.4 units per decade. This shows the utility that such transfer functions with realistic uncertainty estimates provide to ocean biogeochemistry and global climate change research. In addition, CANYON-B provides robust and accurate estimates of nitrate, phosphate, and silicate. Matlab and R code are available at https://github.com/HCBScienceProducts/. Introductio

    A neural network-based method for merging ocean color and Argo data to extend surface bio-optical properties to depth: Retrieval of the particulate backscattering coefficient

    Get PDF
    The present study proposes a novel method that merges satellite ocean color bio-optical products with Argo temperature-salinity profiles to infer the vertical distribution of the particulate backscattering coefficient (bbp). This neural network-based method (SOCA-BBP for Satellite Ocean-Color merged with Argo data to infer the vertical distribution of the Particulate Backscattering coefficient) uses three main input components: (1) satellite-based surface estimates of bbp and chlorophyll a concentration matched up in space and time with (2) depth-resolved physical properties derived from temperature-salinity profiles measured by Argo profiling floats and (3) the day of the year of the considered satellite-Argo matchup. The neural network is trained and validated using a database including 4725 simultaneous profiles of temperature-salinity and bio-optical properties collected by Bio-Argo floats, with concomitant satellite-derived products. The Bio-Argo profiles are representative of the global open-ocean in terms of oceanographic conditions, making the proposed method applicable to most open-ocean environments. SOCA-BBP is validated using 20% of the entire database (global error of 21%). We present additional validation results based on two other independent data sets acquired (1) by four Bio-Argo floats deployed in major oceanic basins, not represented in the database used to train the method; and (2) during an AMT (Atlantic Meridional Transect) field cruise in 2009. These validation tests based on two fully independent data sets indicate the robustness of the predicted vertical distribution of bbp. To illustrate the potential of the method, we merged monthly climatological Argo profiles with ocean color products to produce a depth-resolved climatology of bbp for the global ocean

    An Alternative to Static Climatologies: Robust Estimation of Open Ocean CO2 Variables and Nutrient Concentrations From T, S, and O2 Data Using Bayesian Neural Networks

    Get PDF
    This work presents two new methods to estimate oceanic alkalinity (AT), dissolved inorganic carbon (CT), pH, and pCO2 from temperature, salinity, oxygen, and geolocation data. “CANYON-B” is a Bayesian neural network mapping that accurately reproduces GLODAPv2 bottle data and the biogeochemical relations contained therein. “CONTENT” combines and refines the four carbonate system variables to be consistent with carbonate chemistry. Both methods come with a robust uncertainty estimate that incorporates information from the local conditions. They are validated against independent GO-SHIP bottle and sensor data, and compare favorably to other state-of-the-art mapping methods. As “dynamic climatologies” they show comparable performance to classical climatologies on large scales but a much better representation on smaller scales (40–120 d, 500–1,500 km) compared to in situ data. The limits of these mappings are explored with pCO2 estimation in surface waters, i.e., at the edge of the domain with high intrinsic variability. In highly productive areas, there is a tendency for pCO2 overestimation due to decoupling of the O2 and C cycles by air-sea gas exchange, but global surface pCO2 estimates are unbiased compared to a monthly climatology. CANYON-B and CONTENT are highly useful as transfer functions between components of the ocean observing system (GO-SHIP repeat hydrography, BGC-Argo, underway observations) and permit the synergistic use of these highly complementary systems, both in spatial/temporal coverage and number of observations. Through easily and robotically-accessible observations they allow densification of more difficult-to-observe variables (e.g., 15 times denser AT and CT compared to direct measurements). At the same time, they give access to the complete carbonate system. This potential is demonstrated by an observation-based global analysis of the Revelle buffer factor, which shows a significant, high latitude-intensified increase between +0.1 and +0.4 units per decade. This shows the utility that such transfer functions with realistic uncertainty estimates provide to ocean biogeochemistry and global climate change research. In addition, CANYON-B provides robust and accurate estimates of nitrate, phosphate, and silicate. Matlab and R code are available at https://github.com/HCBScienceProducts/

    Using machine learning and Biogeochemical-Argo (BGC-Argo) floats to assess biogeochemical models and optimize observing system design

    Get PDF
    Numerical models of ocean biogeochemistry are becoming the major tools used to detect and predict the impact of climate change on marine resources and to monitor ocean health. However, with the continuous improvement of model structure and spatial resolution, incorporation of these additional degrees of freedom into fidelity assessment has become increasingly challenging. Here, we propose a new method to provide information on the model predictive skill in a concise way. The method is based on the conjoint use of a k-means clustering technique, assessment metrics, and Biogeochemical-Argo (BGC-Argo) observations. The k-means algorithm and the assessment metrics reduce the number of model data points to be evaluated. The metrics evaluate either the model state accuracy or the skill of the model with respect to capturing emergent properties, such as the deep chlorophyll maximums and oxygen minimum zones. The use of BGC-Argo observations as the sole evaluation data set ensures the accuracy of the data, as it is a homogenous data set with strict sampling methodologies and data quality control procedures. The method is applied to the Global Ocean Biogeochemistry Analysis and Forecast system of the Copernicus Marine Service. The model performance is evaluated using the model efficiency statistical score, which compares the model–observation misfit with the variability in the observations and, thus, objectively quantifies whether the model outperforms the BGC-Argo climatology. We show that, overall, the model surpasses the BGC-Argo climatology in predicting pH, dissolved inorganic carbon, alkalinity, oxygen, nitrate, and phosphate in the mesopelagic and the mixed layers as well as silicate in the mesopelagic layer. However, there are still areas for improvement with respect to reducing the model–data misfit for certain variables such as silicate, pH, and the partial pressure of CO2 in the mixed layer as well as chlorophyll-a-related, oxygen-minimum-zone-related, and particulate-organic-carbon-related metrics. The method proposed here can also aid in refining the design of the BGC-Argo network, in particular regarding the regions in which BGC-Argo observations should be enhanced to improve the model accuracy via the assimilation of BGC-Argo data or process-oriented assessment studies. We strongly recommend increasing the number of observations in the Arctic region while maintaining the existing high-density of observations in the Southern Oceans. The model error in these regions is only slightly less than the variability observed in BGC-Argo measurements. Our study illustrates how the synergic use of modeling and BGC-Argo data can both provide information about the performance of models and improve the design of observing systems.</p

    Machine learning in marine ecology: an overview of techniques and applications

    Get PDF
    Machine learning covers a large set of algorithms that can be trained to identify patterns in data. Thanks to the increase in the amount of data and computing power available, it has become pervasive across scientific disciplines. We first highlight why machine learning is needed in marine ecology. Then we provide a quick primer on machine learning techniques and vocabulary. We built a database of ∼1000 publications that implement such techniques to analyse marine ecology data. For various data types (images, optical spectra, acoustics, omics, geolocations, biogeochemical profiles, and satellite imagery), we present a historical perspective on applications that proved influential, can serve as templates for new work, or represent the diversity of approaches. Then, we illustrate how machine learning can be used to better understand ecological systems, by combining various sources of marine data. Through this coverage of the literature, we demonstrate an increase in the proportion of marine ecology studies that use machine learning, the pervasiveness of images as a data source, the dominance of machine learning for classification-type problems, and a shift towards deep learning for all data types. This overview is meant to guide researchers who wish to apply machine learning methods to their marine datasets.Machine learning in marine ecology: an overview of techniques and applicationspublishedVersio

    Introduction to the French GEOTRACES North Atlantic Transect (GA01): GEOVIDE cruise

    Get PDF
    The GEOVIDE cruise, a collaborative project within the framework of the international GEOTRACES programme, was conducted along the French-led section in the North Atlantic Ocean (Section GA01), between 15 May and 30 June 2014. In this special issue (https://www.biogeosciences.net/special_issue900.html), results from GEOVIDE, including physical oceanography and trace element and isotope cyclings, are presented among 18 articles. Here, the scientific context, project objectives, and scientific strategy of GEOVIDE are provided, along with an overview of the main results from the articles published in the special issue

    Machine learning in marine ecology: an overview of techniques and applications

    Get PDF
    Machine learning covers a large set of algorithms that can be trained to identify patterns in data. Thanks to the increase in the amount of data and computing power available, it has become pervasive across scientific disciplines. We first highlight why machine learning is needed in marine ecology. Then we provide a quick primer on machine learning techniques and vocabulary. We built a database of ∼1000 publications that implement such techniques to analyse marine ecology data. For various data types (images, optical spectra, acoustics, omics, geolocations, biogeochemical profiles, and satellite imagery), we present a historical perspective on applications that proved influential, can serve as templates for new work, or represent the diversity of approaches. Then, we illustrate how machine learning can be used to better understand ecological systems, by combining various sources of marine data. Through this coverage of the literature, we demonstrate an increase in the proportion of marine ecology studies that use machine learning, the pervasiveness of images as a data source, the dominance of machine learning for classification-type problems, and a shift towards deep learning for all data types. This overview is meant to guide researchers who wish to apply machine learning methods to their marine datasets

    Novel learning-based methods to derive biogeochemical parameters from profiling floats

    No full text
    International audienc
    corecore