88 research outputs found

    WORKING GROUP ON MACHINE LEARNING IN MARINE SCIENCE (WGMLEARN)

    Get PDF
    publishedVersio

    Repeats and EST analysis for new organisms

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Repeat masking is an important step in the EST analysis pipeline. For new species, genomic knowledge is scarce and good repeat libraries are typically unavailable. In these cases it is common practice to mask against known repeats from other species (i.e., model organisms). There are few studies that investigate the effectiveness of this approach, or attempt to evaluate the different methods for identifying and masking repeats.</p> <p>Results</p> <p>Using zebrafish and medaka as example organisms, we show that accurate repeat masking is an important factor for obtaining a high quality clustering. Furthermore, we show that masking with standard repeat libraries based on curated genomic information from other species has little or no positive effect on the quality of the resulting EST clustering. Library based repeat masking which often constitutes a computational bottleneck in the EST analysis pipeline can therefore be reduced to species specific repeat libraries, or perhaps eliminated entirely. In contrast, substantially improved results can be achived by applying a repeat library derived from a partial reference clustering (e.g., from mapping sequences against a partially sequenced genome).</p> <p>Conclusion</p> <p>Of the methods explored, we find that the best EST clustering is achieved after masking with repeat libraries that are species specific. In the absence of such libraries, library-less masking gives results superior to the current practice of using cross-species, genome-based libraries.</p

    Annotating otoliths with a deep generative model

    Get PDF
    Otoliths are a central information source for fish ecology and stock management, conveying important data about age and other life history for individual fish. Traditionally, interpretation of otoliths has required skilled expert readers, but recently deep learning classification and regression models have been trained to extract fish age from images of otoliths from a variety of species. Despite high accuracy in many cases, the adoption of such models in fisheries management has been slow. One reason may be that the underlying mechanisms the model uses to derive its results from the data are opaque, and this lack of legibility makes it challenging to build sufficient trust in the results. Here, we implement a deep learning model that instead of age predicts the location of annotation marks for each of the annuli. This allows an expert to evaluate the model’s performance in detail. The quality of the annotations was judged by a panel of four expert otolith readers in a double-blinded randomized survey. Using a scale from 1 to 5, the generated marks received an average quality score of 4.22, whereas expert annotations received an average score of 4.33. By counting the marks to determine fish age, we obtained an agreement between expert and model annotations of 64% on our test set, which running the model stochastically increased to 69%. Stochastic sampling yields further benefits, including an explicit measure of the model’s uncertainty, the post hoc likelihood of the different age classes for each otolith, and a set of alternative annotation sequences that highlight the structure of the annuli.publishedVersio

    Inspecting class hierarchies in classification-based metric learning models

    Full text link
    Most classification models treat all misclassifications equally. However, different classes may be related, and these hierarchical relationships must be considered in some classification problems. These problems can be addressed by using hierarchical information during training. Unfortunately, this information is not available for all datasets. Many classification-based metric learning methods use class representatives in embedding space to represent different classes. The relationships among the learned class representatives can then be used to estimate class hierarchical structures. If we have a predefined class hierarchy, the learned class representatives can be assessed to determine whether the metric learning model learned semantic distances that match our prior knowledge. In this work, we train a softmax classifier and three metric learning models with several training options on benchmark and real-world datasets. In addition to the standard classification accuracy, we evaluate the hierarchical inference performance by inspecting learned class representatives and the hierarchy-informed performance, i.e., the classification performance, and the metric learning performance by considering predefined hierarchical structures. Furthermore, we investigate how the considered measures are affected by various models and training options. When our proposed ProxyDR model is trained without using predefined hierarchical structures, the hierarchical inference performance is significantly better than that of the popular NormFace model. Additionally, our model enhances some hierarchy-informed performance measures under the same training options. We also found that convolutional neural networks (CNNs) with random weights correspond to the predefined hierarchies better than random chance.Comment: The main manuscript is 22 pages. The whole paper is 49 pages. The codes for our experiments will be available in https://github.com/hjk92g/Inspecting_Hierarchies_ML . The plankton datasets are available from the Norwegian Marine Data Center (MicroS: https://doi.org/10.21335/NMDC-2102309336 , MicroL: https://doi.org/10.21335/NMDC-573815973 , MesoZ: https://doi.org/10.21335/NMDC-1805578916

    Masking repeats while clustering ESTs

    Get PDF
    A problem in EST clustering is the presence of repeat sequences. To avoid false matches, repeats have to be masked. This can be a time-consuming process, and it depends on available repeat libraries. We present a fast and effective method that aims to eliminate the problems repeats cause in the process of clustering. Unlike traditional methods, repeats are inferred directly from the EST data, we do not rely on any external library of known repeats. This makes the method especially suitable for analysing the ESTs from organisms without good repeat libraries. We demonstrate that the result is very similar to performing standard repeat masking before clustering

    Calcium from salmon and cod bone is well absorbed in young healthy men: a double-blinded randomised crossover design

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Calcium (Ca) - fortified foods are likely to play an important role in helping the consumer achieve an adequate Ca intake, especially for persons with a low intake of dairy products. Fish bones have a high Ca content, and huge quantities of this raw material are available as a by-product from the fish industry. Previously, emphasis has been on producing high quality products from fish by-products by use of bacterial proteases. However, documentation of the nutritional value of the enzymatically rinsed Ca-rich bone fraction remains unexplored. The objective of the present study was to assess the bioavailability of calcium in bones of Atlantic salmon (oily fish) and Atlantic cod (lean fish) in a double-blinded randomised crossover design.</p> <p>Methods</p> <p>Ca absorption was measured in 10 healthy young men using <sup>47</sup>Ca whole body counting after ingestion of a test meal extrinsically labelled with the <sup>47</sup>Ca isotope. The three test meals contained 800 mg of Ca from three different calcium sources: cod bones, salmon bones and control (CaCO<sub>3</sub>).</p> <p>Results</p> <p>Mean Ca absorption (± SEE) from the three different Ca sources were 21.9 ± 1.7%, 22.5 ± 1.7% and 27.4 ± 1.8% for cod bones, salmon bones, and control (CaCO<sub>3</sub>), respectively.</p> <p>Conclusion</p> <p>We conclude that bones from Atlantic salmon and Atlantic cod are suitable as natural Ca sources in e.g. functional foods or as supplements.</p

    A deep learning-based method to identify and count pelagic and mesopelagic fishes from trawl camera images

    Get PDF
    Fish counts and species information can be obtained from images taken within trawls, which enables trawl surveys to operate without extracting fish from their habitat, yields distribution data at fine scale for better interpretation of acoustic results, and can detect fish that are not retained in the catch due to mesh selection. To automate the process of image-based fish detection and identification, we trained a deep learning algorithm (RetinaNet) on images collected from the trawl-mounted Deep Vision camera system. In this study, we focused on the detection of blue whiting, Atlantic herring, Atlantic mackerel, and mesopelagic fishes from images collected in the Norwegian sea. To address the need for large amounts of annotated data to train these models, we used a combination of real and synthetic images, and obtained a mean average precision of 0.845 on a test set of 918 images. Regression models were used to compare predicted fish counts, which were derived from RetinaNet classification of fish in the individual image frames, with catch data collected at 20 trawl stations. We have automatically detected and counted fish from individual images, related these counts to the trawl catches, and discussed how to use this in regular trawl surveys.publishedVersio
    corecore