Search CORE

67 research outputs found

Statistical classification using a linear mixture of two multinormal probability densities

Author: Eikvil Line
Hjort Nils Lid
Taxt Torfinn
Publication venue: Matematisk Institutt, Universitetet i Oslo
Publication date: 01/01/1991
Field of study

Automatic Fish Age Determination across Different Otolith Image Labs Using Domain Adaptation

Author: Eikvil Line
Elvarsson Bjarki Thor
Harbitz Alf
Ordonez Alba
Salberg Arnt-Børre
Publication venue: 'MDPI AG'
Publication date: 01/01/2022
Field of study

The age determination of fish is fundamental to marine resource management. This task is commonly done by analysis of otoliths performed manually by human experts. Otolith images from Greenland halibut acquired by the Institute of Marine Research (Norway) were recently used to train a convolutional neural network (CNN) for automatically predicting fish age, opening the way for requiring less human effort and availability of expertise by means of deep learning (DL). In this study, we demonstrate that applying a CNN model trained on images from one lab (in Norway) does not lead to a suitable performance when predicting fish ages from otolith images from another lab (in Iceland) for the same species. This is due to a problem known as dataset shift, where the source data, i.e., the dataset the model was trained on have different characteristics from the dataset at test stage, here denoted as target data. We further demonstrate that we can handle this problem by using domain adaptation, such that an existing model trained in the source domain is adapted to perform well in the target domain, without requiring extra annotation effort. We investigate four different approaches: (i) simple adaptation via image standardization, (ii) adversarial generative adaptation, (iii) adversarial discriminative adaptation and (iv) self-supervised adaptation. The results show that the performance varies substantially between the methods, with adversarial discriminative and self-supervised adaptations being the best approaches. Without using a domain adaptation approach, the root mean squared error (RMSE) and coefficient of variation (CV) on the Icelandic dataset are as high as 5.12 years and 28.6%, respectively, whereas by using the self-supervised domain adaptation, the RMSE and CV are reduced to 1.94 years and 11.1%. We conclude that careful consideration must be given before DL-based predictors are applied to perform large scale inference. Despite that, domain adaptation is a promising solution for handling problems of dataset shift across image labs.publishedVersio

Brage IMR

Machine Learning + Marine Science: Critical Role of Partnerships in Norway

Author: Eikvil Line
Handegard Nils Olav
Jenssen Robert
Kampffmeyer Michael
Malde Ketil
Salberg Arnt Børre
Publication venue
Publication date: 01/01/2021
Field of study

In this essay, we review some recent advances in developing machine learning (ML) methods for marine science applications in Norway. We focus mostly on deep learning (DL) methods and review the challenges we have faced in the process, including data preparation, (lack of) labelled training data, and interpretability. We also present the partnerships that have been formed between e-science institutions and marine science institutions in Norway. These partnerships have been instrumental in moving this effort forward and have been fuelled by grants from the Norwegian Research Council. The last addition to this collaboration is the recent centres for research-based innovation in Marine Acoustic Abundance Estimation and Backscatter Classification (CRIMAC) and Visual Intelligence (VI).publishedVersio

Brage IMR

NORA - Norwegian Open Research Archives

Semi-supervised target classification in multi-frequency echosounder data

Author: Brautaset Olav
Choi Changkyu
Eikvil Line
Handegard Nils Olav
Jenssen Robert
Kampffmeyer Michael
Salberg Arnt Børre
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2021
Field of study

Acoustic target classification in multi-frequency echosounder data is a major interest for the marine ecosystem and fishery management since it can potentially estimate the abundance or biomass of the species. A key problem of current methods is the heavy dependence on the manual categorization of data samples. As a solution, we propose a novel semi-supervised deep learning method leveraging a few annotated data samples together with vast amounts of unannotated data samples, all in a single model. Specifically, two inter-connected objectives, namely, a clustering objective and a classification objective, optimize one shared convolutional neural network in an alternating manner. The clustering objective exploits the underlying structure of all data, both annotated and unannotated; the classification objective enforces a certain consistency to given classes using the few annotated data samples. We evaluate our classification method using echosounder data from the sandeel case study in the North Sea. In the semi-supervised setting with only a tenth of the training data annotated, our method achieves 67.6% accuracy, outperforming a conventional semi-supervised method by 7.0 percentage points. When applying the proposed method in a fully supervised setup, we achieve 74.7% accuracy, surpassing the standard supervised deep learning method by 4.7 percentage points.publishedVersio

Brage IMR

Munin - Open Research Archive

NORA - Norwegian Open Research Archives

Acoustic classification in multifrequency echosounder data using deep convolutional neural networks

Author: Brautaset Olav
Eikvil Line
Handegard Nils Olav
Johnsen Espen
Malde Ketil
Salberg Arnt-Børre
Waldeland Anders Ueland
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2020
Field of study

Acoustic target classification is the process of assigning observed acoustic backscattering intensity to an acoustic category. A deep learning strategy for acoustic target classification using a convolutional network is developed, consisting of an encoder and a decoder, which allow the network to use pixel information and more abstract features. The network can learn features directly from data, and the learned feature space may include both frequency response and school morphology. We tested the method on multifrequency data collected between 2007 and 2018 during the Norwegian sandeel survey. The network was able to distinguish between sandeel schools, schools of other species, and background pixels (including seabed) in new survey data with an F1 score of 0.87 when tested against manually labelled schools. The network separated schools of sandeel and schools of other species with an F1 score of 0.94. A traditional school classification algorithm obtained substantially lower F1 scores (0.77 and 0.82) when tested against the manually labelled schools. To train the network, it was necessary to develop sampling and preprocessing strategies to account for unbalanced classes, inaccurate annotations, and biases in the training data. This is a step towards a method to be applied across a range of acoustic trawl surveys.publishedVersio

Brage IMR

NORA - Norwegian Open Research Archives

A Tale of Two Transcriptions : Machine-Assisted Transcription of Historical Sources

Author: Andersen Trygve
Cabré Anna,
Eikvil Line
Fornes Bisquerra Alicia
Lladós Josep
Pujadas-Mora Joana Maria
Thorvaldsen Gunnar
Publication venue
Publication date: 01/01/2015
Field of study

This article is part of the "Norwegian Historical Population Register" project financed by the Norwegian Research Council (grant # 225950) and the Advanced Grand Project "Five Centuries of Marriages"(2011-2016) funded by the European Research Council (# ERC 2010-AdG_20100407)This article explains how two projects implement semi-automated transcription routines: for census sheets in Norway and marriage protocols from Barcelona. The Spanish system was created to transcribe the marriage license books from 1451 to 1905 for the Barcelona area; one of the world's longest series of preserved vital records. Thus, in the Project "Five Centuries of Marriages" (5CofM) at the Autonomous University of Barcelona's Center for Demographic Studies, the Barcelona Historical Marriage Database has been built. More than 600,000 records were transcribed by 150 transcribers working online. The Norwegian material is cross-sectional as it is the 1891 census, recorded on one sheet per person. This format and the underlining of keywords for several variables made it more feasible to semi-automate data entry than when many persons are listed on the same page. While Optical Character Recognition (OCR) for printed text is scientifically mature, computer vision research is now focused on more difficult problems such as handwriting recognition. In the marriage project, document analysis methods have been proposed to automatically recognize the marriage licenses. Fully automatic recognition is still a challenge, but some promising results have been obtained. In Spain, Norway and elsewhere the source material is available as scanned pictures on the Internet, opening up the possibility for further international cooperation concerning automating the transcription of historic source materials. Like what is being done in projects to digitize printed materials, the optimal solution is likely to be a combination of manual transcription and machine-assisted recognition also for hand-written sources

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

Diposit Digital de Documents de la UAB