1,238 research outputs found
Explainable Semantic Medical Image Segmentation with Style
Semantic medical image segmentation using deep learning has recently achieved
high accuracy, making it appealing to clinical problems such as radiation
therapy. However, the lack of high-quality semantically labelled data remains a
challenge leading to model brittleness to small shifts to input data. Most
works require extra data for semi-supervised learning and lack the
interpretability of the boundaries of the training data distribution during
training, which is essential for model deployment in clinical practice. We
propose a fully supervised generative framework that can achieve generalisable
segmentation with only limited labelled data by simultaneously constructing an
explorable manifold during training. The proposed approach creates medical
image style paired with a segmentation task driven discriminator incorporating
end-to-end adversarial training. The discriminator is generalised to small
domain shifts as much as permissible by the training data, and the generator
automatically diversifies the training samples using a manifold of input
features learnt during segmentation. All the while, the discriminator guides
the manifold learning by supervising the semantic content and fine-grained
features separately during the image diversification. After training,
visualisation of the learnt manifold from the generator is available to
interpret the model limits. Experiments on a fully semantic, publicly available
pelvis dataset demonstrated that our method is more generalisable to shifts
than other state-of-the-art methods while being more explainable using an
explorable manifold
Towards Saner Deep Image Registration
With recent advances in computing hardware and surges of deep-learning
architectures, learning-based deep image registration methods have surpassed
their traditional counterparts, in terms of metric performance and inference
time. However, these methods focus on improving performance measurements such
as Dice, resulting in less attention given to model behaviors that are equally
desirable for registrations, especially for medical imaging. This paper
investigates these behaviors for popular learning-based deep registrations
under a sanity-checking microscope. We find that most existing registrations
suffer from low inverse consistency and nondiscrimination of identical pairs
due to overly optimized image similarities. To rectify these behaviors, we
propose a novel regularization-based sanity-enforcer method that imposes two
sanity checks on the deep model to reduce its inverse consistency errors and
increase its discriminative power simultaneously. Moreover, we derive a set of
theoretical guarantees for our sanity-checked image registration method, with
experimental results supporting our theoretical findings and their
effectiveness in increasing the sanity of models without sacrificing any
performance. Our code and models are available at
https://github.com/tuffr5/Saner-deep-registration.Comment: ICCV 202
Unveiling the frontiers of deep learning: innovations shaping diverse domains
Deep learning (DL) enables the development of computer models that are
capable of learning, visualizing, optimizing, refining, and predicting data. In
recent years, DL has been applied in a range of fields, including audio-visual
data processing, agriculture, transportation prediction, natural language,
biomedicine, disaster management, bioinformatics, drug design, genomics, face
recognition, and ecology. To explore the current state of deep learning, it is
necessary to investigate the latest developments and applications of deep
learning in these disciplines. However, the literature is lacking in exploring
the applications of deep learning in all potential sectors. This paper thus
extensively investigates the potential applications of deep learning across all
major fields of study as well as the associated benefits and challenges. As
evidenced in the literature, DL exhibits accuracy in prediction and analysis,
makes it a powerful computational tool, and has the ability to articulate
itself and optimize, making it effective in processing data with no prior
training. Given its independence from training data, deep learning necessitates
massive amounts of data for effective analysis and processing, much like data
volume. To handle the challenge of compiling huge amounts of medical,
scientific, healthcare, and environmental data for use in deep learning, gated
architectures like LSTMs and GRUs can be utilized. For multimodal learning,
shared neurons in the neural network for all activities and specialized neurons
for particular tasks are necessary.Comment: 64 pages, 3 figures, 3 table
Image-based Decision Support Systems: Technical Concepts, Design Knowledge, and Applications for Sustainability
Unstructured data accounts for 80-90% of all data generated, with image data contributing its largest portion. In recent years, the field of computer vision, fueled by deep learning techniques, has made significant advances in exploiting this data to generate value. However, often computer vision models are not sufficient for value creation. In these cases, image-based decision support systems (IB-DSSs), i.e., decision support systems that rely on images and computer vision, can be used to create value by combining human and artificial intelligence. Despite its potential, there is only little work on IB-DSSs so far.
In this thesis, we develop technical foundations and design knowledge for IBDSSs and demonstrate the possible positive effect of IB-DSSs on environmental sustainability. The theoretical contributions of this work are based on and evaluated in a series of artifacts in practical use cases: First, we use technical experiments to demonstrate the feasibility of innovative approaches to exploit images for IBDSSs.
We show the feasibility of deep-learning-based computer vision and identify future research opportunities based on one of our practical use cases. Building on this, we develop and evaluate a novel approach for combining human and artificial intelligence for value creation from image data. Second, we develop design knowledge that can serve as a blueprint for future IB-DSSs. We perform two design science research studies to formulate generalizable principles for purposeful design — one for IB-DSSs and one for the subclass of image-mining-based decision support systems (IM-DSSs). While IB-DSSs can provide decision support based on single images, IM-DSSs are suitable when large amounts of image data are available and required for decision-making. Third, we demonstrate the viability of applying IBDSSs to enhance environmental sustainability by performing life cycle assessments for two practical use cases — one in which the IB-DSS enables a prolonged product lifetime and one in which the IB-DSS facilitates an improvement of manufacturing processes.
We hope this thesis will contribute to expand the use and effectiveness of imagebased decision support systems in practice and will provide directions for future research
Advances and Applications of DSmT for Information Fusion. Collected Works, Volume 5
This fifth volume on Advances and Applications of DSmT for Information Fusion collects theoretical and applied contributions of researchers working in different fields of applications and in mathematics, and is available in open-access. The collected contributions of this volume have either been published or presented after disseminating the fourth volume in 2015 in international conferences, seminars, workshops and journals, or they are new. The contributions of each part of this volume are chronologically ordered.
First Part of this book presents some theoretical advances on DSmT, dealing mainly with modified Proportional Conflict Redistribution Rules (PCR) of combination with degree of intersection, coarsening techniques, interval calculus for PCR thanks to set inversion via interval analysis (SIVIA), rough set classifiers, canonical decomposition of dichotomous belief functions, fast PCR fusion, fast inter-criteria analysis with PCR, and improved PCR5 and PCR6 rules preserving the (quasi-)neutrality of (quasi-)vacuous belief assignment in the fusion of sources of evidence with their Matlab codes.
Because more applications of DSmT have emerged in the past years since the apparition of the fourth book of DSmT in 2015, the second part of this volume is about selected applications of DSmT mainly in building change detection, object recognition, quality of data association in tracking, perception in robotics, risk assessment for torrent protection and multi-criteria decision-making, multi-modal image fusion, coarsening techniques, recommender system, levee characterization and assessment, human heading perception, trust assessment, robotics, biometrics, failure detection, GPS systems, inter-criteria analysis, group decision, human activity recognition, storm prediction, data association for autonomous vehicles, identification of maritime vessels, fusion of support vector machines (SVM), Silx-Furtif RUST code library for information fusion including PCR rules, and network for ship classification.
Finally, the third part presents interesting contributions related to belief functions in general published or presented along the years since 2015. These contributions are related with decision-making under uncertainty, belief approximations, probability transformations, new distances between belief functions, non-classical multi-criteria decision-making problems with belief functions, generalization of Bayes theorem, image processing, data association, entropy and cross-entropy measures, fuzzy evidence numbers, negator of belief mass, human activity recognition, information fusion for breast cancer therapy, imbalanced data classification, and hybrid techniques mixing deep learning with belief functions as well
Serial sectioning PSOCT and 2PM for imaging post-mortem human brain and neurodegeneration
The study of aging and neurodegenerative processes in the human brain necessitates a comprehensive understanding of its myeloarchitectonic, cytoarchitectonic, and vascular structures. While recent computational advances have enabled volumetric reconstruction of the human brain using stained slices, the standard histological processing methods have often led to tissue distortions and loss, making deformation-free reconstruction challenging. Therefore, the development of a multi-scale and volumetric imaging technique that can accurately measure multiple structures within the intact brain would be a significant technical breakthrough. In this work, we present the development of an integrated approach that combines serial sectioning Polarization Sensitive Optical Coherence Tomography (PSOCT) and Two Photon Microscopy (2PM) to provide label-free multi-contrast imaging of human brain tissue. Our method allows for the simultaneous visualization of scattering, birefringence, and autofluorescence properties of the post-mortem human brain. By utilizing high-throughput reconstruction of 4x4x2cm3 sample blocks and simple registration of PSOCT and 2PM images, we enable comprehensive analysis of myelin content, cellular information, and vascular structure. PSOCT provides mesoscopic images and enables quantitative measurement of those brain structures, while 2PM provide complementary microscopic validation and enrichment of cellular and capillary information. This combined approach reveals myelin density and structure maps of the whole brain block and supplies intricate vessel and capillary networks as well as lipofuscin-filled cell soma across cortical regions, providing insights into the myeloarchitectural, cellular and vascular changes associated with neurodegenerative diseases such as Alzheimer's disease (AD) and Chronic Traumatic Encephalopathy (CTE)
Non-iterative Coarse-to-fine Transformer Networks for Joint Affine and Deformable Image Registration
Image registration is a fundamental requirement for medical image analysis.
Deep registration methods based on deep learning have been widely recognized
for their capabilities to perform fast end-to-end registration. Many deep
registration methods achieved state-of-the-art performance by performing
coarse-to-fine registration, where multiple registration steps were iterated
with cascaded networks. Recently, Non-Iterative Coarse-to-finE (NICE)
registration methods have been proposed to perform coarse-to-fine registration
in a single network and showed advantages in both registration accuracy and
runtime. However, existing NICE registration methods mainly focus on deformable
registration, while affine registration, a common prerequisite, is still
reliant on time-consuming traditional optimization-based methods or extra
affine registration networks. In addition, existing NICE registration methods
are limited by the intrinsic locality of convolution operations. Transformers
may address this limitation for their capabilities to capture long-range
dependency, but the benefits of using transformers for NICE registration have
not been explored. In this study, we propose a Non-Iterative Coarse-to-finE
Transformer network (NICE-Trans) for image registration. Our NICE-Trans is the
first deep registration method that (i) performs joint affine and deformable
coarse-to-fine registration within a single network, and (ii) embeds
transformers into a NICE registration framework to model long-range relevance
between images. Extensive experiments with seven public datasets show that our
NICE-Trans outperforms state-of-the-art registration methods on both
registration accuracy and runtime.Comment: Accepted at International Conference on Medical Image Computing and
Computer Assisted Intervention (MICCAI 2023
Evaluation of different segmentation-based approaches for skin disorders from dermoscopic images
Treballs Finals de Grau d'Enginyeria Biomèdica. Facultat de Medicina i Ciències de la Salut. Universitat de Barcelona. Curs: 2022-2023. Tutor/Director: Sala Llonch, Roser, Mata Miquel, Christian, Munuera, JosepSkin disorders are the most common type of cancer in the world and the incident has been lately increasing over the past decades. Even with the most complex and advanced technologies, current image acquisition systems do not permit a reliable identification of the skin lesion by visual examination due to the challenging structure of the malignancy. This promotes the need for the implementation of automatic skin lesion segmentation methods in order to assist in physicians’ diagnostic when determining the lesion's region and to serve as a preliminary step for the classification of the skin lesion. Accurate and precise segmentation is crucial for a rigorous screening and monitoring of the disease's progression.
For the purpose of the commented concern, the present project aims to accomplish a state-of-the-art review about the most predominant conventional segmentation models for skin lesion segmentation, alongside with a market analysis examination. With the rise of automatic segmentation tools, a wide number of algorithms are currently being used, but many are the drawbacks when employing them for dermatological disorders due to the high-level presence of artefacts in the image acquired.
In light of the above, three segmentation techniques have been selected for the completion of the work: level set method, an algorithm combining GrabCut and k-means methods and an intensity automatic algorithm developed by Hospital Sant Joan de Déu de Barcelona research group. In addition, a validation of their performance is conducted for a further implementation of them in clinical training. The proposals, together with the got outcomes, have been accomplished by means of a publicly available skin lesion image database
Is attention all you need in medical image analysis? A review
Medical imaging is a key component in clinical diagnosis, treatment planning
and clinical trial design, accounting for almost 90% of all healthcare data.
CNNs achieved performance gains in medical image analysis (MIA) over the last
years. CNNs can efficiently model local pixel interactions and be trained on
small-scale MI data. The main disadvantage of typical CNN models is that they
ignore global pixel relationships within images, which limits their
generalisation ability to understand out-of-distribution data with different
'global' information. The recent progress of Artificial Intelligence gave rise
to Transformers, which can learn global relationships from data. However, full
Transformer models need to be trained on large-scale data and involve
tremendous computational complexity. Attention and Transformer compartments
(Transf/Attention) which can well maintain properties for modelling global
relationships, have been proposed as lighter alternatives of full Transformers.
Recently, there is an increasing trend to co-pollinate complementary
local-global properties from CNN and Transf/Attention architectures, which led
to a new era of hybrid models. The past years have witnessed substantial growth
in hybrid CNN-Transf/Attention models across diverse MIA problems. In this
systematic review, we survey existing hybrid CNN-Transf/Attention models,
review and unravel key architectural designs, analyse breakthroughs, and
evaluate current and future opportunities as well as challenges. We also
introduced a comprehensive analysis framework on generalisation opportunities
of scientific and clinical impact, based on which new data-driven domain
generalisation and adaptation methods can be stimulated
- …