30 research outputs found
The Medical Segmentation Decathlon
International challenges have become the de facto standard for comparative assessment of image analysis algorithms. Although segmentation is the most widely investigated medical image processing task, the various challenges have been organized to focus only on specific clinical tasks. We organized the Medical Segmentation Decathlon (MSD)—a biomedical image analysis challenge, in which algorithms compete in a multitude of both tasks and modalities to investigate the hypothesis that a method capable of performing well on multiple tasks will generalize well to a previously unseen task and potentially outperform a custom-designed solution. MSD results confirmed this hypothesis, moreover, MSD winner continued generalizing well to a wide range of other clinical problems for the next two years. Three main conclusions can be drawn from this study: (1) state-of-the-art image segmentation algorithms generalize well when retrained on unseen tasks; (2) consistent algorithmic performance across multiple tasks is a strong surrogate of algorithmic generalizability; (3) the training of accurate AI segmentation models is now commoditized to scientists that are not versed in AI model training
Light Microsopy Module, International Space Station Premier Automated Microscope
The Light Microscopy Module (LMM) was launched to the International Space Station (ISS) in 2009 and began science operations in 2010. It continues to support Physical and Biological scientific research on ISS. During 2015, if all goes as planned, five experiments will be completed: [1] Advanced Colloids Experiments with a manual sample base -3 (ACE-M-3), [2] the Advanced Colloids Experiment with a Heated Base -1 (ACE-H-1), [3] (ACE-H-2), [4] the Advanced Plant Experiment -03 (APEX-03), and [5] the Microchannel Diffusion Experiment (MDE). Preliminary results, along with an overview of present and future LMM capabilities will be presented; this includes details on the planned data imaging processing and storage system, along with the confocal upgrade to the core microscope. [1] New York University: Paul Chaikin, Andrew Hollingsworth, and Stefano Sacanna, [2] University of Pennsylvania: Arjun Yodh and Matthew Gratale, [3] a consortium of universities from the State of Kentucky working through the Experimental Program to Stimulate Competitive Research (EPSCoR): Stuart Williams, Gerold Willing, Hemali Rathnayake, et al., [4] from the University of Florida and CASIS: Anna-Lisa Paul and Rob Ferl, and [5] from the Methodist Hospital Research Institute from CASIS: Alessandro Grattoni and Giancarlo Canavese
Alcator C-Mod: research in support of ITER and steps beyond
This paper presents an overview of recent highlights from research on Alcator C-Mod. Significant progress has been made across all research areas over the last two years, with particular emphasis on divertor physics and power handling, plasma–material interaction studies, edge localized mode-suppressed pedestal dynamics, core transport and turbulence, and RF heating and current drive utilizing ion cyclotron and lower hybrid tools. Specific results of particular relevance to ITER include: inner wall SOL transport studies that have led, together with results from other experiments, to the change of the detailed shape of the inner wall in ITER; runaway electron studies showing that the critical electric field required for runaway generation is much higher than predicted from collisional theory; core tungsten impurity transport studies reveal that tungsten accumulation is naturally avoided in typical C-Mod conditions.United States. Department of Energy (DE-FC02-99ER54512-CMOD)United States. Department of Energy (DE-AC02-09CH11466)United States. Department of Energy (DE-FG02-96ER-54373)United States. Department of Energy (DE-FG02-94ER54235
The Medical Segmentation Decathlon
International challenges have become the de facto standard for comparative
assessment of image analysis algorithms given a specific task. Segmentation is
so far the most widely investigated medical image processing task, but the
various segmentation challenges have typically been organized in isolation,
such that algorithm development was driven by the need to tackle a single
specific clinical problem. We hypothesized that a method capable of performing
well on multiple tasks will generalize well to a previously unseen task and
potentially outperform a custom-designed solution. To investigate the
hypothesis, we organized the Medical Segmentation Decathlon (MSD) - a
biomedical image analysis challenge, in which algorithms compete in a multitude
of both tasks and modalities. The underlying data set was designed to explore
the axis of difficulties typically encountered when dealing with medical
images, such as small data sets, unbalanced labels, multi-site data and small
objects. The MSD challenge confirmed that algorithms with a consistent good
performance on a set of tasks preserved their good average performance on a
different set of previously unseen tasks. Moreover, by monitoring the MSD
winner for two years, we found that this algorithm continued generalizing well
to a wide range of other clinical problems, further confirming our hypothesis.
Three main conclusions can be drawn from this study: (1) state-of-the-art image
segmentation algorithms are mature, accurate, and generalize well when
retrained on unseen tasks; (2) consistent algorithmic performance across
multiple tasks is a strong surrogate of algorithmic generalizability; (3) the
training of accurate AI segmentation models is now commoditized to non AI
experts
Metrics reloaded: Pitfalls and recommendations for image analysis validation
Increasing evidence shows that flaws in machine learning (ML) algorithm validation are an underestimated global problem. Particularly in automatic biomedical image analysis, chosen performance metrics often do not reflect the domain interest, thus failing to adequately measure scientific progress and hindering translation of ML techniques into practice. To overcome this, our large international expert consortium created Metrics Reloaded, a comprehensive framework guiding researchers in the problem-aware selection of metrics. Following the convergence of ML methodology across application domains, Metrics Reloaded fosters the convergence of validation methodology. The framework was developed in a multi-stage Delphi process and is based on the novel concept of a problem fingerprint - a structured representation of the given problem that captures all aspects that are relevant for metric selection, from the domain interest to the properties of the target structure(s), data set and algorithm output. Based on the problem fingerprint, users are guided through the process of choosing and applying appropriate validation metrics while being made aware of potential pitfalls. Metrics Reloaded targets image analysis problems that can be interpreted as a classification task at image, object or pixel level, namely image-level classification, object detection, semantic segmentation, and instance segmentation tasks. To improve the user experience, we implemented the framework in the Metrics Reloaded online tool, which also provides a point of access to explore weaknesses, strengths and specific recommendations for the most common validation metrics. The broad applicability of our framework across domains is demonstrated by an instantiation for various biological and medical image analysis use cases
Common Limitations of Image Processing Metrics:A Picture Story
While the importance of automatic image analysis is continuously increasing,
recent meta-research revealed major flaws with respect to algorithm validation.
Performance metrics are particularly key for meaningful, objective, and
transparent performance assessment and validation of the used automatic
algorithms, but relatively little attention has been given to the practical
pitfalls when using specific metrics for a given image analysis task. These are
typically related to (1) the disregard of inherent metric properties, such as
the behaviour in the presence of class imbalance or small target structures,
(2) the disregard of inherent data set properties, such as the non-independence
of the test cases, and (3) the disregard of the actual biomedical domain
interest that the metrics should reflect. This living dynamically document has
the purpose to illustrate important limitations of performance metrics commonly
applied in the field of image analysis. In this context, it focuses on
biomedical image analysis problems that can be phrased as image-level
classification, semantic segmentation, instance segmentation, or object
detection task. The current version is based on a Delphi process on metrics
conducted by an international consortium of image analysis experts from more
than 60 institutions worldwide.Comment: This is a dynamic paper on limitations of commonly used metrics. The
current version discusses metrics for image-level classification, semantic
segmentation, object detection and instance segmentation. For missing use
cases, comments or questions, please contact [email protected] or
[email protected]. Substantial contributions to this document will be
acknowledged with a co-authorshi
Understanding metric-related pitfalls in image analysis validation
Validation metrics are key for the reliable tracking of scientific progress
and for bridging the current chasm between artificial intelligence (AI)
research and its translation into practice. However, increasing evidence shows
that particularly in image analysis, metrics are often chosen inadequately in
relation to the underlying research problem. This could be attributed to a lack
of accessibility of metric-related knowledge: While taking into account the
individual strengths, weaknesses, and limitations of validation metrics is a
critical prerequisite to making educated choices, the relevant knowledge is
currently scattered and poorly accessible to individual researchers. Based on a
multi-stage Delphi process conducted by a multidisciplinary expert consortium
as well as extensive community feedback, the present work provides the first
reliable and comprehensive common point of access to information on pitfalls
related to validation metrics in image analysis. Focusing on biomedical image
analysis but with the potential of transfer to other fields, the addressed
pitfalls generalize across application domains and are categorized according to
a newly created, domain-agnostic taxonomy. To facilitate comprehension,
illustrations and specific examples accompany each pitfall. As a structured
body of information accessible to researchers of all levels of expertise, this
work enhances global comprehension of a key topic in image analysis validation.Comment: Shared first authors: Annika Reinke, Minu D. Tizabi; shared senior
authors: Paul F. J\"ager, Lena Maier-Hei
Using the Light Microscopy Module (LMM) on the International Space Station (ISS), The Advanced Colloids Experiment (ACE) and MacroMolecular Biophysics (MMB)
The Light Microscopy Module (LMM) was launched to the International Space Station (ISS) in 2009 and began science operations in 2010. It continues to support Physical and Biological scientific research on ISS. During 2016, if all goes as planned, three experiments will be completed: [1] Advanced Colloids Experiments with Heated base-2 (ACE-H2) and [2] Advanced Colloids Experiments with Temperature control (ACE-T1). Preliminary results, along with an overview of present and future LMM capabilities will be presented; this includes details on the planned data imaging processing and storage system, along with the confocal upgrade to the core microscope. [1] a consortium of universities from the State of Kentucky working through the Experimental Program to Stimulate Competitive Research (EPSCoR): Stuart Williams, Gerold Willing, Hemali Rathnayake, et al. and [2] from Chungnam National University, Daejeon, S. Korea: Chang-Soo Lee, et al
Recommended from our members
The impact of guilt and type of compliance-gaining message on compliance
Consistent with Cialdini's Negative State Relief Model it has been established repeatedly that targets of compliance-gaining attempts comply with a request to help more frequently when those targets feel guilty than when they do not feel guilty. Expanding upon this result it was predicted that to the extent that a compliance-gaining message serves as a cue to link compliance with the restoration of positive or neutral affect, the compliance rate would vary. Building upon this reasoning it was hypothesized that a positive self-feeling compliance-gaining message would be more effective in producing target compliance than would a direct request message when the target felt guilty, but that the opposite relationship would hold when the target was not feeling guilty. An experiment in which both guilt and message type were varied was designed to test this hypothesis. In the main, the data were consistent with these predictions