Search CORE

9,653 research outputs found

Semi-parametric analysis of multi-rater data

Author: Girolami M.
Polajnar T.
Rogers S.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/04/2009
Field of study

Datasets that are subjectively labeled by a number of experts are becoming more common in tasks such as biological text annotation where class definitions are necessarily somewhat subjective. Standard classification and regression models are not suited to multiple labels and typically a pre-processing step (normally assigning the majority class) is performed. We propose Bayesian models for classification and ordinal regression that naturally incorporate multiple expert opinions in defining predictive distributions. The models make use of Gaussian process priors, resulting in great flexibility and particular suitability to text based problems where the number of covariates can be far greater than the number of data instances. We show that using all labels rather than just the majority improves performance on a recent biological dataset

UCL Discovery

Enlighten

Semi-parametric analysis of multi-rater data

Author: A. Gelman
A.P. Dawid
C.K. Williams
J. Albert
J.H. Albert
J.S. Uebersax
M. Girolami
M.K. Cowles
Mark Girolami
S. Rogers
Simon Rogers
Tamara Polajnar
V. Johnson
V.E. Johnson
W. Chu
W.J. Wilbur
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Semi-automated stereoradiographic upper limb 3D reconstructions using a combined parametric and statistical model: a preliminary study

Author: AUBERT Benjamin
CHAÏBI Yasmina
CLAIREMIDI Alain
DE GUISE Jacques
FONTAINE Christian
GUERARD Sandra
LEBAILLY Fréderic
LIMA Lucas
SKALLI Wafa
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

PURPOSE: Quantitative assessment of 3D clinical indices may be crucial for elbow surgery planning. 3D parametric modeling from bi-planar radiographs was successfully proposed for spine and lower limb clinical investigation as an alternative for CT-scan. The aim of this study was to adapt this method to the upper limb with a preliminary validation. METHODS: CT-scan 3D models of humerus, radius and ulna were obtained from 20 cadaveric upper limbs and yielded parametric models made of geometric primitives. Primitives were defined by descriptor parameters (diameters, angles...) and correlations between these descriptors were found. Using these correlations, a semi-automated reconstruction method of humerus using bi-planar radiographs was achieved: a 3D personalized parametric model was built, from which clinical parameters were computed [orientation and projections on bone surface of trochlea sulcus to capitulum (CTS) axis, trochlea sulcus anterior offset and width of distal humeral epiphysis]. This method was evaluated by accuracy compared to CT-scan and reproducibility. RESULTS: Points-to-surface mean distance was 0.9 mm (2 RMS = 2.5 mm). For clinical parameters, mean differences were 0.4-1.9 mm and from 1.7° to 2.3°. All parameters except from angle formed by CTS axis and bi-epicondylar axis in transverse plane were reproducible. Reconstruction time was about 5 min. CONCLUSIONS: The presented method provides access to morphological upper limb parameters with very low level of radiation. Preliminary in vitro validation for humerus showed that it is fast and accurate enough to be used in clinical daily practice as an alternative to CT-scan for total elbow arthroplasty pre operative evaluation

HAL-Paris 13

SAM : Science Arts et Métiers

HAL - UPEC / UPEM

Quality of Radiomic Features in Glioblastoma Multiforme: Impact of Semi-Automated Tumor Segmentation Software.

Author: Jamshidi Neema
Kim Jong Hyo
Kuo Michael D
Lee Myungeun
Woo Boyeong
Publication venue: eScholarship, University of California
Publication date: 03/04/2017
Field of study

ObjectiveThe purpose of this study was to evaluate the reliability and quality of radiomic features in glioblastoma multiforme (GBM) derived from tumor volumes obtained with semi-automated tumor segmentation software.Materials and methodsMR images of 45 GBM patients (29 males, 16 females) were downloaded from The Cancer Imaging Archive, in which post-contrast T1-weighted imaging and fluid-attenuated inversion recovery MR sequences were used. Two raters independently segmented the tumors using two semi-automated segmentation tools (TumorPrism3D and 3D Slicer). Regions of interest corresponding to contrast-enhancing lesion, necrotic portions, and non-enhancing T2 high signal intensity component were segmented for each tumor. A total of 180 imaging features were extracted, and their quality was evaluated in terms of stability, normalized dynamic range (NDR), and redundancy, using intra-class correlation coefficients, cluster consensus, and Rand Statistic.ResultsOur study results showed that most of the radiomic features in GBM were highly stable. Over 90% of 180 features showed good stability (intra-class correlation coefficient [ICC] ≥ 0.8), whereas only 7 features were of poor stability (ICC < 0.5). Most first order statistics and morphometric features showed moderate-to-high NDR (4 > NDR ≥1), while above 35% of the texture features showed poor NDR (< 1). Features were shown to cluster into only 5 groups, indicating that they were highly redundant.ConclusionThe use of semi-automated software tools provided sufficiently reliable tumor segmentation and feature stability; thus helping to overcome the inherent inter-rater and intra-rater variability of user intervention. However, certain aspects of feature quality, including NDR and redundancy, need to be assessed for determination of representative signature features before further development of radiomics

PubMed Central

eScholarship - University of California

DeepCoder: Semi-parametric Variational Autoencoders for Automatic Facial Action Coding

Author: Eleftheriadis Stefanos
Pantic Maja
Rudovic Ognjen
Schuller Bjørn
Tran Dieu Linh
Walecki Robert
Publication venue
Publication date: 05/08/2017
Field of study

Human face exhibits an inherent hierarchy in its representations (i.e., holistic facial expressions can be encoded via a set of facial action units (AUs) and their intensity). Variational (deep) auto-encoders (VAE) have shown great results in unsupervised extraction of hierarchical latent representations from large amounts of image data, while being robust to noise and other undesired artifacts. Potentially, this makes VAEs a suitable approach for learning facial features for AU intensity estimation. Yet, most existing VAE-based methods apply classifiers learned separately from the encoded features. By contrast, the non-parametric (probabilistic) approaches, such as Gaussian Processes (GPs), typically outperform their parametric counterparts, but cannot deal easily with large amounts of data. To this end, we propose a novel VAE semi-parametric modeling framework, named DeepCoder, which combines the modeling power of parametric (convolutional) and nonparametric (ordinal GPs) VAEs, for joint learning of (1) latent representations at multiple levels in a task hierarchy1, and (2) classification of multiple ordinal outputs. We show on benchmark datasets for AU intensity estimation that the proposed DeepCoder outperforms the state-of-the-art approaches, and related VAEs and deep learning models.Comment: ICCV 2017 - accepte

arXiv.org e-Print Archive

Spiral - Imperial College Digital Repository

Evolution of statistical analysis in empirical software engineering research: Current state and steps forward

Author: Feldt Robert
Furia Carlo A.
Gren Lucas
Huang Ziwei
Neto Francisco Gomes de Oliveira
Torkar Richard
Publication venue
Publication date: 01/01/2019
Field of study

Software engineering research is evolving and papers are increasingly based on empirical data from a multitude of sources, using statistical tests to determine if and to what degree empirical evidence supports their hypotheses. To investigate the practices and trends of statistical analysis in empirical software engineering (ESE), this paper presents a review of a large pool of papers from top-ranked software engineering journals. First, we manually reviewed 161 papers and in the second phase of our method, we conducted a more extensive semi-automatic classification of papers spanning the years 2001--2015 and 5,196 papers. Results from both review steps was used to: i) identify and analyze the predominant practices in ESE (e.g., using t-test or ANOVA), as well as relevant trends in usage of specific statistical methods (e.g., nonparametric tests and effect size measures) and, ii) develop a conceptual model for a statistical analysis workflow with suggestions on how to apply different statistical methods as well as guidelines to avoid pitfalls. Lastly, we confirm existing claims that current ESE practices lack a standard to report practical significance of results. We illustrate how practical significance can be discussed in terms of both the statistical analysis and in the practitioner's context.Comment: journal submission, 34 pages, 8 figure

arXiv.org e-Print Archive

Chalmers Research

Feasibility of in vivo multi-parametric quantitative magnetic resonance imaging of the healthy sciatic nerve with a unified signal readout protocol

Author: Battiston M
Boonsuth R
Calvi A
Gandini Wheeler-Kingshott CAM
Grussu F
Samlidou CM
Samson RS
Yiannakas MC
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/04/2023
Field of study

Magnetic resonance neurography (MRN) has been used successfully over the years to investigate the peripheral nervous system (PNS) because it allows early detection and precise localisation of neural tissue damage. However, studies demonstrating the feasibility of combining MRN with multi-parametric quantitative magnetic resonance imaging (qMRI) methods, which provide more specific information related to nerve tissue composition and microstructural organisation, can be invaluable. The translation of emerging qMRI methods previously validated in the central nervous system to the PNS offers real potential to characterise in patients in vivo the underlying pathophysiological mechanisms involved in a plethora of conditions of the PNS. The aim of this study was to assess the feasibility of combining MRN with qMRI to measure diffusion, magnetisation transfer and relaxation properties of the healthy sciatic nerve in vivo using a unified signal readout protocol. The reproducibility of the multi-parametric qMRI protocol as well as normative qMRI measures in the healthy sciatic nerve are reported. The findings presented herein pave the way to the practical implementation of joint MRN-qMRI in future studies of pathological conditions affecting the PNS

UCL Discovery

Recommended from our members

Beyond Standard Assumptions - Semiparametric Models, A Dyadic Item Response Theory Model, and Cluster-Endogenous Random Intercept Models

Author: Sim Nicholas
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

In most statistical analyses, quantitative education researchers often make simplifying assumptions regarding the manner in which their data was generated in order to answer some of these questions. These assumptions can help to reduce the complexity of the problem, and allow the researcher to describe their data using a simpler, and often times more interpretable, statistical model. However, making some of these assumptions when they are not true can lead to biased estimates and misleading answers. While the standard sets of assumptions associated with commonly-used statistical models are usually sufficient in a wide range of contexts, it will always be beneficial for education researchers to understand what they are, when they are reasonable, and how to modify them if necessary. This dissertation focuses on three of the most common models used in quantitative education research (viz. parametric models like Linear Models (LMs), Item Response Theory (IRT) models, and Random-Intercept Models (RIMs)), discusses the standard sets of assumptions that accompany these models, and then describes related models with less stringent sets of assumptions. In each of the following three chapters, we either explicitly unpack existing models that are useful but are currently still uncommon in the field of education research, or propose novel models and/or estimation strategies for these models. We begin in Chapter 1 with a common parametric model known as the Gaussian LM, and use it as a scaffold to better understand semiparametric models and their estimation. We begin by reviewing how the coefficients of the Gaussian LM are usually estimated using Maximum Likelihood (ML) or Least-Squares (LS). We then introduce the notion of an

m

-estimator as well as that of a Regular Asymptotically Linear estimator, and show how they relate to the ML estimator. In particular, we introduce the notion of influence functions/curves and discuss their geometry together with concepts such as Hilbert spaces and tangent spaces. We then demonstrate, concretely, how to derive the so-called efficient influence function under the Gaussian LM, and show that it is precisely the influence function of the ML and (Ordinary) LS estimators. This shows that the ML estimator (at least under the Gaussian LM) is efficient. Using the foundation built, we move on from the Gaussian LM by relaxing both the assumption that the residuals are normally distributed, as well as the assumption that they have a constant variance, and define this as the Heteroskedastic Linear Model. Unlike the Gaussian LM, this is a semiparametric model. Where possible, we make use of intuition and analogous results from the parametric setting to help describe the workflow for obtaining an efficient estimator for the coefficients of the Heteroskedastic Linear Model. In particular, we derive the nuisance tangent space for this semiparametric model, and use it to obtain the efficient influence function for our model. We then show how to use the efficient influence function to obtain an efficient estimator (which happens to be the Weighted LS estimator) from the (Ordinary) LS estimator via a one-step approach as well as an estimating equations approach. We then conclude by directing readers to more advanced material, including references on more modern approaches to estimating more general semiparametric models such as Targeted Maximum Likelihood Estimation. In Chapter 2, we focus on a class of measurement models known as Item Response Theory models which are useful for measuring latent traits of a subject based on the subject's response to items. We relax the condition that the responses are only a result of the individual's latent trait (and possibly an external rater), and propose a dyadic Item Response Theory (dIRT) model for measuring interactions of pairs of individuals when the responses to items represent the actions (or behaviors, perceptions, etc.) of each individual (actor) made within the context of a dyad formed with another individual (partner). Examples of its use in education include the assessment of collaborative problem solving among students, or the evaluation of intra-departmental dynamics among teachers. The dIRT model generalizes both Item Response Theory models for measurement and the Social Relations Model for dyadic data. Here, the responses of an actor when paired with a partner are modeled as a function of not only the actor's inclination to act and the partner's tendency to elicit that action, but also the unique relationship of the pair, represented by two directional, possibly correlated, interaction latent variables. We discuss generalizations such as accommodating triads or larger groups, but focus on demonstrating the key idea in the dyadic case. We show that estimation may be performed using Markov-chain Monte Carlo implemented in \texttt{Stan}, making it straightforward to extend the dIRT model in various ways. Specifically, we show how the basic dIRT model can be extended to accommodate latent regressions, random effects, distal outcomes. We perform a simulation study that demonstrates that our estimation approach performs well. In the absence of educational data of this form, we demonstrate the usefulness of our proposed approach using speed-dating data instead, and find new evidence of pairwise interactions between participants, describing a mutual attraction that is inadequately characterized by individual properties alone.Finally, in Chapter 3, we consider the often implicit assumption made when estimating the coefficients of structural Random Intercept Models (RIMs) that covariates at all levels do not co-vary with the random intercepts. A violation of this assumption (called cluster-level endogeneity) leads to inconsistent estimates when using standard estimation procedures. For two-level RIMs with such endogeneity, Hausman and Taylor (HT) devised a consistent multi-step instrumental variable estimator using only internal instruments. We, instead, approach this problem by explicitly modeling the endogeneity using a Structural Equation Model (SEM). In this chapter, we compare, through simulation, the HT and SEM estimators, and evaluate their asymptotic and finite sample properties. We show that the SEM approach is also flexible enough to deal with different exchangeability assumptions for the covariates (e.g., whether the correlations between pairs of all units in a cluster are the same) and investigate how these exchangeability assumptions affect finite sample properties of the HT estimator. For the simulations, we propose a new procedure for generating cluster- and unit-level covariates and random intercepts with a fully flexible covariance structure. We also compare our approach to another common approach known as Multilevel Matching using data from the High School and Beyond survey

eScholarship - University of California

Mixture polarization in inter-rater agreement analysis: a Bayesian nonparametric index

Author: Calcagnì Antonio
Manolopoulou Ioanna
Mignemi Giuseppe
Spoto Andrea
Publication venue
Publication date: 26/09/2023
Field of study

In several observational contexts where different raters evaluate a set of items, it is common to assume that all raters draw their scores from the same underlying distribution. However, a plenty of scientific works have evidenced the relevance of individual variability in different type of rating tasks. To address this issue the intra-class correlation coefficient (ICC) has been used as a measure of variability among raters within the Hierarchical Linear Models approach. A common distributional assumption in this setting is to specify hierarchical effects as independent and identically distributed from a normal with the mean parameter fixed to zero and unknown variance. The present work aims to overcome this strong assumption in the inter-rater agreement estimation by placing a Dirichlet Process Mixture over the hierarchical effects' prior distribution. A new nonparametric index

\lambda

is proposed to quantify raters polarization in presence of group heterogeneity. The model is applied on a set of simulated experiments and real world data. Possible future directions are discussed

arXiv.org e-Print Archive