43 research outputs found
Confidence intervals for robust estimates of measurement uncertainty
Uncertainties arising at different stages of a measurement process can be estimated using Analysis of Variance (ANOVA) on duplicated measurements. In some cases it is also desirable to calculate confidence intervals for these uncertainties. This can be achieved using probability models that assume the measurement data are normally distributed. However, it is often the case in practice that a set of otherwise normally distributed measurement values is contaminated by a small number of outlying values, which may have a disproportionate effect on the variances calculated using the âclassicalâ form of ANOVA. In this case, robust ANOVA methods are able to provide variance estimates that are much closer to the parameters of the underlying normal distributions. A method using bootstrapping to calculate confidence intervals from robust estimates of variances is proposed and evaluated, and is shown to work well when the number of outlying values is small. The method has been implemented in a Visual Basic program
Optimisation of the predictive ability of NIR models to estimate nutritional parameters in elephant grass through LOCAL algorithms
Elephant grass is a tropical forage widely used for livestock feed. The analytical techniques traditionally used for its
nutritional evaluation are costly and time consuming. Alternatively, Near Infrared Spectroscopy (NIRS) technology
has been used as a rapid analysis technique. However, in crops with high variability due to genetic improvement,
predictive models quickly lose accuracy and must be recalibrated. The use of non-linear models such as LOCAL
calibrations could mitigate these issues, although a number of parameters need to be optimized to obtain accurate
results. The objective of this work was to compare the predictive results obtained with global NIRS calibrations and
with LOCAL calibrations, paying special attention to the configuration parameters of the models.
The results obtained showed that the prediction errors with the LOCAL models were between 1.6 and 17.5 %
lower. The best results were obtained in most cases with a low number of selected samples (n = 100â250) and a
high number of PLS terms (n = 20). This configuration allows a reduced computation time with high accuracy,
becoming a valuable alternative for analytical determinations that require ruminal fluid, which would improve
the welfare of the animals by avoiding the need to surgically prepare animals to estimate the nutritional value of
the feeds
Determination of ingredients in packaged pharmaceutical tablets by energy dispersive Xâray diffraction and maximum likelihood principal component analysis multivariate curve resolutionâalternating least squares with correlation constraint
Energy dispersive Xâray diffraction (EDXRD) and maximum likelihood principal component analysis multivariate curve resolutionâalternating least squares (MLPCAâMCRâALS) with correlation constraint were used to quantify the composition of packaged pharmaceutical formulations. Recorded EDXRD profiles from unpackaged and packaged samples of ternary mixtures were modelled together in order to recover the concentrations as well as the pure profiles of the constituent compounds. MLPCA was used as a data pretreatment step to MCRâALS, accounting for the high noise and nonconstant variance observed in the EDXRD profiles and was shown to improve the resolution accuracy of MCRâALS for the data set. Local correlation constraints were applied in the MCRâALS procedure in order to model unpackaged and packaged samples simultaneously while accounting for the matrix effect of the packaging materials. The composition of the formulations was estimated with rootâmeanâsquare error of prediction for each component, including paracetamol, being approximately 2.5 %w/w for unpackaged and packaged samples. Paracetamol concentration was resolved simultaneously for the unpackaged and packaged samples to a greater degree of accuracy than achieved by partial least squares regression (PLSR) when modelling the contexts separately. By modelling the effects of the packaging and incorporating accurate reference information of unpackaged samples into the resolution of packaged samples, the potential of EDXRD and MLPCAâMCRâALS for the identification and quantification of packaged solidâdosage medicine in nondestructive screening and counterfeit medicine detection has been raised
Classifying degraded modern polymeric museum artefacts by their smell
Volatile organic compound (VOC) analysis is a successful method for diagnosing medical conditions such as Alzheimerâs disease. However, despite its relevance to heritage, it has found little application in museums. We report the first use of VOC analysis to âdiagnoseâ degradation in modern polymeric museum artefacts. Modern polymers are increasingly found in museum collections but pose serious conservation difficulties due to unstable and widely varying formulations. Solid-phase microextraction gas chromatography/mass spectrometry and linear discriminant analysis were used to classify samples according to the length of time they had been artificially degraded. Classification accuracies of 50-83% were obtained after validation with separate test sets. The method was applied to three artefacts from collections at Tate to detect evidence of degradation. This novel approach could be used for any material in heritage collections and more widely in the field of polymer degradation
A Partially Supervised Bayesian Image Classification Model with Applications in Diagnosis of Sentinel Lymph Node Metastases in Breast Cancer
A method has been developed for the analysis of images of sentinel lymph
nodes generated by a spectral scanning device. The aim is to classify the
nodes, excised during surgery for breast cancer, as normal or metastatic. The
data from one node constitute spectra at 86 wavelengths for each pixel of a
20*20 grid. For the analysis, the spectra are reduced to scores on two factors,
one derived externally from a linear discriminant analysis using spectra taken
manually from known normal and metastatic tissue, and one derived from the node
under investigation to capture variability orthogonal to the external factor.
Then a three-group mixture model (normal, metastatic, non-nodal background)
using multivariate t distributions is fitted to the scores, with external data
being used to specify informative prior distributions for the parameters of the
three distributions. A Markov random field prior imposes smoothness on the
image generated by the model. Finally, the node is classified as metastatic if
any one pixel in this smoothed image is classified as metastatic. The model
parameters were tuned on a training set of nodes, and then the tuned model was
tested on a separate validation set of nodes, achieving satisfactory
sensitivity and specificity. The aim in developing the analysis was to allow
flexibility in the way each node is modelled whilst still using external
information. The Bayesian framework employed is ideal for this.Comment: 31 pages, 7 figure
LabKey Server: An open source platform for scientific data integration, analysis and collaboration
<p>Abstract</p> <p>Background</p> <p>Broad-based collaborations are becoming increasingly common among disease researchers. For example, the Global HIV Enterprise has united cross-disciplinary consortia to speed progress towards HIV vaccines through coordinated research across the boundaries of institutions, continents and specialties. New, end-to-end software tools for data and specimen management are necessary to achieve the ambitious goals of such alliances. These tools must enable researchers to organize and integrate heterogeneous data early in the discovery process, standardize processes, gain new insights into pooled data and collaborate securely.</p> <p>Results</p> <p>To meet these needs, we enhanced the LabKey Server platform, formerly known as CPAS. This freely available, open source software is maintained by professional engineers who use commercially proven practices for software development and maintenance. Recent enhancements support: (i) Submitting specimens requests across collaborating organizations (ii) Graphically defining new experimental data types, metadata and wizards for data collection (iii) Transitioning experimental results from a multiplicity of spreadsheets to custom tables in a shared database (iv) Securely organizing, integrating, analyzing, visualizing and sharing diverse data types, from clinical records to specimens to complex assays (v) Interacting dynamically with external data sources (vi) Tracking study participants and cohorts over time (vii) Developing custom interfaces using client libraries (viii) Authoring custom visualizations in a built-in R scripting environment.</p> <p>Diverse research organizations have adopted and adapted LabKey Server, including consortia within the Global HIV Enterprise. Atlas is an installation of LabKey Server that has been tailored to serve these consortia. It is in production use and demonstrates the core capabilities of LabKey Server. Atlas now has over 2,800 active user accounts originating from approximately 36 countries and 350 organizations. It tracks roughly 27,000 assay runs, 860,000 specimen vials and 1,300,000 vial transfers.</p> <p>Conclusions</p> <p>Sharing data, analysis tools and infrastructure can speed the efforts of large research consortia by enhancing efficiency and enabling new insights. The Atlas installation of LabKey Server demonstrates the utility of the LabKey platform for collaborative research. Stable, supported builds of LabKey Server are freely available for download at <url>http://www.labkey.org</url>. Documentation and source code are available under the Apache License 2.0.</p
SARS-CoV-2 lineage B.1.1.7 is associated with greater disease severity among hospitalised women but not men: multicentre cohort study.
BACKGROUND: SARS-CoV-2 lineage B.1.1.7 has been associated with an increased rate of transmission and disease severity among subjects testing positive in the community. Its impact on hospitalised patients is less well documented. METHODS: We collected viral sequences and clinical data of patients admitted with SARS-CoV-2 and hospital-onset COVID-19 infections (HOCIs), sampled 16 November 2020 to 10 January 2021, from eight hospitals participating in the COG-UK-HOCI study. Associations between the variant and the outcomes of all-cause mortality and intensive therapy unit (ITU) admission were evaluated using mixed effects Cox models adjusted by age, sex, comorbidities, care home residence, pregnancy and ethnicity. FINDINGS: Sequences were obtained from 2341 inpatients (HOCI cases=786) and analysis of clinical outcomes was carried out in 2147 inpatients with all data available. The HR for mortality of B.1.1.7 compared with other lineages was 1.01 (95% CI 0.79 to 1.28, p=0.94) and for ITU admission was 1.01 (95% CI 0.75 to 1.37, p=0.96). Analysis of sex-specific effects of B.1.1.7 identified increased risk of mortality (HR 1.30, 95%âCI 0.95 to 1.78, p=0.096) and ITU admission (HR 1.82, 95%âCI 1.15 to 2.90, p=0.011) in females infected with the variant but not males (mortality HR 0.82, 95%âCI 0.61 to 1.10, p=0.177; ITU HR 0.74, 95%âCI 0.52 to 1.04, p=0.086). INTERPRETATION: In common with smaller studies of patients hospitalised with SARS-CoV-2, we did not find an overall increase in mortality or ITU admission associated with B.1.1.7 compared with other lineages. However, women with B.1.1.7 may be at an increased risk of admission to intensive care and at modestly increased risk of mortality.This report was produced by members of the COG-UK-HOCI Variant
substudy consortium. COG-UK-HOCI is part of COG-UK. COG-UK is supported by funding from the Medical Research Council (MRC) part of UK Research & Innovation (UKRI), the National Institute of Health Research (NIHR) and Genome Research Limited, operating as the Wellcome Sanger Institute
Genomic epidemiology of SARS-CoV-2 in a UK university identifies dynamics of transmission
AbstractUnderstanding SARS-CoV-2 transmission in higher education settings is important to limit spread between students, and into at-risk populations. In this study, we sequenced 482 SARS-CoV-2 isolates from the University of Cambridge from 5 October to 6 December 2020. We perform a detailed phylogenetic comparison with 972 isolates from the surrounding community, complemented with epidemiological and contact tracing data, to determine transmission dynamics. We observe limited viral introductions into the university; the majority of student cases were linked to a single genetic cluster, likely following social gatherings at a venue outside the university. We identify considerable onward transmission associated with student accommodation and courses; this was effectively contained using local infection control measures and following a national lockdown. Transmission clusters were largely segregated within the university or the community. Our study highlights key determinants of SARS-CoV-2 transmission and effective interventions in a higher education setting that will inform public health policy during pandemics.</jats:p