4 research outputs found
Leveraging Spatial Metadata in Machine Learning for Improved Objective Quantification of Geological Drill Core
Here we present a method for using the spatial x–y coordinate of an image cropped from the cylindrical surface of digital 3D drill core images and demonstrate how this spatial metadata can be used to improve unsupervised machine learning performance. This approach is applicable to any data set with known spatial context, however, here it is used to classify 400 m of drillcore imagery into 12 distinct classes reflecting the dominant rock types and alteration features in the core. We modified two unsupervised learning models to incorporate spatial metadata and an average improvement of 25% was achieved over equivalent models that did not utilize metadata. Our semi-supervised workflow involves unsupervised network training followed by semi-supervised clustering where a support vector machine uses a subset of M expert labeled images to assign a pseudolabel to the entire data set. Fine-tuning of the best performing model showed an f1 (macro average) of 90%, and its classifications were used to estimate bulk fresh and altered rock abundance downhole. Validation against the same information gathered manually by experts when the core was recovered during the Oman Drilling Project revealed that our automatically generated data sets have a significant positive correlation (Pearson's r of 0.65–0.72) to the expert generated equivalent, demonstrating that valuable geological information can be generated automatically for 400 m of core with only ∼24 hr of domain expert effort
Testing the activitystat hypothesis: a randomised controlled trial protocol
Background: The activitystat hypothesis proposes that when physical activity or energy expenditure is increased or decreased in one domain, there will be a compensatory change in another domain to maintain an overall, stable level of physical activity or energy expenditure. To date, there has been no experimental study primarily designed to test the activitystat hypothesis in adults. The aim of this trial is to determine the effect of two different imposed exercise loads on total daily energy expenditure and physical activity levels. Methods. This study will be a randomised, multi-arm, parallel controlled trial. Insufficiently active adults (as determined by the Active Australia survey) aged 18-60 years old will be recruited for this study (n=146). Participants must also satisfy the Sports Medicine Australia Pre-Exercise Screening System and must weigh less than 150 kg. Participants will be randomly assigned to one of three groups using a computer-generated allocation sequence. Participants in the Moderate exercise group will receive an additional 150 minutes of moderate to vigorous physical activity per week for six weeks, and those in the Extensive exercise group will receive an additional 300 minutes of moderate to vigorous physical activity per week for six weeks. Exercise targets will be accumulated through both group and individual exercise sessions monitored by heart rate telemetry. Control participants will not be given any instructions regarding lifestyle. The primary outcome measures are activity energy expenditure (doubly labeled water) and physical activity (accelerometry). Secondary measures will include resting metabolic rate via indirect calorimetry, use of time, maximal oxygen consumption and several anthropometric and physiological measures. Outcome measures will be conducted at baseline (zero weeks), mid- and end-intervention (three and six weeks) with three (12 weeks) and six month (24 week) follow-up. All assessors will be blinded to group allocation. Discussion. This protocol has been specifically designed to test the activitystat hypothesis while taking into account the key conceptual and methodological considerations of testing a biologically regulated homeostatic feedback loop. Results of this study will be an important addition to the growing literature and debate concerning the possible existence of an activitystat. Trial registration. Australian New Zealand Clinical Trials Registry ACTRN12610000248066
Accuracy versus precision in boosted top tagging with the ATLAS detector
Abstract
The identification of top quark decays where the top quark has a large momentum transverse to the beam axis, known as top tagging, is a crucial component in many measurements of Standard Model processes and searches for beyond the Standard Model physics at the Large Hadron Collider.
Machine learning techniques have improved the performance of top tagging algorithms, but the size of the systematic uncertainties for all proposed algorithms has not been systematically studied.
This paper presents the performance of several machine learning based top tagging algorithms on a dataset constructed from simulated proton-proton collision events measured with the ATLAS detector at √
s
= 13 TeV.
The systematic uncertainties associated with these algorithms are estimated through an approximate procedure that is not meant to be used in a physics analysis, but is appropriate for the level of precision required for this study.
The most performant algorithms are found to have the largest uncertainties, motivating the development of methods to reduce these uncertainties without compromising performance.
To enable such efforts in the wider scientific community, the datasets used in this paper are made publicly available.</jats:p