Search CORE

97 research outputs found

Infection status outcome, machine learning method and virus type interact to affect the optimised prediction of hepatitis virus immunoassay results from routine pathology laboratory assays in unbalanced data

Author: Alice M Richardson
AS File PE Dugard PI Houston
Brett A Lidbury
C Drummond
CW Shepard
G Shang
G Williams
JR Quinlan
KS Woods
L Brieman
L Han
M Negnevitsky
MJ de Rantala Mvan Laar
MK Kerr
N Japcowicz
PAD Wilks
SK Murthy
T Sy
V Busic
Z Zhou
Z Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Use of machine learning to shorten observation-based screening and diagnosis of autism

Author: A Bailey
B Martin
BR Gaines
C Lord
DH Geschwind
DL Robins
E Frank
GD Fischbach
GI Webb
IH Witten
J Gama
JA Pinto-Martin
K Gotham
L Breiman
L Brieman
LC Eaves
LD Wiggins
M Hall
N Landwehr
P Howlin
PT Shattuck
R Bernier
R Kohavi
R Quinlan
RC Holte
SK Berument
Y Freund
Y Freund
Publication venue: Nature Publishing Group
Publication date: 07/12/2012
Field of study

The Autism Diagnostic Observation Schedule-Generic (ADOS) is one of the most widely used instruments for behavioral evaluation of autism spectrum disorders. It is composed of four modules, each tailored for a specific group of individuals based on their language and developmental level. On average, a module takes between 30 and 60 min to deliver. We used a series of machine-learning algorithms to study the complete set of scores from Module 1 of the ADOS available at the Autism Genetic Resource Exchange (AGRE) for 612 individuals with a classification of autism and 15 non-spectrum individuals from both AGRE and the Boston Autism Consortium (AC). Our analysis indicated that 8 of the 29 items contained in Module 1 of the ADOS were sufficient to classify autism with 100% accuracy. We further validated the accuracy of this eight-item classifier against complete sets of scores from two independent sources, a collection of 110 individuals with autism from AC and a collection of 336 individuals with autism from the Simons Foundation. In both cases, our classifier performed with nearly 100% sensitivity, correctly classifying all but two of the individuals from these two resources with a diagnosis of autism, and with 94% specificity on a collection of observed and simulated non-spectrum controls. The classifier contained several elements found in the ADOS algorithm, demonstrating high test validity, and also resulted in a quantitative score that measures classification confidence and extremeness of the phenotype. With incidence rates rising, the ability to classify autism effectively and quickly requires careful design of assessment and diagnostic tools. Given the brevity, accuracy and quantitative nature of the classifier, results from this study may prove valuable in the development of mobile tools for preliminary evaluation and clinical prioritization—in particular those focused on assessment of short home videos of children—that speed the pace of initial evaluation and broaden the reach to a significantly larger percentage of the population at risk

Crossref

Harvard University - DASH

PubMed Central

Mastectomy or breast conserving surgery? Factors affecting type of surgical treatment for breast cancer – a classification tree approach

Author: A Potosky
C Furnival
C Morris
CS Foo
D Altman
E Graf
G Maskarinec
G Riley
GG Giles
K Hiotis
K Hiotis
K Spilsbury
KR Hess
L Brieman
M Morrow
M Najafi
Michael A Martin
National Health and Medical Research Council
Ramona Meyricke
SE Hall
Steven Roberts
T Hastie
Terry O'Neill
UW Jayasinghe
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: A critical choice facing breast cancer patients is which surgical treatment – mastectomy or breast conserving surgery (BCS) – is most appropriate. Several studies have investigated factors that impact the type of surgery chosen, identifying features such as place of residence, age at diagnosis, tumor size, socio-economic and racial/ethnic elements as relevant. Such assessment of "propensity" is important in understanding issues such as a reported under-utilisation of BCS among women for whom such treatment was not contraindicated. Using Western Australian (WA) data, we further examine the factors associated with the type of surgical treatment for breast cancer using a classification tree approach. This approach deals naturally with complicated interactions between factors, and so allows flexible and interpretable models for treatment choice to be built that add to the current understanding of this complex decision process. METHODS: Data was extracted from the WA Cancer Registry on women diagnosed with breast cancer in WA from 1990 to 2000. Subjects' treatment preferences were predicted from covariates using both classification trees and logistic regression. RESULTS: Tumor size was the primary determinant of patient choice, subjects with tumors smaller than 20 mm in diameter preferring BCS. For subjects with tumors greater than 20 mm in diameter factors such as patient age, nodal status, and tumor histology become relevant as predictors of patient choice. CONCLUSION: Classification trees perform as well as logistic regression for predicting patient choice, but are much easier to interpret for clinical use. The selected tree can inform clinicians' advice to patients

ANU Digital Collections

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

The Australian National University

Abstract Background A fundamental goal of human genetics is the discovery of polymorphisms that predict common, complex diseases. It is hypothesized that complex diseases are due to a myriad of factors including environmental exposures and complex genetic risk models, including gene-gene interactions. Such epistatic models present an important analytical challenge, requiring that methods perform not only statistical modeling, but also variable selection to generate testable genetic model hypotheses. This challenge is amplified by recent advances in genotyping technology, as the number of potential predictor variables is rapidly increasing. Methods Decision trees are a highly successful, easily interpretable data-mining method that are typically optimized with a hierarchical model building approach, which limits their potential to identify interacting effects. To overcome this limitation, we utilize evolutionary computation, specifically grammatical evolution, to build decision trees to detect and model gene-gene interactions. In the current study, we introduce the Grammatical Evolution Decision Trees (GEDT) method and software and evaluate this approach on simulated data representing gene-gene interaction models of a range of effect sizes. We compare the performance of the method to a traditional decision tree algorithm and a random search approach and demonstrate the improved performance of the method to detect purely epistatic interactions. Results The results of our simulations demonstrate that GEDT has high power to detect even very moderate genetic risk models. GEDT has high power to detect interactions with and without main effects. Conclusions GEDT, while still in its initial stages of development, is a promising new approach for identifying gene-gene interactions in genetic association studies.</p

Crossref

Directory of Open Access Journals

PubMed Central

Disparities in mammographic screening for Asian women in California: a cross-sectional analysis to identify meaningful groups for targeted intervention

Abstract Background Breast cancer is the most commonly diagnosed cancer among the rapidly growing population of Asian Americans; it is also the most common cause of cancer mortality among Filipinas. Asian women continue to have lower rates of mammographic screening than women of most other racial/ethnic groups. While prior studies have described the effects of sociodemographic and other characteristics of women on non-adherence to screening guidelines, they have not identified the distinct segments of the population who remain at highest risk of not being screened. Methods To better describe characteristics of Asian women associated with not having a mammogram in the last two years, we applied recursive partitioning to population-based data (N = 1521) from the 2001 California Health Interview Survey (CHIS), for seven racial/ethnic groups of interest: Chinese, Japanese, Filipino, Korean, South Asian, Vietnamese, and all Asians combined. Results We identified two major subgroups of Asian women who reported not having a mammogram in the past two years and therefore, did not follow mammography screening recommendations: 1) women who have never had a pap exam to screen for cervical cancer (68% had no mammogram), and 2) women who have had a pap exam, but have no women's health issues (osteoporosis, using menopausal hormone therapies, and/or hysterectomy) nor a usual source of care (62% had no mammogram). Only 19% of Asian women who have had pap screening and have women's health issues did not have a mammogram in the past two years. In virtually all ethnic subgroups, having had pap or colorectal screening were the strongest delineators of mammography usage. Other characteristics of women least likely to have had a mammogram included: Chinese non-U.S. citizens or citizens without usual source of health care, Filipinas with no health insurance, Koreans without women's health issues and public or no health insurance, South Asians less than age 50 who were unemployed or non-citizens, and Vietnamese women who were never married. Conclusion We identified distinct subgroups of Asian women at highest risk of not adhering to mammography screening guidelines; these data can inform outreach efforts aimed at reducing the disparity in mammography screening among Asian women.</p

Crossref

Springer

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Measurement of the form-factor ratios for D^+ -> \overline K^{*0}e^+\nu_e

Author: A. Abada
A. B. d'Oliveira
A. C. dos Reis
A. F. S. Santoro
A. Fernandez
A. J. Schwartz
A. J. Slaughter
A. K. S. Santha
A. K. Tripathi
A. M. Halling
A. Napier
A. Nguyen
A. Rafatian
B. Lundberg
B. Meadows
B. Quinn
C. Darling
C. James
C. R. Allton
C. Zhang
D. A. Sanders
D. Ashery
D. C. Langs
D. J. Summers
D. J. Summers
D. M. Schmidt
D. Mihalcea
D. Scora
D. Yi
E. M. Aitala
E. Wolin
G. Blaylock
G. Fox
G. Herrera
G. Hurvits
G. P. Gagnon
H. A. Rubin
H. S. Carvalho
I. Bediaga
J. A. Appel
J. A. Appel
J. C. Anjos
J. C. Anjos
J. D. Richman
J. J. Reidy
J. Leslie
J. M. de Miranda
J. Nieves
J. R. T. de Mello Neto
J. Solano
J. Wiener
K. C. Peng
K. Denisenko
K. Gounder
K. Kodama
K. O'Shaughnessy
K. Stenson
K. Thorne
L. Brieman
L. M. Cremaldi
L. P. Perera
M. D. Sokoloff
M. Sheaff
M. V. Purohit
N. Isgur
N. K. Copty
N. R. Stanton
N. W. Reay
N. Witchey
P. A. Kasper
P. L. Frabetti
P. R. Burchat
R. A. Burnstein
R. A. Sidwell
R. H. Milburn
R. Weiss-Babai
R. Zaliznyak
S. Amato
S. B. Bracker
S. Banerjee
S. Gusken
S. Kwan
S. MayTal-Beck
S. Radeztsky
S. Takach
S. Watanabe
S. Yoshida
T. Carter
Publication venue: 'American Physical Society (APS)'
Publication date: 01/10/1997
Field of study

We present a measurement of the form-factor ratios r_V=V(0)/A_1(0) and r_2=A_2(0)/A_1(0) for the decay D^+ -> \overline K^{*0} e^+ \nu_e. The measurement is based on a signal of approximately 3000

D^+ -> \overline K^{*0} e^+ \nu_e, \overline K^{*0} -> K^-\pi^+

decays reconstructed in data from charm hadroproduction experiment E791 at Fermilab. The results are r_V = 1.84 +- 0.11 +- 0.08 and r_2 = 0.71 +- 0.08 +- 0.09.Comment: 9 pages, 2 figure