Search CORE

37 research outputs found

Accuracy of Administratively-Assigned Ancestry for Diverse Populations in an Electronic Medical Record-Linked Biobank

Author: Dana C. Crawford (102216)
Holli H. Dilks (572402)
Jacob B. Hall (572401)
Logan Dumitrescu (169202)
William S. Bush (107449)
Publication venue
Publication date: 04/06/2014
Field of study

<div>Recently, the development of biobanks linked to electronic medical records has presented new opportunities for genetic and epidemiological research. Studies based on these resources, however, present unique challenges, including the accurate assignment of individual-level population ancestry. In this work we examine the accuracy of administratively-assigned race in diverse populations by comparing assigned races to genetically-defined ancestry estimates. Using 220 ancestry informative markers, we generated principal components for patients in our dataset, which were used to cluster patients into groups based on genetic ancestry. Consistent with other studies, we find a strong overall agreement (Kappa = 0.872) between genetic ancestry and assigned race, with higher rates of agreement for African-descent and European-descent assignments, and reduced agreement for Hispanic, East Asian-descent, and South Asian-descent assignments. These results suggest caution when selecting study samples of non-African and non-European backgrounds when administratively-assigned race from biobanks is used.</div

Directory of Open Access Journals

PubMed Central

FigShare

Comparison of administratively-assigned race and genetic ancestry, based on principal component analysis.

Author: Dana C. Crawford (102216)
Holli H. Dilks (572402)
Jacob B. Hall (572401)
Logan Dumitrescu (169202)
William S. Bush (107449)
Publication venue
Publication date
Field of study

A) All pairwise combinations of principle components (PCs) 1 through 3, by administratively assigned race. B) All pairwise combinations of PCs 1 through 3, by cluster assignments corresponding to genetic ancestry. Comparison of Frames 1A and1B indicate individuals with administratively assigned race different than their genetically defined ancestry cluster. For example, the East Asian-descent cluster (1B; blue) contains individuals with administratively-assigned race (1A) of Caucasian (green), Hispanic (purple), and Other (orange).</p

FigShare

Agreement between genetic and assigned ancestry.

Author: Dana C. Crawford (102216)
Holli H. Dilks (572402)
Jacob B. Hall (572401)
Logan Dumitrescu (169202)
William S. Bush (107449)
Publication venue
Publication date
Field of study

Notation: Cohen's Kappa coefficient (standard error).South Asian-descent includes individuals with Native American and Indian race codes in BioVU.Samples with administratively-assigned race of “Unknown” were excluded from this analysis.</p

FigShare

Percentages of each administratively-assigned race assigned to each genetic ancestry group.

Author: Dana C. Crawford (102216)
Holli H. Dilks (572402)
Jacob B. Hall (572401)
Logan Dumitrescu (169202)
William S. Bush (107449)
Publication venue
Publication date
Field of study

Percentages reflect the proportion of individuals assigned to a genetic ancestry cluster for given administratively-assigned race.</p

FigShare

Distribution of administratively-assigned race.

Author: Dana C. Crawford (102216)
Holli H. Dilks (572402)
Jacob B. Hall (572401)
Logan Dumitrescu (169202)
William S. Bush (107449)
Publication venue
Publication date
Field of study

Race categories listed are based on classification options originating from the SD. Our BioVU dataset contained no individuals labeled Other (O). Vanderbilt University Medical Center is located in Davidson County, TN. 2010 US census data is shown for Davidson County, Tennessee <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0099161#pone.0099161-US1" target="_blank">[25]</a>. * For Davidson County, “Asian/Pacific” includes Asian (Non-Indian), Native Hawaiian, and Pacific Islander individuals, “Native American” includes Native American (American Indian) and Alaskan Native individuals, “Indian” includes Asian Indian individuals, and “Unknown” includes ‘some other race’ and individuals who reported two or more races for the census. ** “Hispanic” is not listed a race in the US Census; rather, Hispanic-origin is indicated and is not exclusive to any racial category. For example, 25,156 individuals in Davidson County who self-identified as ‘White’ also self-identified, separately, as Hispanic. Within Davidson County, 9.8% of individuals indicated Hispanic origin.</p

FigShare

Descriptive statistics by biomarker group.

Author: Amy Oksol (4634746)
Angela L. Jefferson (822693)
Katherine A. Gifford (822689)
Logan Dumitrescu (169202)
Madison Wagener (4634749)
Timothy J. Hohman (486551)
Publication venue
Publication date
Field of study

Descriptive statistics by biomarker group.</p

FigShare

APOE allele frequencies in suspected non-amyloid pathophysiology (SNAP) and the prodromal stages of Alzheimer’s Disease

Author: Amy Oksol (4634746)
Angela L. Jefferson (822693)
Katherine A. Gifford (822689)
Logan Dumitrescu (169202)
Madison Wagener (4634749)
Timothy J. Hohman (486551)
Publication venue
Publication date
Field of study

<div>Biomarker definitions for preclinical Alzheimer’s disease (AD) have identified individuals with neurodegeneration (ND+) without β-amyloidosis (Aβ-) and labeled them with suspected non-AD pathophysiology (SNAP). We evaluated Apolipoprotein E (APOE) ε2 and ε4 allele frequencies across biomarker definitions—Aβ-/ND- (n = 268), Aβ+/ND- (n = 236), Aβ-/ND+ or SNAP (n = 78), Aβ+/ND+ (n = 204)—hypothesizing that SNAP would have an APOE profile comparable to Aβ-/ND-. Using AD Neuroimaging Initiative data (n = 786, 72±7 years, 48% female), amyloid status (Aβ+ or Aβ-) was defined by cerebrospinal fluid (CSF) Aβ-42 levels, and neurodegeneration status (ND+ or ND-) was defined by hippocampal volume from MRI. Binary logistic regression related biomarker status to APOE ε2 and ε4 allele carrier status, adjusting for age, sex, education, and cognitive diagnosis. Compared to the biomarker negative (Aβ-/ND-) participants, higher proportions of ε4 and lower proportions of ε2 carriers were observed among Aβ+/ND- (ε4: OR = 6.23, p<0.001; ε2: OR = 0.53, p = 0.03) and Aβ+/ND+ participants (ε4: OR = 12.07, p<0.001; ε2: OR = 0.29, p = 0.004). SNAP participants were statistically comparable to biomarker negative participants (p-values>0.30). In supplemental analyses, comparable results were observed when coding SNAP using amyloid imaging and when using CSF tau levels. In contrast to APOE, a polygenic risk score for AD that excluded APOE did not show an association with amyloidosis or neurodegeneration (p-values>0.15), but did show an association with SNAP defined using CSF tau (β = 0.004, p = 0.02). Thus, in a population with low levels of cerebrovascular disease and a lower prevalence of SNAP than the general population, APOE and known genetic drivers of AD do not appear to contribute to the neurodegeneration observed in SNAP. Additional work in population based samples is needed to better elucidate the genetic contributors to various etiological drivers of SNAP.</div

FigShare

Associations between biomarker groups and APOE carrier status.

Author: Amy Oksol (4634746)
Angela L. Jefferson (822693)
Katherine A. Gifford (822689)
Logan Dumitrescu (169202)
Madison Wagener (4634749)
Timothy J. Hohman (486551)
Publication venue
Publication date
Field of study

Associations between biomarker groups and APOE carrier status.</p

FigShare

Sample characteristics.

Author: Amy Oksol (4634746)
Angela L. Jefferson (822693)
Katherine A. Gifford (822689)
Logan Dumitrescu (169202)
Madison Wagener (4634749)
Timothy J. Hohman (486551)
Publication venue
Publication date
Field of study

Sample characteristics.</p

FigShare

APOE genotypes across AD biomarker.

Author: Amy Oksol (4634746)
Angela L. Jefferson (822693)
Katherine A. Gifford (822689)
Logan Dumitrescu (169202)
Madison Wagener (4634749)
Timothy J. Hohman (486551)
Publication venue
Publication date
Field of study

Pie charts are presented by biomarker group based on amyloid status defined using levels of cerebrospinal fluid amyloid-β 42 (Aβ) and neurodegeneration defined using hippocampal volume (ND). Colors represent APOE genotype whereby gray represents homozygous ε3 allele carriers, shades of red represent ε4 allele carriers, shades of blue represent ε2 allele carriers, and purple is used to represent ε2/ε4 carriers. Sample sizes are presented below the segment label for each allele combination. Allele combinations that do not have any participants within a given biomarker group are labeled in light grey font.</p

FigShare