Search CORE

253 research outputs found

Alien Registration- Furlotte, Alfred (Medway, Penobscot County)

Author: Furlotte Alfred
Publication venue: Digital Maine
Publication date: 01/01/1940
Field of study

https://digitalmaine.com/alien_docs/8248/thumbnail.jp

Maine State Library

Maine State Documents (Maine State Library)

Accounting for Population Structure in Gene-by-Environment Interactions in Genome-Wide Association Studies Using Mixed Models.

Author: Bilow Michael
Eskin Eleazar
Furlotte Nick
He Dan
Kostem Emrah
Sul Jae Hoon
Yang Wen-Yun
Publication venue: eScholarship, University of California
Publication date: 01/03/2016
Field of study

Although genome-wide association studies (GWASs) have discovered numerous novel genetic variants associated with many complex traits and diseases, those genetic variants typically explain only a small fraction of phenotypic variance. Factors that account for phenotypic variance include environmental factors and gene-by-environment interactions (GEIs). Recently, several studies have conducted genome-wide gene-by-environment association analyses and demonstrated important roles of GEIs in complex traits. One of the main challenges in these association studies is to control effects of population structure that may cause spurious associations. Many studies have analyzed how population structure influences statistics of genetic variants and developed several statistical approaches to correct for population structure. However, the impact of population structure on GEI statistics in GWASs has not been extensively studied and nor have there been methods designed to correct for population structure on GEI statistics. In this paper, we show both analytically and empirically that population structure may cause spurious GEIs and use both simulation and two GWAS datasets to support our finding. We propose a statistical approach based on mixed models to account for population structure on GEI statistics. We find that our approach effectively controls population structure on statistics for GEIs as well as for genetic variants

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

FigShare

Quantifying the uncertainty in heritability

Author: Furlotte N.A.
Heckerman D.
Lippert C.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/03/2014
Field of study

The use of mixed models to determine narrow-sense heritability and related quantities such as SNP heritability has received much recent attention. Less attention has been paid to the inherent variability in these estimates. One approach for quantifying variability in estimates of heritability is a frequentist approach, in which heritability is estimated using maximum likelihood and its variance is quantified through an asymptotic normal approximation. An alternative approach is to quantify the uncertainty in heritability through its Bayesian posterior distribution. In this paper, we develop the latter approach, make it computationally efficient and compare it to the frequentist approach. We show theoretically that, for a sufficiently large sample size and intermediate values of heritability, the two approaches provide similar results. Using the Atherosclerosis Risk in Communities cohort, we show empirically that the two approaches can give different results and that the variance/uncertainty can remain large

PubMed Central

MDC Repository

Multimodal LLMs for health grounded in individual-specific data

Author: Belyaeva Anastasiya
Carroll Andrew
Corrado Greg
Cosentino Justin
Eswaran Krish
Furlotte Nicholas A.
Hormozdiari Farhad
McLean Cory Y.
Shetty Shravya
Publication venue
Publication date: 20/07/2023
Field of study

Foundation large language models (LLMs) have shown an impressive ability to solve tasks across a wide range of fields including health. To effectively solve personalized health tasks, LLMs need the ability to ingest a diversity of data modalities that are relevant to an individual's health status. In this paper, we take a step towards creating multimodal LLMs for health that are grounded in individual-specific data by developing a framework (HeLM: Health Large Language Model for Multimodal Understanding) that enables LLMs to use high-dimensional clinical modalities to estimate underlying disease risk. HeLM encodes complex data modalities by learning an encoder that maps them into the LLM's token embedding space and for simple modalities like tabular data by serializing the data into text. Using data from the UK Biobank, we show that HeLM can effectively use demographic and clinical features in addition to high-dimensional time-series data to estimate disease risk. For example, HeLM achieves an AUROC of 0.75 for asthma prediction when combining tabular and spirogram data modalities compared with 0.49 when only using tabular data. Overall, we find that HeLM outperforms or performs at parity with classical machine learning approaches across a selection of eight binary traits. Furthermore, we investigate the downstream uses of this model such as its generalizability to out-of-distribution traits and its ability to power conversations around individual health and wellness

arXiv.org e-Print Archive

Meta-Analysis Identifies Gene-by-Environment Interactions as Demonstrated in a Study of 4,965 Mice

Author: Davis Richard C.
Eskin Eleazar
Furlotte Nicholas
Han Buhm
Joo Jong Wha J.
Kang Eun Yong
Lusis Aldons J.
Shih Diana
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

Identifying environmentally-specific genetic effects is a key challenge in understanding the structure of complex traits. Model organisms play a crucial role in the identification of such gene-by-environment interactions, as a result of the unique ability to observe genetically similar individuals across multiple distinct environments. Many model organism studies examine the same traits but under varying environmental conditions. For example, knock-out or diet-controlled studies are often used to examine cholesterol in mice. These studies, when examined in aggregate, provide an opportunity to identify genomic loci exhibiting environmentally-dependent effects. However, the straightforward application of traditional methodologies to aggregate separate studies suffers from several problems. First, environmental conditions are often variable and do not fit the standard univariate model for interactions. Additionally, applying a multivariate model results in increased degrees of freedom and low statistical power. In this paper, we jointly analyze multiple studies with varying environmental conditions using a meta-analytic approach based on a random effects model to identify loci involved in gene-by-environment interactions. Our approach is motivated by the observation that methods for discovering gene-by-environment interactions are closely related to random effects models for meta-analysis. We show that interactions can be interpreted as heterogeneity and can be detected without utilizing the traditional uni- or multi-variate approaches for discovery of gene-by-environment interactions. We apply our new method to combine 17 mouse studies containing in aggregate 4,965 distinct animals. We identify 26 significant loci involved in High-density lipoprotein (HDL) cholesterol, many of which are consistent with previous findings. Several of these loci show significant evidence of involvement in gene-by-environment interactions. An additional advantage of our meta-analysis approach is that our combined study has significantly higher power and improved resolution compared to any single study thus explaining the large number of loci discovered in the combined study

Crossref

SNU Open Repository and Archive

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

FigShare

R/qtl2: Software for Mapping Quantitative Trait Loci with High-Dimensional Data and Multiparent Populations.

Author: Brian S. Yandell
Daniel M. Gatti
Gary A. Churchill
Karl W. Broman
Lander
Mott
Nicholas A. Furlotte
Petr Simecek
Pjotr Prins
Sen
Wickham
Śaunak Sen
Publication venue: The Mouseion at the JAXlibrary
Publication date: 01/02/2019
Field of study

R/qtl2 is an interactive software environment for mapping quantitative trait loci (QTL) in experimental populations. The R/qtl2 software expands the scope of the widely used R/qtl software package to include multiparent populations derived from more than two founder strains, such as the Collaborative Cross and Diversity Outbred mice, heterogeneous stocks, and MAGIC plant populations. R/qtl2 is designed to handle modern high-density genotyping data and high-dimensional molecular phenotypes, including gene expression and proteomics. R/qtl2 includes the ability to perform genome scans using a linear mixed model to account for population structure, and also includes features to impute SNPs based on founder strain genomes and to carry out association mapping. The R/qtl2 software provides all of the basic features needed for QTL mapping, including graphical displays and summary reports, and it can be extended through the creation of add-on packages. R/qtl2, which is free and open source software written in the R and C++ programming languages, comes with a test framework

Crossref

The Jackson Laboratory: The Mouseion at the JAXlibrary

biMM : efficient estimation of genetic variances and covariances for cohorts with high-dimensional phenotype measurements

Author: Bulik-Sullivan
Casale
Christian Benner
Furlotte
International HapMap3 consortium
Loh
Manuel A Rivas
Marjo-Riitta Järvelin
Matti Pirinen
Oliver Stegle
Pekka Marttinen
Rantakallio
Samuli Ripatti
Su
Tukiainen
Yang
Yang
Zhou
Publication venue
Publication date: 15/11/2016
Field of study

Genetic research utilizes a decomposition of trait variances and covariances into genetic and environmental parts. Our software package biMM is a computationally efficient implementation of a bivariate linear mixed model for settings where hundreds of traits have been measured on partially overlapping sets of individuals.Peer reviewe

Crossref

Aaltodoc Publication Archive

Spiral - Imperial College Digital Repository

Helsingin yliopiston digitaalinen arkisto

Functional Cohesion of Gene Sets Determined by Latent Semantic Indexing of PubMed Abstracts

Author: A Subramanian
AP Oron
B Zhang
B Zheng
B Zheng
CA Joslyn
CMaS Manning
D Martin
D Nam
Ebenezer O. George
F Al-Shahrour
G Yona
HH van Haagen
IB Jeffery
JD Storey
JD Wren
Kevin Heinrich
L Wei
Lijing Xu
M Ashburner
M Chagoyen
M Schuemie
Michael W. Berry
MS Pepe
MW Berry
Nicholas Furlotte
P Minguez
R Homayouni
R Jelier
Ramin Homayouni
Ramy K. Aziz
S Chiaretti
S Raychaudhuri
S Raychaudhuri
SG Lee
TK Landauer
TM Kim
VK Mootha
W Pan
Y Pawitan
Yunyue Lin
Z Jiang
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

High-throughput genomic technologies enable researchers to identify genes that are co-regulated with respect to specific experimental conditions. Numerous statistical approaches have been developed to identify differentially expressed genes. Because each approach can produce distinct gene sets, it is difficult for biologists to determine which statistical approach yields biologically relevant gene sets and is appropriate for their study. To address this issue, we implemented Latent Semantic Indexing (LSI) to determine the functional coherence of gene sets. An LSI model was built using over 1 million Medline abstracts for over 20,000 mouse and human genes annotated in Entrez Gene. The gene-to-gene LSI-derived similarities were used to calculate a literature cohesion p-value (LPv) for a given gene set using a Fisher's exact test. We tested this method against genes in more than 6,000 functional pathways annotated in Gene Ontology (GO) and found that approximately 75% of gene sets in GO biological process category and 90% of the gene sets in GO molecular function and cellular component categories were functionally cohesive (LPv<0.05). These results indicate that the LPv methodology is both robust and accurate. Application of this method to previously published microarray datasets demonstrated that LPv can be helpful in selecting the appropriate feature extraction methods. To enable real-time calculation of LPv for mouse or human gene sets, we developed a web tool called Gene-set Cohesion Analysis Tool (GCAT). GCAT can complement other gene set enrichment approaches by determining the overall functional cohesion of data sets, taking into account both explicit and implicit gene interactions reported in the biomedical literature

University of Memphis Digital Commons

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Identification and Functional Validation of the Novel Antimalarial Resistance Locus PF10_0355 in Plasmodium falciparum

Author: Angelino Elaine
Barnes Kayla G.
Becker Justin S.
Birren Bruce W.
Cortese Joseph F.
Daniels Rachel F.
Daniels Rachel F.
Eskin Eleazar
Furlotte Nicholas A.
Grossman Sharon Rachel
Happi Christian
Hartl Daniel L.
Johnson Charles A.
Kang Hyun Min
Karlsson Elinor K.
Lander Eric S.
Lukens Amanda K.
Mboup Soulyemane
Milner Danny A.
Ndiaye Daouda
Neafsey Daniel E.
Park Daniel J.
Rosen David M.
Sabeti Pardis C.
Sarr Ousmane
Schaffner Stephen F.
Shlyakhter Ilya
Tyne Daria Van
Volkman Sarah K.
Wiegand Roger C.
Wirth Dyann F.
Yamins Daniel
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/09/2010
Field of study

The Plasmodium falciparum parasite's ability to adapt to environmental pressures, such as the human immune system and antimalarial drugs, makes malaria an enduring burden to public health. Understanding the genetic basis of these adaptations is critical to intervening successfully against malaria. To that end, we created a high-density genotyping array that assays over 17,000 single nucleotide polymorphisms (~1 SNP/kb), and applied it to 57 culture-adapted parasites from three continents. We characterized genome-wide genetic diversity within and between populations and identified numerous loci with signals of natural selection, suggesting their role in recent adaptation. In addition, we performed a genome-wide association study (GWAS), searching for loci correlated with resistance to thirteen antimalarials; we detected both known and novel resistance loci, including a new halofantrine resistance locus, PF10_0355. Through functional testing we demonstrated that PF10_0355 overexpression decreases sensitivity to halofantrine, mefloquine, and lumefantrine, but not to structurally unrelated antimalarials, and that increased gene copy number mediates resistance. Our GWAS and follow-on functional validation demonstrate the potential of genome-wide studies to elucidate functionally important loci in the malaria parasite genome.Bill & Melinda Gates FoundationEllison Medical FoundationExxon Mobil FoundationFogarty International CenterNational Institute of Allergy and Infectious Diseases (U.S.)Burroughs Wellcome FundDavid & Lucile Packard FoundationNational Science Foundation (U.S.). Graduate Research Fellowship Progra

CiteSeerX

DSpace@MIT

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

eScholarship - University of California