3,039 research outputs found
Multi-Channel Stochastic Variational Inference for the Joint Analysis of Heterogeneous Biomedical Data in Alzheimer's Disease
The joint analysis of biomedical data in Alzheimer's Disease (AD) is
important for better clinical diagnosis and to understand the relationship
between biomarkers. However, jointly accounting for heterogeneous measures
poses important challenges related to the modeling of the variability and the
interpretability of the results. These issues are here addressed by proposing a
novel multi-channel stochastic generative model. We assume that a latent
variable generates the data observed through different channels (e.g., clinical
scores, imaging, ...) and describe an efficient way to estimate jointly the
distribution of both latent variable and data generative process. Experiments
on synthetic data show that the multi-channel formulation allows superior data
reconstruction as opposed to the single channel one. Moreover, the derived
lower bound of the model evidence represents a promising model selection
criterion. Experiments on AD data show that the model parameters can be used
for unsupervised patient stratification and for the joint interpretation of the
heterogeneous observations. Because of its general and flexible formulation, we
believe that the proposed method can find important applications as a general
data fusion technique.Comment: accepted for presentation at MLCN 2018 workshop, in Conjunction with
MICCAI 2018, September 20, Granada, Spai
Pyridine functionalized carbon nanotubes: unveiling the role of external pyridinic nitrogen sites for oxygen reduction reaction.
Pyridinic nitrogen has been recognized as the primary active site in nitrogen-doped carbon electrocatalysts for the oxygen reduction reaction (ORR), which is a critical process in many renewable energy devices. However, the preparation of nitrogen-doped carbon catalysts comprised of exclusively pyridinic nitrogen remains challenging, as well as understanding the precise ORR mechanisms on the catalyst. Herein, a novel process is developed using pyridyne reactive intermediates to functionalize carbon nanotubes (CNTs) exclusively with pyridine rings for ORR electrocatalysis. The relationship between the structure and ORR performance of the prepared materials is studied in combination with density functional theory calculations to probe the ORR mechanism on the catalyst. Pyridinic nitrogen can contribute to a more efficient 4-electron reaction pathway, while high level of pyridyne functionalization result in negative structural effects, such as poor electrical conductivity, reduced surface area, and small pore diameters, that suppressed the ORR performance. This study provides insights into pyridine-doped CNTs-functionalized for the first time via pyridyne intermediates-as applied in the ORR and is expected to serve as valuable inspiration in designing high-performance electrocatalysts for energy applications
Use-Exposure Relationships of Pesticides for Aquatic Risk Assessment
Field-scale environmental models have been widely used in aquatic exposure assessments of pesticides. Those models usually require a large set of input parameters and separate simulations for each pesticide in evaluation. In this study, a simple use-exposure relationship is developed based on regression analysis of stochastic simulation results generated from the Pesticide Root-Zone Model (PRZM). The developed mathematical relationship estimates edge-of-field peak concentrations of pesticides from aerobic soil metabolism half-life (AERO), organic carbon-normalized soil sorption coefficient (KOC), and application rate (RATE). In a case study of California crop scenarios, the relationships explained 90–95% of the variances in the peak concentrations of dissolved pesticides as predicted by PRZM simulations for a 30-year period. KOC was identified as the governing parameter in determining the relative magnitudes of pesticide exposures in a given crop scenario. The results of model application also indicated that the effects of chemical fate processes such as partitioning and degradation on pesticide exposure were similar among crop scenarios, while the cross-scenario variations were mainly associated with the landscape characteristics, such as organic carbon contents and curve numbers. With a minimum set of input data, the use-exposure relationships proposed in this study could be used in screening procedures for potential water quality impacts from the off-site movement of pesticides
Recommended from our members
Sources of uncertainty in modeled land carbon storage within and across three MIPs: Diagnosis with three new techniques
This is the final version. Available from the American Meteorological Society via the DOI in this recordTerrestrial carbon cycle models have incorporated increasingly more processes as a means to achieve more-realistic representations of ecosystem carbon cycling. Despite this, there are large across-model variations in the simulation and projection of carbon cycling. Several model intercomparison projects (MIPs), for example, the fifth phase of the Coupled Model Intercomparison Project (CMIP5) (historical simulations), Trends in Net Land-Atmosphere Carbon Exchange (TRENDY), and Multiscale Synthesis and Terrestrial Model Intercomparison Project (MsTMIP), have sought to understand intermodel differences. In this study, the authors developed a suite of new techniques to conduct post-MIP analysis to gain insights into uncertainty sources across 25 models in the three MIPs. First, terrestrial carbon storage dynamics were characterized by a three-dimensional (3D) model output space with coordinates of carbon residence time, net primary productivity (NPP), and carbon storage potential. The latter represents the potential of an ecosystem to lose or gain carbon. This space can be used to measure how and why model output differs. Models with a nitrogen cycle generally exhibit lower annual NPP in comparison with other models, and mostly negative carbon storage potential. Second, a transient traceability framework was used to decompose any given carbon cycle model into traceable components and identify the sources of model differences. The carbon residence time (or NPP) was traced to baseline carbon residence time (or baseline NPP related to the maximum carbon input), environmental scalars, and climate forcing. Third, by applying a variance decomposition method, the authors show that the intermodel differences in carbon storage can be mainly attributed to the baseline carbon residence time and baseline NPP (>90% in the three MIPs). The three techniques developed in this study offer a novel approach to gain more insight from existing MIPs and can point out directions for future MIPs. Since this study is conducted at the global scale for an overview on intermodel differences, future studies should focus more on regional analysis to identify the sources of uncertainties and improve models at the specified mechanism level.This paper is financially supported by the Research and Development Special Fund for Public Welfare Industry of the Ministry of Water Research in China (201501028). JBF and CRS were supported in part by NASA’s Carbon Cycle Science program. JBF was also supported in part by NASA’s Terrestrial Ecology and Carbon Monitoring System programs. JT acknowledges RCN funded project EVA (229771) and BCCR-BIGCHANGE
Recommended from our members
Sources of uncertainty in modeled land carbon storage within and across three MIPs: Diagnosis with three new techniques
This is the final version. Available from the American Meteorological Society via the DOI in this recordTerrestrial carbon cycle models have incorporated increasingly more processes as a means to achieve more-realistic representations of ecosystem carbon cycling. Despite this, there are large across-model variations in the simulation and projection of carbon cycling. Several model intercomparison projects (MIPs), for example, the fifth phase of the Coupled Model Intercomparison Project (CMIP5) (historical simulations), Trends in Net Land-Atmosphere Carbon Exchange (TRENDY), and Multiscale Synthesis and Terrestrial Model Intercomparison Project (MsTMIP), have sought to understand intermodel differences. In this study, the authors developed a suite of new techniques to conduct post-MIP analysis to gain insights into uncertainty sources across 25 models in the three MIPs. First, terrestrial carbon storage dynamics were characterized by a three-dimensional (3D) model output space with coordinates of carbon residence time, net primary productivity (NPP), and carbon storage potential. The latter represents the potential of an ecosystem to lose or gain carbon. This space can be used to measure how and why model output differs. Models with a nitrogen cycle generally exhibit lower annual NPP in comparison with other models, and mostly negative carbon storage potential. Second, a transient traceability framework was used to decompose any given carbon cycle model into traceable components and identify the sources of model differences. The carbon residence time (or NPP) was traced to baseline carbon residence time (or baseline NPP related to the maximum carbon input), environmental scalars, and climate forcing. Third, by applying a variance decomposition method, the authors show that the intermodel differences in carbon storage can be mainly attributed to the baseline carbon residence time and baseline NPP (>90% in the three MIPs). The three techniques developed in this study offer a novel approach to gain more insight from existing MIPs and can point out directions for future MIPs. Since this study is conducted at the global scale for an overview on intermodel differences, future studies should focus more on regional analysis to identify the sources of uncertainties and improve models at the specified mechanism level.This paper is financially supported by the Research and Development Special Fund for Public Welfare Industry of the Ministry of Water Research in China (201501028). JBF and CRS were supported in part by NASA’s Carbon Cycle Science program. JBF was also supported in part by NASA’s Terrestrial Ecology and Carbon Monitoring System programs. JT acknowledges RCN funded project EVA (229771) and BCCR-BIGCHANGE
NMR studies of p7 protein from hepatitis C virus
The p7 protein of hepatitis C virus (HCV) plays an important role in the viral lifecycle. Like other members of the viroporin family of small membrane proteins, the amino acid sequence of p7 is largely conserved over the entire range of genotypes, and it forms ion channels that can be blocked by a number of established channel-blocking compounds. Its characteristics as a membrane protein make it difficult to study by most structural techniques, since it requires the presence of lipids to fold and function properly. Purified p7 can be incorporated into phospholipid bilayers and micelles. Initial solid-state nuclear magnetic resonance (NMR) studies of p7 in 14-O-PC/6-O-PC bicelles indicate that the protein contains helical segments that are tilted approximately 10° and 25° relative to the bilayer normal. A truncated construct corresponding to the second transmembrane domain of p7 is shown to have properties similar to those of the full-length protein, and was used to determine that the helix segment tilted at 10° is in the C-terminal portion of the protein. The addition of the channel blocker amantadine to the full-length protein resulted in selective chemical shift changes, demonstrating that NMR has a potential role in the development of drugs targeted to p7
Beyond Volume: The Impact of Complex Healthcare Data on the Machine Learning Pipeline
From medical charts to national census, healthcare has traditionally operated
under a paper-based paradigm. However, the past decade has marked a long and
arduous transformation bringing healthcare into the digital age. Ranging from
electronic health records, to digitized imaging and laboratory reports, to
public health datasets, today, healthcare now generates an incredible amount of
digital information. Such a wealth of data presents an exciting opportunity for
integrated machine learning solutions to address problems across multiple
facets of healthcare practice and administration. Unfortunately, the ability to
derive accurate and informative insights requires more than the ability to
execute machine learning models. Rather, a deeper understanding of the data on
which the models are run is imperative for their success. While a significant
effort has been undertaken to develop models able to process the volume of data
obtained during the analysis of millions of digitalized patient records, it is
important to remember that volume represents only one aspect of the data. In
fact, drawing on data from an increasingly diverse set of sources, healthcare
data presents an incredibly complex set of attributes that must be accounted
for throughout the machine learning pipeline. This chapter focuses on
highlighting such challenges, and is broken down into three distinct
components, each representing a phase of the pipeline. We begin with attributes
of the data accounted for during preprocessing, then move to considerations
during model building, and end with challenges to the interpretation of model
output. For each component, we present a discussion around data as it relates
to the healthcare domain and offer insight into the challenges each may impose
on the efficiency of machine learning techniques.Comment: Healthcare Informatics, Machine Learning, Knowledge Discovery: 20
Pages, 1 Figur
How large should whales be?
The evolution and distribution of species body sizes for terrestrial mammals
is well-explained by a macroevolutionary tradeoff between short-term selective
advantages and long-term extinction risks from increased species body size,
unfolding above the 2g minimum size induced by thermoregulation in air. Here,
we consider whether this same tradeoff, formalized as a constrained
convection-reaction-diffusion system, can also explain the sizes of fully
aquatic mammals, which have not previously been considered. By replacing the
terrestrial minimum with a pelagic one, at roughly 7000g, the terrestrial
mammal tradeoff model accurately predicts, with no tunable parameters, the
observed body masses of all extant cetacean species, including the 175,000,000g
Blue Whale. This strong agreement between theory and data suggests that a
universal macroevolutionary tradeoff governs body size evolution for all
mammals, regardless of their habitat. The dramatic sizes of cetaceans can thus
be attributed mainly to the increased convective heat loss is water, which
shifts the species size distribution upward and pushes its right tail into
ranges inaccessible to terrestrial mammals. Under this macroevolutionary
tradeoff, the largest expected species occurs where the rate at which
smaller-bodied species move up into large-bodied niches approximately equals
the rate at which extinction removes them.Comment: 7 pages, 3 figures, 2 data table
- …