1,395 research outputs found
Modeling association between DNA copy number and gene expression with constrained piecewise linear regression splines
DNA copy number and mRNA expression are widely used data types in cancer
studies, which combined provide more insight than separately. Whereas in
existing literature the form of the relationship between these two types of
markers is fixed a priori, in this paper we model their association. We employ
piecewise linear regression splines (PLRS), which combine good interpretation
with sufficient flexibility to identify any plausible type of relationship. The
specification of the model leads to estimation and model selection in a
constrained, nonstandard setting. We provide methodology for testing the effect
of DNA on mRNA and choosing the appropriate model. Furthermore, we present a
novel approach to obtain reliable confidence bands for constrained PLRS, which
incorporates model uncertainty. The procedures are applied to colorectal and
breast cancer data. Common assumptions are found to be potentially misleading
for biologically relevant genes. More flexible models may bring more insight in
the interaction between the two markers.Comment: Published in at http://dx.doi.org/10.1214/12-AOAS605 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Segregation of large particles in dense granular flows: A granular Saffman effect?
We report on the scaling between the lift force and the velocity lag
experienced by a single particle of different size in a monodisperse dense
granular chute flow. The similarity of this scaling to the Saffman lift force
in (micro) fluids, suggests an inertial origin for the lift force responsible
for segregation of (isolated, large) intruders in dense granular flows. We also
observe an anisotropic pressure/stress field surrounding the particle, which
potentially lies at the origin of the velocity lag. These findings are relevant
for modelling and theoretical predictions of particle-size segregation. At the
same time, the suggested interplay between polydispersity and inertial effects
in dense granular flows with stress- and strain-gradients, implies striking new
parallels between fluids, suspensions and granular flows with wide application
perspectives
Adverse effects of personalized automated feedback
In large classes with hundreds of students, it is rarely feasible to provide students with individual feedback on their performance. Automatically generated personalized feedback on students' performance might help to overcome this issue, but available empirical effect studies are inconclusive due to lack of methodological rigor. This study uses a repetitive randomized control experiment to explore whether automatically generated feedback is effective and for which students. Our results indicate that feedback does not have a positive effect on performance for all students. Some groups benefit from receiving personalized feedback, while others do not perform better than the control group. Students that perform average benefit most from receiving personalized feedback. However, lower-scoring students who received feedback tend to have lower attrition rates and if they participate at the final exam, their performance is not higher than the control group. Therefore, providing automated feedback is not something that should be undertaken mindlessly.</p
Erfassung von Meilerpodien in Lidar-Daten: von der manuellen Kartierung über Crowdsourcing zum maschinellen Lernen
Archaeological science
Optimization Under Uncertainty Using the Generalized Inverse Distribution Function
A framework for robust optimization under uncertainty based on the use of the
generalized inverse distribution function (GIDF), also called quantile
function, is here proposed. Compared to more classical approaches that rely on
the usage of statistical moments as deterministic attributes that define the
objectives of the optimization process, the inverse cumulative distribution
function allows for the use of all the possible information available in the
probabilistic domain. Furthermore, the use of a quantile based approach leads
naturally to a multi-objective methodology which allows an a-posteriori
selection of the candidate design based on risk/opportunity criteria defined by
the designer. Finally, the error on the estimation of the objectives due to the
resolution of the GIDF will be proven to be quantifiableComment: 20 pages, 25 figure
Sequential Data-Adaptive Bandwidth Selection by Cross-Validation for Nonparametric Prediction
We consider the problem of bandwidth selection by cross-validation from a
sequential point of view in a nonparametric regression model. Having in mind
that in applications one often aims at estimation, prediction and change
detection simultaneously, we investigate that approach for sequential kernel
smoothers in order to base these tasks on a single statistic. We provide
uniform weak laws of large numbers and weak consistency results for the
cross-validated bandwidth. Extensions to weakly dependent error terms are
discussed as well. The errors may be {\alpha}-mixing or L2-near epoch
dependent, which guarantees that the uniform convergence of the cross
validation sum and the consistency of the cross-validated bandwidth hold true
for a large class of time series. The method is illustrated by analyzing
photovoltaic data.Comment: 26 page
Guided online self-management interventions in primary care: a survey on use, facilitators, and barriers
BACKGROUND\nGuided online psychological self-management interventions offer broad prospects for the treatment of people with mild to moderate mental health problems, but implementation is challenging. The aims of this study are (1) to gain insight into use of and intention to use these interventions among primary care health professionals, (2) to determine the main barriers to use such interventions among non-users.\nMETHODS\nAn online survey based on the Unified Theory of Acceptance and Use of Technology (UTAUT) was disseminated among mental health counsellors (MHCs; in Dutch POHs) in GP practices and primary care psychologists (PCP) in mental health care practices. The survey covered the current use of online interventions, the intention to use these in the future, and an operationalization of the UTAUT concepts: performance expectancy, effort expectancy, social influence, and facilitating conditions.\nRESULTS\nIn total, 481 MHCs and 290 PCPs responded (24Â %). Of them, 49Â % of MHCs and 21Â % of PCPs currently use online interventions in their treatments. A further 40Â % of MHCs and 27Â % of PCPs plan to introduce such interventions within the next year. Both groups were moderately positive about the presence of eHealth facilitators in their daily practice. Among current non-users, performance expectancy and facilitating conditions were significant predictors of usage intention in both groups of health professionals.\nCONCLUSIONS\nUse of and intention to use online interventions is relatively high in Dutch primary care. Non-users, particularly, experience several barriers which need attention to enhance implementation. There is a need for further efforts regarding facilitation of and education on eHealth, as well as for research directed to its normalization in daily practice.FSW - Self-regulation models for health behavior and psychopathology - ou
Semiparametric theory and empirical processes in causal inference
In this paper we review important aspects of semiparametric theory and
empirical processes that arise in causal inference problems. We begin with a
brief introduction to the general problem of causal inference, and go on to
discuss estimation and inference for causal effects under semiparametric
models, which allow parts of the data-generating process to be unrestricted if
they are not of particular interest (i.e., nuisance functions). These models
are very useful in causal problems because the outcome process is often complex
and difficult to model, and there may only be information available about the
treatment process (at best). Semiparametric theory gives a framework for
benchmarking efficiency and constructing estimators in such settings. In the
second part of the paper we discuss empirical process theory, which provides
powerful tools for understanding the asymptotic behavior of semiparametric
estimators that depend on flexible nonparametric estimators of nuisance
functions. These tools are crucial for incorporating machine learning and other
modern methods into causal inference analyses. We conclude by examining related
extensions and future directions for work in semiparametric causal inference
- …