14,472 research outputs found
Comparing the writing style of real and artificial papers
Recent years have witnessed the increase of competition in science. While
promoting the quality of research in many cases, an intense competition among
scientists can also trigger unethical scientific behaviors. To increase the
total number of published papers, some authors even resort to software tools
that are able to produce grammatical, but meaningless scientific manuscripts.
Because automatically generated papers can be misunderstood as real papers, it
becomes of paramount importance to develop means to identify these scientific
frauds. In this paper, I devise a methodology to distinguish real manuscripts
from those generated with SCIGen, an automatic paper generator. Upon modeling
texts as complex networks (CN), it was possible to discriminate real from fake
papers with at least 89\% of accuracy. A systematic analysis of features
relevance revealed that the accessibility and betweenness were useful in
particular cases, even though the relevance depended upon the dataset. The
successful application of the methods described here show, as a proof of
principle, that network features can be used to identify scientific gibberish
papers. In addition, the CN-based approach can be combined in a straightforward
fashion with traditional statistical language processing methods to improve the
performance in identifying artificially generated papers.Comment: To appear in Scientometrics (2015
The HR-Firm Performance Relationship: Can it be in the Mind of the Beholder?
This study examined whether respondents’ implicit theories of performance could impact their responses to surveys regarding HR practices and effectiveness. Senior Human Resource and Line Executives and MBA, graduate Engineering, and graduate HR students read scenarios of high and low performing firms and were asked to report on the prevalence of various HR practices and effectiveness of the HR function in each firm. Results indicated that all four groups of respondents held implicit theories that high performing firms were characterized by extensive HR practices and had highly effective HR functions relative to low performing firms. Subjects with substantial work experience reported greater differences between and high and low performing firms than did subjects with relatively little work experience. The implications of these results for research on the HR Practices – Firm Performance relationship are discussed
Estimating variances and covariances for multivariate animal models by restricted maximum likelihood
Restricted Maximum Likelihood estimates of variance and covariance components can be obtained by direct maximization of the associated likelihood using standard, derivative-free optimization procedures. In general, this requires a multi-dimensional search and numerous evaluations of the (log) likelihood function. Use of this approach for analyses under an Animal Model has been described for the univariate case. This model includes animals' additive genetic merit as random e#ect and accounts for all relationships between animals. In addition, other random factors such as common environmental or maternal genetic e#ects can be fitted. This paper describes the extension to multivariate analyses, allowing for missing records. A numerical example is given and simplifications for specific models are discussed. Keywords : Variance components, Restricted Maximum Likelihood, Animal Model, additional random e#ects, derivative-free approach, multivariate analysis 1. Introduction In the statistic..
Intrapersonal positive future thinking predicts repeat suicide attempts in hospital-treated suicide attempters
Objective: Although there is clear evidence that low levels of positive future thinking (anticipation of positive experiences in the future) and hopelessness are associated with suicide risk, the relationship between the content of positive future thinking and suicidal behavior has yet to be investigated. This is the first study to determine whether the positive future thinking–suicide attempt relationship varies as a function of the content of the thoughts and whether positive future thinking predicts suicide attempts over time. Method: A total of 388 patients hospitalized following a suicide attempt completed a range of clinical and psychological measures (depression, hopelessness, suicidal ideation, suicidal intent and positive future thinking). Fifteen months later, a nationally linked database was used to determine who had been hospitalized again after a suicide attempt. Results: During follow-up, 25.6% of linked participants were readmitted to hospital following a suicide attempt. In univariate logistic regression analyses, previous suicide attempts, suicidal ideation, hopelessness, and depression—as well as low levels of achievement, low levels of financial positive future thoughts, and high levels of intrapersonal (thoughts about the individual and no one else) positive future thoughts predicted repeat suicide attempts. However, only previous suicide attempts, suicidal ideation, and high levels of intrapersonal positive future thinking were significant predictors in multivariate analyses. Discussion: Positive future thinking has predictive utility over time; however, the content of the thinking affects the direction and strength of the positive future thinking–suicidal behavior relationship. Future research is required to understand the mechanisms that link high levels of intrapersonal positive future thinking to suicide risk and how intrapersonal thinking should be targeted in treatment interventions
Modeling and forecasting electricity spot prices: A functional data perspective
Classical time series models have serious difficulties in modeling and
forecasting the enormous fluctuations of electricity spot prices. Markov regime
switch models belong to the most often used models in the electricity
literature. These models try to capture the fluctuations of electricity spot
prices by using different regimes, each with its own mean and covariance
structure. Usually one regime is dedicated to moderate prices and another is
dedicated to high prices. However, these models show poor performance and there
is no theoretical justification for this kind of classification. The merit
order model, the most important micro-economic pricing model for electricity
spot prices, however, suggests a continuum of mean levels with a functional
dependence on electricity demand. We propose a new statistical perspective on
modeling and forecasting electricity spot prices that accounts for the merit
order model. In a first step, the functional relation between electricity spot
prices and electricity demand is modeled by daily price-demand functions. In a
second step, we parameterize the series of daily price-demand functions using a
functional factor model. The power of this new perspective is demonstrated by a
forecast study that compares our functional factor model with two established
classical time series models as well as two alternative functional data models.Comment: Published in at http://dx.doi.org/10.1214/13-AOAS652 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Normal Reference Bandwidths for the General Order, Multivariate Kernel Density Derivative Estimator
This note derives the general form of the approximate mean integrated squared error for the q-variate, th-order kernel density r th derivative estimator. This formula allows for normal reference rule-of-thumb bandwidths to be derived. We give tables for some of the most common cases in the literature.Derivative Estimation, Smoothing, AMISE
Biomarkers of systemic inflammation and growth in early infancy are associated with stunting in young Tanzanian children
Stunting can afflict up to one-third of children in resource-constrained countries. We hypothesized that low-grade systemic inflammation (defined as elevations in serum C-reactive protein or alpha-1-acid glycoprotein) in infancy suppresses the growth hormone–insulin-like growth factor (IGF) axis and is associated with subsequent stunting. Blood samples of 590 children from periurban Dar es Salaam, Tanzania, were obtained at 6 weeks and 6 months of age as part of a randomized controlled trial. Primary outcomes were stunting, underweight, and wasting (defined as length-for-age, weight-for-age and weight-for-length z-scores < −2) between randomization and endline (18 months after randomization). Cox proportional hazards models were constructed to estimate hazard ratios (HRs) and corresponding 95% confidence intervals (CIs) of time to first stunting, underweight, and wasting as outcomes, with measures of systemic inflammation, insulin-like growth factor-1 (IGF-1) and insulin-like growth factor binding protein-3 (IGFBP-3) as exposures, adjusting for numerous demographic and clinical variables. The incidences of subsequent stunting, underweight, and wasting were 26%, 20%, and 18%, respectively. In multivariate analyses, systemic inflammation at 6 weeks of age was significantly associated with stunting (HR: 2.14, 95% CI: 1.23, 3.72; p = 0.002). Children with higher levels of IGF-1 at 6 weeks were less likely to become stunted (HR: 0.58, 95% CI: 0.37, 0.93; p for trend = 0.019); a similar trend was noted in children with higher levels of IGF-1 at 6 months of age (HR: 0.50, 95% CI: 0.22, 1.12; p for trend = 0.07). Systemic inflammation occurs as early as 6 weeks of age and is associated with the risk of future stunting among Tanzanian children.This research was funded by the National Institutes of Health (R01 HD048969, 2P30 DK040561, K24 DK104676-Dr. Duggan) and the Bill and Melinda Gates Foundation (OPP1066203-Dr. Duggan). (R01 HD048969 - National Institutes of Health; 2P30 DK040561 - National Institutes of Health; K24 DK104676 - National Institutes of Health; OPP1066203 - Bill and Melinda Gates Foundation)Accepted manuscrip
- …