14,472 research outputs found

    Comparing the writing style of real and artificial papers

    Full text link
    Recent years have witnessed the increase of competition in science. While promoting the quality of research in many cases, an intense competition among scientists can also trigger unethical scientific behaviors. To increase the total number of published papers, some authors even resort to software tools that are able to produce grammatical, but meaningless scientific manuscripts. Because automatically generated papers can be misunderstood as real papers, it becomes of paramount importance to develop means to identify these scientific frauds. In this paper, I devise a methodology to distinguish real manuscripts from those generated with SCIGen, an automatic paper generator. Upon modeling texts as complex networks (CN), it was possible to discriminate real from fake papers with at least 89\% of accuracy. A systematic analysis of features relevance revealed that the accessibility and betweenness were useful in particular cases, even though the relevance depended upon the dataset. The successful application of the methods described here show, as a proof of principle, that network features can be used to identify scientific gibberish papers. In addition, the CN-based approach can be combined in a straightforward fashion with traditional statistical language processing methods to improve the performance in identifying artificially generated papers.Comment: To appear in Scientometrics (2015

    The HR-Firm Performance Relationship: Can it be in the Mind of the Beholder?

    Get PDF
    This study examined whether respondents’ implicit theories of performance could impact their responses to surveys regarding HR practices and effectiveness. Senior Human Resource and Line Executives and MBA, graduate Engineering, and graduate HR students read scenarios of high and low performing firms and were asked to report on the prevalence of various HR practices and effectiveness of the HR function in each firm. Results indicated that all four groups of respondents held implicit theories that high performing firms were characterized by extensive HR practices and had highly effective HR functions relative to low performing firms. Subjects with substantial work experience reported greater differences between and high and low performing firms than did subjects with relatively little work experience. The implications of these results for research on the HR Practices – Firm Performance relationship are discussed

    Estimating variances and covariances for multivariate animal models by restricted maximum likelihood

    Get PDF
    Restricted Maximum Likelihood estimates of variance and covariance components can be obtained by direct maximization of the associated likelihood using standard, derivative-free optimization procedures. In general, this requires a multi-dimensional search and numerous evaluations of the (log) likelihood function. Use of this approach for analyses under an Animal Model has been described for the univariate case. This model includes animals' additive genetic merit as random e#ect and accounts for all relationships between animals. In addition, other random factors such as common environmental or maternal genetic e#ects can be fitted. This paper describes the extension to multivariate analyses, allowing for missing records. A numerical example is given and simplifications for specific models are discussed. Keywords : Variance components, Restricted Maximum Likelihood, Animal Model, additional random e#ects, derivative-free approach, multivariate analysis 1. Introduction In the statistic..

    Intrapersonal positive future thinking predicts repeat suicide attempts in hospital-treated suicide attempters

    Get PDF
    Objective: Although there is clear evidence that low levels of positive future thinking (anticipation of positive experiences in the future) and hopelessness are associated with suicide risk, the relationship between the content of positive future thinking and suicidal behavior has yet to be investigated. This is the first study to determine whether the positive future thinking–suicide attempt relationship varies as a function of the content of the thoughts and whether positive future thinking predicts suicide attempts over time. Method: A total of 388 patients hospitalized following a suicide attempt completed a range of clinical and psychological measures (depression, hopelessness, suicidal ideation, suicidal intent and positive future thinking). Fifteen months later, a nationally linked database was used to determine who had been hospitalized again after a suicide attempt. Results: During follow-up, 25.6% of linked participants were readmitted to hospital following a suicide attempt. In univariate logistic regression analyses, previous suicide attempts, suicidal ideation, hopelessness, and depression—as well as low levels of achievement, low levels of financial positive future thoughts, and high levels of intrapersonal (thoughts about the individual and no one else) positive future thoughts predicted repeat suicide attempts. However, only previous suicide attempts, suicidal ideation, and high levels of intrapersonal positive future thinking were significant predictors in multivariate analyses. Discussion: Positive future thinking has predictive utility over time; however, the content of the thinking affects the direction and strength of the positive future thinking–suicidal behavior relationship. Future research is required to understand the mechanisms that link high levels of intrapersonal positive future thinking to suicide risk and how intrapersonal thinking should be targeted in treatment interventions

    Modeling and forecasting electricity spot prices: A functional data perspective

    Get PDF
    Classical time series models have serious difficulties in modeling and forecasting the enormous fluctuations of electricity spot prices. Markov regime switch models belong to the most often used models in the electricity literature. These models try to capture the fluctuations of electricity spot prices by using different regimes, each with its own mean and covariance structure. Usually one regime is dedicated to moderate prices and another is dedicated to high prices. However, these models show poor performance and there is no theoretical justification for this kind of classification. The merit order model, the most important micro-economic pricing model for electricity spot prices, however, suggests a continuum of mean levels with a functional dependence on electricity demand. We propose a new statistical perspective on modeling and forecasting electricity spot prices that accounts for the merit order model. In a first step, the functional relation between electricity spot prices and electricity demand is modeled by daily price-demand functions. In a second step, we parameterize the series of daily price-demand functions using a functional factor model. The power of this new perspective is demonstrated by a forecast study that compares our functional factor model with two established classical time series models as well as two alternative functional data models.Comment: Published in at http://dx.doi.org/10.1214/13-AOAS652 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Normal Reference Bandwidths for the General Order, Multivariate Kernel Density Derivative Estimator

    Get PDF
    This note derives the general form of the approximate mean integrated squared error for the q-variate, th-order kernel density r th derivative estimator. This formula allows for normal reference rule-of-thumb bandwidths to be derived. We give tables for some of the most common cases in the literature.Derivative Estimation, Smoothing, AMISE

    Biomarkers of systemic inflammation and growth in early infancy are associated with stunting in young Tanzanian children

    Get PDF
    Stunting can afflict up to one-third of children in resource-constrained countries. We hypothesized that low-grade systemic inflammation (defined as elevations in serum C-reactive protein or alpha-1-acid glycoprotein) in infancy suppresses the growth hormone–insulin-like growth factor (IGF) axis and is associated with subsequent stunting. Blood samples of 590 children from periurban Dar es Salaam, Tanzania, were obtained at 6 weeks and 6 months of age as part of a randomized controlled trial. Primary outcomes were stunting, underweight, and wasting (defined as length-for-age, weight-for-age and weight-for-length z-scores < −2) between randomization and endline (18 months after randomization). Cox proportional hazards models were constructed to estimate hazard ratios (HRs) and corresponding 95% confidence intervals (CIs) of time to first stunting, underweight, and wasting as outcomes, with measures of systemic inflammation, insulin-like growth factor-1 (IGF-1) and insulin-like growth factor binding protein-3 (IGFBP-3) as exposures, adjusting for numerous demographic and clinical variables. The incidences of subsequent stunting, underweight, and wasting were 26%, 20%, and 18%, respectively. In multivariate analyses, systemic inflammation at 6 weeks of age was significantly associated with stunting (HR: 2.14, 95% CI: 1.23, 3.72; p = 0.002). Children with higher levels of IGF-1 at 6 weeks were less likely to become stunted (HR: 0.58, 95% CI: 0.37, 0.93; p for trend = 0.019); a similar trend was noted in children with higher levels of IGF-1 at 6 months of age (HR: 0.50, 95% CI: 0.22, 1.12; p for trend = 0.07). Systemic inflammation occurs as early as 6 weeks of age and is associated with the risk of future stunting among Tanzanian children.This research was funded by the National Institutes of Health (R01 HD048969, 2P30 DK040561, K24 DK104676-Dr. Duggan) and the Bill and Melinda Gates Foundation (OPP1066203-Dr. Duggan). (R01 HD048969 - National Institutes of Health; 2P30 DK040561 - National Institutes of Health; K24 DK104676 - National Institutes of Health; OPP1066203 - Bill and Melinda Gates Foundation)Accepted manuscrip
    • …
    corecore