553 research outputs found

    Software Defect Association Mining and Defect Correction Effort Prediction

    Get PDF
    Much current software defect prediction work concentrates on the number of defects remaining in software system. In this paper, we present association rule mining based methods to predict defect associations and defect-correction effort. This is to help developers detect software defects and assist project managers in allocating testing resources more effectively. We applied the proposed methods to the SEL defect data consisting of more than 200 projects over more than 15 years. The results show that for the defect association prediction, the accuracy is very high and the false negative rate is very low. Likewise for the defect-correction effort prediction, the accuracy for both defect isolation effort prediction and defect correction effort prediction are also high. We compared the defect-correction effort prediction method with other types of methods: PART, C4.5, and Na¨ıve Bayes and show that accuracy has been improved by at least 23%. We also evaluated the impact of support and confidence levels on prediction accuracy, false negative rate, false positive rate, and the number of rules. We found that higher support and confidence levels may not result in higher prediction accuracy, and a sufficient number of rules is a precondition for high prediction accuracy

    Data quality: Some comments on the NASA software defect datasets

    Get PDF
    Background-Self-evidently empirical analyses rely upon the quality of their data. Likewise, replications rely upon accurate reporting and using the same rather than similar versions of datasets. In recent years, there has been much interest in using machine learners to classify software modules into defect-prone and not defect-prone categories. The publicly available NASA datasets have been extensively used as part of this research. Objective-This short note investigates the extent to which published analyses based on the NASA defect datasets are meaningful and comparable. Method-We analyze the five studies published in the IEEE Transactions on Software Engineering since 2007 that have utilized these datasets and compare the two versions of the datasets currently in use. Results-We find important differences between the two versions of the datasets, implausible values in one dataset and generally insufficient detail documented on dataset preprocessing. Conclusions-It is recommended that researchers 1) indicate the provenance of the datasets they use, 2) report any preprocessing in sufficient detail to enable meaningful replication, and 3) invest effort in understanding the data prior to applying machine learners

    Orbital Angular Momentum Parton Distributions in Light-Front Dynamics

    Full text link
    We study the quark angular momentum distribution in the nucleon within a light-front covariant quark model. Special emphasis is put into the orbital angular momentum: a quantity which is very sensitive to the relativistic treatment of the spin in a light-front dynamical approach. Discrepancies with the predictions of the low-energy traditional quark models where relativistic spin effects are neglected, are visible also after perturbative evolution to higher momentum scales. Orbital angular momentum distributions and their contribution to the spin sum rule are calculated for different phenomenological mass operators and compared with the results of the MIT bag model.Comment: 14 pages; latex; 3 ps figure

    Transition of plasmodium sporozoites into liver stage-like forms is regulated by the RNA binding protein pumilio

    Get PDF
    Many eukaryotic developmental and cell fate decisions that are effected post-transcriptionally involve RNA binding proteins as regulators of translation of key mRNAs. In malaria parasites (Plasmodium spp.), the development of round, non-motile and replicating exo-erythrocytic liver stage forms from slender, motile and cell-cycle arrested sporozoites is believed to depend on environmental changes experienced during the transmission of the parasite from the mosquito vector to the vertebrate host. Here we identify a Plasmodium member of the RNA binding protein family PUF as a key regulator of this transformation. In the absence of Pumilio-2 (Puf2) sporozoites initiate EEF development inside mosquito salivary glands independently of the normal transmission-associated environmental cues. Puf2- sporozoites exhibit genome-wide transcriptional changes that result in loss of gliding motility, cell traversal ability and reduction in infectivity, and, moreover, trigger metamorphosis typical of early Plasmodium intra-hepatic development. These data demonstrate that Puf2 is a key player in regulating sporozoite developmental control, and imply that transformation of salivary gland-resident sporozoites into liver stage-like parasites is regulated by a post-transcriptional mechanism

    Motion-light parametric amplifier and entanglement distributor

    Get PDF
    We propose a scheme for entangling the motional mode of a trapped atom with a propagating light field via a cavity-mediated parametric interaction. We then show that if this light field is subsequently coupled to a second distant atom via a cavity-mediated linear-mixing interaction, it is possible to transfer the entanglement from the light beam to the motional mode of the second atom to create an EPR-type entangled state of the positions and momenta of two distantly-separated atoms.Comment: 9 pages, 8 figures, REVTe

    Slow Light Propagation in a Thin Optical Fiber via Electromagnetically Induced Transparency

    Get PDF
    We propose a novel configuration that utilizes electromagnetically induced transparency (EIT) to tailor a fiber mode propagating inside a thin optical fiber and coherently control its dispersion properties to drastically reduce the group velocity of the fiber mode. The key to this proposal is: the evanescent-like field of the thin fiber strongly couples with the surrounding active medium, so that the EIT condition is met by the medium. We show how the properties of the fiber mode is modified due to the EIT medium, both numerically and analytically. We demonstrate that the group velocity of the new modified fiber mode can be drastically reduced (approximately 44 m/sec) using the coherently prepared orthohydrogen doped in a matrix of parahydrogen crystal as the EIT medium.Comment: 10 pages in two column RevTex4, 6 Figure

    Understanding the performance and reliability of NLP tools: a comparison of four NLP tools predicting stroke phenotypes in radiology reports

    Get PDF
    BACKGROUND: Natural language processing (NLP) has the potential to automate the reading of radiology reports, but there is a need to demonstrate that NLP methods are adaptable and reliable for use in real-world clinical applications. METHODS: We tested the F1 score, precision, and recall to compare NLP tools on a cohort from a study on delirium using images and radiology reports from NHS Fife and a population-based cohort (Generation Scotland) that spans multiple National Health Service health boards. We compared four off-the-shelf rule-based and neural NLP tools (namely, EdIE-R, ALARM+, ESPRESSO, and Sem-EHR) and reported on their performance for three cerebrovascular phenotypes, namely, ischaemic stroke, small vessel disease (SVD), and atrophy. Clinical experts from the EdIE-R team defined phenotypes using labelling techniques developed in the development of EdIE-R, in conjunction with an expert researcher who read underlying images. RESULTS: EdIE-R obtained the highest F1 score in both cohorts for ischaemic stroke, ≥93%, followed by ALARM+, ≥87%. The F1 score of ESPRESSO was ≥74%, whilst that of Sem-EHR is ≥66%, although ESPRESSO had the highest precision in both cohorts, 90% and 98%. For F1 scores for SVD, EdIE-R scored ≥98% and ALARM+ ≥90%. ESPRESSO scored lowest with ≥77% and Sem-EHR ≥81%. In NHS Fife, F1 scores for atrophy by EdIE-R and ALARM+ were 99%, dropping in Generation Scotland to 96% for EdIE-R and 91% for ALARM+. Sem-EHR performed lowest for atrophy at 89% in NHS Fife and 73% in Generation Scotland. When comparing NLP tool output with brain image reads using F1 scores, ALARM+ scored 80%, outperforming EdIE-R at 66% in ischaemic stroke. For SVD, EdIE-R performed best, scoring 84%, with Sem-EHR 82%. For atrophy, EdIE-R and both ALARM+ versions were comparable at 80%. CONCLUSIONS: The four NLP tools show varying F1 (and precision/recall) scores across all three phenotypes, although more apparent for ischaemic stroke. If NLP tools are to be used in clinical settings, this cannot be performed "out of the box." It is essential to understand the context of their development to assess whether they are suitable for the task at hand or whether further training, re-training, or modification is required to adapt tools to the target task

    Random walk with barriers: Diffusion restricted by permeable membranes

    Full text link
    Restrictions to molecular motion by barriers (membranes) are ubiquitous in biological tissues, porous media and composite materials. A major challenge is to characterize the microstructure of a material or an organism nondestructively using a bulk transport measurement. Here we demonstrate how the long-range structural correlations introduced by permeable membranes give rise to distinct features of transport. We consider Brownian motion restricted by randomly placed and oriented permeable membranes and focus on the disorder-averaged diffusion propagator using a scattering approach. The renormalization group solution reveals a scaling behavior of the diffusion coefficient for large times, with a characteristically slow inverse square root time dependence. The predicted time dependence of the diffusion coefficient agrees well with Monte Carlo simulations in two dimensions. Our results can be used to identify permeable membranes as restrictions to transport in disordered materials and in biological tissues, and to quantify their permeability and surface area.Comment: 8 pages, 3 figures; origin of dispersion clarified, refs adde
    corecore