553 research outputs found
Software Defect Association Mining and Defect Correction Effort Prediction
Much current software defect prediction work concentrates on the number of defects remaining in software system. In this paper, we present association rule mining based methods to predict defect associations and defect-correction effort. This is to help developers detect software defects and assist project managers in allocating testing resources more effectively. We applied the proposed methods to the SEL defect data consisting of more than 200 projects over more than 15 years. The results show that for the defect association prediction, the accuracy is very high and the false negative rate is very low. Likewise for the defect-correction effort prediction, the accuracy for both defect isolation effort prediction and defect correction effort prediction are also high. We compared the defect-correction effort prediction method with other types of methods: PART, C4.5, and Na¨ıve Bayes and show that accuracy has been improved by at least 23%. We also evaluated the impact of support and confidence levels on prediction accuracy, false negative rate, false positive rate, and the number of rules. We found that higher support and confidence levels may not result in higher prediction accuracy, and a sufficient number of rules is a precondition for high prediction accuracy
Data quality: Some comments on the NASA software defect datasets
Background-Self-evidently empirical analyses rely upon the quality of their data. Likewise, replications rely upon accurate reporting and using the same rather than similar versions of datasets. In recent years, there has been much interest in using machine learners to classify software modules into defect-prone and not defect-prone categories. The publicly available NASA datasets have been extensively used as part of this research. Objective-This short note investigates the extent to which published analyses based on the NASA defect datasets are meaningful and comparable. Method-We analyze the five studies published in the IEEE Transactions on Software Engineering since 2007 that have utilized these datasets and compare the two versions of the datasets currently in use. Results-We find important differences between the two versions of the datasets, implausible values in one dataset and generally insufficient detail documented on dataset preprocessing. Conclusions-It is recommended that researchers 1) indicate the provenance of the datasets they use, 2) report any preprocessing in sufficient detail to enable meaningful replication, and 3) invest effort in understanding the data prior to applying machine learners
Orbital Angular Momentum Parton Distributions in Light-Front Dynamics
We study the quark angular momentum distribution in the nucleon within a
light-front covariant quark model. Special emphasis is put into the orbital
angular momentum: a quantity which is very sensitive to the relativistic
treatment of the spin in a light-front dynamical approach. Discrepancies with
the predictions of the low-energy traditional quark models where relativistic
spin effects are neglected, are visible also after perturbative evolution to
higher momentum scales. Orbital angular momentum distributions and their
contribution to the spin sum rule are calculated for different phenomenological
mass operators and compared with the results of the MIT bag model.Comment: 14 pages; latex; 3 ps figure
Transition of plasmodium sporozoites into liver stage-like forms is regulated by the RNA binding protein pumilio
Many eukaryotic developmental and cell fate decisions that are effected post-transcriptionally involve RNA binding proteins as regulators of translation of key mRNAs. In malaria parasites (Plasmodium spp.), the development of round, non-motile and replicating exo-erythrocytic liver stage forms from slender, motile and cell-cycle arrested sporozoites is believed to depend on environmental changes experienced during the transmission of the parasite from the mosquito vector to the vertebrate host. Here we identify a Plasmodium member of the RNA binding protein family PUF as a key regulator of this transformation. In the absence of Pumilio-2 (Puf2) sporozoites initiate EEF development inside mosquito salivary glands independently of the normal transmission-associated environmental cues. Puf2- sporozoites exhibit genome-wide transcriptional changes that result in loss of gliding motility, cell traversal ability and reduction in infectivity, and, moreover, trigger metamorphosis typical of early Plasmodium intra-hepatic development. These data demonstrate that Puf2 is a key player in regulating sporozoite developmental control, and imply that transformation of salivary gland-resident sporozoites into liver stage-like parasites is regulated by a post-transcriptional mechanism
Recommended from our members
Trilingual education in China: perspectives from a university programme for minority students
Attention to trilingual education programmes in China has tended to focus on basic education; there had been little attention to date on the higher education sector. The present paper will attempt to bridge this gap by exploring a Yi-English-Chinese (YEC) trilingual education programme through case studies of three Yi students. We will start by briefly exploring multilingualism and minority participation in higher education in China, contextual issues important for an understanding of trilingual education. We will then describe the methodology employed to collect data from the students before analysing the life events which steer them to the programme, the ways in which they position themselves in relation to the three languages, and the implications for the delivery of the programme and the learning outcomes of students
Motion-light parametric amplifier and entanglement distributor
We propose a scheme for entangling the motional mode of a trapped atom with a
propagating light field via a cavity-mediated parametric interaction. We then
show that if this light field is subsequently coupled to a second distant atom
via a cavity-mediated linear-mixing interaction, it is possible to transfer the
entanglement from the light beam to the motional mode of the second atom to
create an EPR-type entangled state of the positions and momenta of two
distantly-separated atoms.Comment: 9 pages, 8 figures, REVTe
Slow Light Propagation in a Thin Optical Fiber via Electromagnetically Induced Transparency
We propose a novel configuration that utilizes electromagnetically induced
transparency (EIT) to tailor a fiber mode propagating inside a thin optical
fiber and coherently control its dispersion properties to drastically reduce
the group velocity of the fiber mode. The key to this proposal is: the
evanescent-like field of the thin fiber strongly couples with the surrounding
active medium, so that the EIT condition is met by the medium. We show how the
properties of the fiber mode is modified due to the EIT medium, both
numerically and analytically. We demonstrate that the group velocity of the new
modified fiber mode can be drastically reduced (approximately 44 m/sec) using
the coherently prepared orthohydrogen doped in a matrix of parahydrogen crystal
as the EIT medium.Comment: 10 pages in two column RevTex4, 6 Figure
Understanding the performance and reliability of NLP tools: a comparison of four NLP tools predicting stroke phenotypes in radiology reports
BACKGROUND: Natural language processing (NLP) has the potential to automate the reading of radiology reports, but there is a need to demonstrate that NLP methods are adaptable and reliable for use in real-world clinical applications. METHODS: We tested the F1 score, precision, and recall to compare NLP tools on a cohort from a study on delirium using images and radiology reports from NHS Fife and a population-based cohort (Generation Scotland) that spans multiple National Health Service health boards. We compared four off-the-shelf rule-based and neural NLP tools (namely, EdIE-R, ALARM+, ESPRESSO, and Sem-EHR) and reported on their performance for three cerebrovascular phenotypes, namely, ischaemic stroke, small vessel disease (SVD), and atrophy. Clinical experts from the EdIE-R team defined phenotypes using labelling techniques developed in the development of EdIE-R, in conjunction with an expert researcher who read underlying images. RESULTS: EdIE-R obtained the highest F1 score in both cohorts for ischaemic stroke, ≥93%, followed by ALARM+, ≥87%. The F1 score of ESPRESSO was ≥74%, whilst that of Sem-EHR is ≥66%, although ESPRESSO had the highest precision in both cohorts, 90% and 98%. For F1 scores for SVD, EdIE-R scored ≥98% and ALARM+ ≥90%. ESPRESSO scored lowest with ≥77% and Sem-EHR ≥81%. In NHS Fife, F1 scores for atrophy by EdIE-R and ALARM+ were 99%, dropping in Generation Scotland to 96% for EdIE-R and 91% for ALARM+. Sem-EHR performed lowest for atrophy at 89% in NHS Fife and 73% in Generation Scotland. When comparing NLP tool output with brain image reads using F1 scores, ALARM+ scored 80%, outperforming EdIE-R at 66% in ischaemic stroke. For SVD, EdIE-R performed best, scoring 84%, with Sem-EHR 82%. For atrophy, EdIE-R and both ALARM+ versions were comparable at 80%. CONCLUSIONS: The four NLP tools show varying F1 (and precision/recall) scores across all three phenotypes, although more apparent for ischaemic stroke. If NLP tools are to be used in clinical settings, this cannot be performed "out of the box." It is essential to understand the context of their development to assess whether they are suitable for the task at hand or whether further training, re-training, or modification is required to adapt tools to the target task
Random walk with barriers: Diffusion restricted by permeable membranes
Restrictions to molecular motion by barriers (membranes) are ubiquitous in
biological tissues, porous media and composite materials. A major challenge is
to characterize the microstructure of a material or an organism
nondestructively using a bulk transport measurement. Here we demonstrate how
the long-range structural correlations introduced by permeable membranes give
rise to distinct features of transport. We consider Brownian motion restricted
by randomly placed and oriented permeable membranes and focus on the
disorder-averaged diffusion propagator using a scattering approach. The
renormalization group solution reveals a scaling behavior of the diffusion
coefficient for large times, with a characteristically slow inverse square root
time dependence. The predicted time dependence of the diffusion coefficient
agrees well with Monte Carlo simulations in two dimensions. Our results can be
used to identify permeable membranes as restrictions to transport in disordered
materials and in biological tissues, and to quantify their permeability and
surface area.Comment: 8 pages, 3 figures; origin of dispersion clarified, refs adde
Understanding the performance and reliability of NLP tools:A comparison of four NLP tools predicting stroke phenotypes in radiology reports
- …