93 research outputs found
Improving the tokenisation of identifier names
Identifier names are the main vehicle for semantic information during program comprehension. For tool-supported program comprehension tasks, including concept location and requirements traceability, identifier names need to be tokenised into their semantic constituents. In this paper we present an approach to the automated tokenisation of identifier names that improves on existing techniques in two ways. First, it improves the tokenisation accuracy for single-case identifier names and for identifier names containing digits, which existing techniques largely ignore. Second, performance gains over existing techniques are achieved using smaller oracles, making the approach easier to deploy.
Accuracy was evaluated by comparing our algorithm to manual tokenizations of 28,000 identifier names drawn from 60 well-known open source Java projects totalling 16.5 MSLOC. Moreover, the projects were used to perform a study of identifier tokenisation features (single case, camel case, use of digits, etc.) per object-oriented construct (class names, method names, local variable names, etc.), thus providing an insight into naming conventions in industrial-scale object-oriented code. Our tokenisation tool and datasets are publicly available
A highly magnified candidate for a young galaxy seen when the Universe was 500 Myrs old
The early Universe at redshift z\sim6-11 marks the reionization of the
intergalactic medium, following the formation of the first generation of stars.
However, those young galaxies at a cosmic age of \lesssim 500 million years
(Myr, at z \gtrsim 10) remain largely unexplored as they are at or beyond the
sensitivity limits of current large telescopes. Gravitational lensing by galaxy
clusters enables the detection of high-redshift galaxies that are fainter than
what otherwise could be found in the deepest images of the sky. We report the
discovery of an object found in the multi-band observations of the cluster
MACS1149+22 that has a high probability of being a gravitationally magnified
object from the early universe. The object is firmly detected (12 sigma) in the
two reddest bands of HST/WFC3, and not detected below 1.2 {\mu}m, matching the
characteristics of z\sim9 objects. We derive a robust photometric redshift of z
= 9.6 \pm 0.2, corresponding to a cosmic age of 490 \pm 15Myr (i.e., 3.6% of
the age of the Universe). The large number of bands used to derive the redshift
estimate make it one of the most accurate estimates ever obtained for such a
distant object. The significant magnification by cluster lensing (a factor of
\sim15) allows us to analyze the object's ultra-violet and optical luminosity
in its rest-frame, thus enabling us to constrain on its stellar mass,
star-formation rate and age. If the galaxy is indeed at such a large redshift,
then its age is less than 200 Myr (at the 95% confidence level), implying a
formation redshift of zf \lesssim 14. The object is the first z>9 candidate
that is bright enough for detailed spectroscopic studies with JWST,
demonstrating the unique potential of galaxy cluster fields for finding highly
magnified, intrinsically faint galaxies at the highest redshifts.Comment: Submitted to the Nature Journal. 39 Pages, 13 figure
Distinguishing Asthma Phenotypes Using Machine Learning Approaches.
Asthma is not a single disease, but an umbrella term for a number of distinct diseases, each of which are caused by a distinct underlying pathophysiological mechanism. These discrete disease entities are often labelled as asthma endotypes. The discovery of different asthma subtypes has moved from subjective approaches in which putative phenotypes are assigned by experts to data-driven ones which incorporate machine learning. This review focuses on the methodological developments of one such machine learning technique-latent class analysis-and how it has contributed to distinguishing asthma and wheezing subtypes in childhood. It also gives a clinical perspective, presenting the findings of studies from the past 5 years that used this approach. The identification of true asthma endotypes may be a crucial step towards understanding their distinct pathophysiological mechanisms, which could ultimately lead to more precise prevention strategies, identification of novel therapeutic targets and the development of effective personalized therapies
Randomised controlled trial of food elimination diet based on IgG antibodies for the prevention of migraine like headaches
<p>Abstract</p> <p>Background</p> <p>Research suggests that food intolerance may be a precipitating factor for migraine like headaches.</p> <p>Aim</p> <p>To evaluate the effectiveness of the ELISA (Enzyme Linked Immuno-Sorbent Assay) Test and subsequent dietary elimination advice for the prevention of migraine like headaches.</p> <p>Design</p> <p>Randomised controlled trial.</p> <p>Setting</p> <p>Community based volunteers in the UK.</p> <p>Participants</p> <p>Volunteers who met the inclusion criteria for migraine like headaches and had one or more food intolerance were included in the study. Participants received either a true diet (n = 84) or a sham diet (n = 83) sheet. Participants were advised to remove the intolerant foods from their diet for 12 weeks.</p> <p>Main outcome measures</p> <p>Number of headache days over a 12 week period (item A MIDAS questionnaire). Other measures includes the total MIDAS score and total HIT-6 score.</p> <p>Results</p> <p>The results indicated a small decrease in the number of migraine like headaches over 12 weeks, although this difference was not statistically significant (IRR 1.15 95% CI 0.94 to 1.41, p = 0.18). At the 4 week assessment, use of the ELISA test with subsequent diet elimination advice significantly reduced the number of migraine like headaches (IRR 1.23 95%CI 1.01 to 1.50, p = 0.04). The disability and impact on daily life of migraines were not significantly different between the true and sham diet groups.</p> <p>Conclusions</p> <p>Use of the ELISA test with subsequent diet elimination advice did not reduce the disability or impact on daily life of migraine like headaches or the number of migraine like headaches at 12 weeks but it did significantly reduce the number of migraine like headaches at 4 weeks.</p> <p>Trial registration number</p> <p>ISRCTN: <a href="http://www.controlled-trials.com/ISRTCN89559672">ISRTCN89559672</a></p
Impact of non-axillary sentinel node biopsy on staging and treatment of breast cancer patients
The purpose of this study was to evaluate the occurrence of lymphatic drainage to non-axillary sentinel nodes and to determine the implications of this phenomenon. A total of 549 breast cancer patients underwent lymphoscintigraphy after intratumoural injection of 99mTc-nanocolloid. The sentinel node was intraoperatively identified with the aid of intratumoural administered patent blue dye and a gamma-ray detection probe. Histopathological examination of sentinel nodes included step-sectioning at six levels and immunohistochemical staining. A sentinel node outside level I or II of the axilla was found in 149 patients (27%): internal mammary sentinel nodes in 86 patients, other non-axillary sentinel nodes in 44 and both internal mammary and other non-axillary sentinel nodes in nineteen patients. The intra-operative identification rate was 80%. Internal mammary metastases were found in seventeen patients and metastases in other non-axillary sentinel nodes in ten patients. Staging improved in 13% of patients with non-axillary sentinel lymph nodes and their treatment strategy was changed in 17%. A small proportion of clinically node negative breast cancer patients can be staged more precisely by biopsy of sentinel nodes outside level I and II of the axilla, resulting in additional decision criteria for postoperative regional or systemic therapy
Follicular fluid content and oocyte quality: from single biochemical markers to metabolomics
The assessment of oocyte quality in human in vitro fertilization (IVF) is getting increasing attention from embryologists. Oocyte selection and the identification of the best oocytes, in fact, would help to limit embryo overproduction and to improve the results of oocyte cryostorage programs. Follicular fluid (FF) is easily available during oocyte pick-up and theorically represents an optimal source on non-invasive biochemical predictors of oocyte quality. Unfortunately, however, the studies aiming to find a good molecular predictor of oocyte quality in FF were not able to identify substances that could be used as reliable markers of oocyte competence to fertilization, embryo development and pregnancy. In the last years, a well definite trend toward passing from the research of single molecular markers to more complex techniques that study all metabolites of FF has been observed. The metabolomic approach is a powerful tool to study biochemical predictors of oocyte quality in FF, but its application in this area is still at the beginning. This review provides an overview of the current knowledge about the biochemical predictors of oocyte quality in FF, describing both the results coming from studies on single biochemical markers and those deriving from the most recent studies of metabolomic
A magnified young galaxy from about 500 million years after the Big Bang
Re-ionization of the intergalactic medium occurred in the early Universe at redshift z ≈ 6-11, following the formation of the first generation of stars. Those young galaxies (where the bulk of stars formed) at a cosmic age of less than about 500 million years (z ≲ 10) remain largely unexplored because they are at or beyond the sensitivity limits of existing large telescopes. Understanding the properties of these galaxies is critical to identifying the source of the radiation that re-ionized the intergalactic medium. Gravitational lensing by galaxy clusters allows the detection of high-redshift galaxies fainter than what otherwise could be found in the deepest images of the sky. Here we report multiband observations of the cluster MACS J1149+2223 that have revealed (with high probability) a gravitationally magnified galaxy from the early Universe, at a redshift of z = 9.6 ± 0.2 (that is, a cosmic age of 490 ± 15 million years, or 3.6 per cent of the age of the Universe). We estimate that it formed less than 200 million years after the Big Bang (at the 95 per cent confidence level), implying a formation redshift of ≲14. Given the small sky area that our observations cover, faint galaxies seem to be abundant at such a young cosmic age, suggesting that they may be the dominant source for the early re-ionization of the intergalactic medium
- …