1,296 research outputs found

    Reconstituting typeset Marriage Registers using simple software tools

    Get PDF
    In a world of fully integrated software applications, which can seem daunting to develop and to maintain, it is sometimes useful to recall that a system of loosely-linked software components can provide surprisingly powerful and flexible methods for software development. This paper describes a project which aims to retypeset a series of volumes from the Phillimore Marriage Registers, first published in England around the turn of the last century. The source material is plain text derived from running Optical Character Recognition (OCR) on a set of page scans taken from the original printed volumes. The regular, tabular, structure of the Register pages allows us to automate the re-typesetting process. The UNIX troff software and its tbl preprocessor are used for the typesetting itself, but a series of simple awk-based software tools, all of them parsers and code generators of one sort or another, is used to bring about the OCR-to-troff transformation. By re-parsing the generated troff codes it is possible to produce a surname index as a supplement to the retypeset volume. Moreover, this second-stage parsing has been invaluable in discovering subtle ‘typos’ in the automatically generated material. With small adjustments to this parser it would be possible to output the complete marriage entries in standard XML or GEDCOM notations

    Cluster Based Term Weighting Model for Web Document Clustering

    Get PDF
    The term weight is based on the frequency with which the term appears in that document. The term weighting scheme measures the importance of a term with respect to a document and a collection. A term with higher weight is more important than a term with lower weight. A document ranking model uses these term weights to find the rank of a document in a collection. We propose a cluster-based term weighting models based on the TF-IDF model. This term weighting model update the inter-cluster and intra-cluster frequency components uses the generated clusters as a reference in improving the retrieved relevant documents. These inter cluster and intra-cluster frequency components are used for weighting the importance of a term in addition to the term and document frequency components

    Quantum fluctuations can promote or inhibit glass formation

    Full text link
    The very nature of glass is somewhat mysterious: while relaxation times in glasses are of sufficient magnitude that large-scale motion on the atomic level is essentially as slow as it is in the crystalline state, the structure of glass appears barely different than that of the liquid that produced it. Quantum mechanical systems ranging from electron liquids to superfluid helium appear to form glasses, but as yet no unifying framework exists connecting classical and quantum regimes of vitrification. Here we develop new insights from theory and simulation into the quantum glass transition that surprisingly reveal distinct regions where quantum fluctuations can either promote or inhibit glass formation.Comment: Accepted for publication in Nature Physics. 22 pages, 3 figures, 1 Tabl

    Asymmetric function theory

    Full text link
    The classical theory of symmetric functions has a central position in algebraic combinatorics, bridging aspects of representation theory, combinatorics, and enumerative geometry. More recently, this theory has been fruitfully extended to the larger ring of quasisymmetric functions, with corresponding applications. Here, we survey recent work extending this theory further to general asymmetric polynomials.Comment: 36 pages, 8 figures, 1 table. Written for the proceedings of the Schubert calculus conference in Guangzhou, Nov. 201

    Risk factors for presentation to hospital with severe anaemia in Tanzanian children: a case-control study.

    Get PDF
    In malaria endemic areas anaemia is a usually silent condition that nevertheless places a considerable burden on health services. Cases of severe anaemia often require hospitalization and blood transfusions. The objective of this study was to assess risk factors for admission with anaemia to facilitate the design of anaemia control programmes. We conducted a prospective case-control study of children aged 2-59 months admitted to a district hospital in southern Tanzania. There were 216 cases of severe anaemia [packed cell volume (PCV) < 25%] and 234 age-matched controls (PCV > or = 25%). Most cases [55.6% (n = 120)] were < 1 year of age. Anaemia was significantly associated with the educational level of parents, type of accommodation, health-seeking behaviour, the child's nutritional status and recent and current medical history. Of these, the single most important factor was Plasmodium falciparum parasitaemia [OR 4.3, 95% confidence interval (CI) 2.9-6.5, P < 0.001]. Multivariate analysis showed that increased recent health expenditure [OR 2.2 (95% CI 1.3-3.9), P = 0.005], malnutrition [OR 2.4 (95%CI 1.3-4.3), P < 0.001], living > 10 km from the hospital [OR 3.0 (95% CI 1.9-4.9), P < 0.001], a history of previous blood transfusion [OR 3.8 (95% CI 1.7-9.1), P < 0.001] and P. falciparum parasitaemia [OR 9.5 (95% CI 4.3-21.3), P < 0.001] were independently related to risk of being admitted with anaemia. These findings are considered in terms of the pathophysiological pathway leading to anaemia. The concentration of anaemia in infants and problems of access to health services and adequate case management underline the need for targeted preventive strategies for anaemia control

    Latent cluster analysis of ALS phenotypes identifies prognostically differing groups

    Get PDF
    BACKGROUND Amyotrophic lateral sclerosis (ALS) is a degenerative disease predominantly affecting motor neurons and manifesting as several different phenotypes. Whether these phenotypes correspond to different underlying disease processes is unknown. We used latent cluster analysis to identify groupings of clinical variables in an objective and unbiased way to improve phenotyping for clinical and research purposes. METHODS Latent class cluster analysis was applied to a large database consisting of 1467 records of people with ALS, using discrete variables which can be readily determined at the first clinic appointment. The model was tested for clinical relevance by survival analysis of the phenotypic groupings using the Kaplan-Meier method. RESULTS The best model generated five distinct phenotypic classes that strongly predicted survival (p<0.0001). Eight variables were used for the latent class analysis, but a good estimate of the classification could be obtained using just two variables: site of first symptoms (bulbar or limb) and time from symptom onset to diagnosis (p<0.00001). CONCLUSION The five phenotypic classes identified using latent cluster analysis can predict prognosis. They could be used to stratify patients recruited into clinical trials and generating more homogeneous disease groups for genetic, proteomic and risk factor research

    Activity of the DNA minor groove cross-linking agent SG2000 (SJG-136) against canine tumours

    Get PDF
    BACKGROUND: Cancer is the leading cause of death in older dogs and its prevalence is increasing. There is clearly a need to develop more effective anti-cancer drugs in dogs. SG2000 (SJG-136) is a sequence selective DNA minor groove cross-linking agent. Based on its in vitro potency, the spectrum of in vivo and clinical activity against human tumours, and its tolerability in human patients, SG2000 has potential as a novel therapeutic against spontaneously occurring canine malignancies. RESULTS: In vitro cytotoxicity was assessed using SRB and MTT assays, and in vivo activity was assessed using canine tumour xenografts. DNA interstrand cross-linking (ICL) was determined using a modification of the single cell gel electrophoresis (comet) assay. Effects on cell cycle distribution were assessed by flow cytometry and measurement of γ-H2AX by immunofluorescence and immunohistochemistry. SG2000 had a multi-log differential cytotoxic profile against a panel of 12 canine tumour cell lines representing a range of common tumour types in dogs. In the CMeC-1 melanoma cell line, DNA ICLs increased linearly with dose following a 1 h treatment. Peak ICL was achieved within 1 h and no removal was observed over 48 h. A relationship between DNA ICL formation and cytotoxicity was observed across cell lines. The formation of γ-H2AX foci was slow, becoming evident after 4 h and reaching a peak at 24 h. SG2000 exhibited significant anti-tumour activity against two canine melanoma tumour models in vivo. Anti-tumour activity was observed at 0.15 and 0.3 mg/kg given i.v. either once, or weekly x 3. Dose-dependent DNA ICL was observed in tumours (and to a lower level in peripheral blood mononuclear cells) at 2 h and persisted at 24 h. ICL increased following the second and third doses in a repeated dose schedule. At 24 h, dose dependent γ-H2AX foci were more numerous than at 2 h, and greater in tumours than in peripheral blood mononuclear cells. SG2000-induced H2AX phosphorylation measured by immunohistochemistry showed good correspondence, but less sensitivity, than measurement of foci. CONCLUSIONS: SG2000 displayed potent activity in vitro against canine cancer cell lines as a result of the formation and persistence of DNA ICLs. SG2000 also had significant in vivo antitumour activity against canine melanoma xenografts, and the comet and γ-H2AX foci methods were relevant pharmacodynamic assays. The clinical testing of SG2000 against spontaneous canine cancer is warranted. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12917-015-0534-2) contains supplementary material, which is available to authorized users

    The interplay of microscopic and mesoscopic structure in complex networks

    Get PDF
    Not all nodes in a network are created equal. Differences and similarities exist at both individual node and group levels. Disentangling single node from group properties is crucial for network modeling and structural inference. Based on unbiased generative probabilistic exponential random graph models and employing distributive message passing techniques, we present an efficient algorithm that allows one to separate the contributions of individual nodes and groups of nodes to the network structure. This leads to improved detection accuracy of latent class structure in real world data sets compared to models that focus on group structure alone. Furthermore, the inclusion of hitherto neglected group specific effects in models used to assess the statistical significance of small subgraph (motif) distributions in networks may be sufficient to explain most of the observed statistics. We show the predictive power of such generative models in forecasting putative gene-disease associations in the Online Mendelian Inheritance in Man (OMIM) database. The approach is suitable for both directed and undirected uni-partite as well as for bipartite networks

    Comparison of serious inhaler technique errors made by device-naïve patients using three different dry powder inhalers: a randomised, crossover, open-label study

    Get PDF
    Background: Serious inhaler technique errors can impair drug delivery to the lungs. This randomised, crossover, open-label study evaluated the proportion of patients making predefined serious errors with Pulmojet compared with Diskus and Turbohaler dry powder inhalers. Methods: Patients ≥18 years old with asthma and/or COPD who were current users of an inhaler but naïve to the study devices were assigned to inhaler technique assessment on Pulmojet and either Diskus or Turbohaler in a randomised order. Patients inhaled through empty devices after reading the patient information leaflet. If serious errors potentially affecting dose delivery were recorded, they repeated the inhalations after watching a training video. Inhaler technique was assessed by a trained nurse observer and an electronic inhalation profile recorder. Results: Baseline patient characteristics were similar between randomisation arms for the Pulmojet-Diskus (n = 277) and Pulmojet-Turbohaler (n = 144) comparisons. Non-inferiority in the proportions of patients recording no nurse-observed serious errors was demonstrated for both Pulmojet versus Diskus, and Pulmojet versus Turbohaler; therefore, superiority was tested. Patients were significantly less likely to make ≥1 nurse-observed serious errors using Pulmojet compared with Diskus (odds ratio, 0.31; 95 % CI, 0.19–0.51) or Pulmojet compared with Turbohaler (0.23; 0.12–0.44) after reading the patient information leaflet with additional video instruction, if required. Conclusions These results suggest Pulmojet is easier to learn to use correctly than the Turbohaler or Diskus for current inhaler users switching to a new dry powder inhaler
    corecore