835 research outputs found

    Linear, Deterministic, and Order-Invariant Initialization Methods for the K-Means Clustering Algorithm

    Full text link
    Over the past five decades, k-means has become the clustering algorithm of choice in many application domains primarily due to its simplicity, time/space efficiency, and invariance to the ordering of the data points. Unfortunately, the algorithm's sensitivity to the initial selection of the cluster centers remains to be its most serious drawback. Numerous initialization methods have been proposed to address this drawback. Many of these methods, however, have time complexity superlinear in the number of data points, which makes them impractical for large data sets. On the other hand, linear methods are often random and/or sensitive to the ordering of the data points. These methods are generally unreliable in that the quality of their results is unpredictable. Therefore, it is common practice to perform multiple runs of such methods and take the output of the run that produces the best results. Such a practice, however, greatly increases the computational requirements of the otherwise highly efficient k-means algorithm. In this chapter, we investigate the empirical performance of six linear, deterministic (non-random), and order-invariant k-means initialization methods on a large and diverse collection of data sets from the UCI Machine Learning Repository. The results demonstrate that two relatively unknown hierarchical initialization methods due to Su and Dy outperform the remaining four methods with respect to two objective effectiveness criteria. In addition, a recent method due to Erisoglu et al. performs surprisingly poorly.Comment: 21 pages, 2 figures, 5 tables, Partitional Clustering Algorithms (Springer, 2014). arXiv admin note: substantial text overlap with arXiv:1304.7465, arXiv:1209.196

    SrCo1−xTixO3−δ perovskites as excellent catalysts for fast degradation of water contaminants in neutral and alkaline solutions

    Get PDF
    Perovskite-like oxides SrCo1−xTixO3−δ (SCTx, x = 0.1, 0.2, 0.4, 0.6) were used as heterogeneous catalysts to activate peroxymonosulfate (PMS) for phenol degradation under a wide pH range, exhibiting more rapid phenol oxidation than Co3O4 and TiO2. The SCT0.4/PMS system produced a high activity at increased initial pH, achieving optimized performance at pH  ≥ 7 in terms of total organic carbon removal, the minimum Co leaching and good catalytic stability. Kinetic studies showed that the phenol oxidation kinetics on SCT0.4/PMS system followed the pseudo-zero order kinetics and the rate on SCT0.4/PMS system decreased with increasing initial phenol concentration, decreased PMS amount, catalyst loading and solution temperature. Quenching tests using ethanol and tert-butyl alcohol demonstrated sulfate and hydroxyl radicals for phenol oxidation. This investigation suggested promising heterogeneous catalysts for organic oxidation with PMS, showing a breakthrough in the barriers of metal leaching, acidic pH, and low efficiency of heterogeneous catalysis

    Warmer Weather Linked to Tick Attack and Emergence of Severe Rickettsioses

    Get PDF
    The impact of climate on the vector behaviour of the worldwide dog tick Rhipicephalus sanguineus is a cause of concern. This tick is a vector for life-threatening organisms including Rickettsia rickettsii, the agent of Rocky Mountain spotted fever, R. conorii, the agent of Mediterranean spotted fever, and the ubiquitous emerging pathogen R. massiliae. A focus of spotted fever was investigated in France in May 2007. Blood and tissue samples from two patients were tested. An entomological survey was organised with the study of climatic conditions. An experimental model was designed to test the affinity of Rh. sanguineus for biting humans in variable temperature conditions. Serological and/or molecular tools confirmed that one patient was infected by R. conorii, whereas the other was infected by R. massiliae. Dense populations of Rh. sanguineus were found. They were infected with new genotypes of clonal populations of either R. conorii (24/133; 18%) or R. massiliae (13/133; 10%). April 2007 was the warmest since 1950, with summer-like temperatures. We show herein that the human affinity of Rh. sanguineus was increased in warmer temperatures. In addition to the originality of theses cases (ophthalmic involvements, the second reported case of R. massiliae infection), we provide evidence that this cluster of cases was related to a warming-mediated increase in the aggressiveness of Rh. sanguineus, leading to increased human attacks. From a global perspective, we predict that as a result of globalisation and warming, more pathogens transmitted by the brown dog tick may emerge in the future

    Developing and testing an instrument for identifying performance incentives in the Greek health care sector

    Get PDF
    BACKGROUND: In the era of cost containment, managers are constantly pursuing increased organizational performance and productivity by aiming at the obvious target, i.e. the workforce. The health care sector, in which production processes are more complicated compared to other industries, is not an exception. In light of recent legislation in Greece in which efficiency improvement and achievement of specific performance targets are identified as undisputable health system goals, the purpose of this study was to develop a reliable and valid instrument for investigating the attitudes of Greek physicians, nurses and administrative personnel towards job-related aspects, and the extent to which these motivate them to improve performance and increase productivity. METHODS: A methodological exploratory design was employed in three phases: a) content development and assessment, which resulted in a 28-item instrument, b) pilot testing (N = 74) and c) field testing (N = 353). Internal consistency reliability was tested via Cronbach's alpha coefficient and factor analysis was used to identify the underlying constructs. Tests of scaling assumptions, according to the Multitrait-Multimethod Matrix, were used to confirm the hypothesized component structure. RESULTS: Four components, referring to intrinsic individual needs and external job-related aspects, were revealed and explain 59.61% of the variability. They were subsequently labeled: job attributes, remuneration, co-workers and achievement. Nine items not meeting item-scale criteria were removed, resulting in a 19-item instrument. Scale reliability ranged from 0.782 to 0.901 and internal item consistency and discriminant validity criteria were satisfied. CONCLUSION: Overall, the instrument appears to be a promising tool for hospital administrations in their attempt to identify job-related factors, which motivate their employees. The psychometric properties were good and warrant administration to a larger sample of employees in the Greek healthcare system

    Deciphering the Code for Retroviral Integration Target Site Selection

    Get PDF
    Upon cell invasion, retroviruses generate a DNA copy of their RNA genome and integrate retroviral cDNA within host chromosomal DNA. Integration occurs throughout the host cell genome, but target site selection is not random. Each subgroup of retrovirus is distinguished from the others by attraction to particular features on chromosomes. Despite extensive efforts to identify host factors that interact with retrovirion components or chromosome features predictive of integration, little is known about how integration sites are selected. We attempted to identify markers predictive of retroviral integration by exploiting Precision-Recall methods for extracting information from highly skewed datasets to derive robust and discriminating measures of association. ChIPSeq datasets for more than 60 factors were compared with 14 retroviral integration datasets. When compared with MLV, PERV or XMRV integration sites, strong association was observed with STAT1, acetylation of H3 and H4 at several positions, and methylation of H2AZ, H3K4, and K9. By combining peaks from ChIPSeq datasets, a supermarker was identified that localized within 2 kB of 75% of MLV proviruses and detected differences in integration preferences among different cell types. The supermarker predicted the likelihood of integration within specific chromosomal regions in a cell-type specific manner, yielding probabilities for integration into proto-oncogene LMO2 identical to experimentally determined values. The supermarker thus identifies chromosomal features highly favored for retroviral integration, provides clues to the mechanism by which retrovirus integration sites are selected, and offers a tool for predicting cell-type specific proto-oncogene activation by retroviruses

    The immunopathology of canine vector-borne diseases

    Get PDF
    The canine vector-borne infectious diseases (CVBDs) are an emerging problem in veterinary medicine and the zoonotic potential of many of these agents is a significant consideration for human health. The successful diagnosis, treatment and prevention of these infections is dependent upon firm understanding of the underlying immunopathology of the diseases in which there are unique tripartite interactions between the microorganism, the vector and the host immune system. Although significant advances have been made in the areas of molecular speciation and the epidemiology of these infections and their vectors, basic knowledge of the pathology and immunology of the diseases has lagged behind. This review summarizes recent studies of the pathology and host immune response in the major CVBDs (leishmaniosis, babesiosis, ehrlichiosis, hepatozoonosis, anaplasmosis, bartonellosis and borreliosis). The ultimate application of such immunological investigation is the development of effective vaccines. The current commercially available vaccines for canine leishmaniosis, babesiosis and borreliosis are reviewed
    corecore