80 research outputs found

    AHXR 101.01: Patient Care in Radiology

    Get PDF

    Analysis of Agglomerative Clustering

    Full text link
    The diameter kk-clustering problem is the problem of partitioning a finite subset of Rd\mathbb{R}^d into kk subsets called clusters such that the maximum diameter of the clusters is minimized. One early clustering algorithm that computes a hierarchy of approximate solutions to this problem (for all values of kk) is the agglomerative clustering algorithm with the complete linkage strategy. For decades, this algorithm has been widely used by practitioners. However, it is not well studied theoretically. In this paper, we analyze the agglomerative complete linkage clustering algorithm. Assuming that the dimension dd is a constant, we show that for any kk the solution computed by this algorithm is an O(logk)O(\log k)-approximation to the diameter kk-clustering problem. Our analysis does not only hold for the Euclidean distance but for any metric that is based on a norm. Furthermore, we analyze the closely related kk-center and discrete kk-center problem. For the corresponding agglomerative algorithms, we deduce an approximation factor of O(logk)O(\log k) as well.Comment: A preliminary version of this article appeared in Proceedings of the 28th International Symposium on Theoretical Aspects of Computer Science (STACS '11), March 2011, pp. 308-319. This article also appeared in Algorithmica. The final publication is available at http://link.springer.com/article/10.1007/s00453-012-9717-

    Phytogeographical patterns of dry forests sensu stricto in northern Minas Gerais State, Brazil

    Get PDF
    The Deciduous Complex that occurs in northern Minas Gerais State, Brazil, raises questions about the floristic affinities of these formations in relation to neighboring phytogeographical domains. Little is known about the identity of the seasonal forest formations that comprise this complex, or about its relationships to abiotic components, such as soils, topography and climate. This study aimed to recognize the patterns of floristic similarity of all studied fragments of dry forest of northern Minas Gerais with soil and climate attributes, based on the available database. Cluster analysis indicated the existence of two floristic groups that had clear associations with either the Koppen's BSh (semi-arid) or Aw (seasonal tropical) climates. Likewise, the subdivisions of these groups showed clear associations with the dominant soil classes in the region. The Red-Yellow Latosol is the dominant soil classes in the BSh climatic domain, seconded by alluvial areas associated with Fluvic Neosols. The Aw domain comprised a much varied set of soils: Nitosols, Argisols, Cambisols and Litholic Neosols, most derived from the Bambuí limestone/slate formation. The ecotonal nature of northern Minas Gerais State provides a complex interaction between the flora of neighboring phytogeographical domains. This, allied to pedogeomorphological factors, allowed a better understanding of the effects of late Quaternary climate changes for the Deciduous Complex evolution. We conclude that the Latosols under present-day semi-arid climates (BSh) are relicts of former wetter climates, during which humid forest (semideciduous) expansion took place. Later, these semideciduous forests were subjected to a much drier climate, when selection for deciduousness led to the present-days Deciduous Complex scenario

    A Survey of Combinatorial Methods for Phylogenetic Networks

    Get PDF
    The evolutionary history of a set of species is usually described by a rooted phylogenetic tree. Although it is generally undisputed that bifurcating speciation events and descent with modifications are major forces of evolution, there is a growing belief that reticulate events also have a role to play. Phylogenetic networks provide an alternative to phylogenetic trees and may be more suitable for data sets where evolution involves significant amounts of reticulate events, such as hybridization, horizontal gene transfer, or recombination. In this article, we give an introduction to the topic of phylogenetic networks, very briefly describing the fundamental concepts and summarizing some of the most important combinatorial methods that are available for their computation

    Genomic Species Are Ecological Species as Revealed by Comparative Genomics in Agrobacterium tumefaciens

    Get PDF
    The definition of bacterial species is based on genomic similarities, giving rise to the operational concept of genomic species, but the reasons of the occurrence of differentiated genomic species remain largely unknown. We used the Agrobacterium tumefaciens species complex and particularly the genomic species presently called genomovar G8, which includes the sequenced strain C58, to test the hypothesis of genomic species having specific ecological adaptations possibly involved in the speciation process. We analyzed the gene repertoire specific to G8 to identify potential adaptive genes. By hybridizing 25 strains of A. tumefaciens on DNA microarrays spanning the C58 genome, we highlighted the presence and absence of genes homologous to C58 in the taxon. We found 196 genes specific to genomovar G8 that were mostly clustered into seven genomic islands on the C58 genome—one on the circular chromosome and six on the linear chromosome—suggesting higher plasticity and a major adaptive role of the latter. Clusters encoded putative functional units, four of which had been verified experimentally. The combination of G8-specific functions defines a hypothetical species primary niche for G8 related to commensal interaction with a host plant. This supports that the G8 ancestor was able to exploit a new ecological niche, maybe initiating ecological isolation and thus speciation. Searching genomic data for synapomorphic traits is a powerful way to describe bacterial species. This procedure allowed us to find such phenotypic traits specific to genomovar G8 and thus propose a Latin binomial, Agrobacterium fabrum, for this bona fide genomic species

    Width of Gene Expression Profile Drives Alternative Splicing

    Get PDF
    Alternative splicing generates an enormous amount of functional and proteomic diversity in metazoan organisms. This process is probably central to the macromolecular and cellular complexity of higher eukaryotes. While most studies have focused on the molecular mechanism triggering and controlling alternative splicing, as well as on its incidence in different species, its maintenance and evolution within populations has been little investigated. Here, we propose to address these questions by comparing the structural characteristics as well as the functional and transcriptional profiles of genes with monomorphic or polymorphic splicing, referred to as MS and PS genes, respectively. We find that MS and PS genes differ particularly in the number of tissues and cell types where they are expressed.We find a striking deficit of PS genes on the sex chromosomes, particularly on the Y chromosome where it is shown not to be due to the observed lower breadth of expression of genes on that chromosome. The development of a simple model of evolution of cis-regulated alternative splicing leads to predictions in agreement with these observations. It further predicts the conditions for the emergence and the maintenance of cis-regulated alternative splicing, which are both favored by the tissue specific expression of splicing variants. We finally propose that the width of the gene expression profile is an essential factor for the acquisition of new transcript isoforms that could later be maintained by a new form of balancing selection
    corecore