80 research outputs found
Bostonia: The Boston University Alumni Magazine. Volume 10
Founded in 1900, Bostonia magazine is Boston University's main alumni publication, which covers alumni and student life, as well as university activities, events, and programs
Analysis of Agglomerative Clustering
The diameter -clustering problem is the problem of partitioning a finite
subset of into subsets called clusters such that the maximum
diameter of the clusters is minimized. One early clustering algorithm that
computes a hierarchy of approximate solutions to this problem (for all values
of ) is the agglomerative clustering algorithm with the complete linkage
strategy. For decades, this algorithm has been widely used by practitioners.
However, it is not well studied theoretically. In this paper, we analyze the
agglomerative complete linkage clustering algorithm. Assuming that the
dimension is a constant, we show that for any the solution computed by
this algorithm is an -approximation to the diameter -clustering
problem. Our analysis does not only hold for the Euclidean distance but for any
metric that is based on a norm. Furthermore, we analyze the closely related
-center and discrete -center problem. For the corresponding agglomerative
algorithms, we deduce an approximation factor of as well.Comment: A preliminary version of this article appeared in Proceedings of the
28th International Symposium on Theoretical Aspects of Computer Science
(STACS '11), March 2011, pp. 308-319. This article also appeared in
Algorithmica. The final publication is available at
http://link.springer.com/article/10.1007/s00453-012-9717-
Phytogeographical patterns of dry forests sensu stricto in northern Minas Gerais State, Brazil
The Deciduous Complex that occurs in northern Minas Gerais State, Brazil, raises questions about the floristic affinities of these formations in relation to neighboring phytogeographical domains. Little is known about the identity of the seasonal forest formations that comprise this complex, or about its relationships to abiotic components, such as soils, topography and climate. This study aimed to recognize the patterns of floristic similarity of all studied fragments of dry forest of northern Minas Gerais with soil and climate attributes, based on the available database. Cluster analysis indicated the existence of two floristic groups that had clear associations with either the Koppen's BSh (semi-arid) or Aw (seasonal tropical) climates. Likewise, the subdivisions of these groups showed clear associations with the dominant soil classes in the region. The Red-Yellow Latosol is the dominant soil classes in the BSh climatic domain, seconded by alluvial areas associated with Fluvic Neosols. The Aw domain comprised a much varied set of soils: Nitosols, Argisols, Cambisols and Litholic Neosols, most derived from the Bambuí limestone/slate formation. The ecotonal nature of northern Minas Gerais State provides a complex interaction between the flora of neighboring phytogeographical domains. This, allied to pedogeomorphological factors, allowed a better understanding of the effects of late Quaternary climate changes for the Deciduous Complex evolution. We conclude that the Latosols under present-day semi-arid climates (BSh) are relicts of former wetter climates, during which humid forest (semideciduous) expansion took place. Later, these semideciduous forests were subjected to a much drier climate, when selection for deciduousness led to the present-days Deciduous Complex scenario
High molecular diversity of the fungus Guignardia citricarpa and Guignardia mangiferae and new primers for the diagnosis of the citrus black spot
A Survey of Combinatorial Methods for Phylogenetic Networks
The evolutionary history of a set of species is usually described by a rooted phylogenetic tree. Although it is generally undisputed that bifurcating speciation events and descent with modifications are major forces of evolution, there is a growing belief that reticulate events also have a role to play. Phylogenetic networks provide an alternative to phylogenetic trees and may be more suitable for data sets where evolution involves significant amounts of reticulate events, such as hybridization, horizontal gene transfer, or recombination. In this article, we give an introduction to the topic of phylogenetic networks, very briefly describing the fundamental concepts and summarizing some of the most important combinatorial methods that are available for their computation
Genomic Species Are Ecological Species as Revealed by Comparative Genomics in Agrobacterium tumefaciens
The definition of bacterial species is based on genomic similarities, giving rise to the operational concept of genomic species, but the reasons of the occurrence of differentiated genomic species remain largely unknown. We used the Agrobacterium tumefaciens species complex and particularly the genomic species presently called genomovar G8, which includes the sequenced strain C58, to test the hypothesis of genomic species having specific ecological adaptations possibly involved in the speciation process. We analyzed the gene repertoire specific to G8 to identify potential adaptive genes. By hybridizing 25 strains of A. tumefaciens on DNA microarrays spanning the C58 genome, we highlighted the presence and absence of genes homologous to C58 in the taxon. We found 196 genes specific to genomovar G8 that were mostly clustered into seven genomic islands on the C58 genome—one on the circular chromosome and six on the linear chromosome—suggesting higher plasticity and a major adaptive role of the latter. Clusters encoded putative functional units, four of which had been verified experimentally. The combination of G8-specific functions defines a hypothetical species primary niche for G8 related to commensal interaction with a host plant. This supports that the G8 ancestor was able to exploit a new ecological niche, maybe initiating ecological isolation and thus speciation. Searching genomic data for synapomorphic traits is a powerful way to describe bacterial species. This procedure allowed us to find such phenotypic traits specific to genomovar G8 and thus propose a Latin binomial, Agrobacterium fabrum, for this bona fide genomic species
Identifying practical indicators of biodiversity for stand-level management of plantation forests
Width of Gene Expression Profile Drives Alternative Splicing
Alternative splicing generates an enormous amount of functional and proteomic diversity in metazoan organisms. This process is probably central to the macromolecular and cellular complexity of higher eukaryotes. While most studies have focused on the molecular mechanism triggering and controlling alternative splicing, as well as on its incidence in different species, its maintenance and evolution within populations has been little investigated. Here, we propose to address these questions by comparing the structural characteristics as well as the functional and transcriptional profiles of genes with monomorphic or polymorphic splicing, referred to as MS and PS genes, respectively. We find that MS and PS genes differ particularly in the number of tissues and cell types where they are expressed.We find a striking deficit of PS genes on the sex chromosomes, particularly on the Y chromosome where it is shown not to be due to the observed lower breadth of expression of genes on that chromosome. The development of a simple model of evolution of cis-regulated alternative splicing leads to predictions in agreement with these observations. It further predicts the conditions for the emergence and the maintenance of cis-regulated alternative splicing, which are both favored by the tissue specific expression of splicing variants. We finally propose that the width of the gene expression profile is an essential factor for the acquisition of new transcript isoforms that could later be maintained by a new form of balancing selection
- …