26 research outputs found
A Differentiation-Based Phylogeny of Cancer Subtypes
Histopathological classification of human tumors relies in part on the degree of differentiation of the tumor sample. To date, there is no objective systematic method to categorize tumor subtypes by maturation. In this paper, we introduce a novel computational algorithm to rank tumor subtypes according to the dissimilarity of their gene expression from that of stem cells and fully differentiated tissue, and thereby construct a phylogenetic tree of cancer. We validate our methodology with expression data of leukemia, breast cancer and liposarcoma subtypes and then apply it to a broader group of sarcomas. This ranking of tumor subtypes resulting from the application of our methodology allows the identification of genes correlated with differentiation and may help to identify novel therapeutic targets. Our algorithm represents the first phylogeny-based tool to analyze the differentiation status of human tumors
Domain architecture evolution of pattern-recognition receptors
In animals, the innate immune system is the first line of defense against invading microorganisms, and the pattern-recognition receptors (PRRs) are the key components of this system, detecting microbial invasion and initiating innate immune defenses. Two families of PRRs, the intracellular NOD-like receptors (NLRs) and the transmembrane Toll-like receptors (TLRs), are of particular interest because of their roles in a number of diseases. Understanding the evolutionary history of these families and their pattern of evolutionary changes may lead to new insights into the functioning of this critical system. We found that the evolution of both NLR and TLR families included massive species-specific expansions and domain shuffling in various lineages, which resulted in the same domain architectures evolving independently within different lineages in a process that fits the definition of parallel evolution. This observation illustrates both the dynamics of the innate immune system and the effects of “combinatorially constrained” evolution, where existence of the limited numbers of functionally relevant domains constrains the choices of domain architectures for new members in the family, resulting in the emergence of independently evolved proteins with identical domain architectures, often mistaken for orthologs
Population-based study of genetic variation in individuals with autism spectrum disorders from Croatia
<p>Abstract</p> <p>Background</p> <p>Genome-wide studies on autism spectrum disorders (ASDs) have mostly focused on large-scale population samples, but examination of rare variations in isolated populations may provide additional insights into the disease pathogenesis.</p> <p>Methods</p> <p>As a first step in the genetic analysis of ASD in Croatia, we characterized genetic variation in a sample of 103 subjects with ASD and 203 control individuals, who were genotyped using the Illumina HumanHap550 BeadChip. We analyzed the genetic diversity of the Croatian population and its relationship to other populations, the degree of relatedness via Runs of Homozygosity (ROHs), and the distribution of large (>500 Kb) copy number variations.</p> <p>Results</p> <p>Combining the Croatian cohort with several previously published populations in the FastME analysis (an alternative to Neighbor Joining) revealed that Croatian subjects cluster, as expected, with Southern Europeans; in addition, individuals from the same geographic region within Europe cluster together. Whereas Croatian subjects could be separated from a sample of healthy control subjects of European origin from North America, Croatian ASD cases and controls are well mixed. A comparison of runs of homozygosity indicated that the number and the median length of regions of homozygosity are higher for ASD subjects than for controls (p = 6 × 10<sup>-3</sup>). Furthermore, analysis of copy number variants found a higher frequency of large chromosomal rearrangements (>2 Mb) in ASD cases (5/103) than in ethnically matched control subjects (1/197, p = 0.019).</p> <p>Conclusions</p> <p>Our findings illustrate the remarkable utility of high-density genotype data for subjects from a limited geographic area in dissecting genetic heterogeneity with respect to population and disease related variation.</p
EEF2 Analysis Challenges the Monophyly of Archaeplastida and Chromalveolata
BACKGROUND: Classification of eukaryotes provides a fundamental phylogenetic framework for ecological, medical, and industrial research. In recent years eukaryotes have been classified into six major supergroups: Amoebozoa, Archaeplastida, Chromalveolata, Excavata, Opisthokonta, and Rhizaria. According to this supergroup classification, Archaeplastida and Chromalveolata each arose from a single plastid-generating endosymbiotic event involving a cyanobacterium (Archaeplastida) or red alga (Chromalveolata). Although the plastids within members of the Archaeplastida and Chromalveolata share some features, no nucleocytoplasmic synapomorphies supporting these supergroups are currently known. METHODOLOGY/PRINCIPAL FINDINGS: This study was designed to test the validity of the Archaeplastida and Chromalveolata through the analysis of nucleus-encoded eukaryotic translation elongation factor 2 (EEF2) and cytosolic heat-shock protein of 70 kDa (HSP70) sequences generated from the glaucophyte Cyanophora paradoxa, the cryptophytes Goniomonas truncata and Guillardia theta, the katablepharid Leucocryptos marina, the rhizarian Thaumatomonas sp. and the green alga Mesostigma viride. The HSP70 phylogeny was largely unresolved except for certain well-established groups. In contrast, EEF2 phylogeny recovered many well-established eukaryotic groups and, most interestingly, revealed a well-supported clade composed of cryptophytes, katablepharids, haptophytes, rhodophytes, and Viridiplantae (green algae and land plants). This clade is further supported by the presence of a two amino acid signature within EEF2, which appears to have arisen from amino acid replacement before the common origin of these eukaryotic groups. CONCLUSIONS/SIGNIFICANCE: Our EEF2 analysis strongly refutes the monophyly of the Archaeplastida and the Chromalveolata, adding to a growing body of evidence that limits the utility of these supergroups. In view of EEF2 phylogeny and other morphological evidence, we discuss the possibility of an alternative eukaryotic supergroup
Estimating phylogenies from molecular data
Phylogenetic estimation from aligned DNA, RNA or amino acid sequences has attracted more and more attention in recent years due to its importance in analysis of many fine-scale genetic data. Nowadays, its application fields range from medical research to drug discovery, to epidemiology, to systematics and population dynamics. Estimating phylogenies involves solving an optimization problem, called the phylogenetic estimation problem (PEP), whose versions depend on the criterion used to select a phylogeny among plausible alternatives. This chapter offers an overview of PEP and discuss the most important versions that occur in the literature.SCOPUS: ch.binfo:eu-repo/semantics/publishe
Evidence of selection upon genomic GC-content in bacteria
The genomic GC-content of bacteria varies dramatically, from less than 20% to more than 70%. This variation is generally ascribed to differences in the pattern of mutation between bacteria. Here we test this hypothesis by examining patterns of synonymous polymorphism using datasets from 149 bacterial species. We find a large excess of synonymous GC->AT mutations over AT->GC mutations segregating in all but the most AT-rich bacteria, across a broad range of phylogenetically diverse species. We show that the excess of GC->AT mutations is inconsistent with mutation bias, since it would imply that most GC-rich bacteria are declining in GC-content; such a pattern would be unsustainable. We also show that the patterns are probably not due to translational selection or biased gene conversion, because optimal codons tend to be AT-rich, and the excess of GC->AT SNPs is observed in datasets with no evidence of recombination. We therefore conclude that there is selection to increase synonymous GC-content in many species. Since synonymous GC-content is highly correlated to genomic GC-content, we further conclude that there is selection on genomic base composition in many bacteria