115 research outputs found

    PhyloPattern: regular expressions to identify complex patterns in phylogenetic trees

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>To effectively apply evolutionary concepts in genome-scale studies, large numbers of phylogenetic trees have to be automatically analysed, at a level approaching human expertise. Complex architectures must be recognized within the trees, so that associated information can be extracted.</p> <p>Results</p> <p>Here, we present a new software library, PhyloPattern, for automating tree manipulations and analysis. PhyloPattern includes three main modules, which address essential tasks in high-throughput phylogenetic tree analysis: node annotation, pattern matching, and tree comparison. PhyloPattern thus allows the programmer to focus on: i) the use of predefined or user defined annotation functions to perform immediate or deferred evaluation of node properties, ii) the search for user-defined patterns in large phylogenetic trees, iii) the pairwise comparison of trees by dynamically generating patterns from one tree and applying them to the other.</p> <p>Conclusion</p> <p>PhyloPattern greatly simplifies and accelerates the work of the computer scientist in the evolutionary biology field. The library has been used to automatically identify phylogenetic evidence for domain shuffling or gene loss events in the evolutionary histories of protein sequences. However any workflow that relies on phylogenetic tree analysis, could be automated with PhyloPattern.</p

    Evaluation of cell-free DNA approaches for multi-cancer early detection

    Get PDF
    In the Circulating Cell-free Genome Atlas (NCT02889978) substudy 1, we evaluate several approaches for a circulating cell-free DNA (cfDNA)-based multi-cancer early detection (MCED) test by defining clinical limit of detection (LOD) based on circulating tumor allele fraction (cTAF), enabling performance comparisons. Among 10 machine-learning classifiers trained on the same samples and independently validated, when evaluated at 98% specificity, those using whole-genome (WG) methylation, single nucleotide variants with paired white blood cell background removal, and combined scores from classifiers evaluated in this study show the highest cancer signal detection sensitivities. Compared with clinical stage and tumor type, cTAF is a more significant predictor of classifier performance and may more closely reflect tumor biology. Clinical LODs mirror relative sensitivities for all approaches. The WG methylation feature best predicts cancer signal origin. WG methylation is the most promising technology for MCED and informs development of a targeted methylation MCED test

    Evolutionary origin of peptidoglycan recognition proteins in vertebrate innate immune system

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Innate immunity is the ancient defense system of multicellular organisms against microbial infection. The basis of this first line of defense resides in the recognition of unique motifs conserved in microorganisms, and absent in the host. Peptidoglycans, structural components of bacterial cell walls, are recognized by Peptidoglycan Recognition Proteins (PGRPs). PGRPs are present in both vertebrates and invertebrates. Although some evidence for similarities and differences in function and structure between them has been found, their evolutionary history and phylogenetic relationship have remained unclear. Such studies have been severely hampered by the great extent of sequence divergence among vertebrate and invertebrate PGRPs. Here we investigate the birth and death processes of PGRPs to elucidate their origin and diversity.</p> <p>Results</p> <p>We found that (i) four rounds of gene duplication and a single domain duplication have generated the major variety of present vertebrate PGRPs, while in invertebrates more than ten times the number of duplications are required to explain the repertoire of present PGRPs, and (ii) the death of genes in vertebrates appears to be almost null whereas in invertebrates it is frequent.</p> <p>Conclusion</p> <p>These results suggest that the emergence of new <it>PGRP </it>genes may have an impact on the availability of the repertoire and its function against pathogens. These striking differences in PGRP evolution of vertebrates and invertebrates should reflect the differences in the role of their innate immunity. Insights on the origin of <it>PGRP </it>genes will pave the way to understand the evolution of the interaction between host and pathogens and to lead to the development of new treatments for immune diseases that involve proteins related to the recognition of self and non-self.</p

    Age- and region-specific hepatitis B prevalence in Turkey estimated using generalized linear mixed models: a systematic review

    Get PDF
    Toy M, ร–nder FO, Wรถrmann T, et al. Age- and region-specific hepatitis B prevalence in Turkey estimated using generalized linear mixed models: a systematic review. BMC infectious diseases. 2011;11(1): 337.BACKGROUND: To provide a clear picture of the current hepatitis B situation, the authors performed a systematic review to estimate the age- and region-specific prevalence of chronic hepatitis B (CHB) in Turkey. METHODS: A total of 339 studies with original data on the prevalence of hepatitis B surface antigen (HBsAg) in Turkey and published between 1999 and 2009 were identified through a search of electronic databases, by reviewing citations, and by writing to authors. After a critical assessment, the authors included 129 studies, divided into categories: 'age-specific'; 'region-specific'; and 'specific population group'. To account for the differences among the studies, a generalized linear mixed model was used to estimate the overall prevalence across all age groups and regions. For specific population groups, the authors calculated the weighted mean prevalence. RESULTS: The estimated overall population prevalence was 4.57, 95% confidence interval (CI): 3.58, 5.76, and the estimated total number of CHB cases was about 3.3 million. The outcomes of the age-specific groups varied from 2.84, (95% CI: 2.60, 3.10) for the 0-14-year olds to 6.36 (95% CI: 5.83, 6.90) in the 25-34-year-old group. CONCLUSION: There are large age-group and regional differences in CHB prevalence in Turkey, where CHB remains a serious health problem

    Early evolution of the LIM homeobox gene family

    Get PDF
    Background: LIM homeobox (Lhx) transcription factors are unique to the animal lineage and have patterning roles during embryonic development in flies, nematodes and vertebrates, with a conserved role in specifying neuronal identity. Though genes of this family have been reported in a sponge and a cnidarian, the expression patterns and functions of the Lhx family during development in non-bilaterian phyla are not known

    Molecular evolution of the LNX gene family

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>LNX (Ligand of Numb Protein-X) proteins typically contain an amino-terminal RING domain adjacent to either two or four PDZ domains - a domain architecture that is unique to the LNX family. LNX proteins function as E3 ubiquitin ligases and their domain organisation suggests that their ubiquitin ligase activity may be targeted to specific substrates or subcellular locations by PDZ domain-mediated interactions. Indeed, numerous interaction partners for LNX proteins have been identified, but the <it>in vivo </it>functions of most family members remain largely unclear.</p> <p>Results</p> <p>To gain insights into their function we examined the phylogenetic origins and evolution of the <it>LNX </it>gene family. We find that a <it>LNX1/LNX2</it>-like gene arose in an early metazoan lineage by gene duplication and fusion events that combined a RING domain with four PDZ domains. These PDZ domains are closely related to the four carboxy-terminal domains from multiple PDZ domain containing protein-1 (MUPP1). Duplication of the <it>LNX1/LNX2</it>-like gene and subsequent loss of PDZ domains appears to have generated a gene encoding a LNX3/LNX4-like protein, with just two PDZ domains. This protein has novel carboxy-terminal sequences that include a potential modular LNX3 homology domain. The two ancestral <it>LNX </it>genes are present in some, but not all, invertebrate lineages. They were, however, maintained in the vertebrate lineage, with further duplication events giving rise to five LNX family members in most mammals. In addition, we identify novel interactions of LNX1 and LNX2 with three known MUPP1 ligands using yeast two-hybrid asssays. This demonstrates conservation of binding specificity between LNX and MUPP1 PDZ domains.</p> <p>Conclusions</p> <p>The <it>LNX </it>gene family has an early metazoan origin with a LNX1/LNX2-like protein likely giving rise to a LNX3/LNX4-like protein through the loss of PDZ domains. The absence of LNX orthologs in some lineages indicates that LNX proteins are not essential in invertebrates. In contrast, the maintenance of both ancestral <it>LNX </it>genes in the vertebrate lineage suggests the acquisition of essential vertebrate specific functions. The revelation that the LNX PDZ domains are phylogenetically related to domains in MUPP1, and have common binding specificities, suggests that LNX and MUPP1 may have similarities in their cellular functions.</p

    RNA-Seq Mapping and Detection of Gene Fusions with a Suffix Array Algorithm

    Get PDF
    High-throughput RNA sequencing enables quantification of transcripts (both known and novel), exon/exon junctions and fusions of exons from different genes. Discovery of gene fusionsโ€“particularly those expressed with low abundanceโ€“ is a challenge with short- and medium-length sequencing reads. To address this challenge, we implemented an RNA-Seq mapping pipeline within the LifeScope software. We introduced new features including filter and junction mapping, annotation-aided pairing rescue and accurate mapping quality values. We combined this pipeline with a Suffix Array Spliced Read (SASR) aligner to detect chimeric transcripts. Performing paired-end RNA-Seq of the breast cancer cell line MCF-7 using the SOLiD system, we called 40 gene fusions among over 120,000 splicing junctions. We validated 36 of these 40 fusions with TaqMan assays, of which 25 were expressed in MCF-7 but not the Human Brain Reference. An intra-chromosomal gene fusion involving the estrogen receptor alpha gene ESR1, and another involving the RPS6KB1 (Ribosomal protein S6 kinase beta-1) were recurrently expressed in a number of breast tumor cell lines and a clinical tumor sample

    Quantifying the Effects of Elastic Collisions and Non-Covalent Binding on Glutamate Receptor Trafficking in the Post-Synaptic Density

    Get PDF
    One mechanism of information storage in neurons is believed to be determined by the strength of synaptic contacts. The strength of an excitatory synapse is partially due to the concentration of a particular type of ionotropic glutamate receptor (AMPAR) in the post-synaptic density (PSD). AMPAR concentration in the PSD has to be plastic, to allow the storage of new memories; but it also has to be stable to preserve important information. Although much is known about the molecular identity of synapses, the biophysical mechanisms by which AMPAR can enter, leave and remain in the synapse are unclear. We used Monte Carlo simulations to determine the influence of PSD structure and activity in maintaining homeostatic concentrations of AMPARs in the synapse. We found that, the high concentration and excluded volume caused by PSD molecules result in molecular crowding. Diffusion of AMPAR in the PSD under such conditions is anomalous. Anomalous diffusion of AMPAR results in retention of these receptors inside the PSD for periods ranging from minutes to several hours in the absence of strong binding of receptors to PSD molecules. Trapping of receptors in the PSD by crowding effects was very sensitive to the concentration of PSD molecules, showing a switch-like behavior for retention of receptors. Non-covalent binding of AMPAR to anchored PSD molecules allowed the synapse to become well-mixed, resulting in normal diffusion of AMPAR. Binding also allowed the exchange of receptors in and out of the PSD. We propose that molecular crowding is an important biophysical mechanism to maintain homeostatic synaptic concentrations of AMPARs in the PSD without the need of energetically expensive biochemical reactions. In this context, binding of AMPAR with PSD molecules could collaborate with crowding to maintain synaptic homeostasis but could also allow synaptic plasticity by increasing the exchange of these receptors with the surrounding extra-synaptic membrane
    • โ€ฆ
    corecore