170 research outputs found

    Transposon insertion mapping with PIMMS, Pragmatic Insertional Mutation Mapping System

    Get PDF
    The PIMMS (Pragmatic Insertional Mutation Mapping System) pipeline has beendeveloped for simple conditionally essential genome discovery experiments in bacteria.Capable of using raw sequence data files alongside a FASTA sequence of thereference genome and GFF file, PIMMS will generate a tabulated output of each codingsequence with corresponding mapped insertions accompanied with normalized resultsenabling streamlined analysis. This allows for a quick assay of the genome to identifyconditionally essential genes on a standar d desktop computer prioritizing results forfurther investigation

    Identification of DNA methylation biomarkers from Infinium arrays

    Get PDF
    Epigenetic modifications of DNA, such as cytosine methylation are differentially abundant in diseases such as cancer. A goal for clinical research is finding sites that are differentially methylated between groups of samples to act as potential biomarkers for disease outcome. However, clinical samples are often limited in availability, represent a heterogeneous collection of cells or are of uncertain clinical class. Array based methods for identification of methylation provide a cost effective method to survey a proportion of the methylome at single base resolution. The Illumina Infinium array has become a popular and reliable high throughput method in this field and are proving useful in the identification of biomarkers for disease. Here, we compare a commonly used statistical test with a new intuitive and flexible computational approach to quickly detect differentially methylated sites. The method rapidly identifies and ranks candidate lists with greatest inter-group variability whilst controlling for intra-group variability. Intuitive and biologically relevant filters can be imposed to quickly identify sites and genes of interest

    Duplicated Paralogous Genes Subject to Positive Selection in the Genome of Trypanosoma brucei

    Get PDF
    Background Whole genome studies have highlighted duplicated genes as important substrates for adaptive evolution. We have investigated adaptive evolution in this class of genes in the human parasite Trypanosoma brucei, as indicated by the ratio of non-synonymous (amino-acid changing) to synonymous (amino acid retaining) nucleotide substitution rates. Methodology/Principal Findings We have identified duplicated genes that are most rapidly evolving in this important human parasite. This is the first attempt to investigate adaptive evolution in this species at the codon level. We identify 109 genes within 23 clusters of paralogous gene expansions to be subject to positive selection. Conclusions/Significance Genes identified include surface antigens in both the mammalian and insect host life cycle stage suggesting that competitive interaction is not solely with the adaptive immune system of the mammalian host. Also surface transporters related to drug resistance and genes related to developmental progression are detected. We discuss how adaptive evolution of these genes may highlight lineage specific processes essential for parasite survival. We also discuss the implications of adaptive evolution of these targets for parasite biology and control

    Triaging informative cis-regulatory elements for the combinatorial control of temporal gene expression during Plasmodium falciparum intraerythrocytic development

    Get PDF
    Background: Over 2700 genes are subject to stage-specific regulation during the intraerythrocytic development of the human malaria parasite Plasmodium falciparum. Bioinformatic analyses have identified a large number of over-represented motifs in the 5′ flanking regions of these genes that may act as cis-acting factors in the promoter-based control of temporal expression. Triaging these lists to provide candidates most likely to play a role in regulating temporal expression is challenging, but important if we are to effectively design in vitro studies to validate this role. Methods: We report here the application of a repeated search of variations of 5′ flanking sequences from P. falciparum using the Finding Informative Regulatory Elements (FIRE) algorithm. Results: Our approach repeatedly found a short-list of high scoring DNA motifs, for which cognate specific transcription factors were available, that appear to be typically associated with upregulation of mRNA accumulation during the first half of intraerythrocytic development. Conclusions: We propose these cis-trans interactions may provide a combinatorial promoter-based control of gene expression to complement more global mechanisms of gene regulation that can account for temporal control during the second half of intraerythrocytic development

    The Human Postsynaptic Density Shares Conserved Elements with Proteomes of Unicellular Eukaryotes and Prokaryotes

    Get PDF
    The animal nervous system processes information from the environment and mediates learning and memory using molecular signaling pathways in the postsynaptic terminal of synapses. Postsynaptic neurotransmitter receptors assemble to form multiprotein complexes that drive signal transduction pathways to downstream cell biological processes. Studies of mouse and Drosophila postsynaptic proteins have identified key roles in synaptic physiology and behavior for a wide range of proteins including receptors, scaffolds, enzymes, structural, translational, and transcriptional regulators. Comparative proteomic and genomic studies identified components of the postsynaptic proteome conserved in eukaryotes and early metazoans. We extend these studies, and examine the conservation of genes and domains found in the human postsynaptic density with those across the three superkingdoms, archaeal, bacteria, and eukaryota. A conserved set of proteins essential for basic cellular functions were conserved across the three superkingdoms, whereas synaptic structural and many signaling molecules were specific to the eukaryote lineage. Genes involved with metabolism and environmental signaling in Escherichia coli including the chemotactic and ArcAB Two-Component signal transduction systems shared homologous genes in the mammalian postsynaptic proteome. These data suggest conservation between prokaryotes and mammalian synapses of signaling mechanisms from receptors to transcriptional responses, a process essential to learning and memory in vertebrates. A number of human postsynaptic proteins with homologs in prokaryotes are mutated in human genetic diseases with nervous system pathology. These data also indicate that structural and signaling proteins characteristic of postsynaptic complexes arose in the eukaryotic lineage and rapidly expanded following the emergence of the metazoa, and provide an insight into the early evolution of synaptic mechanisms and conserved mechanisms of learning and memory

    Diverse Spatial, Temporal, and Sexual Expression of Recently Duplicated Androgen-Binding Protein Genes in \u3ci\u3eMus musculus\u3c/i\u3e

    Get PDF
    Background The genes for salivary androgen-binding protein (ABP) subunits have been evolving rapidly in ancestors of the house mouse Mus musculus, as evidenced both by recent and extensive gene duplication and by high ratios of nonsynonymous to synonymous nucleotide substitution rates. This makes ABP an appropriate model system with which to investigate how recent adaptive evolution of paralogous genes results in functional innovation (neofunctionalization). Results It was our goal to find evidence for the expression of as many of the Abp paralogues in the mouse genome as possible. We observed expression of six Abpa paralogues and five Abpbg paralogues in ten glands and other organs located predominantly in the head and neck (olfactory lobe of the brain, three salivary glands, lacrimal gland, Harderian gland, vomeronasal organ, and major olfactory epithelium). These Abp paralogues differed dramatically in their specific expression in these different glands and in their sexual dimorphism of expression. We also studied the appearance of expression in both late-stage embryos and postnatal animals prior to puberty and found significantly different timing of the onset of expression among the various paralogues. Conclusion The multiple changes in the spatial expression profile of these genes resulting in various combinations of expression in glands and other organs in the head and face of the mouse strongly suggest that neofunctionalization of these genes, driven by adaptive evolution, has occurred following duplication. The extensive diversification in expression of this family of proteins provides two lines of evidence for a pheromonal role for ABP: 1) different patterns of Abpa/Abpbg expression in different glands; and 2) sexual dimorphism in the expression of the paralogues in a subset of those glands. These expression patterns differ dramatically among various glands that are located almost exclusively in the head and neck, where the sensory organs are located. Since mice are nocturnal, it is expected that they will make extensive use of olfactory as opposed to visual cues. The glands expressing Abp paralogues produce secretions (lacrimal and salivary) or detect odors (MOE and VNO) and thus it appears highly likely that ABP proteins play a role in olfactory communication

    Gene Expression in the Microfilariae of Brugia pahangi

    Get PDF
    Filarial nematodes have a complex developmental life cycle involving mammalian and mosquito hosts. The microfilariae (mf) which circulate in the blood stream are developmentally blocked prior to transfer to the mosquito. The switch between the different environments must be accompanied by changes in gene expression to allow development within the new host. The aim of this study was to isolate mf genes from the mf of B. pahangi that are differentially expressed within the two hosts. To this end a mammalian-derived mf cDNA library was constructed and differentially screened using mammalian and mosquito-derived mf cDNA probes. Due to the difficulty of obtaining in vivo parasites from the mosquito, the cDNA probes were produced by RT-PCR from mf culture in vitro under either mammalian or mosquito like conditions. Nine independent cDNA clones were isolated, of which five hybridised more strongly to the mammalian-derived mf probe and four to the vector-derived mf probe. Analysis of the cDNAs nucleotide sequence revealed that five of the clones were homologous to ribosomal protein mRNAs previously characterised from other species, one cDNA corresponded to the B. pahangi heat shock protein 90 mRNA and three cDNAs represented novel genes of unknown function. A more detailed molecular analysis was conducted on two of the cDNAs of unknown function, Bp-vmc-2 (B. pahangi-vector-derived mf cDNA-2) and Bp-mmc- 1 (B. pahangi-mammalian-derived mf cDNA-1). Analysis of the expression of the mRNAs by semi-quantitative RT-PCR showed that Bp-vmc-2 was expressed at varying levels throughout the life-cycle of B. pahangi. The mRNA abundance of Bp- vmc-2 increased as the parasite matured in the mosquito host and reached peak expression in infective L3 parasites. In contrast Bp-mmc-1 was exclusive to the mf stage and was not expressed by mf developing in utero. An antiserum raised to the recombinant protein localised MMC-1 throughout the body of the mf, but it was not present on the surface of the sheath or cuticle of the parasite. Western blotting using MMC-1 antiserum, reacted specifically with mf extracts, confirming the specificity of temporal expression. Analysis of immune responses to the recombinant protein showed that significant amounts of IL-5 were produced by T-cells in response to stimulation with MMC-1. Human serum from patients infected with B. malayi recognised MMC-1 and the predominant reactive immunoglobulin subclasses were IgGl and IgG3, which have been associated with disease pathology in other studies

    A consensus approach to vertebrate de novo transcriptome assembly from RNA-seq data: assembly of the duck (Anas platyrhynchos) transcriptome

    Get PDF
    For vertebrate organisms where a reference genome is not available, de novo transcriptome assembly enables a cost effective insight into the identification of tissue specific or differentially expressed genes and variation of the coding part of the genome. However, since there are a number of different tools and parameters that can be used to reconstruct transcripts, it is difficult to determine an optimal method. Here we suggest a pipeline based on (1) assessing the performance of three different assembly tools (2) using both single and multiple k-mer (MK) approaches (3) examining the influence of the number of reads used in the assembly (4) merging assemblies from different tools. We use an example dataset from the vertebrate Anas platyrhynchos domestica (Pekin duck). We find that taking a subset of data enables a robust assembly to be produced by multiple methods without the need for very high memory capacity. The use of reads mapped back to transcripts (RMBT) and CEGMA (Core Eukaryotic Genes Mapping Approach) provides useful metrics to determine the completeness of assembly obtained. For this dataset the use of MK in the assembly generated a more complete assembly as measured by greater number of RMBT and CEGMA score. Merged single k-mer assemblies are generally smaller but consist of longer transcripts, suggesting an assembly consisting of fewer fragmented transcripts. We suggest that the use of a subset of reads during assembly allows the relatively rapid investigation of assembly characteristics and can guide the user to the most appropriate transcriptome for particular downstream use. Transcriptomes generated by the compared assembly methods and the final merged assembly are freely available for download at http://dx.doi.org/10.6084/m9.figshare.1032613

    Comparison of Clustering Methods for Investigation of Genome-Wide Methylation Array Data

    Get PDF
    The use of genome-wide methylation arrays has proved very informative to investigate both clinical and biological questions in human epigenomics. The use of clustering methods either for exploration of these data or to compare to an a priori grouping, e.g., normal versus disease allows assessment of groupings of data without user bias. However no consensus on the methods to use for clustering of methylation array approaches has been reached. To determine the most appropriate clustering method for analysis of illumina array methylation data, a collection of data sets was simulated and used to compare clustering methods. Both hierarchical clustering and non-hierarchical clustering methods (k-means, k-medoids, and fuzzy clustering algorithms) were compared using a range of distance and linkage methods. As no single method consistently outperformed others across different simulations, we propose a method to capture the best clustering outcome based on an additional measure, the silhouette width. This approach produced a consistently higher cluster accuracy compared to using any one method in isolation
    corecore