83 research outputs found

    Learning transcriptional regulatory networks from high throughput gene expression data using continuous three-way mutual information

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Probability based statistical learning methods such as mutual information and Bayesian networks have emerged as a major category of tools for reverse engineering mechanistic relationships from quantitative biological data. In this work we introduce a new statistical learning strategy, MI3 that addresses three common issues in previous methods simultaneously: (1) handling of continuous variables, (2) detection of more complex three-way relationships and (3) better differentiation of causal versus confounding relationships. With these improvements, we provide a more realistic representation of the underlying biological system.</p> <p>Results</p> <p>We test the MI3 algorithm using both synthetic and experimental data. In the synthetic data experiment, MI3 achieved an absolute sensitivity/precision of 0.77/0.83 and a relative sensitivity/precision both of 0.99. In addition, MI3 significantly outperformed the control methods, including Bayesian networks, classical two-way mutual information and a discrete version of MI3. We then used MI3 and control methods to infer a regulatory network centered at the MYC transcription factor from a published microarray dataset. Models selected by MI3 were numerically and biologically distinct from those selected by control methods. Unlike control methods, MI3 effectively differentiated true causal models from confounding models. MI3 recovered major MYC cofactors, and revealed major mechanisms involved in MYC dependent transcriptional regulation, which are strongly supported by literature. The MI3 network showed that limited sets of regulatory mechanisms are employed repeatedly to control the expression of large number of genes.</p> <p>Conclusion</p> <p>Overall, our work demonstrates that MI3 outperforms the frequently used control methods, and provides a powerful method for inferring mechanistic relationships underlying biological and other complex systems. The MI3 method is implemented in R in the "mi3" package, available under the GNU GPL from <url>http://sysbio.engin.umich.edu/~luow/downloads.php</url> and from the R package archive CRAN.</p

    Spermatogenesis drives rapid gene creation and masculinization of the X chromosome in stalk-eyed flies (Diopsidae)

    Get PDF
    Throughout their evolutionary history, genomes acquire new genetic material that facilitates phenotypic innovation and diversification. Developmental processes associated with reproduction are particularly likely to involve novel genes. Abundant gene creation impacts the evolution of chromosomal gene content and general regulatory mechanisms such as dosage compensation. Numerous studies in model organisms have found complex and, at times contradictory, relationships among these genomic attributes highlighting the need to examine these patterns in other systems characterized by abundant sexual selection. Therefore, we examined the association among novel gene creation, tissue-specific gene expression, and chromosomal gene content within stalk-eyed flies. Flies in this family are characterized by strong sexual selection and the presence of a newly evolved X chromosome. We generated RNA-seq transcriptome data from the testes for three species within the family and from seven additional tissues in the highly dimorphic species, Teleopsis dalmanni. Analysis of dipteran gene orthology reveals dramatic testes-specific gene creation in stalk-eyed flies, involving numerous gene families that are highly conserved in other insect groups. Identification of X-linked genes for the three species indicates that the X chromosome arose prior to the diversification of the family. The most striking feature of this X chromosome is that it is highly masculinized, containing nearly twice as many testes-specific genes as expected based on its size. All the major processes that may drive differential sex chromosome gene content—creation of genes with male-specific expression, development of male-specific expression from pre-existing genes, and movement of genes with male-specific expression—are elevated on the X chromosome of T. dalmanni. This masculinization occurs despite evidence that testes expressed genes do not achieve the same levels of gene expression on the X chromosome as they do on the autosomes. © The Author 2016

    Low-pass shotgun sequencing of the barley genome facilitates rapid identification of genes, conserved non-coding sequences and novel repeats

    Get PDF
    BACKGROUND: Barley has one of the largest and most complex genomes of all economically important food crops. The rise of new short read sequencing technologies such as Illumina/Solexa permits such large genomes to be effectively sampled at relatively low cost. Based on the corresponding sequence reads a Mathematically Defined Repeat (MDR) index can be generated to map repetitive regions in genomic sequences. RESULTS: We have generated 574 Mbp of Illumina/Solexa sequences from barley total genomic DNA, representing about 10% of a genome equivalent. From these sequences we generated an MDR index which was then used to identify and mark repetitive regions in the barley genome. Comparison of the MDR plots with expert repeat annotation drawing on the information already available for known repetitive elements revealed a significant correspondence between the two methods. MDR-based annotation allowed for the identification of dozens of novel repeat sequences, though, which were not recognised by hand-annotation. The MDR data was also used to identify gene-containing regions by masking of repetitive sequences in eight de-novo sequenced bacterial artificial chromosome (BAC) clones. For half of the identified candidate gene islands indeed gene sequences could be identified. MDR data were only of limited use, when mapped on genomic sequences from the closely related species Triticum monococcum as only a fraction of the repetitive sequences was recognised. CONCLUSION: An MDR index for barley, which was obtained by whole-genome Illumina/Solexa sequencing, proved as efficient in repeat identification as manual expert annotation. Circumventing the labour-intensive step of producing a specific repeat library for expert annotation, an MDR index provides an elegant and efficient resource for the identification of repetitive and low-copy (i.e. potentially gene-containing sequences) regions in uncharacterised genomic sequences. The restriction that a particular MDR index can not be used across species is outweighed by the low costs of Illumina/Solexa sequencing which makes any chosen genome accessible for whole-genome sequence sampling

    Transcriptome Deep-Sequencing and Clustering of Expressed Isoforms from Favia Corals

    Full text link
    Background: Genomic and transcriptomic sequence data are essential tools for tackling ecological problems. Using an approach that combines next-generation sequencing, de novo transcriptome assembly, gene annotation and synthetic gene construction, we identify and cluster the protein families from Favia corals from the northern Red Sea. Results: We obtained 80 million 75 bp paired-end cDNA reads from two Favia adult samples collected at 65 m (Fav1, Fav2) on the Illumina GA platform, and generated two de novo assemblies using ABySS and CAP3. After removing redundancy and filtering out low quality reads, our transcriptome datasets contained 58,268 (Fav1) and 62,469 (Fav2) contigs longer than 100 bp, with N50 values of 1,665 bp and 1,439 bp, respectively. Using the proteome of the sea anemone Nematostella vectensis as a reference, we were able to annotate almost 20% of each dataset using reciprocal homology searches. Homologous clustering of these annotated transcripts allowed us to divide them into 7,186 (Fav1) and 6,862 (Fav2) homologous transcript clusters (E-value ≀ 2e-30). Functional annotation categories were assigned to homologous clusters using the functional annotation of Nematostella vectensis. General annotation of the assembled transcripts was improved 1-3% using the Acropora digitifera proteome. In addition, we screened these transcript isoform clusters for fluorescent proteins (FPs) homologs and identified seven potential FP homologs in Fav1, and four in Fav2. These transcripts were validated as bona fide FP transcripts via robust fluorescence heterologous expression. Annotation of the assembled contigs revealed that 1.34% and 1.61% (in Fav1 and Fav2, respectively) of the total assembled contigs likely originated from the corals’ algal symbiont, Symbiodinium spp. Conclusions: Here we present a study to identify the homologous transcript isoform clusters from the transcriptome of Favia corals using a far-related reference proteome. Furthermore, the symbiont-derived transcripts were isolated from the datasets and their contribution quantified. This is the first annotated transcriptome of the genus Favia, a major increase in genomics resources available in this important family of corals

    Transcriptome Deep-Sequencing and Clustering of Expressed Isoforms from Favia Corals

    Get PDF
    Background: Genomic and transcriptomic sequence data are essential tools for tackling ecological problems. Using an approach that combines next-generation sequencing, de novo transcriptome assembly, gene annotation and synthetic gene construction, we identify and cluster the protein families from Favia corals from the northern Red Sea. Results: We obtained 80 million 75 bp paired-end cDNA reads from two Favia adult samples collected at 65 m (Fav1, Fav2) on the Illumina GA platform, and generated two de novo assemblies using ABySS and CAP3. After removing redundancy and filtering out low quality reads, our transcriptome datasets contained 58,268 (Fav1) and 62,469 (Fav2) contigs longer than 100 bp, with N50 values of 1,665 bp and 1,439 bp, respectively. Using the proteome of the sea anemone Nematostella vectensis as a reference, we were able to annotate almost 20% of each dataset using reciprocal homology searches. Homologous clustering of these annotated transcripts allowed us to divide them into 7,186 (Fav1) and 6,862 (Fav2) homologous transcript clusters (E-value ≀ 2e-30). Functional annotation categories were assigned to homologous clusters using the functional annotation of Nematostella vectensis. General annotation of the assembled transcripts was improved 1-3% using the Acropora digitifera proteome. In addition, we screened these transcript isoform clusters for fluorescent proteins (FPs) homologs and identified seven potential FP homologs in Fav1, and four in Fav2. These transcripts were validated as bona fide FP transcripts via robust fluorescence heterologous expression. Annotation of the assembled contigs revealed that 1.34% and 1.61% (in Fav1 and Fav2, respectively) of the total assembled contigs likely originated from the corals’ algal symbiont, Symbiodinium spp. Conclusions: Here we present a study to identify the homologous transcript isoform clusters from the transcriptome of Favia corals using a far-related reference proteome. Furthermore, the symbiont-derived transcripts were isolated from the datasets and their contribution quantified. This is the first annotated transcriptome of the genus Favia, a major increase in genomics resources available in this important family of corals

    The mitogenome of the bed bug Cimex lectularius (Hemiptera: Cimicidae)

    Full text link
    We report the extraction of a bed bug mitogenome from high-throughput sequencing projects originally focused on the nuclear genome of Cimex lectularius. The assembled mitogenome has a similar AT nucleotide composition bias found in other insects. Phylogenetic analysis of all protein-coding genes indicates that C. lectularius is clearly a member of a paraphyletic Cimicomorpha clade within the Order Hemiptera

    Antimicrobial sensing coupled with cell membrane remodeling mediates antibiotic resistance and virulence in Enterococcus faecalis.

    Get PDF
    Bacteria have developed several evolutionary strategies to protect their cell membranes (CMs) from the attack of antibiotics and antimicrobial peptides (AMPs) produced by the innate immune system, including remodeling of phospholipid content and localization. Multidrug-resistant Enterococcus faecalis, an opportunistic human pathogen, evolves resistance to the lipopeptide daptomycin and AMPs by diverting the antibiotic away from critical septal targets using CM anionic phospholipid redistribution. The LiaFSR stress response system regulates this CM remodeling via the LiaR response regulator by a previously unknown mechanism. Here, we characterize a LiaR-regulated protein, LiaX, that senses daptomycin or AMPs and triggers protective CM remodeling. LiaX is surface exposed, and in daptomycin-resistant clinical strains, both LiaX and the N-terminal domain alone are released into the extracellular milieu. The N-terminal domain of LiaX binds daptomycin and AMPs (such as human LL-37) and functions as an extracellular sentinel that activates the cell envelope stress response. The C-terminal domain of LiaX plays a role in inhibiting the LiaFSR system, and when this domain is absent, it leads to activation of anionic phospholipid redistribution. Strains that exhibit LiaX-mediated CM remodeling and AMP resistance show enhanced virulence in the Caenorhabditis elegans model, an effect that is abolished in animals lacking an innate immune pathway crucial for producing AMPs. In conclusion, we report a mechanism of antibiotic and AMP resistance that couples bacterial stress sensing to major changes in CM architecture, ultimately also affecting host-pathogen interactions

    Mobile-genetic-element-encoded hypertolerance to copper protects Staphylococcus aureus from killing by host phagocytes

    Get PDF
    M.Z. and J.A.G. were supported by funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement 634588. K.J.W. was supported by a Sir Henry Dale Fellowship funded by the Wellcome Trust and the Royal Society (098375/Z/12/Z). G.P.R. was funded by a CAPES Science Without Borders scholarship (BEX 2445/13-1). M.T.G.H. is funded by the Chief Scientist Office through the Scottish Infection Research Network, a part of the SHAIPI consortium (grant SIRN/10).Pathogens are exposed to toxic levels of copper during infection, and copper tolerance may be a general virulence mechanism used by bacteria to resist host defenses. In support of this, inactivation of copper exporter genes has been found to reduce the virulence of bacterial pathogens in vivo. Here we investigate the role of copper hypertolerance in methicillin-resistant Staphylococcus aureus (MRSA). We show that a copper hypertolerance operon (copB-mco), carried on a mobile genetic element (MGE), is prevalent in a collection of invasive S. aureus strains and more widely among clonal complex 22, 30, and 398 strains. The copB and mco genes encode a copper efflux pump and a multicopper oxidase, respectively. Isogenic mutants lacking copB or mco had impaired growth in subinhibitory concentrations of copper. Transfer of a copB-mco-carrying plasmid to a naive clinical isolate resulted in a gain of copper hypertolerance and enhanced bacterial survival inside primed macrophages. The copB and mco genes were upregulated within infected macrophages, and their expression was dependent on the copper-sensitive operon repressor CsoR. Isogenic copB and mco mutants were impaired in their ability to persist intracellularly in macrophages and were less resistant to phagocytic killing in human blood than the parent strain. The importance of copper-regulated genes in resistance to phagocytic killing was further elaborated using mutants expressing a copper-insensitive variant of CsoR. Our findings suggest that the gain of mobile genetic elements carrying copper hypertolerance genes contributes to the evolution of virulent strains of S. aureus that are better equipped to resist killing by host immune cells.Publisher PDFPeer reviewe
    • 

    corecore