28,442 research outputs found

    Systematic identification of functional plant modules through the integration of complementary data sources

    Get PDF
    A major challenge is to unravel how genes interact and are regulated to exert specific biological functions. The integration of genome-wide functional genomics data, followed by the construction of gene networks, provides a powerful approach to identify functional gene modules. Large-scale expression data, functional gene annotations, experimental protein-protein interactions, and transcription factor-target interactions were integrated to delineate modules in Arabidopsis (Arabidopsis thaliana). The different experimental input data sets showed little overlap, demonstrating the advantage of combining multiple data types to study gene function and regulation. In the set of 1,563 modules covering 13,142 genes, most modules displayed strong coexpression, but functional and cis-regulatory coherence was less prevalent. Highly connected hub genes showed a significant enrichment toward embryo lethality and evidence for cross talk between different biological processes. Comparative analysis revealed that 58% of the modules showed conserved coexpression across multiple plants. Using module-based functional predictions, 5,562 genes were annotated, and an evaluation experiment disclosed that, based on 197 recently experimentally characterized genes, 38.1% of these functions could be inferred through the module context. Examples of confirmed genes of unknown function related to cell wall biogenesis, xylem and phloem pattern formation, cell cycle, hormone stimulus, and circadian rhythm highlight the potential to identify new gene functions. The module-based predictions offer new biological hypotheses for functionally unknown genes in Arabidopsis (1,701 genes) and six other plant species (43,621 genes). Furthermore, the inferred modules provide new insights into the conservation of coexpression and coregulation as well as a starting point for comparative functional annotation

    MorphDB : prioritizing genes for specialized metabolism pathways and gene ontology categories in plants

    Get PDF
    Recent times have seen an enormous growth of "omics" data, of which high-throughput gene expression data are arguably the most important from a functional perspective. Despite huge improvements in computational techniques for the functional classification of gene sequences, common similarity-based methods often fall short of providing full and reliable functional information. Recently, the combination of comparative genomics with approaches in functional genomics has received considerable interest for gene function analysis, leveraging both gene expression based guilt-by-association methods and annotation efforts in closely related model organisms. Besides the identification of missing genes in pathways, these methods also typically enable the discovery of biological regulators (i.e., transcription factors or signaling genes). A previously built guilt-by-association method is MORPH, which was proven to be an efficient algorithm that performs particularly well in identifying and prioritizing missing genes in plant metabolic pathways. Here, we present MorphDB, a resource where MORPH-based candidate genes for large-scale functional annotations (Gene Ontology, MapMan bins) are integrated across multiple plant species. Besides a gene centric query utility, we present a comparative network approach that enables researchers to efficiently browse MORPH predictions across functional gene sets and species, facilitating efficient gene discovery and candidate gene prioritization. MorphDB is available at http://bioinformatics.psb.ugent.be/webtools/morphdb/morphDB/index/. We also provide a toolkit, named "MORPH bulk" (https://github.com/arzwa/morph-bulk), for running MORPH in bulk mode on novel data sets, enabling researchers to apply MORPH to their own species of interest

    Transcriptome of the dead: characterisation of immune genes and marker development from necropsy samples in a free-ranging marine mammal

    Get PDF
    Background Transcriptomes are powerful resources, providing a window on the expressed portion of the genome that can be generated rapidly and at low cost for virtually any organism. However, because many genes have tissue-specific expression patterns, developing a complete transcriptome usually requires a 'discovery pool' of individuals to be sacrificed in order to harvest mRNA from as many different types of tissue as possible. This hinders transcriptome development in large, charismatic and endangered species, many of which stand the most to gain from such approaches. To circumvent this problem in a model pinniped species, we 454 sequenced cDNA from testis, heart, spleen, intestine, kidney and lung tissues obtained from nine adult male Antarctic fur seals (Arctocephalus gazella) that died of natural causes at Bird Island, South Georgia. Results After applying stringent quality control criteria based on length and annotation, we obtained 12,397 contigs which, in combination with 454 data previously obtained from skin, gave a total of 23,096 unique contigs. Homology was found to 77.0% of dog (Canis lupus familiaris) transcripts, suggesting that the combined assembly represents a substantial proportion of this species' transcriptome. Moreover, only 0.5% of transcripts revealed sequence similarity to bacteria, implying minimal contamination, and the percentage of transcripts involved in cell death was low at 2.6%. Transcripts with immune-related annotations were almost five-fold enriched relative to skin and represented 13.2% of all spleen-specific contigs. By reference to the dog, we also identified transcripts revealing homology to five class I, ten class II and three class III genes of the Major Histocompatibility Complex and derived the putative genomic distribution of 17,121 contigs, 2,119 in silico mined microsatellites and 9,382 single nucleotide polymorphisms. Conclusions Our findings suggest that transcriptome development based on samples collected post mortem may greatly facilitate genomic studies, not only of marine mammals but also more generally of species that are of conservation concern

    Using giant scarlet runner bean embryos to uncover regulatory networks controlling suspensor gene activity.

    Get PDF
    One of the major unsolved issues in plant development is understanding the regulatory networks that control the differential gene activity that is required for the specification and development of the two major embryonic regions, the embryo proper and suspensor. Historically, the giant embryo of scarlet runner bean (SRB), Phaseolus coccineus, has been used as a model system to investigate the physiological events that occur early in embryogenesis-focusing on the question of what role the suspensor region plays. A major feature distinguishing SRB embryos from those of other plants is a highly enlarged suspensor containing at least 200 cells that synthesize growth regulators required for subsequent embryonic development. Recent studies have exploited the giant size of the SRB embryo to micro-dissect the embryo proper and suspensor regions in order to use genomics-based approaches to identify regulatory genes that may be involved in controlling suspensor and embryo proper differentiation, as well as the cellular processes that may be unique to each embryonic region. Here we review the current genomics resources that make SRB embryos a compelling model system for studying the early events required to program embryo development

    Ancient Pbx-Hox signatures define hundreds of vertebrate developmental enhancers

    Get PDF
    Background: Gene regulation through cis-regulatory elements plays a crucial role in development and disease. A major aim of the post-genomic era is to be able to read the function of cis-regulatory elements through scrutiny of their DNA sequence. Whilst comparative genomics approaches have identified thousands of putative regulatory elements, our knowledge of their mechanism of action is poor and very little progress has been made in systematically de-coding them. Results: Here, we identify ancient functional signatures within vertebrate conserved non-coding elements (CNEs) through a combination of phylogenetic footprinting and functional assay, using genomic sequence from the sea lamprey as a reference. We uncover a striking enrichment within vertebrate CNEs for conserved binding-site motifs of the Pbx-Hox hetero-dimer. We further show that these predict reporter gene expression in a segment specific manner in the hindbrain and pharyngeal arches during zebrafish development. Conclusions: These findings evoke an evolutionary scenario in which many CNEs evolved early in the vertebrate lineage to co-ordinate Hox-dependent gene-regulatory interactions that pattern the vertebrate head. In a broader context, our evolutionary analyses reveal that CNEs are composed of tightly linked transcription-factor binding-sites (TFBSs), which can be systematically identified through phylogenetic footprinting approaches. By placing a large number of ancient vertebrate CNEs into a developmental context, our findings promise to have a significant impact on efforts toward de-coding gene-regulatory elements that underlie vertebrate development, and will facilitate building general models of regulatory element evolution

    Conserved noncoding sequences highlight shared components of regulatory networks in dicotyledonous plants

    Get PDF
    Conserved noncoding sequences (CNSs) in DNA are reliable pointers to regulatory elements controlling gene expression. Using a comparative genomics approach with four dicotyledonous plant species (Arabidopsis thaliana, papaya [Carica papaya], poplar [Populus trichocarpa], and grape [Vitis vinifera]), we detected hundreds of CNSs upstream of Arabidopsis genes. Distinct positioning, length, and enrichment for transcription factor binding sites suggest these CNSs play a functional role in transcriptional regulation. The enrichment of transcription factors within the set of genes associated with CNS is consistent with the hypothesis that together they form part of a conserved transcriptional network whose function is to regulate other transcription factors and control development. We identified a set of promoters where regulatory mechanisms are likely to be shared between the model organism Arabidopsis and other dicots, providing areas of focus for further research

    How to understand the cell by breaking it: network analysis of gene perturbation screens

    Get PDF
    Modern high-throughput gene perturbation screens are key technologies at the forefront of genetic research. Combined with rich phenotypic descriptors they enable researchers to observe detailed cellular reactions to experimental perturbations on a genome-wide scale. This review surveys the current state-of-the-art in analyzing perturbation screens from a network point of view. We describe approaches to make the step from the parts list to the wiring diagram by using phenotypes for network inference and integrating them with complementary data sources. The first part of the review describes methods to analyze one- or low-dimensional phenotypes like viability or reporter activity; the second part concentrates on high-dimensional phenotypes showing global changes in cell morphology, transcriptome or proteome.Comment: Review based on ISMB 2009 tutorial; after two rounds of revisio
    • …
    corecore