22 research outputs found

    Combining comparative genomics with de novo motif discovery to identify human transcription factor DNA-binding motifs

    Get PDF
    BACKGROUND: As more and more genomes are sequenced, comparative genomics approaches provide a methodology for identifying conserved regulatory elements that may be involved in gene regulations. RESULTS: We developed a novel method to combine comparative genomics with de novo motif discovery to identify human transcription factor binding motifs that are overrepresented and conserved in the upstream regions of a set of co-regulated genes. The method is validated by analyzing a well-characterized muscle specific gene set, and the results showed that our approach performed better than the existing programs in terms of sensitivity and prediction rate. CONCLUSION: The newly developed method can be used to extract regulatory signals in co-regulated genes, which can be derived from the microarray clustering analysis

    Identification of gene modules associated with low temperatures response in Bambara groundnut by network-based analysis

    Get PDF
    Bambara groundnut (Vigna subterranea (L.) Verdc.) is an African legume and is a promising underutilized crop with good seed nutritional values. Low temperature stress in a number of African countries at night, such as Botswana, can effect the growth and development of bambara groundnut, leading to losses in potential crop yield. Therefore, in this study we developed a computational pipeline to identify and analyze the genes and gene modules associated with low temperature stress responses in bambara groundnut using the cross-species microarray technique (as bambara groundnut has no microarray chip) coupled with network-based analysis. Analyses of the bambara groundnut transcriptome using cross-species gene expression data resulted in the identification of 375 and 659 differentially expressed genes (p<0.01) under the sub-optimal (23°C) and very sub-optimal (18°C) temperatures, respectively, of which 110 genes are commonly shared between the two stress conditions. The construction of a Highest Reciprocal Rank-based gene co-expression network, followed by its partition using a Heuristic Cluster Chiseling Algorithm resulted in 6 and 7 gene modules in sub-optimal and very sub-optimal temperature stresses being identified, respectively. Modules of sub-optimal temperature stress are principally enriched with carbohydrate and lipid metabolic processes, while most of the modules of very sub-optimal temperature stress are significantly enriched with responses to stimuli and various metabolic processes. Several transcription factors (from MYB, NAC, WRKY, WHIRLY & GATA classes) that may regulate the downstream genes involved in response to stimulus in order for the plant to withstand very sub-optimal temperature stress were highlighted. The identified gene modules could be useful in breeding for low-temperature stress tolerant bambara groundnut varieties

    Genome-Wide Identification and Analysis of Grape Aldehyde Dehydrogenase (ALDH) Gene Superfamily

    Get PDF
    The completion of the grape genome sequencing project has paved the way for novel gene discovery and functional analysis. Aldehyde dehydrogenases (ALDHs) comprise a gene superfamily encoding NAD(P)(+)-dependent enzymes that catalyze the irreversible oxidation of a wide range of endogenous and exogenous aromatic and aliphatic aldehydes. Although ALDHs have been systematically investigated in several plant species including Arabidopsis and rice, our knowledge concerning the ALDH genes, their evolutionary relationship and expression patterns in grape has been limited.A total of 23 ALDH genes were identified in the grape genome and grouped into ten families according to the unified nomenclature system developed by the ALDH Gene Nomenclature Committee (AGNC). Members within the same grape ALDH families possess nearly identical exon-intron structures. Evolutionary analysis indicates that both segmental and tandem duplication events have contributed significantly to the expansion of grape ALDH genes. Phylogenetic analysis of ALDH protein sequences from seven plant species indicates that grape ALDHs are more closely related to those of Arabidopsis. In addition, synteny analysis between grape and Arabidopsis shows that homologs of a number of grape ALDHs are found in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the speciation of the grape and Arabidopsis. Microarray gene expression analysis revealed large number of grape ALDH genes responsive to drought or salt stress. Furthermore, we found a number of ALDH genes showed significantly changed expressions in responses to infection with different pathogens and during grape berry development, suggesting novel roles of ALDH genes in plant-pathogen interactions and berry development.The genome-wide identification, evolutionary and expression analysis of grape ALDH genes should facilitate research in this gene family and provide new insights regarding their evolution history and functional roles in plant stress tolerance

    Population differentiation in allele frequencies of obesity-associated SNPs

    No full text
    Abstract Background Obesity is emerging as a global health problem, with more than one-third of the world’s adult population being overweight or obese. In this study, we investigated worldwide population differentiation in allele frequencies of obesity-associated SNPs (single nucleotide polymorphisms). Results We collected a total of 225 obesity-associated SNPs from a public database. Their population-level allele frequencies were derived based on the genotype data from 1000 Genomes Project (phase 3). We used hypergeometric model to assess whether the effect allele at a given SNP is significantly enriched or depleted in each of the 26 populations surveyed in the 1000 Genomes Project with respect to the overall pooled population. Our results indicate that 195 out of 225 SNPs (86.7%) possess effect alleles significantly enriched or depleted in at least one of the 26 populations. Populations within the same continental group exhibit similar allele enrichment/depletion patterns whereas inter-continental populations show distinct patterns. Among the 225 SNPs, 15 SNPs cluster in the first intron region of the FTO gene, which is a major gene associated with body-mass index (BMI) and fat mass. African populations exhibit much smaller blocks of LD (linkage disequilibrium) among these15 SNPs while European and Asian populations have larger blocks. To estimate the cumulative effect of all variants associated with obesity, we developed the personal composite genetic risk score for obesity. Our results indicate that the East Asian populations have the lowest averages of the composite risk scores, whereas three European populations have the highest averages. In addition, the population-level average of composite genetic risk scores is significantly correlated (R2 = 0.35, P = 0.0060) with obesity prevalence. Conclusions We have detected substantial population differentiation in allele frequencies of obesity-associated SNPs. The results will help elucidate the genetic basis which may contribute to population disparities in obesity prevalence
    corecore