38 research outputs found

    Coordinately Co-opted Multiple Transposable Elements Constitute an Enhancer for wnt5a Expression in the Mammalian Secondary Palate

    Get PDF
    Acquisition of cis-regulatory elements is a major driving force of evolution, and there are several examples of developmental enhancers derived from transposable elements (TEs). However, it remains unclear whether one enhancer element could have been produced via cooperation among multiple, yet distinct, TEs during evolution. Here we show that an evolutionarily conserved genomic region named AS3_9 comprises three TEs (AmnSINE1, X6b_DNA and MER117), inserted side-by-side, and functions as a distal enhancer for wnt5a expression during morphogenesis of the mammalian secondary palate. Functional analysis of each TE revealed step-by-step retroposition/transposition and co-option together with acquisition of a binding site for Msx1 for its full enhancer function during mammalian evolution. The present study provides a new perspective suggesting that a huge variety of TEs, in combination, could have accelerated the diversity of cis-regulatory elements involved in morphological evolution

    KEGG for representation and analysis of molecular networks involving diseases and drugs

    Get PDF
    Most human diseases are complex multi-factorial diseases resulting from the combination of various genetic and environmental factors. In the KEGG database resource (http://www.genome.jp/kegg/), diseases are viewed as perturbed states of the molecular system, and drugs as perturbants to the molecular system. Disease information is computerized in two forms: pathway maps and gene/molecule lists. The KEGG PATHWAY database contains pathway maps for the molecular systems in both normal and perturbed states. In the KEGG DISEASE database, each disease is represented by a list of known disease genes, any known environmental factors at the molecular level, diagnostic markers and therapeutic drugs, which may reflect the underlying molecular system. The KEGG DRUG database contains chemical structures and/or chemical components of all drugs in Japan, including crude drugs and TCM (Traditional Chinese Medicine) formulas, and drugs in the USA and Europe. This database also captures knowledge about two types of molecular networks: the interaction network with target molecules, metabolizing enzymes, other drugs, etc. and the chemical structure transformation network in the history of drug development. The new disease/drug information resource named KEGG MEDICUS can be used as a reference knowledge base for computational analysis of molecular networks, especially, by integrating large-scale experimental datasets

    A Mammalian Conserved Element Derived from SINE Displays Enhancer Properties Recapitulating Satb2 Expression in Early-Born Callosal Projection Neurons

    Get PDF
    Short interspersed repetitive elements (SINEs) are highly repeated sequences that account for a significant proportion of many eukaryotic genomes and are usually considered “junk DNA”. However, we previously discovered that many AmnSINE1 loci are evolutionarily conserved across mammalian genomes, suggesting that they may have acquired significant functions involved in controlling mammalian-specific traits. Notably, we identified the AS021 SINE locus, located 390 kbp upstream of Satb2. Using transgenic mice, we showed that this SINE displays specific enhancer activity in the developing cerebral cortex. The transcription factor Satb2 is expressed by cortical neurons extending axons through the corpus callosum and is a determinant of callosal versus subcortical projection. Mouse mutants reveal a crucial function for Sabt2 in corpus callosum formation. In this study, we compared the enhancer activity of the AS021 locus with Satb2 expression during telencephalic development in the mouse. First, we showed that the AS021 enhancer is specifically activated in early-born Satb2+ neurons. Second, we demonstrated that the activity of the AS021 enhancer recapitulates the expression of Satb2 at later embryonic and postnatal stages in deep-layer but not superficial-layer neurons, suggesting the possibility that the expression of Satb2 in these two subpopulations of cortical neurons is under genetically distinct transcriptional control. Third, we showed that the AS021 enhancer is activated in neurons projecting through the corpus callosum, as described for Satb2+ neurons. Notably, AS021 drives specific expression in axons crossing through the ventral (TAG1−/NPY+) portion of the corpus callosum, confirming that it is active in a subpopulation of callosal neurons. These data suggest that exaptation of the AS021 SINE locus might be involved in enhancement of Satb2 expression, leading to the establishment of interhemispheric communication via the corpus callosum, a eutherian-specific brain structure

    Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones

    Get PDF
    The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology

    Integrative annotation of 21,037 human genes validated by full-length cDNA clones.

    Get PDF
    publication en ligne. Article dans revue scientifique avec comité de lecture. nationale.National audienceThe human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology

    The KEGG databases and tools facilitating omics analysis: latest developments involving human diseases and pharmaceuticals.

    Get PDF
    In this chapter, we demonstrate the usability of the KEGG (Kyoto encyclopedia of genes and genomes) databases and tools, especially focusing on the visualization of the omics data. The desktop application KegArray and many Web-based tools are tightly integrated with the KEGG knowledgebase, which helps visualize and interpret large amount of data derived from high-throughput measurement techniques including microarray, metagenome, and metabolome analyses. Recently developed resources for human disease, drug, and plant research are also mentioned

    GDB(Genome Data Base). The construction of database in Human Genome Project.

    No full text

    LSD1-LIKE1-Mediated H3K4me2 Demethylation Is Required for Homologous Recombination Repair

    No full text
    International audienceHomologous recombination is a key process for maintaining genome integrity and diversity. In eukaryotes, the nucleosome structure of chromatin inhibits the progression of homologous recombination. The DNA repair and recombination protein RAD54 alters the chromatin structure via nucleosome sliding to enable homology searches. For homologous recombination to progress, appropriate recruitment and dissociation of RAD54 is required at the site of homologous recombination; however, little is known about the mechanism regulating RAD54 dynamics in chromatin. Here, we reveal that the histone demethylase LYSINE-SPECIFIC DEMETHYLASE1-LIKE 1 (LDL1) regulates the dissociation of RAD54 at damaged sites during homologous recombination repair in the somatic cells of Arabidopsis (Arabidopsis thaliana). Depletion of LDL1 leads to an overaccumulation of RAD54 at damaged sites with DNA double-strand breaks. Moreover, RAD54 accumulates at damaged sites by recognizing histone H3 Lys 4 di-methylation (H3K4me2); the frequency of the interaction between RAD54 and H3K4me2 increased in the ldl1 mutant with DNA double-strand breaks. We propose that LDL1 removes RAD54 at damaged sites by demethylating H3K4me2 during homologous recombination repair and thereby maintains genome stability in Arabidopsis
    corecore