13 research outputs found
Structural and Functional Genomics in Semi-Autonomous Organelles: Composition and Origin of Proteomes of Chloroplasts and Mitochondria and Related Transcriptomics
Mitochondria and chloroplasts are semi-autonomous organelles that have arisen through an endosymbiotic event and, over evolutionary time, have donated most of their genome to the nucleus of the host cell. Due to this transfer of genetic material, the expression of many proteins of the organellar proteomes, now synthesized in the cytosol and re-targeted to the organelles, came under control of the nucleus. Subsequently, means of communication between organelles and nucleus must exist, enabling the organelles to take influence on the nuclear gene expression. This thesis focused on structural and functional genomics in mitochondria and chloroplasts, addressing questions related to the composition, origin and evolution of the organelles, as well as, chloroplast-to-nucleus signaling. The accuracy of five different predictors for the detection of N-terminal targeting peptides was evaluated employing test sets consisting of proteins with experimentally proven subcellular localization, and found to be substantially lower than reported before. Combinations of the predictors showed to be more accurate than any of the predictors alone and were subsequently used to estimate the size and composition of the organellar proteomes. A prediction of the mitochondrial proteomes for ten species was performed and revealed that functional mitochondria harbor from a few hundred to more than 3,000 gene products. A core set of conserved mitochondrial proteins could be identified whose functions are mostly related to transport and metabolism, and -- if mutated -- are frequently associated with disease in humans. In collaboration with W. Martin (Universität Düsseldorf) and co-workers, the cyanobacterial heritage of the Arabidopsis genome was estimated by phylogenetic inferrence and about 4,800 genes (or 18% of the genome) were shown to have been acquired from the prokaryotic ancestor. In both flowering plants, A. thaliana and O. sativa, about 7% of the whole proteome were predicted to be targeted to the chloroplast, with close to 600 of those proteins shared by both species and most likely to be derived from cyanobacteria. The functions of this subset are mainly related to metabolism and energy. In both organelles, species-specific proteins were detected indicating a functional diversification. Even though in A. thaliana the cTP-featuring proteins are predominantly of prokaryotic origin (more than 50%), this indicates that post-endosymbiotic relocations of proteins from/to the chloroplast occurred by altered targeting. These findings were also confirmed for mitochondria. A differential-expression analysis of the nuclear chloroplast transcriptome under 35 environmental and genetic conditions was performed. It revealed, that most of those conditions elicit only three main classes of transcriptome response. Two of these classes, probably involving GUN-type plastid signaling, are characterized by alterations, in opposite directions, in the expression of largely overlapping sets of genes. Thus these findings, suggest the existence of a regulatory, binary master-switch
NUMTs in Sequenced Eukaryotic Genomes
Mitochondrial DNA sequences are frequently transferred to the nucleus giving rise to the so-called nuclear mitochondrial DNA (NUMT). Analysis of 13 eukaryotic species with sequenced mitochondrial and nuclear genomes reveals a large interspecific variation of NUMT number and size. Copy number ranges from none or few copies in Anopheles, Caenorhabditis, Plasmodium, Drosophila, and Fugu to more than 500 in human, rice, and Arabidopsis. The average size is between 62 (baker’s yeast) and 647 bps (Neurospora), respectively. A correlation between the abundance of NUMTs and the size of the nuclear or the mitochondrial genomes, or of the nuclear gene density, is not evident. Other factors, such as the number and/or stability of mitochondria in the germline, or species-specific mechanisms controlling accumulation/loss of nuclear DNA, might be responsible for the interspecific diversity in NUMT accumulation
NUPTs in Sequenced Eukaryotes and Their Genomic Organization in Relation to NUMTs
NUPTs (nuclear plastid DNA) derive from plastid-to-nucleus DNA transfer and exist in various plant species. Experimental data imply that the DNA transfer is an ongoing, highly frequent process, but for the interspecific diversity of NUPTs, no clear explanation exists. Here, an inventory of NUPTs in the four sequenced plastid-bearing species and their genomic organization is presented. Large genomes with a predicted low gene density contain more NUPTs. In Chlamydomonas and Plasmodium, DNA transfer occurred but was limited, probably because of the presence of only one plastid per cell. In Arabidopsis and rice, NUPTs are frequently organized as clusters. Tight clusters can contain both NUPTs and NUMTs (nuclear mitochondrial DNA), indicating that preNUPTs and preNUMTs might have concatamerized before integration. The composition of such a hypothetical preNUPT-preNUMT pool seems to be variable, as implied by substantially different NUPTs:NUMTs ratios in different species. Loose clusters can span several dozens of kbps of nuclear DNA, and they contain markedly more NUPTs or NUMTs than expected from a random genomic distribution of nuclear organellar DNA. The level of sequence similarity between NUPTs/NUMTs and plastid/mitochondrial DNA correlates with the size of the integrant. This implies that original insertions are large and decay over evolutionary time into smaller fragments with diverging sequences. We suggest that tight and loose clusters represent intermediates of this decay process
Mode of Amplification and Reorganization of Resistance Genes During Recent Arabidopsis thaliana Evolution
The NBS-LRR (nucleotide-binding site plus leucine-rich repeat) genes represent the major class of disease resistance genes in flowering plants and comprise 166 genes in the ecotype Col-0 of Arabidopsis thaliana. NBS-LRR genes are organized in single-gene loci, clusters, and superclusters. Phylogenetic analysis reveals nine monophyletic clades and a few phylogenetic orphans. Most clusters contain only genes from the same phylogenetic lineage, reflecting their origin from the exchange of sequence blocks as a result of intralocus recombination. Multiple duplications increased the number of NBS-LRR genes in the progenitors of Arabidopsis, suggesting that the present complexity in Col-0 may derive from as few as 17 progenitors. The combination of physical and phylogenetic analyses of the NBS-LRR genes makes it possible to detect relatively recent gene rearrangements, which increased the number of NBS-LRR genes by about 50, but which are almost never associated with large segmental duplications. The identification of 10 heterogeneous clusters containing members from different clades demonstrates that sequence sampling between different resistance gene loci and clades has occurred. Such events may have taken place early during flowering plant evolution, but they generated modules that have been duplicated and remobilized also more recently
A genome phylogeny for mitochondria among alpha-proteobacteria and a predominantly eubacterial ancestry of yeast nuclear genes
Analyses of 55 individual and 31 concatenated protein data sets encoded in Reclinomonas americana and Marchantia polymorpha mitochondrial genomes revealed that current methods for constructing phylogenetic trees are insufficiently sensitive (or artifact-insensitive) to ascertain the sister of mitochondria among the current sample of eight alpha-proteobacterial genomes using mitochondrially-encoded proteins. However, Rhodospirillum rubrum came as close to mitochondria as any alpha-proteobacterium investigated. This prompted a search for methods to directly compare eukaryotic genomes to their prokaryotic counterparts to investigate the origin of the mitochondrion and its host from the standpoint of nuclear genes. We examined pairwise amino acid sequence identity in comparisons of 6,214 nuclear protein-coding genes from Saccharomyces cerevisiae to 177,117 proteins encoded in sequenced genomes from 45 eubacteria and 15 archaebacteria. The results reveal that approximately 75% of yeast genes having homologues among the present prokaryotic sample share greater amino acid sequence identity to eubacterial than to archaebacterial homologues. At high stringency comparisons, only the eubacterial component of the yeast genome is detectable. Our findings indicate that at the levels of overall amino acid sequence identity and gene content, yeast shares a sister-group relationship with eubacteria, not with archaebacteria, in contrast to the current phylogenetic paradigm based on ribosomal RNA. Among eubacteria and archaebacteria, proteobacterial and methanogen genomes, respectively, shared more similarity with the yeast genome than other prokaryotic genomes surveyed
Generation and evolutionary fate of insertions of organelle DNA in the nuclear genomes of flowering plants
Nuclear genomes are exposed to a continuous influx of DNA from mitochondria and plastids. We have characterized the structure of ∼750 kb of organelle DNA, distributed among 13 loci, in the nuclear genomes of Arabidopsis and rice. These segments are large and migrated to the nucleus quite recently, allowing us to reconstruct their evolution. Two general types of nuclear insertions coexist; one is characterized by long sequence stretches that are colinear with organelle DNA, the other type consists of mosaics of organelle DNA, often derived from both plastids and mitochondria. The levels of sequence divergence of the two types exclude their common descent, implying that at least two independent modes of DNA transfer from organelle to nucleus operate. The post-integration fate of organelle DNA is characterized by a predominance of transition mutations, associated with the gradual amelioration of the integrated sequence to the nucleotide composition of the host chromosome. Deletion of organelle DNA at these loci is essentially balanced by insertions of nonorganelle DNA. Deletions are associated with the removal of DNA between perfect repeats, indicating that they originate by replication slippage
GST-PRIME: a genome-wide primer design software for the generation of gene sequence tags
The availability of sequenced genomes has generated a need for experimental approaches that allow the simultaneous analysis of large, or even complete, sets of genes. To facilitate such analyses, we have developed GST-PRIME, a software package for retrieving and assembling gene sequences, even from complex genomes, using the NCBI public database, and then designing sets of primer pairs for use in gene amplification. Primers were designed by the program for the direct amplification of gene sequence tags (GSTs) from either genomic DNA or cDNA. Test runs of GST-PRIME on 2000 randomly selected Arabidopsis and Drosophila genes demonstrate that 93 and 88% of resulting GSTs, respectively, fulfilled imposed length criteria. GST-PRIME primer pairs were tested on a set of 1900 Arabidopsis genes coding for chloroplast-targeted proteins: 95% of the primer pairs used in PCRs with genomic DNA generated the correct amplicons. GST-PRIME can thus be reliably used for large-scale or specific amplification of intron-containing genes of multicellular eukaryotes
Covariations in the nuclear chloroplast transcriptome reveal a regulatory master-switch
The evolution of the endosymbiotic progenitor into the chloroplast organelle was associated with the transfer of numerous chloroplast genes into the nucleus. Hence, inter-organellar signalling, and the co-ordinated expression of sets of nuclear genes, was set up to control the metabolic and developmental status of the chloroplast. Here, we show by the differential-expression analysis of 3,292 genes, that most of the 35 environmental and genetic conditions tested, including plastid signalling mutations, elicit only three main classes of response from the nuclear chloroplast transcriptome. Two classes, probably involving GUN (genomes uncoupled)-type plastid signalling, are characterized by alterations, in opposite directions, in the expression of largely overlapping sets of genes