400 research outputs found
Law of Genome Evolution Direction : Coding Information Quantity Grows
The problem of the directionality of genome evolution is studied. Based on
the analysis of C-value paradox and the evolution of genome size we propose
that the function-coding information quantity of a genome always grows in the
course of evolution through sequence duplication, expansion of code, and gene
transfer from outside. The function-coding information quantity of a genome
consists of two parts, p-coding information quantity which encodes functional
protein and n-coding information quantity which encodes other functional
elements except amino acid sequence. The evidences on the evolutionary law
about the function-coding information quantity are listed. The needs of
function is the motive force for the expansion of coding information quantity
and the information quantity expansion is the way to make functional innovation
and extension for a species. So, the increase of coding information quantity of
a genome is a measure of the acquired new function and it determines the
directionality of genome evolution.Comment: 16 page
MIPS: analysis and annotation of genome information in 2007
The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) combines automatic processing of large amounts of sequences with manual annotation of selected model genomes. Due to the massive growth of the available data, the depth of annotation varies widely between independent databases. Also, the criteria for the transfer of information from known to orthologous sequences are diverse. To cope with the task of global in-depth genome annotation has become unfeasible. Therefore, our efforts are dedicated to three levels of annotation: (i) the curation of selected genomes, in particular from fungal and plant taxa (e.g. CYGD, MNCDB, MatDB), (ii) the comprehensive, consistent, automatic annotation employing exhaustive methods for the computation of sequence similarities and sequence-related attributes as well as the classification of individual sequences (SIMAP, PEDANT and FunCat) and (iii) the compilation of manually curated databases for protein interactions based on scrutinized information from the literature to serve as an accepted set of reliable annotated interaction data (MPACT, MPPI, CORUM). All databases and tools described as well as the detailed descriptions of our projects can be accessed through the MIPS web server (http://mips.gsf.de)
Enrichment analysis of Alu elements with different spatial chromatin proximity in the human genome
Transposable elements (TEs) have no longer been totally considered as “junk DNA” for quite a time since the continual discoveries of their multifunctional roles in eukaryote genomes. As one of the most important and abundant TEs that still active in human genome, Alu, a SINE family, has demonstrated its indispensable regulatory functions at sequence level, but its spatial roles are still unclear. Technologies based on 3C(chromosomeconformation capture) have revealed the mysterious three-dimensional structure of chromatin, and make it possible to study the distal chromatin interaction in the genome. To find the role TE
playing in distal regulation in human genome, we compiled the new released Hi-C data, TE annotation, histone marker annotations, and the genome-wide methylation data to operate correlation analysis, and found that the density of Alu elements showed a strong positive correlation with the level of chromatin interactions (hESC: r=0.9, P<2.2×1016; IMR90 fibroblasts: r = 0.94, P < 2.2 × 1016) and also have a significant positive correlation withsomeremote functional DNA elements like enhancers and promoters (Enhancer: hESC: r=0.997, P=2.3×10−4; IMR90: r=0.934, P=2×10−2; Promoter: hESC: r = 0.995, P = 3.8 × 10−4; IMR90: r = 0.996, P = 3.2 × 10−4). Further investigation involving GC content and methylation status showed the GC content of Alu covered sequences shared a similar pattern with that of the overall sequence, suggesting that Alu elements also function as the GC nucleotide and CpG site provider. In all, our results suggest that the Alu elements may act as an alternative parameter to evaluate the Hi-C data, which is confirmed by the correlation analysis of Alu elements and histone markers. Moreover, the GC-rich Alu sequence can bring high GC content and methylation flexibility to the regions with more distal chromatin contact, regulating the transcription of tissue-specific genes
Mechanisms controlling anaemia in Trypanosoma congolense infected mice.
Trypanosoma congolense are extracellular protozoan parasites of the blood stream of artiodactyls and are one of the main constraints on cattle production in Africa. In cattle, anaemia is the key feature of disease and persists after parasitaemia has declined to low or undetectable levels, but treatment to clear the parasites usually resolves the anaemia. The progress of anaemia after Trypanosoma congolense infection was followed in three mouse strains. Anaemia developed rapidly in all three strains until the peak of the first wave of parasitaemia. This was followed by a second phase, characterized by slower progress to severe anaemia in C57BL/6, by slow recovery in surviving A/J and a rapid recovery in BALB/c. There was no association between parasitaemia and severity of anaemia. Furthermore, functional T lymphocytes are not required for the induction of anaemia, since suppression of T cell activity with Cyclosporin A had neither an effect on the course of infection nor on anaemia. Expression of genes involved in erythropoiesis and iron metabolism was followed in spleen, liver and kidney tissues in the three strains of mice using microarrays. There was no evidence for a response to erythropoietin, consistent with anaemia of chronic disease, which is erythropoietin insensitive. However, the expression of transcription factors and genes involved in erythropoiesis and haemolysis did correlate with the expression of the inflammatory cytokines Il6 and Ifng. The innate immune response appears to be the major contributor to the inflammation associated with anaemia since suppression of T cells with CsA had no observable effect. Several transcription factors regulating haematopoiesis, Tal1, Gata1, Zfpm1 and Klf1 were expressed at consistently lower levels in C57BL/6 mice suggesting that these mice have a lower haematopoietic capacity and therefore less ability to recover from haemolysis induced anaemia after infection
Exon deletions and intragenic insertions are not rare in ataxia with oculomotor apraxia 2
<p>Abstract</p> <p>Background</p> <p>The autosomal recessively inherited ataxia with oculomotor apraxia 2 (AOA2) is a neurodegenerative disorder characterized by juvenile or adolescent age of onset, gait ataxia, cerebellar atrophy, axonal sensorimotor neuropathy, oculomotor apraxia, and elevated serum AFP levels. AOA2 is caused by mutations within the senataxin gene (<it>SETX</it>). The majority of known mutations are nonsense, missense, and splice site mutations, as well as small deletions and insertions.</p> <p>Methods</p> <p>To detect mutations in patients showing a clinical phenotype consistent with AOA2, the coding region including splice sites of the <it>SETX </it>gene was sequenced and dosage analyses for all exons were performed on genomic DNA. The sequence of cDNA fragments of alternative transcripts isolated after RT-PCR was determined.</p> <p>Results</p> <p>Sequence analyses of the <it>SETX </it>gene in four patients revealed a heterozygous nonsense mutation or a 4 bp deletion in three cases. In another patient, PCR amplification of exon 11 to 15 dropped out. Dosage analyses and breakpoint localisation yielded a 1.3 kb LINE1 insertion in exon 12 (patient P1) and a 6.1 kb deletion between intron 11 and intron 14 (patient P2) in addition to the heterozygous nonsense mutation R1606X. Patient P3 was compound heterozygous for a 4 bp deletion in exon 10 and a 20.7 kb deletion between intron 10 and 15. This deletion was present in a homozygous state in patient P4.</p> <p>Conclusion</p> <p>Our findings indicate that gross mutations seem to be a frequent cause of AOA2 and reveal the importance of additional copy number analysis for routine diagnostics.</p
Chromosomal-level assembly of the Asian Seabass genome using long sequence reads and multi-layered scaffolding
We report here the ~670 Mb genome assembly of the Asian seabass (Lates calcarifer), a tropical marine teleost. We used long-read sequencing augmented by transcriptomics, optical and genetic mapping along with shared synteny from closely related fish species to derive a chromosome-level assembly with a contig N50 size over 1 Mb and scaffold N50 size over 25 Mb that span ~90% of the genome. The population structure of L. calcarifer species complex was analyzed by re-sequencing 61 individuals representing various regions across the species' native range. SNP analyses identified high levels of genetic diversity and confirmed earlier indications of a population stratification comprising three clades with signs of admixture apparent in the South-East Asian population. The quality of the Asian seabass genome assembly far exceeds that of any other fish species, and will serve as a new standard for fish genomics
RISCI - Repeat Induced Sequence Changes Identifier: a comprehensive, comparative genomics-based, in silico subtractive hybridization pipeline to identify repeat induced sequence changes in closely related genomes
<p>Abstract</p> <p>Background -</p> <p>The availability of multiple whole genome sequences has facilitated <it>in silico </it>identification of fixed and polymorphic transposable elements (TE). Whereas polymorphic loci serve as makers for phylogenetic and forensic analysis, fixed species-specific transposon insertions, when compared to orthologous loci in other closely related species, may give insights into their evolutionary significance. Besides, TE insertions are not isolated events and are frequently associated with subtle sequence changes concurrent with insertion or post insertion. These include duplication of target site, 3' and 5' flank transduction, deletion of the target locus, 5' truncation or partial deletion and inversion of the transposon, and post insertion changes like inter or intra element recombination, disruption etc. Although such changes have been studied independently, no automated platform to identify differential transposon insertions and the associated array of sequence changes in genomes of the same or closely related species is available till date. To this end, we have designed RISCI - 'Repeat Induced Sequence Changes Identifier' - a comprehensive, comparative genomics-based, <it>in silico </it>subtractive hybridization pipeline to identify differential transposon insertions and associated sequence changes using specific alignment signatures, which may then be examined for their downstream effects.</p> <p>Results -</p> <p>We showcase the utility of RISCI by comparing full length and truncated L1HS and AluYa5 retrotransposons in the reference human genome with the chimpanzee genome and the alternate human assemblies (Celera and HuRef). Comparison of the reference human genome with alternate human assemblies using RISCI predicts 14 novel polymorphisms in full length L1HS, 24 in truncated L1HS and 140 novel polymorphisms in AluYa5 insertions, besides several insertion and post insertion changes. We present comparison with two previous studies to show that RISCI predictions are broadly in agreement with earlier reports. We also demonstrate its versatility by comparing various strains of <it>Mycobacterium tuberculosis </it>for IS 6100 insertion polymorphism.</p> <p>Conclusions -</p> <p>RISCI combines comparative genomics with subtractive hybridization, inferring changes only when exclusive to one of the two genomes being compared. The pipeline is generic and may be applied to most transposons and to any two or more genomes sharing high sequence similarity. Such comparisons, when performed on a larger scale, may pull out a few critical events, which may have seeded the divergence between the two species under comparison.</p
The association of Alu repeats with the generation of potential AU-rich elements (ARE) at 3' untranslated regions.
BACKGROUND: A significant portion (about 8% in the human genome) of mammalian mRNA sequences contains AU (Adenine and Uracil) rich elements or AREs at their 3' untranslated regions (UTR). These mRNA sequences are usually stable. However, an increasing number of observations have been made of unstable species, possibly depending on certain elements such as Alu repeats. ARE motifs are repeats of the tetramer AUUU and a monomer A at the end of the repeats ((AUUU)(n)A). The importance of AREs in biology is that they make certain mRNA unstable. Proto-oncogene, such as c-fos, c-myc, and c-jun in humans, are associated with AREs. Although it has been known that the increased number of ARE motifs caused the decrease of the half-life of mRNA containing ARE repeats, the exact mechanism is as of yet unknown. We analyzed the occurrences of AREs and Alu and propose a possible mechanism for how human mRNA could acquire and keep AREs at its 3' UTR originating from Alu repeats. RESULTS: Interspersed in the human genome, Alu repeats occupy 5% of the 3' UTR of mRNA sequences. Alu has poly-adenine (poly-A) regions at its end, which lead to poly-thymine (poly-T) regions at the end of its complementary Alu. It has been found that AREs are present at the poly-T regions. From the 3' UTR of the NCBI's reference mRNA sequence database, we found nearly 40% (38.5%) of ARE (Class I) were associated with Alu sequences (Table 1) within one mismatch allowance in ARE sequences. Other ARE classes had statistically significant associations as well. This is far from a random occurrence given their limited quantity. At each ARE class, random distribution was simulated 1,000 times, and it was shown that there is a special relationship between ARE patterns and the Alu repeats. CONCLUSION: AREs are mediating sequence elements affecting the stabilization or degradation of mRNA at the 3' untranslated regions. However, AREs' mechanism and origins are unknown. We report that Alu is a source of ARE. We found that half of the longest AREs were derived from the poly-T regions of the complementary Alu
Transposon Excision from an Atypical Site: A Mechanism of Evolution of Novel Transposable Elements
The role of transposable elements in sculpting the genome is well appreciated but remains poorly understood. Some organisms, such as humans, do not have active transposons; however, transposable elements were presumably active in their ancestral genomes. Of specific interest is whether the DNA surrounding the sites of transposon excision become recombinogenic, thus bringing about homologous recombination. Previous studies in maize and Drosophila have provided conflicting evidence on whether transposon excision is correlated with homologous recombination. Here we take advantage of an atypical Dissociation (Ds) element, a maize transposon that can be mobilized by the Ac transposase gene in Arabidopsis thaliana, to address questions on the mechanism of Ds excision. This atypical Ds element contains an adjacent 598 base pairs (bp) inverted repeat; the element was allowed to excise by the introduction of an unlinked Ac transposase source through mating. Footprints at the excision site suggest a micro-homology mediated non-homologous end joining reminiscent of V(D)J recombination involving the formation of intra-helix 3′ to 5′ trans-esterification as an intermediate, a mechanism consistent with previous observations in maize, Antirrhinum and in certain insects. The proposed mechanism suggests that the broken chromosome at the excision site should not allow recombinational interaction with the homologous chromosome, and that the linked inverted repeat should also be mobilizable. To test the first prediction, we measured recombination of flanking chromosomal arms selected for the excision of Ds. In congruence with the model, Ds excision did not influence crossover recombination. Furthermore, evidence for correlated movement of the adjacent inverted repeat sequence is presented; its origin and movement suggest a novel mechanism for the evolution of repeated elements. Taken together these results suggest that the movement of transposable elements themselves may not directly influence linkage. Possibility remains, however, for novel repeated DNA sequences produced as a consequence of transposon movement to influence crossover in subsequent generations
- …