38 research outputs found

    Cross-species protein sequence and gene structure prediction with fine-tuned Webscipio 2.0 and Scipio

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Obtaining transcripts of homologs of closely related organisms and retrieving the reconstructed exon-intron patterns of the genes is a very important process during the analysis of the evolution of a protein family and the comparative analysis of the exon-intron structure of a certain gene from different species. Due to the ever-increasing speed of genome sequencing, the gap to genome annotation is growing. Thus, tools for the correct prediction and reconstruction of genes in related organisms become more and more important. The tool Scipio, which can also be used via the graphical interface WebScipio, performs significant hit processing of the output of the Blat program to account for sequencing errors, missing sequence, and fragmented genome assemblies. However, Scipio has so far been limited to high sequence similarity and unable to reconstruct short exons.</p> <p>Results</p> <p>Scipio and WebScipio have fundamentally been extended to better reconstruct very short exons and intron splice sites and to be better suited for cross-species gene structure predictions. The Needleman-Wunsch algorithm has been implemented for the search for short parts of the query sequence that were not recognized by Blat. Those regions might either be short exons, divergent sequence at intron splice sites, or very divergent exons. We have shown the benefit and use of new parameters with several protein examples from completely different protein families in searches against species from several kingdoms of the eukaryotes. The performance of the new Scipio version has been tested in comparison with several similar tools.</p> <p>Conclusions</p> <p>With the new version of Scipio very short exons, terminal and internal, of even just one amino acid can correctly be reconstructed. Scipio is also able to correctly predict almost all genes in cross-species searches even if the ancestors of the species separated more than 100 Myr ago and if the protein sequence identity is below 80%. For our test cases Scipio outperforms all other software tested. WebScipio has been restructured and provides easy access to the genome assemblies of about 640 eukaryotic species. Scipio and WebScipio are freely accessible at <url>http://www.webscipio.org</url>.</p

    diArk 2.0 provides detailed analyses of the ever increasing eukaryotic genome sequencing data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Nowadays, the sequencing of even the largest mammalian genomes has become a question of days with current next-generation sequencing methods. It comes as no surprise that dozens of genome assemblies are released per months now. Since the number of next-generation sequencing machines increases worldwide and new major sequencing plans are announced, a further increase in the speed of releasing genome assemblies is expected. Thus it becomes increasingly important to get an overview as well as detailed information about available sequenced genomes. The different sequencing and assembly methods have specific characteristics that need to be known to evaluate the various genome assemblies before performing subsequent analyses.</p> <p>Results</p> <p>diArk has been developed to provide fast and easy access to all sequenced eukaryotic genomes worldwide. Currently, diArk 2.0 contains information about more than 880 species and more than 2350 genome assembly files. Many meta-data like sequencing and read-assembly methods, sequencing coverage, GC-content, extended lists of alternatively used scientific names and common species names, and various kinds of statistics are provided. To intuitively approach the data the web interface makes extensive usage of modern web techniques. A number of search modules and result views facilitate finding and judging the data of interest. Subscribing to the RSS feed is the easiest way to stay up-to-date with the latest genome data.</p> <p>Conclusions</p> <p>diArk 2.0 is the most up-to-date database of sequenced eukaryotic genomes compared to databases like GOLD, NCBI Genome, NHGRI, and ISC. It is different in that only those projects are stored for which genome assembly data or considerable amounts of cDNA data are available. Projects in planning stage or in the process of being sequenced are not included. The user can easily search through the provided data and directly access the genome assembly files of the sequenced genome of interest. diArk 2.0 is available at <url>http://www.diark.org</url>.</p

    Evolution of the eukaryotic ARP2/3 activators of the WASP family: WASP, WAVE, WASH, and WHAMM, and the proposed new family members WAWH and WAML

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>WASP family proteins stimulate the actin-nucleating activity of the ARP2/3 complex. They include members of the well-known WASP and WAVE/Scar proteins, and the recently identified WASH and WHAMM proteins. WASP family proteins contain family specific N-terminal domains followed by proline-rich regions and C-terminal VCA domains that harbour the ARP2/3-activating regions.</p> <p>Results</p> <p>To reveal the evolution of ARP2/3 activation by WASP family proteins we performed a "holistic" analysis by manually assembling and annotating all homologs in most of the eukaryotic genomes available. We have identified two new families: the WAML proteins (WASP and MIM like), which combine the membrane-deforming and actin bundling functions of the IMD domains with the ARP2/3-activating VCA regions, and the WAWH protein (WASP without WH1 domain) that have been identified in amoebae, Apusozoa, and the anole lizard. Surprisingly, with one exception we did not identify any alternative splice forms for WASP family proteins, which is in strong contrast to other actin-binding proteins like Ena/VASP, MIM, or NHS proteins that share domains with WASP proteins.</p> <p>Conclusions</p> <p>Our analysis showed that the last common ancestor of the eukaryotes must have contained a homolog of WASP, WAVE, and WASH. Specific families have subsequently been lost in many taxa like the WASPs in plants, algae, Stramenopiles, and Euglenozoa, and the WASH proteins in fungi. The WHAMM proteins are metazoa specific and have most probably been invented by the Eumetazoa. The diversity of WASP family proteins has strongly been increased by many species- and taxon-specific gene duplications and multimerisations. All data is freely accessible via <url>http://www.cymobase.org</url>.</p

    Comparative genomic analysis of the arthropod muscle myosin heavy chain genes allows ancestral gene reconstruction and reveals a new type of 'partially' processed pseudogene

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Alternative splicing of mutually exclusive exons is an important mechanism for increasing protein diversity in eukaryotes. The insect <it>Mhc </it>(myosin heavy chain) gene produces all different muscle myosins as a result of alternative splicing in contrast to most other organisms of the Metazoa lineage, that have a family of muscle genes with each gene coding for a protein specialized for a functional niche.</p> <p>Results</p> <p>The muscle myosin heavy chain genes of 22 species of the Arthropoda ranging from the waterflea to wasp and <it>Drosophila </it>have been annotated. The analysis of the gene structures allowed the reconstruction of an ancient muscle myosin heavy chain gene and showed that during evolution of the arthropods introns have mainly been lost in these genes although intron gain might have happened in a few cases. Surprisingly, the genome of <it>Aedes aegypti </it>contains another and that of <it>Culex pipiens quinquefasciatus </it>two further muscle myosin heavy chain genes, called <it>Mhc3 </it>and <it>Mhc4</it>, that contain only one variant of the corresponding alternative exons of the <it>Mhc1 </it>gene. <it>Mhc3 </it>transcription in <it>Aedes aegypti </it>is documented by EST data. <it>Mhc3 </it>and <it>Mhc4 </it>inserted in the <it>Aedes </it>and <it>Culex </it>genomes either by gene duplication followed by the loss of all but one variant of the alternative exons, or by incorporation of a transcript of which all other variants have been spliced out retaining the exon-intron structure. The second and more likely possibility represents a new type of a 'partially' processed pseudogene.</p> <p>Conclusion</p> <p>Based on the comparative genomic analysis of the alternatively spliced arthropod muscle myosin heavy chain genes we propose that the splicing process operates sequentially on the transcript. The process consists of the splicing of the mutually exclusive exons until one exon out of the cluster remains while retaining surrounding intronic sequence. In a second step splicing of introns takes place. A related mechanism could be responsible for the splicing of other genes containing mutually exclusive exons.</p

    The UNC-45 Chaperone Is Critical for Establishing Myosin-Based Myofibrillar Organization and Cardiac Contractility in the Drosophila Heart Model

    Get PDF
    UNC-45 is a UCS (UNC-45/CRO1/She4P) class chaperone necessary for myosin folding and/or accumulation, but its requirement for maintaining cardiac contractility has not been explored. Given the prevalence of myosin mutations in eliciting cardiomyopathy, chaperones like UNC-45 are likely to be equally critical in provoking or modulating myosin-associated cardiomyopathy. Here, we used the Drosophila heart model to examine its role in cardiac physiology, in conjunction with RNAi-mediated gene silencing specifically in the heart in vivo. Analysis of cardiac physiology was carried out using high-speed video recording in conjunction with movement analysis algorithms. unc-45 knockdown resulted in severely compromised cardiac function in adults as evidenced by prolonged diastolic and systolic intervals, and increased incidence of arrhythmias and extreme dilation; the latter was accompanied by a significant reduction in muscle contractility. Structural analysis showed reduced myofibrils, myofibrillar disarray, and greatly decreased cardiac myosin accumulation. Cardiac unc-45 silencing also dramatically reduced life-span. In contrast, third instar larval and young pupal hearts showed mild cardiac abnormalities, as severe cardiac defects only developed during metamorphosis. Furthermore, cardiac unc-45 silencing in the adult heart (after metamorphosis) led to less severe phenotypes. This suggests that UNC-45 is mostly required for myosin accumulation/folding during remodeling of the forming adult heart. The cardiac defects, myosin deficit and decreased life-span in flies upon heart-specific unc-45 knockdown were significantly rescued by UNC-45 over-expression. Our results are the first to demonstrate a cardiac-specific requirement of a chaperone in Drosophila, suggestive of a critical role of UNC-45 in cardiomyopathies, including those associated with unfolded proteins in the failing human heart. The dilated cardiomyopathy phenotype associated with UNC-45 deficiency is mimicked by myosin knockdown suggesting that UNC-45 plays a crucial role in stabilizing myosin and possibly preventing human cardiomyopathies associated with functional deficiencies of myosin

    The Functions of Myosin II and Myosin V Homologs in Tip Growth and Septation in Aspergillus nidulans

    Get PDF
    Because of the industrial and medical importance of members of the fungal genus Aspergillus, there is considerable interest in the functions of cytoskeletal components in growth and secretion in these organisms. We have analyzed the genome of Aspergillus nidulans and found that there are two previously unstudied myosin genes, a myosin II homolog, myoB (product = MyoB) and a myosin V homolog, myoE (product = MyoE). Deletions of either cause significant growth defects. MyoB localizes in strings that coalesce into contractile rings at forming septa. It is critical for septation and normal deposition of chitin but not for hyphal extension. MyoE localizes to the Spitzenkörper and to moving puncta in the cytoplasm. Time-lapse imaging of SynA, a v-SNARE, reveals that in myoE deletion strains vesicles no longer localize to the Spitzenkörper. Tip morphology is slightly abnormal and branching occurs more frequently than in controls. Tip extension is slower than in controls, but because hyphal diameter is greater, growth (increase in volume/time) is only slightly reduced. Concentration of vesicles into the Spitzenkörper before incorporation into the plasma membrane is, thus, not required for hyphal growth but facilitates faster tip extension and a more normal hyphal shape

    Skipping of Exons by Premature Termination of Transcription and Alternative Splicing within Intron-5 of the Sheep SCF Gene: A Novel Splice Variant

    Get PDF
    Stem cell factor (SCF) is a growth factor, essential for haemopoiesis, mast cell development and melanogenesis. In the hematopoietic microenvironment (HM), SCF is produced either as a membrane-bound (−) or soluble (+) forms. Skin expression of SCF stimulates melanocyte migration, proliferation, differentiation, and survival. We report for the first time, a novel mRNA splice variant of SCF from the skin of white merino sheep via cloning and sequencing. Reverse transcriptase (RT)-PCR and molecular prediction revealed two different cDNA products of SCF. Full-length cDNA libraries were enriched by the method of rapid amplification of cDNA ends (RACE-PCR). Nucleotide sequencing and molecular prediction revealed that the primary 1519 base pair (bp) cDNA encodes a precursor protein of 274 amino acids (aa), commonly known as ‘soluble’ isoform. In contrast, the shorter (835 and/or 725 bp) cDNA was found to be a ‘novel’ mRNA splice variant. It contains an open reading frame (ORF) corresponding to a truncated protein of 181 aa (vs 245 aa) with an unique C-terminus lacking the primary proteolytic segment (28 aa) right after the D175G site which is necessary to produce ‘soluble’ form of SCF. This alternative splice (AS) variant was explained by the complete nucleotide sequencing of splice junction covering exon 5-intron (5)-exon 6 (948 bp) with a premature termination codon (PTC) whereby exons 6 to 9/10 are skipped (Cassette Exon, CE 6–9/10). We also demonstrated that the Northern blot analysis at transcript level is mediated via an intron-5 splicing event. Our data refine the structure of SCF gene; clarify the presence (+) and/or absence (−) of primary proteolytic-cleavage site specific SCF splice variants. This work provides a basis for understanding the functional role and regulation of SCF in hair follicle melanogenesis in sheep beyond what was known in mice, humans and other mammals

    Molecular features of the UNC-45 chaperone critical for binding and folding muscle myosin

    Get PDF
    Myosin is a motor protein that is essential for a variety of processes ranging from intracellular transport to muscle contraction. Folding and assembly of myosin relies on a specific chaperone, UNC-45. To address its substrate-targeting mechanism, we reconstitute the interplay between Caenorhabditis elegans UNC-45 and muscle myosin MHC-B in insect cells. In addition to providing a cellular chaperone assay, the established system enabled us to produce large amounts of functional muscle myosin, as evidenced by a biochemical and structural characterization, and to directly monitor substrate binding to UNC-45. Data from in vitro and cellular chaperone assays, together with crystal structures of binding-deficient UNC-45 mutants, highlight the importance of utilizing a flexible myosin-binding domain. This so-called UCS domain can adopt discrete conformations to efficiently bind and fold substrate. Moreover, our data uncover the molecular basis of temperature-sensitive UNC-45 mutations underlying one of the most prominent motility defects in C. elegans

    Genomics and phylogeny of motor proteins: tools and analyses.

    No full text
    corecore