132 research outputs found

    The GC Skew Index: A Measure of Genomic Compositional Asymmetry and the Degree of Replicational Selection

    Get PDF
    Circular bacterial chromosomes have highly polarized nucleotide composition in the two replichores, and this genomic strand asymmetry can be visualized using GC skew graphs. Here we propose and discuss the GC skew index (GCSI) for the quantification of genomic compositional skew, which combines a normalized measure of fast Fourier transform to capture the shape of the skew graph and Euclidean distance between the two vertices in a cumulative skew graph to represent the degree of skew. We calculated GCSI for all available bacterial genomes, and GCSI correlated well with the visibility of GC skew. This novel index is useful for estimating confidence levels for the prediction of replication origin and terminus by methods based on GC skew and for measuring the strength of replicational selection in a genome

    Selection Effects on the Positioning of Genes and Gene Structures from the Interplay of Replication and Transcription in Bacterial Genomes

    Get PDF
    Bacterial chromosomes are partly shaped by the functional requirements for efficient replication, which lead to strand bias as commonly characterized by the excess of guanines over cytosines in the leading strand. Gene structures are also highly organized within bacterial genomes as a result of such functional constraints, displaying characteristic positioning and structuring along the genome. Here we analyze the gene structures in completely sequenced bacterial chromosomes to observe the positional constraints on gene orientation, length, and codon usage with regard to the positions of replication origin and terminus. Selection on these gene features is different in regions surrounding the terminus of replication from the rest of the genome, but the selection could be either positive or negative depending on the species, and these positional effects are partly attributed to the A-T enrichment near the terminus. Characteristic gene structuring relative to the position of replication origin and terminus is commonly observed among most bacterial species with circular chromosomes, and therefore we argue that the highly organized gene positioning as well as the strand bias should be considered for genomics studies of bacteria

    KBWS: an EMBOSS associated package for accessing bioinformatics web services

    Get PDF
    The availability of bioinformatics web-based services is rapidly proliferating, for their interoperability and ease of use. The next challenge is in the integration of these services in the form of workflows, and several projects are already underway, standardizing the syntax, semantics, and user interfaces. In order to deploy the advantages of web services with locally installed tools, here we describe a collection of proxy client tools for 42 major bioinformatics web services in the form of European Molecular Biology Open Software Suite (EMBOSS) UNIX command-line tools. EMBOSS provides sophisticated means for discoverability and interoperability for hundreds of tools, and our package, named the Keio Bioinformatics Web Service (KBWS), adds functionalities of local and multiple alignment of sequences, phylogenetic analyses, and prediction of cellular localization of proteins and RNA secondary structures. This software implemented in C is available under GPL from http://www.g-language.org/kbws/ and GitHub repository http://github.com/cory-ko/KBWS. Users can utilize the SOAP services implemented in Perl directly via WSDL file at http://soap.g-language.org/kbws.wsdl (RPC Encoded) and http://soap.g-language.org/kbws_dl.wsdl (Document/literal)

    Validation of Bacterial Replication Termination Models Using Simulation of Genomic Mutations

    Get PDF
    In bacterial circular chromosomes and most plasmids, the replication is known to be terminated when either of the following occurs: the forks progressing in opposite directions meet at the distal end of the chromosome or the replication forks become trapped by Tus proteins bound to Ter sites. Most bacterial genomes have various polarities in their genomic structures. The most notable feature is polar genomic compositional asymmetry of the bases G and C in the leading and lagging strands, called GC skew. This asymmetry is caused by replication-associated mutation bias, and this “footprint" of the replication machinery suggests that, in contrast to the two known mechanisms, replication termination occurs near the chromosome dimer resolution site dif. To understand this difference between the known replication machinery and genomic compositional bias, we undertook a simulation study of genomic mutations, and we report here how different replication termination models contribute to the generation of replication-related genomic compositional asymmetry. Contrary to naive expectations, our results show that a single finite termination site at dif or at the GC skew shift point is not sufficient to reconstruct the genomic compositional bias as observed in published sequences. The results also show that the known replication mechanisms are sufficient to explain the position of the GC skew shift point

    New Tardigrade Opsins and Differential Expression Analyses Show Ontogenic Variation in Light Perception

    Get PDF
    Abstract Opsins are light-sensitive proteins involved in many photoreceptive processes, including, but not limited to, vision and regulation of circadian rhythms. Arthropod (e.g., insects, spiders, centipedes, and crabs) opsins have been extensively researched, but the relationships and function of opsins found in lineages that are evolutionarily closely related to the arthropods remains unclear. Multiple, independent, opsin duplications are known in Tardigrada (the water bears), evidencing that protostome opsin duplications are not limited to the Arthropoda. However, the relationships, function, and expression of these new opsins are still unknown. Here, we use two tardigrade transcriptomes with deep coverage to greatly expand our knowledge of the diversity of tardigrade opsins. We reconstruct the phylogenetic relationships of the tardigrade opsins and investigate their ontogenetic expression. We found that while tardigrades have multiple opsins that evolved from lineage-specific duplications of well-understood arthropod opsins, their expression levels change during ontogeny such that most of these opsins are not co-temporally expressed. Co-temporal expression of multiple opsins underpins color vision in Arthropoda and Vertebrata. Our results clearly show duplications of both rhabdomeric and ciliary opsins within Tardigrada, forming clades specific to both the Heterotardigrada and Eutardigrada in addition to multiple independent duplications within genera. However, lack of co-temporal, ontogenetic, expression suggests that while tardigrades possess multiple opsins, they are unlikely to be able to distinguish color

    Quantitative analysis of replication-related mutation and selection pressures in bacterial chromosomes and plasmids using generalised GC skew index

    Get PDF
    BACKGROUND: Due to their bi-directional replication machinery starting from a single finite origin, bacterial genomes show characteristic nucleotide compositional bias between the two replichores, which can be visualised through GC skew or (C-G)/(C+G). Although this polarisation is used for computational prediction of replication origins in many bacterial genomes, the degree of GC skew visibility varies widely among different species, necessitating a quantitative measurement of GC skew strength in order to provide confidence measures for GC skew-based predictions of replication origins. RESULTS: Here we discuss a quantitative index for the measurement of GC skew strength, named the generalised GC skew index (gGCSI), which is applicable to genomes of any length, including bacterial chromosomes and plasmids. We demonstrate that gGCSI is independent of the window size and can thus be used to compare genomes with different sizes, such as bacterial chromosomes and plasmids. It can suggest the existence of different replication mechanisms in archaea and of rolling-circle replication in plasmids. Correlation of gGCSI values between plasmids and their corresponding host chromosomes suggests that within the same strain, these replicons have reproduced using the same replication machinery and thus exhibit similar strengths of replication strand skew. CONCLUSIONS: gGCSI can be applied to genomes of any length and thus allows comparative study of replication-related mutation and selection pressures in genomes of different lengths such as bacterial chromosomes and plasmids. Using gGCSI, we showed that replication-related mutation or selection pressure is similar for replicons with similar machinery

    Spider silk self-assembly via modular liquid-liquid phase separation and nanofibrillation

    Get PDF
    クモ糸の階層構造を初めて再現 --シルクタンパク質の液液相分離による階層構造形成--. 京都大学プレスリリース. 2020-11-06.How does the spider spin its self-assembled silk?. 京都大学プレスリリース. 2020-12-01.Spider silk fiber rapidly assembles from spidroin protein in soluble state via an incompletely understood mechanism. Here, we present an integrated model for silk formation that incorporates the effects of multiple chemical and physical gradients on the different spidroin functional domains. Central to the process is liquid-liquid phase separation (LLPS) that occurs in response to multivalent anions such as phosphate, mediated by the carboxyl-terminal and repetitive domains. Acidification coupled with LLPS triggers the swift self-assembly of nanofibril networks, facilitated by dimerization of the amino-terminal domain, and leads to a liquid-to-solid phase transition. Mechanical stress applied to the fibril structures yields macroscopic fibers with hierarchical organization and enriched for β-sheet conformations. Studies using native silk gland material corroborate our findings on spidroin phase separation. Our results suggest an intriguing parallel between silk assembly and other LLPS-mediated mechanisms, such as found in intracellular membraneless organelles and protein aggregation disorders

    A web server for interactive and zoomable Chaos Game Representation images

    Get PDF
    Chaos Game Representation (CGR) is a generalized scale-independent Markov transition table, which is useful for the visualization and comparative study of genomic signature, or for the study of characteristic sequence motifs. However, in order to fully utilize the scale-independent properties of CGR, it should be accessible through scale-independent user interface instead of static images. Here we describe a web server and Perl library for generating zoomable CGR images utilizing Google Maps API, which is also easily searchable for specific motifs. The web server is freely accessible at , and the Perl library as well as the source code is distributed with the G-language Genome Analysis Environment under GNU General Public License

    Pathway Projector: Web-Based Zoomable Pathway Browser Using KEGG Atlas and Google Maps API

    Get PDF
    BACKGROUND: Biochemical pathways provide an essential context for understanding comprehensive experimental data and the systematic workings of a cell. Therefore, the availability of online pathway browsers will facilitate post-genomic research, just as genome browsers have contributed to genomics. Many pathway maps have been provided online as part of public pathway databases. Most of these maps, however, function as the gateway interface to a specific database, and the comprehensiveness of their represented entities, data mapping capabilities, and user interfaces are not always sufficient for generic usage. METHODOLOGY/PRINCIPAL FINDINGS: We have identified five central requirements for a pathway browser: (1) availability of large integrated maps showing genes, enzymes, and metabolites; (2) comprehensive search features and data access; (3) data mapping for transcriptomic, proteomic, and metabolomic experiments, as well as the ability to edit and annotate pathway maps; (4) easy exchange of pathway data; and (5) intuitive user experience without the requirement for installation and regular maintenance. According to these requirements, we have evaluated existing pathway databases and tools and implemented a web-based pathway browser named Pathway Projector as a solution. CONCLUSIONS/SIGNIFICANCE: Pathway Projector provides integrated pathway maps that are based upon the KEGG Atlas, with the addition of nodes for genes and enzymes, and is implemented as a scalable, zoomable map utilizing the Google Maps API. Users can search pathway-related data using keywords, molecular weights, nucleotide sequences, and amino acid sequences, or as possible routes between compounds. In addition, experimental data from transcriptomic, proteomic, and metabolomic analyses can be readily mapped. Pathway Projector is freely available for academic users at (http://www.g-language.org/PathwayProjector/)
    corecore