197 research outputs found

    Relationship between the Composition of Flavonoids and Flower Colors Variation in Tropical Water Lily (Nymphaea) Cultivars

    Get PDF
    Water lily, the member of the Nymphaeaceae family, is the symbol of Buddhism and Brahmanism in India. Despite its limited researches on flower color variations and formation mechanism, water lily has background of blue flowers and displays an exceptionally wide diversity of flower colors from purple, red, blue to yellow, in nature. In this study, 34 flavonoids were identified among 35 tropical cultivars by high-performance liquid chromatography (HPLC) with photodiode array detection (DAD) and electrospray ionization mass spectrometry (ESI-MS). Among them, four anthocyanins: delphinidin 3-O-rhamnosyl-5-O-galactoside (Dp3Rh5Ga), delphinidin 3-O-(2″-O-galloyl-6″-O-oxalyl-rhamnoside) (Dp3galloyl-oxalylRh), delphinidin 3-O-(6″-O-acetyl-β-glucopyranoside) (Dp3acetylG) and cyanidin 3- O-(2″-O-galloyl-galactopyranoside)-5-O-rhamnoside (Cy3galloylGa5Rh), one chalcone: chalcononaringenin 2′-O-galactoside (Chal2′Ga) and twelve flavonols: myricetin 7-O-rhamnosyl-(1→2)-rhamnoside (My7RhRh), quercetin 7-O-galactosyl-(1→2)-rhamnoside (Qu7GaRh), quercetin 7-O-galactoside (Qu7Ga), kaempferol 7-O-galactosyl-(1→2)-rhamnoside (Km7GaRh), myricetin 3-O-galactoside (My3Ga), kaempferol 7-O-galloylgalactosyl-(1→2)-rhamnoside (Km7galloylGaRh), myricetin 3-O-galloylrhamnoside (My3galloylRh), kaempferol 3-O-galactoside (Km3Ga), isorhamnetin 7-O-galactoside (Is7Ga), isorhamnetin 7-O-xyloside (Is7Xy), kaempferol 3-O-(3″-acetylrhamnoside) (Km3-3″acetylRh) and quercetin 3-O-acetylgalactoside (Qu3acetylGa) were identified in the petals of tropic water lily for the first time. Meanwhile a multivariate analysis was used to explore the relationship between pigments and flower color. By comparing, the cultivars which were detected delphinidin 3-galactoside (Dp3Ga) presented amaranth, and detected delphinidin 3′-galactoside (Dp3′Ga) presented blue. However, the derivatives of delphinidin and cyanidin were more complicated in red group. No anthocyanins were detected within white and yellow group. At the same time a possible flavonoid biosynthesis pathway of tropical water lily was presumed putatively. These studies will help to elucidate the evolution mechanism on the formation of flower colors and provide theoretical basis for outcross breeding and developing health care products from this plant

    MSACompro: protein multiple sequence alignment using predicted secondary structure, solvent accessibility, and residue-residue contacts

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Multiple Sequence Alignment (MSA) is a basic tool for bioinformatics research and analysis. It has been used essentially in almost all bioinformatics tasks such as protein structure modeling, gene and protein function prediction, DNA motif recognition, and phylogenetic analysis. Therefore, improving the accuracy of multiple sequence alignment is important for advancing many bioinformatics fields.</p> <p>Results</p> <p>We designed and developed a new method, MSACompro, to synergistically incorporate predicted secondary structure, relative solvent accessibility, and residue-residue contact information into the currently most accurate posterior probability-based MSA methods to improve the accuracy of multiple sequence alignments. The method is different from the multiple sequence alignment methods (e.g. 3D-Coffee) that use the tertiary structure information of some sequences since the structural information of our method is fully predicted from sequences. To the best of our knowledge, applying predicted relative solvent accessibility and contact map to multiple sequence alignment is novel. The rigorous benchmarking of our method to the standard benchmarks (i.e. BAliBASE, SABmark and OXBENCH) clearly demonstrated that incorporating predicted protein structural information improves the multiple sequence alignment accuracy over the leading multiple protein sequence alignment tools without using this information, such as MSAProbs, ProbCons, Probalign, T-coffee, MAFFT and MUSCLE. And the performance of the method is comparable to the state-of-the-art method PROMALS of using structural features and additional homologous sequences by slightly lower scores.</p> <p>Conclusion</p> <p>MSACompro is an efficient and reliable multiple protein sequence alignment tool that can effectively incorporate predicted protein structural information into multiple sequence alignment. The software is available at <url>http://sysbio.rnet.missouri.edu/multicom_toolbox/</url>.</p

    Gene fusions and gene duplications: relevance to genomic annotation and functional analysis

    Get PDF
    BACKGROUND: Escherichia coli a model organism provides information for annotation of other genomes. Our analysis of its genome has shown that proteins encoded by fused genes need special attention. Such composite (multimodular) proteins consist of two or more components (modules) encoding distinct functions. Multimodular proteins have been found to complicate both annotation and generation of sequence similar groups. Previous work overstated the number of multimodular proteins in E. coli. This work corrects the identification of modules by including sequence information from proteins in 50 sequenced microbial genomes. RESULTS: Multimodular E. coli K-12 proteins were identified from sequence similarities between their component modules and non-fused proteins in 50 genomes and from the literature. We found 109 multimodular proteins in E. coli containing either two or three modules. Most modules had standalone sequence relatives in other genomes. The separated modules together with all the single (un-fused) proteins constitute the sum of all unimodular proteins of E. coli. Pairwise sequence relationships among all E. coli unimodular proteins generated 490 sequence similar, paralogous groups. Groups ranged in size from 92 to 2 members and had varying degrees of relatedness among their members. Some E. coli enzyme groups were compared to homologs in other bacterial genomes. CONCLUSION: The deleterious effects of multimodular proteins on annotation and on the formation of groups of paralogs are emphasized. To improve annotation results, all multimodular proteins in an organism should be detected and when known each function should be connected with its location in the sequence of the protein. When transferring functions by sequence similarity, alignment locations must be noted, particularly when alignments cover only part of the sequences, in order to enable transfer of the correct function. Separating multimodular proteins into module units makes it possible to generate protein groups related by both sequence and function, avoiding mixing of unrelated sequences. Organisms differ in sizes of groups of sequence-related proteins. A sample comparison of orthologs to selected E. coli paralogous groups correlates with known physiological and taxonomic relationships between the organisms

    A General Approach for Predicting the Filtration of Soft and Permeable Colloids: The Milk Example

    Get PDF
    Membrane filtration operations (ultra-, microfiltration) are now extensively used for concentrating or separating an ever-growing variety of colloidal dispersions. However, the phenomena that determine the efficiency of these operations are not yet fully understood. This is especially the case when dealing with colloids that are soft, deformable, and permeable. In this paper, we propose a methodology for building a model that is able to predict the performance (flux, concentration profiles) of the filtration of such objects in relation with the operating conditions. This is done by focusing on the case of milk filtration, all experiments being performed with dispersions of milk casein micelles, which are sort of ″natural″ colloidal microgels. Using this example, we develop the general idea that a filtration model can always be built for a given colloidal dispersion as long as this dispersion has been characterized in terms of osmotic pressure Π and hydraulic permeability k. For soft and permeable colloids, the major issue is that the permeability k cannot be assessed in a trivial way like in the case for hard-sphere colloids. To get around this difficulty, we follow two distinct approaches to actually measure k: a direct approach, involving osmotic stress experiments, and a reverse-calculation approach, that consists of estimating k through well-controlled filtration experiments. The resulting filtration model is then validated against experimental measurements obtained from combined milk filtration/SAXS experiments. We also give precise examples of how the model can be used, as well as a brief discussion on the possible universality of the approach presented here

    Compressed Suffix Arrays for Massive Data

    Get PDF
    We present a fast space-efficient algorithm for constructing compressed suffix arrays (CSA). The algorithm requires O(n log n) time in the worst case, and only O(n) bits of extra space in addition to the CSA. As the basic step, we describe an algorithm for merging two CSAs. We show that the construction algorithm can be parallelized in a symmetric multiprocessor system, and discuss the possibility of a distributed implementation. We also describe a parallel implementation of the algorithm, capable of indexing several gigabytes per hour

    Comparative genomics of small RNA regulatory pathway components in vector mosquitoes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Small RNA regulatory pathways (SRRPs) control key aspects of development and anti-viral defense in metazoans. Members of the Argonaute family of catalytic enzymes degrade target RNAs in each of these pathways. SRRPs include the microRNA, small interfering RNA (siRNA) and PIWI-type gene silencing pathways. Mosquitoes generate viral siRNAs when infected with RNA arboviruses. However, in some mosquitoes, arboviruses survive antiviral RNA interference (RNAi) and are transmitted via mosquito bite to a subsequent host. Increased knowledge of these pathways and functional components should increase understanding of the limitations of anti-viral defense in vector mosquitoes. To do this, we compared the genomic structure of SRRP components across three mosquito species and three major small RNA pathways.</p> <p>Results</p> <p>The <it>Ae. aegypti, An. gambiae </it>and <it>Cx. pipiens </it>genomes encode putative orthologs for all major components of the miRNA, siRNA, and piRNA pathways. <it>Ae. aegypti </it>and <it>Cx. pipiens </it>have undergone expansion of Argonaute and PIWI subfamily genes. Phylogenetic analyses were performed for these protein families. In addition, sequence pattern recognition algorithms MEME, MDScan and Weeder were used to identify upstream regulatory motifs for all SRRP components. Statistical analyses confirmed enrichment of species-specific and pathway-specific cis-elements over the rest of the genome.</p> <p>Conclusion</p> <p>Analysis of Argonaute and PIWI subfamily genes suggests that the small regulatory RNA pathways of the major arbovirus vectors, <it>Ae. aegypti and Cx. pipiens</it>, are evolving faster than those of the malaria vector <it>An. gambiae </it>and <it>D. melanogaster</it>. Further, protein and genomic features suggest functional differences between subclasses of PIWI proteins and provide a basis for future analyses. Common UCR elements among SRRP components indicate that 1) key components from the miRNA, siRNA, and piRNA pathways contain NF-kappaB-related and Broad complex transcription factor binding sites, 2) purifying selection has occurred to maintain common pathway-specific elements across mosquito species and 3) species-specific differences in upstream elements suggest that there may be differences in regulatory control among mosquito species. Implications for arbovirus vector competence in mosquitoes are discussed.</p

    Improving the Alignment Quality of Consistency Based Aligners with an Evaluation Function Using Synonymous Protein Words

    Get PDF
    Most sequence alignment tools can successfully align protein sequences with higher levels of sequence identity. The accuracy of corresponding structure alignment, however, decreases rapidly when considering distantly related sequences (<20% identity). In this range of identity, alignments optimized so as to maximize sequence similarity are often inaccurate from a structural point of view. Over the last two decades, most multiple protein aligners have been optimized for their capacity to reproduce structure-based alignments while using sequence information. Methods currently available differ essentially in the similarity measurement between aligned residues using substitution matrices, Fourier transform, sophisticated profile-profile functions, or consistency-based approaches, more recently
    corecore