151 research outputs found

    A novel series of compositionally biased substitution matrices for comparing Plasmodium proteins

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The most common substitution matrices currently used (BLOSUM and PAM) are based on protein sequences with average amino acid distributions, thus they do not represent a fully accurate substitution model for proteins characterized by a biased amino acid composition. This problem has been addressed recently by adjusting existing matrices, however, to date, no empirical approach has been taken to build matrices which offer a substitution model for comparing proteins sharing an amino acid compositional bias. Here, we present a novel procedure to construct series of symmetrical substitution matrices to align proteins from similarly biased <it>Plasmodium </it>proteomes.</p> <p>Results</p> <p>We generated substitution matrices by selecting from the BLOCKS database those multiple alignments with a compositional bias similar to that of <it>P. falciparum </it>and <it>P. yoelii </it>proteins. A novel 'fuzzy' clustering method was adopted to group sequences within these alignments, showing that this method retains more complete information on the amino acid substitutions when compared to hierarchical clustering. We assessed the performance against the BLOSUM62 series and showed that the usage of our matrices results in an improvement in the performance of BLAST database searches, greatly reducing the number of false positive hits. We then demonstrated applications of the use of novel matrices to improve the annotation of homologs between the two <it>Plasmodium </it>species and to classify members of the <it>P. falciparum </it>RIFIN/STEVOR family.</p> <p>Conclusion</p> <p>We confirmed that in the case of compositionally biased proteins, standard BLOSUM matrices are not suited for optimal alignments, and specific substitution matrices are required. In addition, we showed that the usage of these matrices leads to a reduction of false positive hits, facilitating the automatic annotation process.</p

    Optimizing substitution matrix choice and gap parameters for sequence alignment

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>While substitution matrices can readily be computed from reference alignments, it is challenging to compute optimal or approximately optimal gap penalties. It is also not well understood which substitution matrices are the most effective when alignment accuracy is the goal rather than homolog recognition. Here a new parameter optimization procedure, POP, is described and applied to the problems of optimizing gap penalties and selecting substitution matrices for pair-wise global protein alignments.</p> <p>Results</p> <p>POP is compared to a recent method due to Kim and Kececioglu and found to achieve from 0.2% to 1.3% higher accuracies on pair-wise benchmarks extracted from BALIBASE. The VTML matrix series is shown to be the most accurate on several global pair-wise alignment benchmarks, with VTML200 giving best or close to the best performance in all tests. BLOSUM matrices are found to be slightly inferior, even with the marginal improvements in the bug-fixed RBLOSUM series. The PAM series is significantly worse, giving accuracies typically 2% less than VTML. Integer rounding is found to cause slight degradations in accuracy. No evidence is found that selecting a matrix based on sequence divergence improves accuracy, suggesting that the use of this heuristic in CLUSTALW may be ineffective. Using VTML200 is found to improve the accuracy of CLUSTALW by 8% on BALIBASE and 5% on PREFAB.</p> <p>Conclusion</p> <p>The hypothesis that more accurate alignments of distantly related sequences may be achieved using low-identity matrices is shown to be false for commonly used matrix types. Source code and test data is freely available from the author's web site at <url>http://www.drive5.com/pop</url>.</p

    Supervised multivariate analysis of sequence groups to identify specificity determining residues

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Proteins that evolve from a common ancestor can change functionality over time, and it is important to be able identify residues that cause this change. In this paper we show how a supervised multivariate statistical method, Between Group Analysis (BGA), can be used to identify these residues from families of proteins with different substrate specifities using multiple sequence alignments.</p> <p>Results</p> <p>We demonstrate the usefulness of this method on three different test cases. Two of these test cases, the Lactate/Malate dehydrogenase family and Nucleotidyl Cyclases, consist of two functional groups. The other family, Serine Proteases consists of three groups. BGA was used to analyse and visualise these three families using two different encoding schemes for the amino acids.</p> <p>Conclusion</p> <p>This overall combination of methods in this paper is powerful and flexible while being computationally very fast and simple. BGA is especially useful because it can be used to analyse any number of functional classes. In the examples we used in this paper, we have only used 2 or 3 classes for demonstration purposes but any number can be used and visualised.</p

    DNA methylation in diploid inbred lines of potatoes and its possible role in the regulation of heterosis

    Get PDF
    Self-incompatible diploid potatoes were altered to self-compatible ones by a function of S-locus inhibitor gene and continued selfing generated highly homozygous inbreds. In this study, this process was investigated for the status of DNA methylation by a simple method using genomic DNA digested by methylation-sensitive restriction enzymes prior to RAPD analysis. We detected 31 methylation-sensitive RAPD bands, of which 11 were newly appeared in the selfed progenies, and 6 of them stably inherited to subsequent generations. Aberrant segregations and paternal- or atavism-like transmission were also found. Segregating methylation-sensitive bands in initial populations became fixed in the advanced selfed progenies by 75.0–93.8%, of which 41.7% were fixed to all present and 58.3% to all absent. Because DNA methylation is generally recognized to suppress gene expression as regulatory factors, homozygosity/heterozygosity of methylated DNA may be involved in inbreeding depression/heterosis

    Next-Generation Sequencing Reveals Recent Horizontal Transfer of a DNA Transposon between Divergent Mosquitoes

    Get PDF
    Horizontal transfer of genetic material between complex organisms often involves transposable elements (TEs). For example, a DNA transposon mariner has been shown to undergo horizontal transfer between different orders of insects and between different phyla of animals. Here we report the discovery and characterization of an ITmD37D transposon, MJ1, in Anopheles sinensis. We show that some MJ1 elements in Aedes aegypti and An. sinensis contain intact open reading frames and share nearly 99% nucleotide identity over the entire transposon, which is unexpectedly high given that these two genera had diverged 145–200 million years ago. Chromosomal hybridization and TE-display showed that MJ1 copy number is low in An. sinensis. Among 24 mosquito species surveyed, MJ1 is only found in Ae. aegypti and the hyrcanus group of anopheline mosquitoes to which An. sinensis belongs. Phylogenetic analysis is consistent with horizontal transfer and provides the basis for inference of its timing and direction. Although report of horizontal transfer of DNA transposons between higher eukaryotes is accumulating, our analysis is one of a small number of cases in which horizontal transfer of nearly identical TEs among highly divergent species has been thoroughly investigated and strongly supported. Horizontal transfer involving mosquitoes is of particular interest because there are ongoing investigations of the possibility of spreading pathogen-resistant genes into mosquito populations to control malaria and other infectious diseases. The initial indication of horizontal transfer of MJ1 came from comparisons between a 0.4x coverage An. sinensis 454 sequence database and available TEs in mosquito genomes. Therefore we have shown that it is feasible to use low coverage sequencing to systematically uncover horizontal transfer events. Expanding such efforts across a wide range of species will generate novel insights into the relative frequency of horizontal transfer of different TEs and provide the evolutionary context of these lateral transfer events

    Transgene × Environment Interactions in Genetically Modified Wheat

    Get PDF
    BACKGROUND: The introduction of transgenes into plants may cause unintended phenotypic effects which could have an impact on the plant itself and the environment. Little is published in the scientific literature about the interrelation of environmental factors and possible unintended effects in genetically modified (GM) plants. METHODS AND FINDINGS: We studied transgenic bread wheat Triticum aestivum lines expressing the wheat Pm3b gene against the fungus powdery mildew Blumeria graminis f.sp. tritici. Four independent offspring pairs, each consisting of a GM line and its corresponding non-GM control line, were grown under different soil nutrient conditions and with and without fungicide treatment in the glasshouse. Furthermore, we performed a field experiment with a similar design to validate our glasshouse results. The transgene increased the resistance to powdery mildew in all environments. However, GM plants reacted sensitive to fungicide spraying in the glasshouse. Without fungicide treatment, in the glasshouse GM lines had increased vegetative biomass and seed number and a twofold yield compared with control lines. In the field these results were reversed. Fertilization generally increased GM/control differences in the glasshouse but not in the field. Two of four GM lines showed up to 56% yield reduction and a 40-fold increase of infection with ergot disease Claviceps purpurea compared with their control lines in the field experiment; one GM line was very similar to its control. CONCLUSIONS: Our results demonstrate that, depending on the insertion event, a particular transgene can have large effects on the entire phenotype of a plant and that these effects can sometimes be reversed when plants are moved from the glasshouse to the field. However, it remains unclear which mechanisms underlie these effects and how they may affect concepts in molecular plant breeding and plant evolutionary ecology

    Likely Role of APOBEC3G-Mediated G-to-A Mutations in HIV-1 Evolution and Drug Resistance

    Get PDF
    The role of APOBEC3 (A3) protein family members in inhibiting retrovirus infection and mobile element retrotransposition is well established. However, the evolutionary effects these restriction factors may have had on active retroviruses such as HIV-1 are less well understood. An HIV-1 variant that has been highly G-to-A mutated is unlikely to be transmitted due to accumulation of deleterious mutations. However, G-to-A mutated hA3G target sequences within which the mutations are the least deleterious are more likely to survive selection pressure. Thus, among hA3G targets in HIV-1, the ratio of nonsynonymous to synonymous changes will increase with virus generations, leaving a footprint of past activity. To study such footprints in HIV-1 evolution, we developed an in silico model based on calculated hA3G target probabilities derived from G-to-A mutation sequence contexts in the literature. We simulated G-to-A changes iteratively in independent sequential HIV-1 infections until a stop codon was introduced into any gene. In addition to our simulation results, we observed higher ratios of nonsynonymous to synonymous mutation at hA3G targets in extant HIV-1 genomes than in their putative ancestral genomes, compared to random controls, implying that moderate levels of A3G-mediated G-to-A mutation have been a factor in HIV-1 evolution. Results from in vitro passaging experiments of HIV-1 modified to be highly susceptible to hA3G mutagenesis verified our simulation accuracy. We also used our simulation to examine the possible role of A3G-induced mutations in the origin of drug resistance. We found that hA3G activity could have been responsible for only a small increase in mutations at known drug resistance sites and propose that concerns for increased resistance to other antiviral drugs should not prevent Vif from being considered a suitable target for development of new drugs

    CpG island hypermethylation-associated silencing of non-coding RNAs transcribed from ultraconserved regions in human cancer

    Get PDF
    Although only 1.5% of the human genome appears to code for proteins, much effort in cancer research has been devoted to this minimal fraction of our DNA. However, the last few years have witnessed the realization that a large class of non-coding RNAs (ncRNAs), named microRNAs, contribute to cancer development and progression by acting as oncogenes or tumor suppressor genes. Recent studies have also shown that epigenetic silencing of microRNAs with tumor suppressor features by CpG island hypermethylation is a common hallmark of human tumors. Thus, we wondered whether there were other ncRNAs undergoing aberrant DNA methylation-associated silencing in transformed cells. We focused on the transcribed-ultraconserved regions (T-UCRs), a subset of DNA sequences that are absolutely conserved between orthologous regions of the human, rat and mouse genomes and that are located in both intra- and intergenic regions. We used a pharmacological and genomic approach to reveal the possible existence of an aberrant epigenetic silencing pattern of T-UCRs by treating cancer cells with a DNA-demethylating agent followed by hybridization to an expression microarray containing these sequences. We observed that DNA hypomethylation induces release of T-UCR silencing in cancer cells. Among the T-UCRs that were reactivated upon drug treatment, Uc.160+, Uc283+A and Uc.346+ were found to undergo specific CpG island hypermethylation-associated silencing in cancer cells compared with normal tissues. The analysis of a large set of primary human tumors (n=283) demonstrated that hypermethylation of the described T-UCR CpG islands was a common event among the various tumor types. Our finding that, in addition to microRNAs, another class of ncRNAs (T-UCRs) undergoes DNA methylation-associated inactivation in transformed cells supports a model in which epigenetic and genetic alterations in coding and non-coding sequences cooperate in human tumorigenesis

    Cytotoxic and Pathogenic Properties of Klebsiella oxytoca Isolated from Laboratory Animals

    Get PDF
    Klebsiella oxytoca is an opportunistic pathogen implicated in various clinical diseases in animals and humans. Studies suggest that in humans K. oxytoca exerts its pathogenicity in part through a cytotoxin. However, cytotoxin production in animal isolates of K. oxytoca and its pathogenic properties have not been characterized. Furthermore, neither the identity of the toxin nor a complete repertoire of genes involved in K. oxytoca pathogenesis have been fully elucidated. Here, we showed that several animal isolates of K. oxytoca, including the clinical isolates, produced secreted products in bacterial culture supernatant that display cytotoxicity on HEp-2 and HeLa cells, indicating the ability to produce cytotoxin. Cytotoxin production appears to be regulated by the environment, and soy based product was found to have a strong toxin induction property. The toxin was identified, by liquid chromatography-mass spectrometry and NMR spectroscopy, as low molecular weight heat labile benzodiazepine, tilivalline, previously shown to cause cytotoxicity in several cell lines, including mouse L1210 leukemic cells. Genome sequencing and analyses of a cytotoxin positive K. oxytoca strain isolated from an abscess of a mouse, identified genes previously shown to promote pathogenesis in other enteric bacterial pathogens including ecotin, several genes encoding for type IV and type VI secretion systems, and proteins that show sequence similarity to known bacterial toxins including cholera toxin. To our knowledge, these results demonstrate for the first time, that animal isolates of K. oxytoca, produces a cytotoxin, and that cytotoxin production is under strict environmental regulation. We also confirmed tilivalline as the cytotoxin present in animal K. oxytoca strains. These findings, along with the discovery of a repertoire of genes with virulence potential, provide important insights into the pathogenesis of K. oxytoca. As a novel diagnostic tool, tilivalline may serve as a biomarker for K oxytoca-induced cytotoxicity in humans and animals through detection in various samples from food to diseased samples using LC-MS/MS. Induction of K. oxytoca cytotoxin by consumption of soy may be in part involved in the pathogenesis of gastrointestinal disease

    Latherin: A Surfactant Protein of Horse Sweat and Saliva

    Get PDF
    Horses are unusual in producing protein-rich sweat for thermoregulation, a major component of which is latherin, a highly surface-active, non-glycosylated protein. The amino acid sequence of latherin, determined from cDNA analysis, is highly conserved across four geographically dispersed equid species (horse, zebra, onager, ass), and is similar to a family of proteins only found previously in the oral cavity and associated tissues of mammals. Latherin produces a significant reduction in water surface tension at low concentrations (≤1 mg ml−1), and therefore probably acts as a wetting agent to facilitate evaporative cooling through a waterproofed pelt. Neutron reflection experiments indicate that this detergent-like activity is associated with the formation of a dense protein layer, about 10 Å thick, at the air-water interface. However, biophysical characterization (circular dichroism, differential scanning calorimetry) in solution shows that latherin behaves like a typical globular protein, although with unusual intrinsic fluorescence characteristics, suggesting that significant conformational change or unfolding of the protein is required for assembly of the air-water interfacial layer. RT-PCR screening revealed latherin transcripts in horse skin and salivary gland but in no other tissues. Recombinant latherin produced in bacteria was also found to be the target of IgE antibody from horse-allergic subjects. Equids therefore may have adapted an oral/salivary mucosal protein for two purposes peculiar to their lifestyle, namely their need for rapid and efficient heat dissipation and their specialisation for masticating and processing large quantities of dry food material
    corecore