184 research outputs found

    From correlation functions to scattering amplitudes

    Get PDF
    We study the correlators of half-BPS protected operators in N=4 super-Yang-Mills theory, in the limit where the positions of the adjacent operators become light-like separated. We compute the loop corrections by means of Lagrangian insertions. The divergences resulting from the light-cone limit are regularized by changing the dimension of the integration measure over the insertion points. Switching from coordinates to dual momenta, we show that the logarithm of the correlator is identical with twice the logarithm of the matching MHV gluon scattering amplitude. We present a number of examples of this new relation, at one and two loops.Comment: typos corrected, references adde

    RegExpBlasting (REB), a Regular Expression Blasting algorithm based on multiply aligned sequences

    Get PDF
    Background: One of the most frequent uses of bioinformatics tools concerns functional characterization of a newly produced nucleotide sequence (a query sequence) by applying Blast or FASTA against a set of sequences (the subject sequences). However, in some specific contexts, it is useful to compare the query sequence against a cluster such as a MultiAlignment (MA). We present here the RegExpBlasting (REB) algorithm, which compares an unclassified sequence with a dataset of patterns defined by application of Regular Expression rules to a given-as-input MA datasets. The REB algorithm workflow consists in i. the definition of a dataset of multialignments ii. the association of each MA to a pattern, defined by application of regular expression rules; iii. automatic characterization of a submitted biosequence according to the function of the sequences described by the pattern best matching the query sequence. Results: An application of this algorithm is used in the "characterize your sequence" tool available in the PPNEMA resource. PPNEMA is a resource of Ribosomal Cistron sequences from various species, grouped according to nematode genera. It allows the retrieval of plant nematode multialigned sequences or the classification of new nematode rDNA sequences by applying REB. The same algorithm also supports automatic updating of the PPNEMA database. The present paper gives examples of the use of REB within PPNEMA. Conclusion: The use of REB in PPNEMA updating, the PPNEMA "characterize your sequence" option clearly demonstrates the power of the method. Using REB can also rapidly solve any other bioinformatics problem, where the addition of a new sequence to a pre-existing cluster is required. The statistical tests carried out here show the powerful flexibility of the method

    Rule-based knowledge aggregation for large-scale protein sequence analysis of influenza A viruses

    Get PDF
    Background: The explosive growth of biological data provides opportunities for new statistical and comparative analyses of large information sets, such as alignments comprising tens of thousands of sequences. In such studies, sequence annotations frequently play an essential role, and reliable results depend on metadata quality. However, the semantic heterogeneity and annotation inconsistencies in biological databases greatly increase the complexity of aggregating and cleaning metadata. Manual curation of datasets, traditionally favoured by life scientists, is impractical for studies involving thousands of records. In this study, we investigate quality issues that affect major public databases, and quantify the effectiveness of an automated metadata extraction approach that combines structural and semantic rules. We applied this approach to more than 90,000 influenza A records, to annotate sequences with protein name, virus subtype, isolate, host, geographic origin, and year of isolation. Results: Over 40,000 annotated Influenza A protein sequences were collected by combining information from more than 90,000 documents from NCBI public databases. Metadata values were automatically extracted, aggregated and reconciled from several document fields by applying user-defined structural rules. For each property, values were recovered from ≥88.8% of records, with accuracy exceeding 96% in most cases. Because of semantic heterogeneity, each property required up to six different structural rules to be combined. Significant quality differences between databases were found: GenBank documents yield values more reliably than documents extracted from GenPept. Using a simple set of semantic rules and a reasoner, we reconstructed relationships between sequences from the same isolate, thus identifying 7640 isolates. Validation of isolate metadata against a simple ontology highlighted more than 400 inconsistencies, leading to over 3,000 property value corrections. Conclusion: To overcome the quality issues inherent in public databases, automated knowledge aggregation with embedded intelligence is needed for large-scale analyses. Our results show that user-controlled intuitive approaches, based on combination of simple rules, can reliably automate various curation tasks, reducing the need for manual corrections to approximately 5% of the records. Emerging semantic technologies possess desirable features to support today's knowledge aggregation tasks, with a potential to bring immediate benefits to this field

    A Parameterization of Heterogeneous Hydrolysis of N2O5 for 3-D Atmospheric Modelling

    Get PDF
    During night-time, the heterogeneous hydrolysis of N 2O 5 on the surface of deliquescent aerosol particles represents a major source for the formation of HNO 3 and leads to an important reduction of NO x in the atmosphere. In Chen et al., Atmos. Chem. Phys. 18:673–689, 2018 [5], we investigate an improved parameterization of the heterogeneous N 2O 5 hydrolysis. This approach is based on laboratory experiments and takes into account the temperature, relative humidity, aerosol particle composition as well as the surface area concentration. The parametrization was implemented in the online coupled model system COSMO-MUSCAT (Consortium for Small-scale Modelling and Multi-Scale Chemistry Aerosol Transport, https://cosmo-muscat.tropos.de). In Chen et al., Atmos. Chem. Phys. 18:673–689, 2018 [5], the modified model was applied for the simulation of the HOPE-Melpitz campaign (10–25 September 2013) where especially the nitrate prediction over western and central Europe was analysed. The modelled particulate nitrate concentrations were compared with filter measurements over Germany. In this first study, the particulate nitrate results are significantly improved by using the developed N 2O 5 parametrization, particularly if the particulate nitrate was dominated by the local chemical formation (September 12, 17–18 and 25). The aim of the current study consists in an evaluation over a longer time period for different meteorological conditions and emission situations. For this reason, we have simulated the period from March to November 2010. The results were compared with other approaches and evaluated by filter measurements. The improvement was confirmed for the results in spring and autumn, but nitrate is strongly over-predicted also for the new parametrization during the summer time

    An Introduction to RNA Databases

    Full text link
    We present an introduction to RNA databases. The history and technology behind RNA databases is briefly discussed. We examine differing methods of data collection and curation, and discuss their impact on both the scope and accuracy of the resulting databases. Finally, we demonstrate these principals through detailed examination of four leading RNA databases: Noncode, miRBase, Rfam, and SILVA.Comment: 27 pages, 10 figures, 1 tables. Submitted as a chapter for "An introduction to RNA bioinformatics" to be published by "Methods in Molecular Biology

    Erythrocyte Inosine Triphosphatase Activity Is Decreased in HIV-Seropositive Individuals

    Get PDF
    Background: Inosine triphosphatase (ITPase) is encoded by the polymorphic gene ITPA and maintains low intracellular levels of the inosine nucleotides ITP and dITP. The most frequently reported polymorphisms are ITPA c.94C<A (rs 1127354) and ITPA c. 124+21 A<C (rs7270101). Some nucleoside-analogues used in the treatment of HIV-seropositive (HIV+) patients are potential substrates for ITPase. Therefore, the frequency of ITPA SNPs and ITPase activity were studied in a population of HIV+-patients. Methods: The study population consisted of 222 patients, predominantly Caucasian males, <95% using HAART. Erythrocyte ITPase activity was determined by measuring the formation of IMP from ITP. ITPA genotype was determined by sequencing genomic DNA. Distribution of ITPase activity, genotype-phenotype correlation and allele frequencies were compared to 198 control subjects. The effect of nucleoside analogues on ITPase activity was studied using lymphoblastic T-cell cultures and human recombinant ITPase. Enzyme kinetic experiments were performed on erythrocyte ITPase from HIV+ patients and controls. Results: No difference was observed in the allele frequencies between the HIV+-cohort (± HAART) and the control population. HIV+ carriers of the wild type and ITPA c.94C<A had significantly lower ITPase activities than control subjects with the same genotype (p<lt;0.005). This was not observed in ITPA c. 124+21 A<C carriers. Nucleoside analogues did not affect ITPase activity in cell culture and human recombinant ITPase. Conclusion: ITPA population genetics were identical in HIV+ and control populations. However, the majority of HIV+-patients had decreased erythrocyte ITPase activity compared to controls, probably due to decreased amounts of ITPase protein. It seems unlikely that ITPase activity is decreased due to nucleoside analogues (HAART). Long-term effects of HIV-infection altering ITPase protein expression or stability may explain the phenomenon observed

    Understanding resonant charge transport through weakly coupled single-molecule junctions

    Get PDF
    Off-resonant charge transport through molecular junctions has been extensively studied since the advent of single-molecule electronics and it is now well understood within the framework of the non-interacting Landauer approach. Conversely, gaining a qualitative and quantitative understanding of the resonant transport regime has proven more elusive. Here, we study resonant charge transport through graphene-based zinc-porphyrin junctions. We experimentally demonstrate an inadequacy of the non-interacting Landauer theory as well as the conventional single-mode Franck-Condon model. Instead, we model the overall charge transport as a sequence of non-adiabatic electron transfers, the rates of which depend on both outer and inner-sphere vibrational interactions. We show that the transport properties of our molecular junctions are determined by a combination of electron-electron and electron-vibrational coupling, and are sensitive to the interactions with the wider local environment. Furthermore, we assess the importance of nuclear tunnelling and examine the suitability of semi-classical Marcus theory as a description of charge transport in molecular devices.Comment: version accepted in Nature Communications; SI available at https://researchportal.hw.ac.uk/en/publications/understanding-resonant-charge-transport-through-weakly-coupled-s

    The Binary Protein Interactome of Treponema pallidum – The Syphilis Spirochete

    Get PDF
    Protein interaction networks shed light on the global organization of proteomes but can also place individual proteins into a functional context. If we know the function of bacterial proteins we will be able to understand how these species have adapted to diverse environments including many extreme habitats. Here we present the protein interaction network for the syphilis spirochete Treponema pallidum which encodes 1,039 proteins, 726 (or 70%) of which interact via 3,649 interactions as revealed by systematic yeast two-hybrid screens. A high-confidence subset of 991 interactions links 576 proteins. To derive further biological insights from our data, we constructed an integrated network of proteins involved in DNA metabolism. Combining our data with additional evidences, we provide improved annotations for at least 18 proteins (including TP0004, TP0050, and TP0183 which are suggested to be involved in DNA metabolism). We estimate that this “minimal” bacterium contains on the order of 3,000 protein interactions. Profiles of functional interconnections indicate that bacterial proteins interact more promiscuously than eukaryotic proteins, reflecting the non-compartmentalized structure of the bacterial cell. Using our high-confidence interactions, we also predict 417,329 homologous interactions (“interologs”) for 372 completely sequenced genomes and provide evidence that at least one third of them can be experimentally confirmed

    Structure-Function Studies of DNA Binding Domain of Response Regulator KdpE Reveals Equal Affinity Interactions at DNA Half-Sites

    Get PDF
    Expression of KdpFABC, a K+ pump that restores osmotic balance, is controlled by binding of the response regulator KdpE to a specific DNA sequence (kdpFABCBS) via the winged helix-turn-helix type DNA binding domain (KdpEDBD). Exploration of E. coli KdpEDBD and kdpFABCBS interaction resulted in the identification of two conserved, AT-rich 6 bp direct repeats that form half-sites. Despite binding to these half-sites, KdpEDBD was incapable of promoting gene expression in vivo. Structure-function studies guided by our 2.5 Å X-ray structure of KdpEDBD revealed the importance of residues R193 and R200 in the α-8 DNA recognition helix and T215 in the wing region for DNA binding. Mutation of these residues renders KdpE incapable of inducing expression of the kdpFABC operon. Detailed biophysical analysis of interactions using analytical ultracentrifugation revealed a 2∶1 stoichiometry of protein to DNA with dissociation constants of 200±100 and 350±100 nM at half-sites. Inactivation of one half-site does not influence binding at the other, indicating that KdpEDBD binds independently to the half-sites with approximately equal affinity and no discernable cooperativity. To our knowledge, these data are the first to describe in quantitative terms the binding at half-sites under equilibrium conditions for a member of the ubiquitous OmpR/PhoB family of proteins

    c-di-GMP Turn-Over in Clostridium difficile Is Controlled by a Plethora of Diguanylate Cyclases and Phosphodiesterases

    Get PDF
    Clostridium difficile infections have become a major healthcare concern in the last decade during which the emergence of new strains has underscored this bacterium's capacity to cause persistent epidemics. c-di-GMP is a bacterial second messenger regulating diverse bacterial phenotypes, notably motility and biofilm formation, in proteobacteria such as Vibrio cholerae, Pseudomonas aeruginosa, and Salmonella. c-di-GMP is synthesized by diguanylate cyclases (DGCs) that contain a conserved GGDEF domain. It is degraded by phosphodiesterases (PDEs) that contain either an EAL or an HD-GYP conserved domain. Very little is known about the role of c-di-GMP in the regulation of phenotypes of Gram-positive or fastidious bacteria. Herein, we exposed the main components of c-di-GMP signalling in 20 genomes of C. difficile, revealed their prevalence, and predicted their enzymatic activity. Ectopic expression of 31 of these conserved genes was carried out in V. cholerae to evaluate their effect on motility and biofilm formation, two well-characterized phenotype alterations associated with intracellular c-di-GMP variation in this bacterium. Most of the predicted DGCs and PDEs were found to be active in the V. cholerae model. Expression of truncated versions of CD0522, a protein with two GGDEF domains and one EAL domain, suggests that it can act alternatively as a DGC or a PDE. The activity of one purified DGC (CD1420) and one purified PDE (CD0757) was confirmed by in vitro enzymatic assays. GTP was shown to be important for the PDE activity of CD0757. Our results indicate that, in contrast to most Gram-positive bacteria including its closest relatives, C. difficile encodes a large assortment of functional DGCs and PDEs, revealing that c-di-GMP signalling is an important and well-conserved signal transduction system in this human pathogen
    corecore