320 research outputs found

    RepSeq-A database of amino acid repeats present in lower eukaryotic pathogens

    Get PDF
    BACKGROUND Amino acid repeat-containing proteins have a broad range of functions and their identification is of relevance to many experimental biologists. In human-infective protozoan parasites (such as the Kinetoplastid and Plasmodium species), they are implicated in immune evasion and have been shown to influence virulence and pathogenicity. RepSeq http://repseq.gugbe.com is a new database of amino acid repeat-containing proteins found in lower eukaryotic pathogens. The RepSeq database is accessed via a web-based application which also provides links to related online tools and databases for further analyses. RESULTS The RepSeq algorithm typically identifies more than 98% of repeat-containing proteins and is capable of identifying both perfect and mismatch repeats. The proportion of proteins that contain repeat elements varies greatly between different families and even species (3 - 35% of the total protein content). The most common motif type is the Sequence Repeat Region (SRR) - a repeated motif containing multiple different amino acid types. Proteins containing Single Amino Acid Repeats (SAARs) and Di-Peptide Repeats (DPRs) typically account for 0.5 - 1.0% of the total protein number. Notable exceptions are P. falciparum and D. discoideum, in which 33.67% and 34.28% respectively of the predicted proteomes consist of repeat-containing proteins. These numbers are due to large insertions of low complexity single and multi-codon repeat regions. CONCLUSION The RepSeq database provides a repository for repeat-containing proteins found in parasitic protozoa. The database allows for both individual and cross-species proteome analyses and also allows users to upload sequences of interest for analysis by the RepSeq algorithm. Identification of repeat-containing proteins provides researchers with a defined subset of proteins which can be analysed by expression profiling and functional characterisation, thereby facilitating study of pathogenicity and virulence factors in the parasitic protozoa. While primarily designed for kinetoplastid work, the RepSeq algorithm and database retain full functionality when used to analyse other species

    Formulation of a 1D finite element of heat exchanger for accurate modelling of the grouting behaviour: Application to cyclic thermal loading

    Get PDF
    This paper presents a comprehensive formulation of a finite element for the modelling of borehole heat exchangers. This work focuses on the accurate modelling of the grouting and the field of temperature near a single borehole. Therefore the grouting of the BHE is explicitly modelled. The purpose of this work is to provide tools necessary to the further modelling of thermo-mechanical couplings. The finite element discretises the classical governing equation of advection-diffusion of heat within a 1D pipe connected to ground nodes. Petrov-Galerkin weighting functions are used to avoid numerical disturbances. The formulation is able to capture highly transient and steady-state phenomena. The proposed finite element is validated with respect to analytical solutions. An example consisting of a 100 m depth U-pipe is finally simulated. A first continuous heating simulation highlights the nonsymmetric distribution of temperature inside and near the borehole. An estimation of the error on the results as a function of the resolution parameters is also carried out. Finally simulations of cyclic thermal loading exhibit the need to take into account all daily variations if the grouting behaviour must be modelled. This is true especially in case of freeze-thaw damaging risk.Geotherwa

    Efficiency of closed loop geothermal heat pumps: A sensitivity analysis

    Get PDF
    Geothermal heat pumps are becoming more and more popular as the price of fossil fuels is increasing and a strong reduction of anthropogenic CO2 emissions is needed. The energy performances of these plants are closely related to the thermal and hydrogeological properties of the soil, but a proper design and installation also plays a crucial role. A set of flow and heat transport simulations has been run to evaluate the impact of different parameters on the operation of a GHSP. It is demonstrated that the BHE length is the most influential factor, that the heat carrier fluid also plays a fundamental role, and that further improvements can be obtained by using pipe spacers and highly conductive grouts. On the other hand, if the physical properties of the soil are not surveyed properly, they represent a strong factor of uncertainty when modelling the operation of these plants. The thermal conductivity of the soil has a prevailing importance and should be determined with in-situ tests (TRT), rather than assigning values from literature. When groundwater flow is present, the advection should also be considered, due to its positive effect on the performances of BHEs; by contrast, as little is currently known about thermal dispersion, relying on this transport mechanism can lead to an excessively optimistic desig

    Category Theoretic Analysis of Hierarchical Protein Materials and Social Networks

    Get PDF
    Materials in biology span all the scales from Angstroms to meters and typically consist of complex hierarchical assemblies of simple building blocks. Here we describe an application of category theory to describe structural and resulting functional properties of biological protein materials by developing so-called ologs. An olog is like a “concept web” or “semantic network” except that it follows a rigorous mathematical formulation based on category theory. This key difference ensures that an olog is unambiguous, highly adaptable to evolution and change, and suitable for sharing concepts with other olog. We consider simple cases of beta-helical and amyloid-like protein filaments subjected to axial extension and develop an olog representation of their structural and resulting mechanical properties. We also construct a representation of a social network in which people send text-messages to their nearest neighbors and act as a team to perform a task. We show that the olog for the protein and the olog for the social network feature identical category-theoretic representations, and we proceed to precisely explicate the analogy or isomorphism between them. The examples presented here demonstrate that the intrinsic nature of a complex system, which in particular includes a precise relationship between structure and function at different hierarchical levels, can be effectively represented by an olog. This, in turn, allows for comparative studies between disparate materials or fields of application, and results in novel approaches to derive functionality in the design of de novo hierarchical systems. We discuss opportunities and challenges associated with the description of complex biological materials by using ologs as a powerful tool for analysis and design in the context of materiomics, and we present the potential impact of this approach for engineering, life sciences, and medicine.Presidential Early Career Award for Scientists and Engineers (N000141010562)United States. Army Research Office. Multidisciplinary University Research Initiative (W911NF0910541)United States. Office of Naval Research (grant N000141010841)Massachusetts Institute of Technology. Dept. of MathematicsStudienstiftung des deutschen VolkesClark BarwickJacob Luri

    Composition-based statistics and translated nucleotide searches: Improving the TBLASTN module of BLAST

    Get PDF
    BACKGROUND: TBLASTN is a mode of operation for BLAST that aligns protein sequences to a nucleotide database translated in all six frames. We present the first description of the modern implementation of TBLASTN, focusing on new techniques that were used to implement composition-based statistics for translated nucleotide searches. Composition-based statistics use the composition of the sequences being aligned to generate more accurate E-values, which allows for a more accurate distinction between true and false matches. Until recently, composition-based statistics were available only for protein-protein searches. They are now available as a command line option for recent versions of TBLASTN and as an option for TBLASTN on the NCBI BLAST web server. RESULTS: We evaluate the statistical and retrieval accuracy of the E-values reported by a baseline version of TBLASTN and by two variants that use different types of composition-based statistics. To test the statistical accuracy of TBLASTN, we ran 1000 searches using scrambled proteins from the mouse genome and a database of human chromosomes. To test retrieval accuracy, we modernize and adapt to translated searches a test set previously used to evaluate the retrieval accuracy of protein-protein searches. We show that composition-based statistics greatly improve the statistical accuracy of TBLASTN, at a small cost to the retrieval accuracy. CONCLUSION: TBLASTN is widely used, as it is common to wish to compare proteins to chromosomes or to libraries of mRNAs. Composition-based statistics improve the statistical accuracy, and therefore the reliability, of TBLASTN results. The algorithms used by TBLASTN are not widely known, and some of the most important are reported here. The data used to test TBLASTN are available for download and may be useful in other studies of translated search algorithms

    ProteoLens: a visual analytic tool for multi-scale database-driven biological network data mining

    Get PDF
    Background New systems biology studies require researchers to understand how interplay among myriads of biomolecular entities is orchestrated in order to achieve high-level cellular and physiological functions. Many software tools have been developed in the past decade to help researchers visually navigate large networks of biomolecular interactions with built-in template-based query capabilities. To further advance researchers' ability to interrogate global physiological states of cells through multi-scale visual network explorations, new visualization software tools still need to be developed to empower the analysis. A robust visual data analysis platform driven by database management systems to perform bi-directional data processing-to-visualizations with declarative querying capabilities is needed. Results We developed ProteoLens as a JAVA-based visual analytic software tool for creating, annotating and exploring multi-scale biological networks. It supports direct database connectivity to either Oracle or PostgreSQL database tables/views, on which SQL statements using both Data Definition Languages (DDL) and Data Manipulation languages (DML) may be specified. The robust query languages embedded directly within the visualization software help users to bring their network data into a visualization context for annotation and exploration. ProteoLens supports graph/network represented data in standard Graph Modeling Language (GML) formats, and this enables interoperation with a wide range of other visual layout tools. The architectural design of ProteoLens enables the de-coupling of complex network data visualization tasks into two distinct phases: 1) creating network data association rules, which are mapping rules between network node IDs or edge IDs and data attributes such as functional annotations, expression levels, scores, synonyms, descriptions etc; 2) applying network data association rules to build the network and perform the visual annotation of graph nodes and edges according to associated data values. We demonstrated the advantages of these new capabilities through three biological network visualization case studies: human disease association network, drug-target interaction network and protein-peptide mapping network. Conclusion The architectural design of ProteoLens makes it suitable for bioinformatics expert data analysts who are experienced with relational database management to perform large-scale integrated network visual explorations. ProteoLens is a promising visual analytic platform that will facilitate knowledge discoveries in future network and systems biology studies

    Focal adhesion kinase contributes to proliferative potential of ErbB2 mammary tumour cells but is dispensable for ErbB2 mammary tumour induction in vivo

    Get PDF
    INTRODUCTION: Activation of focal adhesion kinase (FAK) is hypothesized to play an important role in the pathogenesis of human breast cancer. METHODS: To directly evaluate the role of FAK in mammary tumour progression, we have used a conditional FAK mouse model and mouse mammary tumour virus (MMTV)-driven Cre recombinase strain to inactivate FAK in the mammary epithelium of a transgenic mouse model of ErbB2 breast cancer. RESULTS: Although mammary epithelial disruption of FAK in this model resulted in both a delay in onset and a decrease in the number of neoplastic lesions, mammary tumours occurred in 100% of virgin female mice. All of the tumours and derived metastases that developed were proficient for FAK due to the absence of Cre recombinase expression. The hyperplastic epithelia where Cre-mediated recombination of FAK could be detected exhibited a profound proliferative defect. Consistent with these observations, disruption of FAK in established tumour cells resulted in reduced tumour growth that was associated with impaired proliferation. To avoid the selection for FAK-proficient ErbB2 tumour epithelia through escape of Cre-mediated recombination, we next intercrossed the FAK conditional mice with a separate MMTV-driven ErbB2 strain that co-expressed ErbB2 and Cre recombinase on the same transcriptional unit. CONCLUSIONS: While a delay in tumour induction was noted, FAK-deficient tumours arose in 100% of female animals indicating that FAK is dispensable for ErbB2 tumour initiation. In addition, the FAK-null ErbB2 tumours retained their metastatic potential. We further demonstrated that the FAK-related Pyk2 kinase is still expressed in these tumours and is associated with its downstream regulator p130Cas. These observations indicate that Pyk2 can functionally substitute for FAK in ErbB2 mammary tumour progression

    Hubs with Network Motifs Organize Modularity Dynamically in the Protein-Protein Interaction Network of Yeast

    Get PDF
    BACKGROUND: It has been recognized that modular organization pervades biological complexity. Based on network analysis, 'party hubs' and 'date hubs' were proposed to understand the basic principle of module organization of biomolecular networks. However, recent study on hubs has suggested that there is no clear evidence for coexistence of 'party hubs' and 'date hubs'. Thus, an open question has been raised as to whether or not 'party hubs' and 'date hubs' truly exist in yeast interactome. METHODOLOGY: In contrast to previous studies focusing on the partners of a hub or the individual proteins around the hub, our work aims to study the network motifs of a hub or interactions among individual proteins including the hub and its neighbors. Depending on the relationship between a hub's network motifs and protein complexes, we define two new types of hubs, 'motif party hubs' and 'motif date hubs', which have the same characteristics as the original 'party hubs' and 'date hubs' respectively. The network motifs of these two types of hubs display significantly different features in spatial distribution (or cellular localizations), co-expression in microarray data, controlling topological structure of network, and organizing modularity. CONCLUSION: By virtue of network motifs, we basically solved the open question about 'party hubs' and 'date hubs' which was raised by previous studies. Specifically, at the level of network motifs instead of individual proteins, we found two types of hubs, motif party hubs (mPHs) and motif date hubs (mDHs), whose network motifs display distinct characteristics on biological functions. In addition, in this paper we studied network motifs from a different viewpoint. That is, we show that a network motif should not be merely considered as an interaction pattern but be considered as an essential function unit in organizing modules of networks

    A three-dimensional numerical model of borehole heat exchanger heat transfer and fluid flow

    Get PDF
    Common approaches to the simulation of borehole heat exchangers assume heat transfer within the circulating fluid and grout to be in a quasi-steady state and ignore axial conduction heat transfer. This paper presents a numerical model that is three-dimensional, includes explicit representations of the circulating fluid and other borehole components, and so allows calculation of dynamic behaviours over short and long timescales. The model is formulated using a finite volume approach using multi-block meshes to represent the ground, pipes, fluid and grout in a geometrically correct manner. Validation and verification exercises are presented that use both short timescale data to identify transport delay effects, and long timescale data to examine the modelling of seasonal heat transfer and show the model is capable of predicting outlet temperatures and heat transfer rates accurately. At long timescales borehole heat transfer seems well characterized by the mean fluid and borehole wall temperature if the fluid circulating velocity is reasonably high but at lower flow rates this is not the case. Study of the short timescale dynamics has shown that nonlinearities in the temperature and heat flux profiles are noticeable over the whole velocity range of practical interest. The importance of representing the thermal mass of the grout and the dynamic variations in temperature gradient as well as the fluid transport within the borehole has been highlighted. Implications for simplified modelling approaches are also discussed

    The ATM and ATR inhibitors CGK733 and caffeine suppress cyclin D1 levels and inhibit cell proliferation

    Get PDF
    The ataxia telangiectasia mutated (ATM) and the ATM- related (ATR) kinases play a central role in facilitating the resistance of cancer cells to genotoxic treatment regimens. The components of the ATM and ATR regulated signaling pathways thus provide attractive pharmacological targets, since their inhibition enhances cellular sensitivity to chemo- and radiotherapy. Caffeine as well as more specific inhibitors of ATM (KU55933) or ATM and ATR (CGK733) have recently been shown to induce cell death in drug-induced senescent tumor cells. Addition of these agents to cancer cells previously rendered senescent by exposure to genotoxins suppressed the ATM mediated p21 expression required for the survival of these cells. The precise molecular pharmacology of these agents however, is not well characterized. Herein, we report that caffeine, CGK733, and to a lesser extent KU55933, inhibit the proliferation of otherwise untreated human cancer and non-transformed mouse fibroblast cell lines. Exposure of human cancer cell lines to caffeine and CGK733 was associated with a rapid decline in cyclin D1 protein levels and a reduction in the levels of both phosphorylated and total retinoblastoma protein (RB). Our studies suggest that observations based on the effects of these compounds on cell proliferation and survival must be interpreted with caution. The differential effects of caffeine/CGK733 and KU55933 on cyclin D1 protein levels suggest that these agents will exhibit dissimilar molecular pharmacological profiles
    corecore