233 research outputs found
Analysis of Genetic Interaction Maps Reveals Functional Pleiotropy
Epistatic or genetic interactions, representing the effects of mutations on the phenotypes caused by other mutations, can be very helpful for uncovering functional relationships between genes. Recently, the Epistasis Miniarray Profile (E-MAP) method has emerged as a powerful approach for identifying such interactions systematically. As part of this approach, hierarchical clustering is used to partition genes into groups on the basis of the similarity between their global interaction profiles. Here we present an original biclustering algorithm for identifying groups of functionally related genes from E-MAP data in a manner that allows individual genes to be assigned to more than one functional group. This enables investigation of the pleiotropic nature of gene function, a goal that cannot be achieved with hierarchical clustering. The performance of our algorithm is illustrated by applying it to two E-MAP datasets and an E-MAP-like in silico dataset for the yeast S. cerevisiae. In addition to identifying the majority of the functional modules reported in these studies, our algorithm uncovers many recently documented and novel multi-functional relationships between genes and gene groups
Relating destabilizing regions to known functional sites in proteins
Most methods for predicting functional sites in protein 3D structures, rely on information on related proteins and cannot be applied to proteins with no known relatives. Another limitation of these methods is the lack of a well annotated set of functional sites to use as benchmark for validating their predictions. Experimental findings and theoretical considerations suggest that residues involved in function often contribute unfavorably to the native state stability. We examine the possibility of systematically exploiting this intrinsic property to identify functional sites using an original procedure that detects destabilizing regions in protein structures. In addition, to relate destabilizing regions to known functional sites, a novel benchmark consisting of a diverse set of hand-curated protein functional sites is derived.Journal ArticleResearch Support, Non-U.S. Gov'tinfo:eu-repo/semantics/publishe
Recognizing protein-protein interfaces with empirical potentials and reduced amino acid alphabets.
International audienceBACKGROUND: In structural genomics, an important goal is the detection and classification of protein-protein interactions, given the structures of the interacting partners. We have developed empirical energy functions to identify native structures of protein-protein complexes among sets of decoy structures. To understand the role of amino acid diversity, we parameterized a series of functions, using a hierarchy of amino acid alphabets of increasing complexity, with 2, 3, 4, 6, and 20 amino acid groups. Compared to previous work, we used the simplest possible functional form, with residue-residue interactions and a stepwise distance-dependence. We used increased computational resources, however, constructing 290,000 decoys for 219 protein-protein complexes, with a realistic docking protocol where the protein partners are flexible and interact through a molecular mechanics energy function. The energy parameters were optimized to correctly assign as many native complexes as possible. To resolve the multiple minimum problem in parameter space, over 64000 starting parameter guesses were tried for each energy function. The optimized functions were tested by cross validation on subsets of our native and decoy structures, by blind tests on series of native and decoy structures available on the Web, and on models for 13 complexes submitted to the CAPRI structure prediction experiment. RESULTS: Performance is similar to several other statistical potentials of the same complexity. For example, the CAPRI target structure is correctly ranked ahead of 90% of its decoys in 6 cases out of 13. The hierarchy of amino acid alphabets leads to a coherent hierarchy of energy functions, with qualitatively similar parameters for similar amino acid types at all levels. Most remarkably, the performance with six amino acid classes is equivalent to that of the most detailed, 20-class energy function. CONCLUSION: This suggests that six carefully chosen amino acid classes are sufficient to encode specificity in protein-protein interactions, and provide a starting point to develop more complicated energy functions
Metabolic PathFinding: inferring relevant pathways in biochemical networks
Our knowledge of metabolism can be represented as a network comprising several thousands of nodes (compounds and reactions). Several groups applied graph theory to analyse the topological properties of this network and to infer metabolic pathways by path finding. This is, however, not straightforward, with a major problem caused by traversing irrelevant shortcuts through highly connected nodes, which correspond to pool metabolites and co-factors (e.g. H(2)O, NADP and H(+)). In this study, we present a web server implementing two simple approaches, which circumvent this problem, thereby improving the relevance of the inferred pathways. In the simplest approach, the shortest path is computed, while filtering out the selection of highly connected compounds. In the second approach, the shortest path is computed on the weighted metabolic graph where each compound is assigned a weight equal to its connectivity in the network. This approach significantly increases the accuracy of the inferred pathways, enabling the correct inference of relatively long pathways (e.g. with as many as eight intermediate reactions). Available options include the calculation of the k-shortest paths between two specified seed nodes (either compounds or reactions). Multiple requests can be submitted in a queue. Results are returned by email, in textual as well as graphical formats (available in )
Transcriptional regulation of protein complexes in yeast
BACKGROUND: Multiprotein complexes play an essential role in many cellular processes. But our knowledge of the mechanism of their formation, regulation and lifetimes is very limited. We investigated transcriptional regulation of protein complexes in yeast using two approaches. First, known regulons, manually curated or identified by genome-wide screens, were mapped onto the components of multiprotein complexes. The complexes comprised manually curated ones and those characterized by high-throughput analyses. Second, putative regulatory sequence motifs were identified in the upstream regions of the genes involved in individual complexes and regulons were predicted on the basis of these motifs. RESULTS: Only a very small fraction of the analyzed complexes (5-6%) have subsets of their components mapping onto known regulons. Likewise, regulatory motifs are detected in only about 8-15% of the complexes, and in those, about half of the components are on average part of predicted regulons. In the manually curated complexes, the so-called 'permanent' assemblies have a larger fraction of their components belonging to putative regulons than 'transient' complexes. For the noisier set of complexes identified by high-throughput screens, valuable insights are obtained into the function and regulation of individual genes. CONCLUSIONS: A small fraction of the known multiprotein complexes in yeast seems to have at least a subset of their components co-regulated on the transcriptional level. Preliminary analysis of the regulatory motifs for these components suggests that the corresponding genes are likely to be co-regulated either together or in smaller subgroups, indicating that transcriptionally regulated modules might exist within complexes
Insight into membraneless organelles and their associated proteins: Drivers, Clients and Regulators
In recent years, attention has been devoted to proteins forming immiscible liquid phases within the liquid intracellular medium, commonly referred to as membraneless organelles (MLO). These organelles enable the spatiotemporal associations of cellular components that exchange dynamically with the cellular milieu. The dysregulation of these liquid–liquid phase separation processes (LLPS) may cause various diseases including neurodegenerative pathologies and cancer, among others. Until very recently, databases containing information on proteins forming MLOs, as well as tools and resources facilitating their analysis, were missing. This has recently changed with the publication of 4 databases that focus on different types of experiments, sets of proteins, inclusion criteria, and levels of annotation or curation. In this study we integrate and analyze the information across these databases, complement their records, and produce a consolidated set of proteins that enables the investigation of the LLPS phenomenon. To gain insight into the features that characterize different types of MLOs and the roles of their associated proteins, they were grouped into categories: High Confidence MLO associated (including Drivers and reviewed proteins), Potential Clients and Regulators, according to their annotated functions. We show that none of the databases taken alone covers the data sufficiently to enable meaningful analysis, validating our integration effort as essential for gaining better understanding of phase separation and laying the foundations for the discovery of new proteins potentially involved in this important cellular process. Lastly, we developed a server, enabling customized selections of different sets of proteins based on MLO location, database, disorder content, among other attributes.Fil: Orti, Fernando Ezequiel. Consejo Nacional de Investigaciones CientÃficas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Instituto de Investigaciones BioquÃmicas de Buenos Aires. Fundación Instituto Leloir. Instituto de Investigaciones BioquÃmicas de Buenos Aires; ArgentinaFil: Navarro, Alvaro Martin. Consejo Nacional de Investigaciones CientÃficas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Instituto de Investigaciones BioquÃmicas de Buenos Aires. Fundación Instituto Leloir. Instituto de Investigaciones BioquÃmicas de Buenos Aires; ArgentinaFil: Rabinovich, Andrés. Consejo Nacional de Investigaciones CientÃficas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Instituto de Investigaciones BioquÃmicas de Buenos Aires. Fundación Instituto Leloir. Instituto de Investigaciones BioquÃmicas de Buenos Aires; ArgentinaFil: Wodak, Shoshana J.. Vrije Unviversiteit Brussel; BélgicaFil: Marino, Cristina Ester. Consejo Nacional de Investigaciones CientÃficas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Instituto de Investigaciones BioquÃmicas de Buenos Aires. Fundación Instituto Leloir. Instituto de Investigaciones BioquÃmicas de Buenos Aires; Argentin
Structural interpretation of the amino acid sequence of a second domain from the Artemia covalent polymer globin
Artemia has a complex extracellular hemoglobin of Mr 260,000 comprising two globin chains (Mr 130,000) each of which is a polymer of eight covalently linked domains of Mr 16,000. The primary structure of this polymeric globin was studied to understand how globin folded domains are ordered within a globin chain and, in turn, how the latter associate into a functional hemoglobin molecule. Here we report the amino acid sequence of a second domain, E7 (Mr 16,081, excluding the heme), and interpretations of sequence data by computer-assisted alignment and modeling. This clearly shows that, as with domain E1 (Moens, L. Van Hauwaert, M.-L. De Smet, K. Geelen, D. Verpooten, G. Van Beeumen, J. Wodak, S. Alard, P. & Trotman, C. (1988) J. Biol. Chem. 263, 4679-4685), domain E7 is compatible with a globin folded structure of the β-type chain. Several specific differences of domains E7 and E1 from the classic globins are identified. They possibly can be interpreted in terms of specific requirements for a double octameric functional molecule.SCOPUS: ar.jinfo:eu-repo/semantics/publishe
Intercellular network structure and regulatory motifs in the human hematopoietic system.
The hematopoietic system is a distributed tissue that consists of functionally distinct cell types continuously produced through hematopoietic stem cell (HSC) differentiation. Combining genomic and phenotypic data with high-content experiments, we have built a directional cell-cell communication network between 12 cell types isolated from human umbilical cord blood. Network structure analysis revealed that ligand production is cell type dependent, whereas ligand binding is promiscuous. Consequently, additional control strategies such as cell frequency modulation and compartmentalization were needed to achieve specificity in HSC fate regulation. Incorporating the in vitro effects (quiescence, self-renewal, proliferation, or differentiation) of 27 HSC binding ligands into the topology of the cell-cell communication network allowed coding of cell type-dependent feedback regulation of HSC fate. Pathway enrichment analysis identified intracellular regulatory motifs enriched in these cell type- and ligand-coupled responses. This study uncovers cellular mechanisms of hematopoietic cell feedback in HSC fate regulation, provides insight into the design principles of the human hematopoietic system, and serves as a foundation for the analysis of intercellular regulation in multicellular systems
- …