742 research outputs found
MIPS: analysis and annotation of genome information in 2007
The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) combines automatic processing of large amounts of sequences with manual annotation of selected model genomes. Due to the massive growth of the available data, the depth of annotation varies widely between independent databases. Also, the criteria for the transfer of information from known to orthologous sequences are diverse. To cope with the task of global in-depth genome annotation has become unfeasible. Therefore, our efforts are dedicated to three levels of annotation: (i) the curation of selected genomes, in particular from fungal and plant taxa (e.g. CYGD, MNCDB, MatDB), (ii) the comprehensive, consistent, automatic annotation employing exhaustive methods for the computation of sequence similarities and sequence-related attributes as well as the classification of individual sequences (SIMAP, PEDANT and FunCat) and (iii) the compilation of manually curated databases for protein interactions based on scrutinized information from the literature to serve as an accepted set of reliable annotated interaction data (MPACT, MPPI, CORUM). All databases and tools described as well as the detailed descriptions of our projects can be accessed through the MIPS web server (http://mips.gsf.de)
Gene3D: comprehensive structural and functional annotation of genomes
Gene3D provides comprehensive structural and functional annotation of most available protein sequences, including the UniProt, RefSeq and Integr8 resources. The main structural annotation is generated through scanning these sequences against the CATH structural domain database profile-HMM library. CATH is a database of manually derived PDB-based structural domains, placed within a hierarchy reflecting topology, homology and conservation and is able to infer more ancient and divergent homology relationships than sequence-based approaches. This data is supplemented with Pfam-A, other non-domain structural predictions (i.e. coiled coils) and experimental data from UniProt. In order to enhance the investigations possible with this data, we have also incorporated a variety of protein annotation resources, including protein–protein interaction data, GO functional assignments, KEGG pathways, FUNCAT functional descriptions and links to microarray expression data. All of this data can be accessed through a newly re-designed website that has a focus on flexibility and clarity, with searches that can be restricted to a single genome or across the entire sequence database. Currently Gene3D contains over 3.5 million domain assignments for nearly 5 million proteins including 527 completed genomes. This is available at: http://gene3d.biochem.ucl.ac.uk
Endothelial dysfunction contributes to renal function-associated cardiovascular mortality in a population with mild renal insufficiency: The Hoorn study
Mildly impaired renal function is associated with cardiovascular morbidity and mortality. There are indications that endothelial dysfunction and/or chronic inflammation, which play an important role in atherothrombosis, are present in early stages of renal insufficiency. This study investigated whether and to which extent endothelial dysfunction and inflammation were related to renal function and contributed to renal function-associated cardiovascular mortality in a population-based cohort (n = 613), aged 50 to 75 yr, that was followed with a median duration of 12.5 yr. During follow-up, 192 individuals died (67 of cardiovascular causes). At baseline, renal function was estimated with serum creatinine, the Cockcroft-Gault formula, and the Modification of Diet in Renal Disease equation of GFR (eGFR). Endothelial function was estimated by plasma von Willebrand factor, soluble vascular cell adhesion molecule-1, and the urinary albumin-creatinine ratio. Inflammatory activity was estimated by plasma C-reactive protein and soluble intercellular adhesion molecule-1. Renal function was mildly impaired (mean eGFR 68 ± 12 ml/min per 1.73
FGDB: revisiting the genome annotation of the plant pathogen Fusarium graminearum
The MIPS Fusarium graminearum Genome Database (FGDB) was established as a comprehensive genome database on one of the most devastating fungal plant pathogens of wheat, barley and maize. The current version of FGDB v3.1 provides information on the full manually revised gene set based on the Broad Institute assembly FG3 genome sequence. The results of gene prediction tools were integrated with the help of comparative data on related species to result in a set of 13.718 annotated protein coding genes. This rigorous approach involved adding or modifying gene models and represents a coding sequence gold standard for the genus Fusarium. The gene loci improvements results in 2461 genes which either are new or have different structures compared to the Broad Institute assembly 3 gene set. Moreover the database serves as a convenient entry point to explore expression data results and to obtain information on the Affymetrix GeneChip probe sets. The resource is accessible on http://mips.gsf.de/genre/proj/FGDB/
Finding undetected protein associations in cell signaling by belief propagation
External information propagates in the cell mainly through signaling cascades
and transcriptional activation, allowing it to react to a wide spectrum of
environmental changes. High throughput experiments identify numerous molecular
components of such cascades that may, however, interact through unknown
partners. Some of them may be detected using data coming from the integration
of a protein-protein interaction network and mRNA expression profiles. This
inference problem can be mapped onto the problem of finding appropriate optimal
connected subgraphs of a network defined by these datasets. The optimization
procedure turns out to be computationally intractable in general. Here we
present a new distributed algorithm for this task, inspired from statistical
physics, and apply this scheme to alpha factor and drug perturbations data in
yeast. We identify the role of the COS8 protein, a member of a gene family of
previously unknown function, and validate the results by genetic experiments.
The algorithm we present is specially suited for very large datasets, can run
in parallel, and can be adapted to other problems in systems biology. On
renowned benchmarks it outperforms other algorithms in the field.Comment: 6 pages, 3 figures, 1 table, Supporting Informatio
The evolutionary dynamics of the Saccharomyces cerevisiae protein interaction network after duplication
Gene duplication is an important mechanism in the evolution of protein interaction networks. Duplications are followed by the gain and loss of interactions, rewiring the network at some unknown rate. Because rewiring is likely to change the distribution of network motifs within the duplicated interaction set, it should be possible to study network rewiring by tracking the evolution of these motifs. We have developed a mathematical framework that, together with duplication data from comparative genomic and proteomic studies, allows us to infer the connectivity of the preduplication network and the changes in connectivity over time. We focused on the whole-genome duplication (WGD) event in Saccharomyces cerevisiae. The model allowed us to predict the frequency of intergene interaction before WGD and the post duplication probabilities of interaction gain and loss. We find that the predicted frequency of self-interactions in the preduplication network is significantly higher than that observed in today's network. This could suggest a structural difference between the modern and ancestral networks, preferential addition or retention of interactions between ohnologs, or selective pressure to preserve duplicates of self-interacting proteins
CORUM: the comprehensive resource of mammalian protein complexes
Protein complexes are key molecular entities that integrate multiple gene products to perform cellular functions. The CORUM (http://mips.gsf.de/genre/proj/corum/index.html) database is a collection of experimentally verified mammalian protein complexes. Information is manually derived by critical reading of the scientific literature from expert annotators. Information about protein complexes includes protein complex names, subunits, literature references as well as the function of the complexes. For functional annotation, we use the FunCat catalogue that enables to organize the protein complex space into biologically meaningful subsets. The database contains more than 1750 protein complexes that are built from 2400 different genes, thus representing 12% of the protein-coding genes in human. A web-based system is available to query, view and download the data. CORUM provides a comprehensive dataset of protein complexes for discoveries in systems biology, analyses of protein networks and protein complex-associated diseases. Comparable to the MIPS reference dataset of protein complexes from yeast, CORUM intends to serve as a reference for mammalian protein complexes
Taal en veiligheid: een groeiend nieuw werkveld [Language and security: a growing new field of work]
Language issues cause safety problems at work. This study presents an inventory of scientific studies in the economic sectors and assesses which level of risk management they address. Complications with language have not been investigated comprehensively across sectors as a causal factor in accidents. This leaves language related risks partially unknown, hence uncontrolled. There is lack of insight in both the nature and magnitude of this danger in healthcare, agricultural, transport and construction sectors. Healthcare is especially troublesome since patients might be victims of language related accidents due to their presence and interaction. The same may occur to members of the public in traffic accidents. In transport and agricultural sectors safety measures were taken without any analysis of language related risks. This study shows that scientific research on ‘language and safety’ is in its infancy and requires priority on the research agenda
CORUM: the comprehensive resource of mammalian protein complexes—2009
CORUM is a database that provides a manually curated repository of experimentally characterized protein complexes from mammalian organisms, mainly human (64%), mouse (16%) and rat (12%). Protein complexes are key molecular entities that integrate multiple gene products to perform cellular functions. The new CORUM 2.0 release encompasses 2837 protein complexes offering the largest and most comprehensive publicly available dataset of mammalian protein complexes. The CORUM dataset is built from 3198 different genes, representing ∼16% of the protein coding genes in humans. Each protein complex is described by a protein complex name, subunit composition, function as well as the literature reference that characterizes the respective protein complex. Recent developments include mapping of functional annotation to Gene Ontology terms as well as cross-references to Entrez Gene identifiers. In addition, a ‘Phylogenetic Conservation’ analysis tool was implemented that analyses the potential occurrence of orthologous protein complex subunits in mammals and other selected groups of organisms. This allows one to predict the occurrence of protein complexes in different phylogenetic groups. CORUM is freely accessible at (http://mips.helmholtz-muenchen.de/genre/proj/corum/index.html)
Endothelial dysfunction and inflammation in asymptomatic proteinuria
Background. Proteinuria is associated with vascular risk and a systemic increase in vascular permeability. Endothelial dysfunction occurs early in atherosclerosis and modulates vascular permeability. Vascular risk and chronic inflammation are associated. This study investigates whether the increased vascular permeability in proteinuria reflects systemic endothelial dysfunction and chronic inflammation. Methods. Twenty-one patients with asymptomatic proteinuria (1.29 g/24 h; range 0.18 to 3.17) and 21 matched controls were studied. Microvascular endothelial function was assessed using acetylcholine iontophoresis. Maximum microvascular hyperemia (MMH) was assessed by flux response to local skin heating. Macrovascular endothelial function was assessed by flow- associated dilation (FAD) in the brachial artery using ultrasound. von Willebrand factor (vWF) was measured as a marker of endothelial activation. Low-grade inflammation was assessed by measurement of circulating C-reactive protein (CRP) values using a high sensitivity assay. Results. FAD was impaired in proteinuric subjects (AP) compared to controls [1.8 (0.2 to 5.3) AP vs. 3.8 (1.5 to 6.2) C %; P = 0.014]. There was no significant difference between groups in MMH or in the response to acetylcholine iontophoresis. The AP group had a higher CRP [4.0 (0.5 to 39.0) AP vs. 0.2 (0.1 to 21.3) C mg/L; P lt 0.001] and tendency to higher vWF [101.5 (67.0 to 197.0) AP vs. 77.5 (45.0 to 185.0) C IU/dL; P = 0.046] compared to controls. In the AP, but not control, group there was an inverse correlation between CRP and microvascular function as determined by acetylcholine iontophoresis (r = -0.509; P = 0.018). Conclusions. In AP subjects there is evidence of macrovascular endothelial dysfunction remote from the kidney and of low-grade inflammation that is associated with microvascular endothelial dysfunction
- …