Search CORE

48 research outputs found

pfsearchV3: a code acceleration and heuristic to search PROSITE profiles

Author: Bougueleret Lydie
Bridge Alan
Cerutti Lorenzo
Pagni Marco
Schuepbach Thierry
Xenarios Ioannis
Publication venue
Publication date: 02/08/2017
Field of study

Summary: The PROSITE resource provides a rich and well annotated source of signatures in the form of generalized profiles that allow protein domain detection and functional annotation. One of the major limiting factors in the application of PROSITE in genome and metagenome annotation pipelines is the time required to search protein sequence databases for putative matches. We describe an improved and optimized implementation of the PROSITE search tool pfsearch that, combined with a newly developed heuristic, addresses this limitation. On a modern x86_64 hyper-threaded quad-core desktop computer, the new pfsearchV3 is two orders of magnitude faster than the original algorithm. Availability and implementation: Source code and binaries of pfsearchV3 are freely available for download at http://web.expasy.org/pftools/#pfsearchV3, implemented in C and supported on Linux. PROSITE generalized profiles including the heuristic cut-off scores are available at the same address. Contact: [email protected]

RERO DOC Digital Library

UniPathway: a resource for the exploration and annotation of metabolic pathways

Author: Axelsen Kristian B.
Bairoch Amos
Bougueleret Lydie
Bridge Alan
Coissac Eric
Coudert Elisabeth
Keller Guillaume
Morgat Anne
Viari Alain
Xenarios Ioannis
Publication venue
Publication date: 02/08/2017
Field of study

UniPathway (http://www.unipathway.org) is a fully manually curated resource for the representation and annotation of metabolic pathways. UniPathway provides explicit representations of enzyme-catalyzed and spontaneous chemical reactions, as well as a hierarchical representation of metabolic pathways. This hierarchy uses linear subpathways as the basic building block for the assembly of larger and more complex pathways, including species-specific pathway variants. All of the pathway data in UniPathway has been extensively cross-linked to existing pathway resources such as KEGG and MetaCyc, as well as sequence resources such as the UniProt KnowledgeBase (UniProtKB), for which UniPathway provides a controlled vocabulary for pathway annotation. We introduce here the basic concepts underlying the UniPathway resource, with the aim of allowing users to fully exploit the information provided by UniPathwa

RERO DOC Digital Library

New and continuing developments at PROSITE

Author: Bougueleret Lydie
Bridge Alan
Cerutti Lorenzo
Cuche Béatrice A.
de Castro Edouard
Hulo Nicolas
Sigrist Christian J. A.
Xenarios Ioannis
Publication venue
Publication date: 02/08/2017
Field of study

PROSITE (http://prosite.expasy.org/) consists of documentation entries describing protein domains, families and functional sites, as well as associated patterns and profiles to identify them. It is complemented by ProRule a collection of rules, which increases the discriminatory power of these profiles and patterns by providing additional information about functionally and/or structurally critical amino acids. PROSITE signatures, together with ProRule, are used for the annotation of domains and features of UniProtKB/Swiss-Prot entries. Here, we describe recent developments that allow users to perform whole-proteome annotation as well as a number of filtering options that can be combined to perform powerful targeted searches for biological discovery. The latest version of PROSITE (release 20.85, of 30 August 2012) contains 1308 patterns, 1039 profiles and 1041 ProRule

RERO DOC Digital Library

HAMAP in 2013, new developments in the protein family classification and annotation system

Author: Auchincloss Andrea H.
Baratin Delphine
Bougueleret Lydie
Bridge Alan
Coudert Elisabeth
Cuche Béatrice A.
de Castro Edouard
Keller Guillaume
Pedruzzi Ivo
Poux Sylvain
Redaschi Nicole
Rivoire Catherine
Xenarios Ioannis
Publication venue
Publication date: 02/08/2017
Field of study

HAMAP (High-quality Automated and Manual Annotation of Proteins—available at http://hamap.expasy.org/) is a system for the classification and annotation of protein sequences. It consists of a collection of manually curated family profiles for protein classification, and associated annotation rules that specify annotations that apply to family members. HAMAP was originally developed to support the manual curation of UniProtKB/Swiss-Prot records describing microbial proteins. Here we describe new developments in HAMAP, including the extension of HAMAP to eukaryotic proteins, the use of HAMAP in the automated annotation of UniProtKB/TrEMBL, providing high-quality annotation for millions of protein sequences, and the future integration of HAMAP into a unified system for UniProtKB annotation, UniRule. HAMAP is continuously updated by expert curators with new family profiles and annotation rules as new protein families are characterized. The collection of HAMAP family classification profiles and annotation rules can be browsed and viewed on the HAMAP website, which also provides an interface to scan user sequences against HAMAP profile

RERO DOC Digital Library

The SwissLipids knowledgebase for lipid biology

Author: Aimo Lucila
Bougueleret Lydie
Bridge Alan
David Fabrice P.A.
Gleizes Anne
Götz Lou
Hyka-Nouspikel Nevila
Kuznetsov Dmitry
Liechti Robin
Niknejad Anne
Riezman Howard
van der Goot F. Gisou
Xenarios Ioannis
Publication venue
Publication date: 02/08/2017
Field of study

Motivation: Lipids are a large and diverse group of biological molecules with roles in membrane formation, energy storage and signaling. Cellular lipidomes may contain tens of thousands of structures, a staggering degree of complexity whose significance is not yet fully understood. High-throughput mass spectrometry-based platforms provide a means to study this complexity, but the interpretation of lipidomic data and its integration with prior knowledge of lipid biology suffers from a lack of appropriate tools to manage the data and extract knowledge from it. Results: To facilitate the description and exploration of lipidomic data and its integration with prior biological knowledge, we have developed a knowledge resource for lipids and their biology—SwissLipids. SwissLipids provides curated knowledge of lipid structures and metabolism which is used to generate an in silico library of feasible lipid structures. These are arranged in a hierarchical classification that links mass spectrometry analytical outputs to all possible lipid structures, metabolic reactions and enzymes. SwissLipids provides a reference namespace for lipidomic data publication, data exploration and hypothesis generation. The current version of SwissLipids includes over 244 000 known and theoretically possible lipid structures, over 800 proteins, and curated links to published knowledge from over 620 peer-reviewed publications. We are continually updating the SwissLipids hierarchy with new lipid categories and new expert curated knowledge. Availability: SwissLipids is freely available at http://www.swisslipids.org/. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics onlin

RERO DOC Digital Library

ViralZone: a knowledge resource to understand virus diversity

Author: Adams
Amos Bairoch
Ascenzi
Bahir
Baltimore
Bao
Boehmer
Carrillo-Tripp
Carstens
Chantal Hulo
Edouard de Castro
Fauquet
Fauquet
Finsterbusch
Forterre
Gene Ontology Consortium
Glansdorff
Ioannis Xenarios
Koonin
Kristensen
Liechti
Lombardi
Lydie Bougueleret
Macnaughton
Murphy
Patrick Masson
Philippe Le Mercier
Saxena
Schlaberg
Shepherd
Suttle
Suttle
Taylor
The Gene Ontology Consortium
Van Den Wollenberg
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

The molecular diversity of viruses complicates the interpretation of viral genomic and proteomic data. To make sense of viral gene functions, investigators must be familiar with the virus host range, replication cycle and virion structure. Our aim is to provide a comprehensive resource bridging together textbook knowledge with genomic and proteomic sequences. ViralZone web resource (www.expasy.org/viralzone/) provides fact sheets on all known virus families/genera with easy access to sequence data. A selection of reference strains (RefStrain) provides annotated standards to circumvent the exponential increase of virus sequences. Moreover ViralZone offers a complete set of detailed and accurate virion pictures

Crossref

Serveur académique lausannois

PubMed Central

Archive ouverte UNIGE