Search CORE

Directory of Open Access Journals

Online Research Database In Technology

Parasail: SIMD C library for global, semi-global, and local pairwise sequence alignments

Author: A Szalkowski
A Wozniak
C Camacho
DJ States
J Daily
J Fischer
Jeff Daily
L Wang
M Farrar
M Zhao
MI Abouelhoda
O Gotoh
S Henikoff
SF Altschul
T Rognes
T Rognes
The UniProt Consortium
Y Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Faster Smith-Waterman database searches with inter-sequence SIMD parallelisation

Author: A Szalkowski
A Wirawan
A Wozniak
Intel Corporation
ITS Li
JR Miller
M Farrar
O Gotoh
S Henikoff
SF Altschul
SF Altschul
SM Rumble
T Rognes
TF Smith
Torbjørn Rognes
UniProt Consortium
W Rudnicki
Y Liu
Y Liu
Ł Ligowski
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The Smith-Waterman algorithm for local sequence alignment is more sensitive than heuristic methods for database searching, but also more time-consuming. The fastest approach to parallelisation with SIMD technology has previously been described by Farrar in 2007. The aim of this study was to explore whether further speed could be gained by other approaches to parallelisation. Results A faster approach and implementation is described and benchmarked. In the new tool SWIPE, residues from sixteen different database sequences are compared in parallel to one query residue. Using a 375 residue query sequence a speed of 106 billion cell updates per second (GCUPS) was achieved on a dual Intel Xeon X5650 six-core processor system, which is over six times more rapid than software based on Farrar's 'striped' approach. SWIPE was about 2.5 times faster when the programs used only a single thread. For shorter queries, the increase in speed was larger. SWIPE was about twice as fast as BLAST when using the BLOSUM50 score matrix, while BLAST was about twice as fast as SWIPE for the BLOSUM62 matrix. The software is designed for 64 bit Linux on processors with SSSE3. Source code is available from <url>http://dna.uio.no/swipe/</url> under the GNU Affero General Public License. Conclusions Efficient parallelisation using SIMD on standard hardware makes it possible to run Smith-Waterman database searches more than six times faster than before. The approach described here could significantly widen the potential application of Smith-Waterman searches. Other applications that require optimal local alignment scores could also benefit from improved performance.</p

Directory of Open Access Journals

NORA - Norwegian Open Research Archives

Predicted sub-populations in a marine shrimp proteome as revealed by combined EST and cDNA data from multiple Penaeus species

arXiv.org e-Print Archive

How simple can a model of an empty viral capsid be? Charge distributions in viral capsids

Author: A Lošdorfer Božič
A Šiber
A Šiber
A Šiber
A Šiber
A Šiber
Antonio Šiber
Anže Lošdorfer Božič
B Michen
CE Felder
CJ Marzec
CL Ting
DG Isom
DG Isom
J Bernal
J Langlet
JD Jackson
K Iwasaki
M Carrillo-Tripp
MR Gunner
P Ni
P Pfeiffer
P Prinsen
R Zandi
R Zandi
Rudolf Podgornik
RV Mannige
S Karlin
T Hu
The UniProt Consortium
TS Baker
VA Belyi
VA Parsegian
VB Chen
W Humphrey
WH Roos
WK Kegel
WK Kegel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 13/07/2012
Field of study

We investigate and quantify salient features of the charge distributions on viral capsids. Our analysis combines the experimentally determined capsid geometry with simple models for ionization of amino acids, thus yielding the detailed description of spatial distribution for positive and negative charge across the capsid wall. The obtained data is processed in order to extract the mean radii of distributions, surface charge densities and dipole moment densities. The results are evaluated and examined in light of previously proposed models of capsid charge distributions, which are shown to have to some extent limited value when applied to real viruses.Comment: 10 pages, 10 figures; accepted for publication in Journal of Biological Physic

MACiE: exploring the diversity of biochemical reactions

Author: Almonacid
Almonacid
Andreini
Andreini
Bartlett
Berman
Claudia Andreini
Cuff
Daniel E. Almonacid
Fischer
Fischer
Fleischmann
Gemma L. Holliday
Holliday
Holliday
Holliday
Holliday
Holliday
Hubbard
Julia D. Fischer
Kanehisa
McDonald
McDonald
Nagano
Najmanovich
O'Boyle
Pegg
Porter
Scheer
Sierk
Sophie T. Williams
Syed Asad Rahman
The Gene Ontology Consortium
The UniProt Consortium
Willett
William R. Pearson
Wittig
Publication venue: Oxford University Press
Publication date
Field of study

MACiE (which stands for Mechanism, Annotation and Classification in Enzymes) is a database of enzyme reaction mechanisms, and can be accessed from http://www.ebi.ac.uk/thornton-srv/databases/MACiE/. This article presents the release of Version 3 of MACiE, which not only extends the dataset to 335 entries, covering 182 of the EC sub-subclasses with a crystal structure available (∼90%), but also incorporates greater chemical and structural detail. This version of MACiE represents a shift in emphasis for new entries, from non-homologous representatives covering EC reaction space to enzymes with mechanisms of interest to our users and collaborators with a view to exploring the chemical diversity of life. We present new tools for exploring the data in MACiE and comparing entries as well as new analyses of the data and new searches, many of which can now be accessed via dedicated Perl scripts

Cape Town University OpenUCT

DAS Writeback: A Collaborative Annotation System

Author: A Grzibovska
A Jenkinson
Alexander Garcia
B Mons
C Bauer
C Pautasso
Edwin Blake
G Salazar
Google
Gustavo A Salazar
H Kilov
Henning Hermjakob
IJW Huss
J Gregorio
N Miyake
Nicola Mulder
P Jones
R Dowell
Rafael C Jimenez
RC Jimenez
S Vinoski
T Doerks
U Bhatia
UniProt Consortium
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Directory of Open Access Journals

PREX: PeroxiRedoxin classification indEX, a database of subfamily assignments across the diverse peroxiredoxin family

Author: Altschul
Bailey
Bailey
Benson
Berman
Cammer
Chris Williamson
Copley
Fetrow
Fomenko
Hall
Hofmann
Huff
Jacquelyn S. Fetrow
Kang
Karplus
Kimberly J. Nelson
Knoops
Koua
Larkin
Laura Soito
Leinonen
Leslie B. Poole
Nelson
Poole
Schaffer
Stacy T. Knutson
Thompson
UniProt Consortium
Veal
Wood
Zhang
Publication venue: Oxford University Press
Publication date
Field of study

PREX (http://www.csb.wfu.edu/prex/) is a database of currently 3516 peroxiredoxin (Prx or PRDX) protein sequences unambiguously classified into one of six distinct subfamilies. Peroxiredoxins are a diverse and ubiquitous family of highly expressed, cysteine-dependent peroxidases that are important for antioxidant defense and for the regulation of cell signaling pathways in eukaryotes. Subfamily members were identified using the Deacon Active Site Profiler (DASP) bioinformatics tool to focus in on functionally relevant sequence fragments surrounding key residues required for protein activity. Searches of this database can be conducted by protein annotation, accession number, PDB ID, organism name or protein sequence. Output includes the subfamily to which each classified Prx belongs, accession and GI numbers, genus and species and the functional site signature used for classification. The query sequence is also presented aligned with a select group of Prxs for manual evaluation and interpretation by the user. A synopsis of the characteristics of members of each subfamily is also provided along with pertinent references

Bovine Genome Database: supporting community annotation and analysis of the Bos taurus genome

Author: A Kasprzyk
AA Salamov
AV Zimin
Bovine Genome Sequencing and Analysis Consortium
C Michael Dickens
CG Elsik
CG Elsik
Christine G Elsik
Christopher P Childers
CJ Mungall
Donald C Vile
G Parra
GS Slater
Jaideep P Sundaram
Justin T Reese
K Eilbeck
KD Pruitt
Kevin L Childs
LD Stein
MS Boguski
P Flicek
RJ Wilson
SE Lewis
SF Altschul
TD Wu
The UniProt Consortium
V Solovyev
Y Liu
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

OAKTrust Digital Repository (Texas A&M Univ)

Species-level functional profiling of metagenomes and metatranscriptomes.

Author: A Sczyrba
A Shafquat
AE Duran-Pinedo
AK Sharma
B Buchfink
B Langmead
BE Suzek
BK Swan
C Burke
C Luo
Curtis Huttenhower
D Medini
DH Huson
DT Truong
DT Truong
E Pasolli
EA Franzosa
EA Franzosa
Eric A. Franzosa
George Weingart
GG Silva
Gholamali Rahnavard
H Hauswedell
J Kim
J Lloyd-Price
J Lloyd-Price
J Ravel
J. Gregory Caporaso
JA Fuhrman
K Huang
Karen Schwarzberg Lipson
Lauren J. McIver
LR Thompson
LR Thompson
Luke R. Thompson
M Hamady
M Kanehisa
M Scholz
Melanie Schirmer
MY Galperin
N Segata
N Segata
Nicola Segata
OU Mason
P Petrenko
PJ Turnbaugh
R Caspi
RC Edgar
RD Finn
Rob Knight
S Abubucker
S Nayfach
S Sunagawa
S Sunagawa
T Bose
UniProt Consortium.
W Huang
Y Ye
Y Zhao
Publication venue: eScholarship, University of California
Publication date: 01/11/2018
Field of study

Functional profiles of microbial communities are typically generated using comprehensive metagenomic or metatranscriptomic sequence read searches, which are time-consuming, prone to spurious mapping, and often limited to community-level quantification. We developed HUMAnN2, a tiered search strategy that enables fast, accurate, and species-resolved functional profiling of host-associated and environmental communities. HUMAnN2 identifies a community's known species, aligns reads to their pangenomes, performs translated search on unclassified reads, and finally quantifies gene families and pathways. Relative to pure translated search, HUMAnN2 is faster and produces more accurate gene family profiles. We applied HUMAnN2 to study clinal variation in marine metabolism, ecological contribution patterns among human microbiome pathways, variation in species' genomic versus transcriptional contributions, and strain profiling. Further, we introduce 'contributional diversity' to explain patterns of ecological assembly across different microbial community types