Search CORE

1,510 research outputs found

Optimization of tau identification in ATLAS experiment using multivariate tools

Author: Wolter M
Zemla A
Publication venue: 'Sissa Medialab'
Publication date: 23/04/2007
Field of study

In elementary particle physics the efficient analysis of huge amount of collected data require the use of sophisticated selection and analysis algorithms. We have implemented a Support Vector Machine (SVM) integrated with the CERN TMVA/ROOT package. SVM approach to signal and background separation is based on building a separating hyperplane defined by the support vectors. The margin between them and the hyperplane is maximized. The extensions to a non-linear separation is performed by mapping the input vectors into a high dimensional space, in which data can be linearly separated. The use of kernel functions allows to perform computations in a high dimension feature space without explicitly knowing a mapping function. Our SVM implementation is based on Platt's Sequential Minimal Optimization (SMO) algorithm and includes various kernel functions like a linear function, polynomial and Gaussian. The identification of hadronic decays of tau leptons in the ATLAS experiment using a tau1P3P package is performed using, beside the baseline cut analysis, also multivariate analysis tools: neural network, PDE_RS and our implementation of the SVM algorithm. The use and the comparison of the three algorithms is presented

CERN Document Server

Recommended from our members

Protein Classification Based on Analysis of Local Sequence-Structure Correspondence

Author: Zemla A T
Publication venue: Lawrence Livermore National Laboratory
Publication date: 13/02/2006
Field of study

The goal of this project was to develop an algorithm to detect and calculate common structural motifs in compared structures, and define a set of numerical criteria to be used for fully automated motif based protein structure classification. The Protein Data Bank (PDB) contains more than 33,000 experimentally solved protein structures, and the Structural Classification of Proteins (SCOP) database, a manual classification of these structures, cannot keep pace with the rapid growth of the PDB. In our approach called STRALCP (STRucture Alignment based Clustering of Proteins), we generate detailed information about global and local similarities between given set of structures, identify similar fragments that are conserved within analyzed proteins, and use these conserved regions (detected structural motifs) to classify proteins

UNT Digital Library

Recommended from our members

Structural re-alignment in an immunologic surface region of ricin A chain

Author: Zemla A T
Zhou C E
Publication venue: Lawrence Livermore National Laboratory
Publication date: 24/07/2007
Field of study

We compared structure alignments generated by several protein structure comparison programs to determine whether existing methods would satisfactorily align residues at a highly conserved position within an immunogenic loop in ribosome inactivating proteins (RIPs). Using default settings, structure alignments generated by several programs (CE, DaliLite, FATCAT, LGA, MAMMOTH, MATRAS, SHEBA, SSM) failed to align the respective conserved residues, although LGA reported correct residue-residue (R-R) correspondences when the beta-carbon (Cb) position was used as the point of reference in the alignment calculations. Further tests using variable points of reference indicated that points distal from the beta carbon along a vector connecting the alpha and beta carbons yielded rigid structural alignments in which residues known to be highly conserved in RIPs were reported as corresponding residues in structural comparisons between ricin A chain, abrin-A, and other RIPs. Results suggest that approaches to structure alignment employing alternate point representations corresponding to side chain position may yield structure alignments that are more consistent with observed conservation of functional surface residues than do standard alignment programs, which apply uniform criteria for alignment (i.e., alpha carbon (Ca) as point of reference) along the entirety of the peptide chain. We present the results of tests that suggest the utility of allowing user-specified points of reference in generating alternate structural alignments, and we present a web server for automatically generating such alignments

UNT Digital Library

AS2TS system for protein structure modeling and analysis

Author: Barsky D.
Kuczmarski T.
Rama D.
Sawicka D.
Slezak T.
Torres C.
Zemla A.
Zhou C. Ecale
Publication venue: Oxford University Press
Publication date: 01/01/2005
Field of study

We present a set of programs and a website designed to facilitate protein structure comparison and protein structure modeling efforts. Our protein structure analysis and comparison services use the LGA (local-global alignment) program to search for regions of local similarity and to evaluate the level of structural similarity between compared protein structures. To facilitate the homology-based protein structure modeling process, our AL2TS service translates given sequence–structure alignment data into the standard Protein Data Bank (PDB) atom records (coordinates). For a given sequence of amino acids, the AS2TS (amino acid sequence to tertiary structure) system calculates (e.g. using PSI-BLAST PDB analysis) a list of the closest proteins from the PDB, and then a set of draft 3D models is automatically created. Web services are available at

CiteSeerX

Crossref

PubMed Central

Antibody Elbow Angles are Influenced by their Light Chain Class

Author: Adam Zemla
Almagro
Berman
Bernhard Rupp
Huber
Ian A. Wilson
Kabat
Kantardjieff
Lesk
Love
Marquart
Robyn L. Stanfield
Rossmann
Satow
Saul
Schiffer
Sheriff
Sotriffer
Suh
Wilson
Wukovitz
Zemla
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Recommended from our members

Antibody elbow angles are influenced by their light chain class

Author: Rupp B
Stanfield R
Wilson I
Zemla A
Publication venue: Lawrence Livermore National Laboratory
Publication date: 12/01/2006
Field of study

We have examined the elbow angles for 365 different Fab fragments, and observe that Fabs with lambda light chains have adopted a wider range of elbow angles than their kappa-chain counterparts, and that the lambda light chain Fabs are frequently found with very large (>195{sup o}) elbow angles. This apparent hyperflexibility of lambda-chain Fabs may be due to an insertion in their switch region, which is one residue longer than in kappa chains, with glycine occurring most frequently at the insertion position. A new, web-based computer program that was used to calculate the Fab elbow angles is also described

UNT Digital Library

Older-Patient-Specific Cancer Trials: A Pooled Analysis of 2,277 Patients (A151715).

Author: +6 additional authors
Budman D.
Citron M.
Cohen H. J.
Dao D.
Freedman R. A.
Hurria A.
Jatoi A.
Le-Rademacher J. G.
Muss H.
Zemla T.
Publication venue: Donald and Barbara Zucker School of Medicine Academic Works
Publication date: 01/01/2019
Field of study

BACKGROUND: Less than 3% of older patients with cancer are enrolled in clinical trials. To reverse this underrepresentation, we compared older patients enrolled with older-patient-specific trials, defined as those designed for older patients with cancer, with those enrolled in age-unspecified trials. MATERIALS AND METHODS: We focused on individual patient data from those ≥65 years (younger patients excluded) and included all Alliance phase III adjuvant breast cancer trials from 1985-2012. RESULTS: Among 2,277 patients, 1,014 had been enrolled to older-patient-specific and 1,263 to age-unspecified trials. The median age (range) in the older-patient-specific trials was 72 (65-89) years compared with 68 (65-84) years in the cohort of older patients in age-unspecified trials; CONCLUSION: Older-patient-specific trials appear to address this underrepresentation of older patients with ostensibly comparable outcomes

Hofstra Northwell Academic Works (Hofstra Northwell School of Medicine)

MannDB – A microbial database of automated protein sequence analyses and evidence integration for protein characterization

Author: Dyer Matthew D
Kuczmarski Thomas A
Lam Marisa W
Slezak Thomas R
Smith Jason R
Vitalis Elizabeth A
Zemla Adam T
Zhou Carol L Ecale
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: MannDB was created to meet a need for rapid, comprehensive automated protein sequence analyses to support selection of proteins suitable as targets for driving the development of reagents for pathogen or protein toxin detection. Because a large number of open-source tools were needed, it was necessary to produce a software system to scale the computations for whole-proteome analysis. Thus, we built a fully automated system for executing software tools and for storage, integration, and display of automated protein sequence analysis and annotation data. DESCRIPTION: MannDB is a relational database that organizes data resulting from fully automated, high-throughput protein-sequence analyses using open-source tools. Types of analyses provided include predictions of cleavage, chemical properties, classification, features, functional assignment, post-translational modifications, motifs, antigenicity, and secondary structure. Proteomes (lists of hypothetical and known proteins) are downloaded and parsed from Genbank and then inserted into MannDB, and annotations from SwissProt are downloaded when identifiers are found in the Genbank entry or when identical sequences are identified. Currently 36 open-source tools are run against MannDB protein sequences either on local systems or by means of batch submission to external servers. In addition, BLAST against protein entries in MvirDB, our database of microbial virulence factors, is performed. A web client browser enables viewing of computational results and downloaded annotations, and a query tool enables structured and free-text search capabilities. When available, links to external databases, including MvirDB, are provided. MannDB contains whole-proteome analyses for at least one representative organism from each category of biological threat organism listed by APHIS, CDC, HHS, NIAID, USDA, USFDA, and WHO. CONCLUSION: MannDB comprises a large number of genomes and comprehensive protein sequence analyses representing organisms listed as high-priority agents on the websites of several governmental organizations concerned with bio-terrorism. MannDB provides the user with a BLAST interface for comparison of native and non-native sequences and a query tool for conveniently selecting proteins of interest. In addition, the user has access to a web-based browser that compiles comprehensive and extensive reports. Access to MannDB is freely available at

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Towards Reliable Automatic Protein Structure Alignment

Author: A. Caprara
A. Zemla
A.G. Murzin
A.S. Konagurthu
C.A. Rohl
C.B. Do
G. Lancia
H.M. Berman
I.N. Shindyalov
J. Shi
J. Xu
J.F. Gibrat
K. Mizuguchi
L. Kinch
L. Xie
M. Comin
M. Levitt
M. Moakher
M. Sadowski
N.M. Daniels
N.N. Alexandrov
S. Henikoff
S. Subbiah
S.B. Needleman
S.B. Pandit
S.R. Eddy
W. Pirovano
Y. Yang
Y. Ye
Y. Zhang
Y. Zhang
Y. Zhang
Publication venue
Publication date: 01/01/2013
Field of study

A variety of methods have been proposed for structure similarity calculation, which are called structure alignment or superposition. One major shortcoming in current structure alignment algorithms is in their inherent design, which is based on local structure similarity. In this work, we propose a method to incorporate global information in obtaining optimal alignments and superpositions. Our method, when applied to optimizing the TM-score and the GDT score, produces significantly better results than current state-of-the-art protein structure alignment tools. Specifically, if the highest TM-score found by TMalign is lower than (0.6) and the highest TM-score found by one of the tested methods is higher than (0.5), there is a probability of (42%) that TMalign failed to find TM-scores higher than (0.5), while the same probability is reduced to (2%) if our method is used. This could significantly improve the accuracy of fold detection if the cutoff TM-score of (0.5) is used. In addition, existing structure alignment algorithms focus on structure similarity alone and simply ignore other important similarities, such as sequence similarity. Our approach has the capacity to incorporate multiple similarities into the scoring function. Results show that sequence similarity aids in finding high quality protein structure alignments that are more consistent with eye-examined alignments in HOMSTRAD. Even when structure similarity itself fails to find alignments with any consistency with eye-examined alignments, our method remains capable of finding alignments highly similar to, or even identical to, eye-examined alignments.Comment: Peer-reviewed and presented as part of the 13th Workshop on Algorithms in Bioinformatics (WABI2013

arXiv.org e-Print Archive

Crossref