Search CORE

8 research outputs found

Quantile distributions of amino acid usage in protein classes.

Author: Blaisdell B.E.
Bucher P.
Karlin S.
Publication venue
Publication date: 01/12/1992
Field of study

A comparative study of the compositional properties of various protein sets from both cellular and viral organisms is presented. Invariants and contrasts of amino acid usages have been discerned for different protein function classes and for different species using robust statistical methods based on quantile distributions and stochastic ordering relationships. In addition, a quantitative criterion to assess amino acid compositional extremes relative to a reference protein set is proposed and applied. Invariants of amino acid usage relate mainly to the central range of quantile distributions, whereas contrasts occur mainly in the tails of the distributions, especially contrasts between eukaryote and prokaryote species. Influences from genomic constraint are evident, for example, in the arginine:lysine ratios and the usage frequencies of residues encoded by G + C-rich versus A + T-rich codon types. The structurally similar amino acids, glutamate versus aspartate and phenylalanine versus tyrosine, show stochastic dominance relationships for most species protein sets favoring glutamate and phenylalanine respectively. The quantile distribution of hydrophobic amino acid usages in prokaryote data dominates the corresponding quantile distribution in human data. In contrast, glutamate, cysteine, proline and serine usages in human proteins dominate the corresponding quantile distributions in Escherichia coli. E. coli dominates human in the use of basic residues, but no dominance ordering applies to acidic residues. The discussion centers on commonalities and anomalies of the amino acid compositional spectrum in relation to species, function, cellular localization, biochemical and steric attributes, complexity of the amino acid biosynthetic pathway, amino acid relative abundances and founder effects

Serveur académique lausannois

Alignment free frequency based distance measures for promoter sequence comparison

Author: A. Meera
A. Meera
A. Meera
B.E. Blaisdell
B.E. Blaisdell
C.A. Leimeister
J. Felsenstein
J. Luo
P. Qui
R. Chowdhary
R.C. Edgar
S. Mantaci
S. Vinga
U. Ohler
V.I. Levenshtein
Publication venue
Publication date: 01/01/2015
Field of study

University of Mysore - Digital Repository of Research, Innovation and Scholarship (ePrints@UoM)

Crossref

Alignment Free Frequency Based Distance Measures for Promoter Sequence Comparison

Author: A. Meera
A. Meera
A. Meera
B.E. Blaisdell
B.E. Blaisdell
C.A. Leimeister
J. Felsenstein
J. Luo
P. Qui
R. Chowdhary
R.C. Edgar
S. Mantaci
S. Vinga
U. Ohler
V.I. Levenshtein
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Crossref

Isolation and Maintenance of Continuous Cultures of Epithelial Cells from Chemically-injured Adult Rabbit Lung

Author: Adamson I.Y.R.
Adamson I.Y.R.
Barile F.A.
Blaisdell F.W.
Devereux T.R.
Diglio C.
Evans M.J.
Forman H.J.
Gaudreault P.
Kikkawa Y.
Klaassen C.D.
Leary J.F.
Mason R.J.
Miller B.E.
Montgomery A.B.
Ryan S.F.
Sullivan T.M.
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref

Estimating evolutionary distances from spaced-word matches

Author: B. Haubold
B. Haubold
B. Ma
B.E. Blaisdell
C.-A. Leimeister
C.-A. Leimeister
E. Bonnet
F. Sievers
G. Didier
G. Reinert
G.E. Sims
I. Ulitsky
J. Felsenstein
J. Lin
M. Kantorovitz
N. Saitou
R.A. Lippert
S. Vinga
S.. Robin
Publication venue: SPRINGER-VERLAG BERLIN
Publication date: 01/01/2014
Field of study

International audienceAlignment-free methods are increasingly used to estimate distances between DNA and protein sequences and to reconstruct phylogenetic trees. Most distance functions used by these methods, however, are heuristic measures of dissimilarity, not based on any explicit model of evolution. Herein, we propose a simple estimator of the evolutionary distance between two DNA sequences calculated from the number of (spaced) word matches between them. We show that this distance function estimates the evolutionary distance between DNA sequences more accurately than other distance measures used by alignment-free methods. In addition, we calculate the variance of the number of (spaced) word matches depending on sequence length and mismatch probability

HAL Evry

Crossref

HAL Descartes