Search CORE

2,041 research outputs found

RNA secondary structure formation: a solvable model of heteropolymer folding

Author: D. Poland
E. I. Shakhnovich
E. Zuo
G. Parisi
J. G. Wetmur
J. N. Onuchic
J. S. McCaskill
K. A. Dill
M. S. Waterman
M. Zuker
N. Go¯
P. G. de Gennes
P. G. Higgs
P. Higgs
R. Bundschuh
S. J. Chen
T. C. Lubensky
T. Garel
T. Hwa
T. Hwa
V. Muñoz
W. Sanger
Publication venue: 'American Physical Society (APS)'
Publication date: 05/03/1999
Field of study

The statistical mechanics of heteropolymer structure formation is studied in the context of RNA secondary structures. A designed RNA sequence biased energetically towards a particular native structure (a hairpin) is used to study the transition between the native and molten phase of the RNA as a function of temperature. The transition is driven by a competition between the energy gained from the polymer's overlap with the native structure and the entropic gain of forming random contacts. A simplified Go-like model is proposed and solved exactly. The predicted critical behavior is verified via exact numerical enumeration of a large ensemble of similarly designed sequences.Comment: 4 pages including 2 figure

arXiv.org e-Print Archive

Crossref

Thermodynamics of protein folding: a random matrix formulation

Author: Betancourt M R
Creighton T E
Frauenfelder H
Kleinberg J Istrail S Pevzner P Waterman M
Lee S
Mehta M L
Pragya Shukla
Richards F M
Shortle D
Shukla P
van den Berg B
Publication venue: 'IOP Publishing'
Publication date: 16/10/2010
Field of study

The process of protein folding from an unfolded state to a biologically active, folded conformation is governed by many parameters e.g the sequence of amino acids, intermolecular interactions, the solvent, temperature and chaperon molecules. Our study, based on random matrix modeling of the interactions, shows however that the evolution of the statistical measures e.g Gibbs free energy, heat capacity, entropy is single parametric. The information can explain the selection of specific folding pathways from an infinite number of possible ways as well as other folding characteristics observed in computer simulation studies.Comment: 21 Pages, no figure

arXiv.org e-Print Archive

Crossref

Counting, generating and sampling tree alignments

Author: A Dress
A Torres
C Herrbach
CB Do
G Blin
H Andrade
HS Wilf
M Höchsmann
M Höchsmann
M Vingron
MS Waterman
P Flajolet
S Schirmer
T Jiang
Y Ponty
Publication venue
Publication date: 07/03/2016
Field of study

Pairwise ordered tree alignment are combinatorial objects that appear in RNA secondary structure comparison. However, the usual representation of tree alignments as supertrees is ambiguous, i.e. two distinct supertrees may induce identical sets of matches between identical pairs of trees. This ambiguity is uninformative, and detrimental to any probabilistic analysis.In this work, we consider tree alignments up to equivalence. Our first result is a precise asymptotic enumeration of tree alignments, obtained from a context-free grammar by mean of basic analytic combinatorics. Our second result focuses on alignments between two given ordered trees

S

and

T

. By refining our grammar to align specific trees, we obtain a decomposition scheme for the space of alignments, and use it to design an efficient dynamic programming algorithm for sampling alignments under the Gibbs-Boltzmann probability distribution. This generalizes existing tree alignment algorithms, and opens the door for a probabilistic analysis of the space of suboptimal RNA secondary structures alignments.Comment: ALCOB - 3rd International Conference on Algorithms for Computational Biology - 2016, Jun 2016, Trujillo, Spain. 201

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

HAL-Polytechnique

Cancer: leaping the E‐cadherin hurdle

Author: George T Chen
Marian L Waterman
McCrea P
Orsulic S
Publication venue: 'EMBO'
Publication date: 01/01/2015
Field of study

Aberrant activation of the Wnt signaling pathway is a common cause of colon cancer and other tumor types, accomplishing many of the hallmarks of cancer including sustained proliferative signaling, replicative immortality, reprogrammed metabolism, angiogenesis, and invasion. Yet, the dominant mutation that leads to chronic Wnt signaling in colon cancer is quite different from the spectrum of mutations that activate Wnt signaling in other tumor types. In this issue of The EMBO Journal, Huels et al (2015) focus on the influential role E-cadherin plays in shaping these differences

Crossref

PubMed Central

eScholarship - University of California

Addition-Deletion Networks

Author: Abramowitz M
Ben-Naim E
Chung F
Cooper C
Durbin R
E Ben-Naim
Farid N
Huynen M A
Kimura M
Knuth D E
Lambiotte R
Mahmoud H M
Moreno Y
P L Krapivsky
Smythe R T
von Smoluchowski M
Waterman M S
Publication venue: 'IOP Publishing'
Publication date: 23/03/2007
Field of study

We study structural properties of growing networks where both addition and deletion of nodes are possible. Our model network evolves via two independent processes. With rate r, a node is added to the system and this node links to a randomly selected existing node. With rate 1, a randomly selected node is deleted, and its parent node inherits the links of its immediate descendants. We show that the in-component size distribution decays algebraically, c_k ~ k^{-beta}, as k-->infty. The exponent beta=2+1/(r-1) varies continuously with the addition rate r. Structural properties of the network including the height distribution, the diameter of the network, the average distance between two nodes, and the fraction of dangling nodes are also obtained analytically. Interestingly, the deletion process leads to a giant hub, a single node with a macroscopic degree whereas all other nodes have a microscopic degree.Comment: 8 pages, 5 figure

arXiv.org e-Print Archive

Crossref

Rank Statistics in Biological Evolution

Author: Ben-Naim E Krapivsky P L
Durbin R
Durrett R Schweisenberg J
E Ben-Naim
Ewens W J
Felsenstein J
Gould S J
Harris T E
Horton R E
Huynen M A
John Sepkoski J
Knuth D E
P L Krapivsky
Page R D M
Simon H A
Tudge C
Waterman M S
Yule G U
Publication venue: 'IOP Publishing'
Publication date: 18/08/2005
Field of study

We present a statistical analysis of biological evolution processes. Specifically, we study the stochastic replication-mutation-death model where the population of a species may grow or shrink by birth or death, respectively, and additionally, mutations lead to the creation of new species. We rank the various species by the chronological order by which they originate. The average population N_k of the kth species decays algebraically with rank, N_k ~ M^{mu} k^{-mu}, where M is the average total population. The characteristic exponent mu=(alpha-gamma)/(alpha+beta-gamma)$ depends on alpha, beta, and gamma, the replication, mutation, and death rates. Furthermore, the average population P_k of all descendants of the kth species has a universal algebraic behavior, P_k ~ M/k.Comment: 4 pages, 3 figure

arXiv.org e-Print Archive

Crossref

Predictive value of Altmetric score on citation rates and bibliometric impact

Author: Egan Richard
Hopkins Luke
James Osian P.
Lewis Wyn G.
Powell Arfon G.M. T.
Robinson David B. T.
Waterman Jennifer
Publication venue: 'Oxford University Press (OUP)'
Publication date: 31/01/2021
Field of study

Background Bibliometric and Altmetric analyses provide different perspectives regarding research impact. This study aimed to determine whether Altmetric score was associated with citation rate independent of established bibliometrics. Methods Citations related to a previous cohort of 100 most cited articles in surgery were collected and a 3-year interval citation gain calculated. Citation count, citation rate index, Altmetric score, 5-year impact factor, and Oxford Centre for Evidence-Based Medicine levels were used to estimate citation rate prospect. Results The median interval citation gain was 161 (i.q.r. 83–281); 74 and 62 articles had an increase in citation rate index (median increase 2.8 (i.q.r. –0.1 to 7.7)) and Altmetric score (median increase 3 (0–4)) respectively. Receiver operating characteristic (ROC) curve analysis revealed that citation rate index (area under the curve (AUC) 0.86, 95 per cent c.i. 0.79 to 0.93; P < 0.001) and Altmetric score (AUC 0.65, 0.55 to 0.76; P = 0.008) were associated with higher interval citation gain. An Altmetric score critical threshold of 2 or more was associated with a better interval citation gain when dichotomized at the interval citation gain median (odds ratio (OR) 4.94, 95 per cent c.i. 1.99 to 12.26; P = 0.001) or upper quartile (OR 4.13, 1.60 to 10.66; P = 0.003). Multivariable analysis revealed only citation rate index to be independently associated with interval citation gain when dichotomized at the median (OR 18.22, 6.70 to 49.55; P < 0.001) or upper quartile (OR 19.30, 4.23 to 88.15; P < 0.001). Conclusion Citation rate index and Altmetric score appear to be important predictors of interval citation gain, and better at predicting future citations than the historical and established impact factor and Oxford Centre for Evidence-Based Medicine quality descriptors

Online Research @ Cardiff

Fr-TM-align: a new protein structural alignment method based on fragment alignments and the TM-score

Author: A Sali
AG Murzin
AM Lisewski
AR Ortiz
AS Yang
CA Orengo
CA Orengo
D Baker
D Kihara
F Teichert
G Vogt
G Vriend
HM Berman
IN Shindyalov
J Moult
J Skolnick
J Skolnick
J Zhu
Jeffrey Skolnick
L Holm
L Holm
M Levitt
M Novotny
ML Sierk
ML Sierk
MS Waterman
N Siew
NN Alexandrov
NN Alexandrov
P Koehl
R Kolodny
R Leplae
RB Russell
RH Lathrop
Shashi Bhushan Pandit
T Akutsu
T Shibuya
TJ Oldfield
V Alesker
WR Taylor
Y Ye
Y Zhang
Y Zhang
Y Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2008
Field of study

©2008 Pandit and Skolnick; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. This article is available from: http://www.biomedcentral.com/1471-2105/9/531doi:10.1186/1471-2105-9-531Background: Protein tertiary structure comparisons are employed in various fields of contemporary structural biology. Most structure comparison methods involve generation of an initial seed alignment, which is extended and/or refined to provide the best structural superposition between a pair of protein structures as assessed by a structure comparison metric. One such metric, the TM-score, was recently introduced to provide a combined structure quality measure of the coordinate root mean square deviation between a pair of structures and coverage. Using the TM-score, the TM-align structure alignment algorithm was developed that was often found to have better accuracy and coverage than the most commonly used structural alignment programs; however, there were a number of situations when this was not true. Results: To further improve structure alignment quality, the Fr-TM-align algorithm has been developed where aligned fragment pairs are used to generate the initial seed alignments that are then refined using dynamic programming to maximize the TM-score. For the assessment of the structural alignment quality from Fr-TM-align in comparison to other programs such as CE and TMalign, we examined various alignment quality assessment scores such as PSI and TM-score. The assessment showed that the structural alignment quality from Fr-TM-align is better in comparison to both CE and TM-align. On average, the structural alignments generated using Fr-TM-align have a higher TM-score (~9%) and coverage (~7%) in comparison to those generated by TM-align. Fr- TM-align uses an exhaustive procedure to generate initial seed alignments. Hence, the algorithm is computationally more expensive than TM-align. Conclusion: Fr-TM-align, a new algorithm that employs fragment alignment and assembly provides better structural alignments in comparison to TM-align. The source code and executables of Fr- TM-align are freely downloadable at: http://cssb.biology.gatech.edu/skolnick/files/FrTMalign/

Scholarly Materials And Research @ Georgia Tech

Crossref

Directory of Open Access Journals

PubMed Central

A simple, practical and complete O-time Algorithm for RNA folding using the Four-Russians Speedup

Author: Dan Gusfield
IL Hofacker
J Kleinberg
M Zuker
M Zuker
MS Waterman
P Clote
R Backofen
R Durbin
R Nussinov
R Nussinov
SE Seemann
SL Graham
T Akutsu
TM Chan
Y Wexler
Yelena Frid
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The problem of computationally predicting the secondary structure (or folding) of RNA molecules was first introduced more than thirty years ago and yet continues to be an area of active research and development. The basic <it>RNA-folding problem </it>of finding a maximum cardinality, non-crossing, matching of complimentary nucleotides in an RNA sequence of length <it>n</it>, has an <it>O</it>(<it>n</it>3)-time dynamic programming solution that is widely applied. It is known that an <it>o</it>(<it>n</it>3) worst-case time solution is possible, but the published and suggested methods are complex and have not been established to be practical. Significant practical improvements to the original dynamic programming method have been introduced, but they retain the <it>O</it>(<it>n</it>3) worst-case time bound when <it>n </it>is the only problem-parameter used in the bound. Surprisingly, the most widely-used, general technique to achieve a worst-case (and often practical) speed up of dynamic programming, the <it>Four-Russians </it>technique, has not been previously applied to the RNA-folding problem. This is perhaps due to technical issues in adapting the technique to RNA-folding. Results In this paper, we give a simple, complete, and practical Four-Russians algorithm for the basic RNA-folding problem, achieving a worst-case time-bound of <it>O</it>(<it>n</it>3/log(<it>n</it>)). Conclusions We show that this time-bound can also be obtained for richer nucleotide matching scoring-schemes, and that the method achieves consistent speed-ups in practice. The contribution is both theoretical and practical, since the basic RNA-folding problem is often solved multiple times in the inner-loop of more complex algorithms, and for long RNA molecules in the study of RNA virus genomes.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Using XML and XSLT for flexible elicitation of mental-health risk knowledge

Author: A. Ahmed
A. E. Adams
Abernethy M A
Ayyub B M
Boegl K
Bouch J
Buckingham C D
Buckingham C D
Buzan T
C. D. Buckingham
Date C J
Del Fiol G
Dong J
Dotsika F
Harold R E
Higgins N
Holdsworth N
Jacobson D
Jovanović J
Kay M
Kouri P
Maden A
Maruish M E
Mays N
Meyer M A
Monahan J
Neuendorf K A
Novak J D
Siau K
Silverman D
Taminé O
Tomić B
Waterman D A
Watts D
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2007
Field of study

Current tools for assessing risks associated with mental-health problems require assessors to make high-level judgements based on clinical experience. This paper describes how new technologies can enhance qualitative research methods to identify lower-level cues underlying these judgements, which can be collected by people without a specialist mental-health background. Methods and evolving results: Content analysis of interviews with 46 multidisciplinary mental-health experts exposed the cues and their interrelationships, which were represented by a mind map using software that stores maps as XML. All 46 mind maps were integrated into a single XML knowledge structure and analysed by a Lisp program to generate quantitative information about the numbers of experts associated with each part of it. The knowledge was refined by the experts, using software developed in Flash to record their collective views within the XML itself. These views specified how the XML should be transformed by XSLT, a technology for rendering XML, which resulted in a validated hierarchical knowledge structure associating patient cues with risks. Conclusions: Changing knowledge elicitation requirements were accommodated by flexible transformations of XML data using XSLT, which also facilitated generation of multiple data-gathering tools suiting different assessment circumstances and levels of mental-health knowledge

Crossref

Warwick Research Archives Portal Repository