Statistical mechanics of transcription-factor binding site discovery
  using Hidden Markov Models

A. Drawid; A. Tanay; Anirvan M. Sengupta; D.J. Schwab; David J. Schwab; E. Schneidman; G. Stormo; H. Jeffreys; J.B. Kinney; L.E. Baum; M. Djordjevic; M. Weigt; N. Halabi; O.G. Berg; P. Mahalanobis; Pankaj Mehta; R. Olsen; S. Sinha; T. Mora

research

Statistical mechanics of transcription-factor binding site discovery using Hidden Markov Models

Authors: A. Drawid
A. Tanay
Anirvan M. Sengupta
D.J. Schwab
David J. Schwab
E. Schneidman
G. Stormo
H. Jeffreys
J.B. Kinney
L.E. Baum
M. Djordjevic
M. Weigt
N. Halabi
O.G. Berg
P. Mahalanobis
Pankaj Mehta
R. Olsen
S. Sinha
T. Mora
Publication date: 27 October 2010
Publisher: 'Springer Science and Business Media LLC'
Doi

Abstract

Hidden Markov Models (HMMs) are a commonly used tool for inference of transcription factor (TF) binding sites from DNA sequence data. We exploit the mathematical equivalence between HMMs for TF binding and the "inverse" statistical mechanics of hard rods in a one-dimensional disordered potential to investigate learning in HMMs. We derive analytic expressions for the Fisher information, a commonly employed measure of confidence in learned parameters, in the biologically relevant limit where the density of binding sites is low. We then use techniques from statistical mechanics to derive a scaling principle relating the specificity (binding energy) of a TF to the minimum amount of training data necessary to learn it.Comment: 25 pages, 2 figures, 1 table V2 - typos fixed and new references adde

Similar works

Full text

Available Versions

Crossref

info:doi/10.1007%2Fs10955-010-...

Last time updated on 03/01/2020