Article thumbnail

Statistical significance based on the length and position of the local score in a model of i.i.d. sequences.

By Agnes Lagnoux, Sabine Mercier and Pierre Vallois

Abstract

International audienceThe local score of a DNA sequence, also called Smith and Waterman score \citealp{Wat95}, is a statistical tool that allows to detect biologically interesting sequence regions. Firstly, we study the accuracy of classical results on the local score distribution using a Kolmogorov and Smirnov goodness of fit test. Secondly, we highlight how the length of the segment that realizes the local score influences the classical setting based on local score only. Finally, we measure with numerical simulation the difference between the classical local score and the new one we have introduced

Topics: [MATH]Mathematics [math], [STAT]Statistics [stat]
Publisher: 'Oxford University Press (OUP)'
Year: 2017
DOI identifier: 10.1093/bioinformatics/btw699
OAI identifier: oai:HAL:hal-01301246v2
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • https://hal.archives-ouvertes.... (external link)
  • https://hal.archives-ouvertes.... (external link)
  • https://hal.archives-ouvertes.... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.