Abstract

<p>(1) Seeds perfectly matching between query (i.e. enhancer) and target (e.g. genomic window) sequence (small black segments) are extended up- and downstream (red segments) using a match/mismatch scoring scheme to generate a raw motif profile. Motifs that overlap the predefined window boundaries are also taken into account and virtually extend the window (grey areas). (2) As a next step, overlapping regions of the extracted raw motifs in the target sequence are determined (grey areas) and the smaller motif truncated whenever it overlaps a larger one (2 to 3). Motifs smaller than the initial seed size after truncation are discarded in this step. (3) Same filtering procedure is repeated in the query sequence for the processed profile (3 to 4). (4) Motifs below the noise threshold (bright blue segment) are discarded and the basic similarity (“PURE”) score calculated from the fully filtered motif profile (dark blue). (5) In addition, a pattern detection method searches for co-linear arrangements in the profile (grey area). Panel shows the same motif composition as (4) but in a co-linear configuration. This time, the motif below the noise threshold (bright pink) is kept as it is contained in a pattern. The score of the full pattern (all pink motifs) is subsequently added to the previously calculated basic score, resulting in the “COMB” score. For a given enhancer, the whole process is repeated window by window until the last window in the target sequence is reached.</p

    Similar works

    Full text

    thumbnail-image