563 research outputs found

    An investigative study into the sensitivity of different partial discharge φ-q-n pattern resolution sizes on statistical neural network pattern classification

    Get PDF
    This paper investigates the sensitivity of statistical fingerprints to different phase resolution (PR) and amplitude bins (AB) sizes of partial discharge (PD) φ-q-n (phase-amplitude-number) patterns. In particular, this paper compares the capability of the ensemble neural network (ENN) and the single neural network (SNN) in recognizing and distinguishing different resolution sizes of φ-q-n discharge patterns. The training fingerprints for both the SNN and ENN comprise statistical fingerprints from different φ-q-n measurements. The result shows that there exists statistical distinction for different PR and AB sizes on some of the statistical fingerprints. Additionally, the ENN and SNN outputs change depending on training and testing with different PR and AB sizes. Furthermore, the ENN appears to be more sensitive in recognizing and discriminating the resolution changes when compared with the SNN. Finally, the results are assessed for practical implementation in the power industry and benefits to practitioners in the field are highlighted

    String Indexing for Patterns with Wildcards

    Get PDF
    We consider the problem of indexing a string tt of length nn to report the occurrences of a query pattern pp containing mm characters and jj wildcards. Let occocc be the number of occurrences of pp in tt, and σ\sigma the size of the alphabet. We obtain the following results. - A linear space index with query time O(m+σjloglogn+occ)O(m+\sigma^j \log \log n + occ). This significantly improves the previously best known linear space index by Lam et al. [ISAAC 2007], which requires query time Θ(jn)\Theta(jn) in the worst case. - An index with query time O(m+j+occ)O(m+j+occ) using space O(σk2nlogklogn)O(\sigma^{k^2} n \log^k \log n), where kk is the maximum number of wildcards allowed in the pattern. This is the first non-trivial bound with this query time. - A time-space trade-off, generalizing the index by Cole et al. [STOC 2004]. We also show that these indexes can be generalized to allow variable length gaps in the pattern. Our results are obtained using a novel combination of well-known and new techniques, which could be of independent interest

    Searching of gapped repeats and subrepetitions in a word

    Full text link
    A gapped repeat is a factor of the form uvuuvu where uu and vv are nonempty words. The period of the gapped repeat is defined as u+v|u|+|v|. The gapped repeat is maximal if it cannot be extended to the left or to the right by at least one letter with preserving its period. The gapped repeat is called α\alpha-gapped if its period is not greater than αv\alpha |v|. A δ\delta-subrepetition is a factor which exponent is less than 2 but is not less than 1+δ1+\delta (the exponent of the factor is the quotient of the length and the minimal period of the factor). The δ\delta-subrepetition is maximal if it cannot be extended to the left or to the right by at least one letter with preserving its minimal period. We reveal a close relation between maximal gapped repeats and maximal subrepetitions. Moreover, we show that in a word of length nn the number of maximal α\alpha-gapped repeats is bounded by O(α2n)O(\alpha^2n) and the number of maximal δ\delta-subrepetitions is bounded by O(n/δ2)O(n/\delta^2). Using the obtained upper bounds, we propose algorithms for finding all maximal α\alpha-gapped repeats and all maximal δ\delta-subrepetitions in a word of length nn. The algorithm for finding all maximal α\alpha-gapped repeats has O(α2n)O(\alpha^2n) time complexity for the case of constant alphabet size and O(nlogn+α2n)O(n\log n + \alpha^2n) time complexity for the general case. For finding all maximal δ\delta-subrepetitions we propose two algorithms. The first algorithm has O(nloglognδ2)O(\frac{n\log\log n}{\delta^2}) time complexity for the case of constant alphabet size and O(nlogn+nloglognδ2)O(n\log n +\frac{n\log\log n}{\delta^2}) time complexity for the general case. The second algorithm has O(nlogn+nδ2log1δ)O(n\log n+\frac{n}{\delta^2}\log \frac{1}{\delta}) expected time complexity

    Carotid plaque area: A tool for targeting and evaluating vascular preventive therapy

    Get PDF
    Background and Purpose - Carotid plaque area measured by ultrasound (cross-sectional area of longitudinal views of all plaques seen) was studied as a way of identifying patients at increased risk of stroke, myocardial infarction, and vascular death. Methods - Patients from an atherosclerosis prevention clinic were followed up annually for up to 5 years (mean, 2.5±1.3 years) with baseline and follow-up measurements recorded. Plaque area progression (or regression) was defined as an increase (or decrease) of ≥0.05 cm2 from baseline. Results - Carotid plaque areas from 1686 patients were categorized into 4 quartile ranges: 0.00 to 0.11 cm2 (n=422), 0.12 to 0.45 cm2 (n=424), 0.46 to 1.18 cm2 (n=421), and 1.19 to 6.73 cm2 (n=419). The combined 5-year risk of stroke, myocardial infarction, and vascular death increased by quartile of plaque area: 5.6%, 10.7%, 13.9%, and 19.5%, respectively (P\u3c0.001) after adjustment for all baseline patient characteristics. A total of 1085 patients had ≥1 annual carotid plaque area measurements: 685 (63.1%) had carotid plaque progression, 306 (28.2%) had plaque regression, and 176 (16.2%) had no change in carotid plaque area over the period of follow-up. The 5-year adjusted risk of combined outcome was 9.4%, 7.6%, and 15.7% for patients with carotid plaque area regression, no change, and progression, respectively (P=0.003). Conclusions - Carotid plaque area and progression of plaque identified high-risk patients. Plaque measurement may be useful for targeting preventive therapy and evaluating new treatments and response to therapy and may improve cost-effectiveness of secondary preventive treatment

    Palindromic Decompositions with Gaps and Errors

    Full text link
    Identifying palindromes in sequences has been an interesting line of research in combinatorics on words and also in computational biology, after the discovery of the relation of palindromes in the DNA sequence with the HIV virus. Efficient algorithms for the factorization of sequences into palindromes and maximal palindromes have been devised in recent years. We extend these studies by allowing gaps in decompositions and errors in palindromes, and also imposing a lower bound to the length of acceptable palindromes. We first present an algorithm for obtaining a palindromic decomposition of a string of length n with the minimal total gap length in time O(n log n * g) and space O(n g), where g is the number of allowed gaps in the decomposition. We then consider a decomposition of the string in maximal \delta-palindromes (i.e. palindromes with \delta errors under the edit or Hamming distance) and g allowed gaps. We present an algorithm to obtain such a decomposition with the minimal total gap length in time O(n (g + \delta)) and space O(n g).Comment: accepted to CSR 201
    corecore