563 research outputs found
An investigative study into the sensitivity of different partial discharge φ-q-n pattern resolution sizes on statistical neural network pattern classification
This paper investigates the sensitivity of statistical fingerprints to different phase resolution (PR) and amplitude bins (AB) sizes of partial discharge (PD) φ-q-n (phase-amplitude-number) patterns. In particular, this paper compares the capability of the ensemble neural network (ENN) and the single neural network (SNN) in recognizing and distinguishing different resolution sizes of φ-q-n discharge patterns. The training fingerprints for both the SNN and ENN comprise statistical fingerprints from different φ-q-n measurements. The result shows that there exists statistical distinction for different PR and AB sizes on some of the statistical fingerprints. Additionally, the ENN and SNN outputs change depending on training and testing with different PR and AB sizes. Furthermore, the ENN appears to be more sensitive in recognizing and discriminating the resolution changes when compared with the SNN. Finally, the results are assessed for practical implementation in the power industry and benefits to practitioners in the field are highlighted
String Indexing for Patterns with Wildcards
We consider the problem of indexing a string of length to report the
occurrences of a query pattern containing characters and wildcards.
Let be the number of occurrences of in , and the size of
the alphabet. We obtain the following results.
- A linear space index with query time .
This significantly improves the previously best known linear space index by Lam
et al. [ISAAC 2007], which requires query time in the worst case.
- An index with query time using space , where is the maximum number of wildcards allowed in the pattern.
This is the first non-trivial bound with this query time.
- A time-space trade-off, generalizing the index by Cole et al. [STOC 2004].
We also show that these indexes can be generalized to allow variable length
gaps in the pattern. Our results are obtained using a novel combination of
well-known and new techniques, which could be of independent interest
Searching of gapped repeats and subrepetitions in a word
A gapped repeat is a factor of the form where and are nonempty
words. The period of the gapped repeat is defined as . The gapped
repeat is maximal if it cannot be extended to the left or to the right by at
least one letter with preserving its period. The gapped repeat is called
-gapped if its period is not greater than . A
-subrepetition is a factor which exponent is less than 2 but is not
less than (the exponent of the factor is the quotient of the length
and the minimal period of the factor). The -subrepetition is maximal if
it cannot be extended to the left or to the right by at least one letter with
preserving its minimal period. We reveal a close relation between maximal
gapped repeats and maximal subrepetitions. Moreover, we show that in a word of
length the number of maximal -gapped repeats is bounded by
and the number of maximal -subrepetitions is bounded by
. Using the obtained upper bounds, we propose algorithms for
finding all maximal -gapped repeats and all maximal
-subrepetitions in a word of length . The algorithm for finding all
maximal -gapped repeats has time complexity for the case
of constant alphabet size and time complexity for the
general case. For finding all maximal -subrepetitions we propose two
algorithms. The first algorithm has time
complexity for the case of constant alphabet size and time complexity for the general case. The
second algorithm has
expected time complexity
Carotid plaque area: A tool for targeting and evaluating vascular preventive therapy
Background and Purpose - Carotid plaque area measured by ultrasound (cross-sectional area of longitudinal views of all plaques seen) was studied as a way of identifying patients at increased risk of stroke, myocardial infarction, and vascular death. Methods - Patients from an atherosclerosis prevention clinic were followed up annually for up to 5 years (mean, 2.5±1.3 years) with baseline and follow-up measurements recorded. Plaque area progression (or regression) was defined as an increase (or decrease) of ≥0.05 cm2 from baseline. Results - Carotid plaque areas from 1686 patients were categorized into 4 quartile ranges: 0.00 to 0.11 cm2 (n=422), 0.12 to 0.45 cm2 (n=424), 0.46 to 1.18 cm2 (n=421), and 1.19 to 6.73 cm2 (n=419). The combined 5-year risk of stroke, myocardial infarction, and vascular death increased by quartile of plaque area: 5.6%, 10.7%, 13.9%, and 19.5%, respectively (P\u3c0.001) after adjustment for all baseline patient characteristics. A total of 1085 patients had ≥1 annual carotid plaque area measurements: 685 (63.1%) had carotid plaque progression, 306 (28.2%) had plaque regression, and 176 (16.2%) had no change in carotid plaque area over the period of follow-up. The 5-year adjusted risk of combined outcome was 9.4%, 7.6%, and 15.7% for patients with carotid plaque area regression, no change, and progression, respectively (P=0.003). Conclusions - Carotid plaque area and progression of plaque identified high-risk patients. Plaque measurement may be useful for targeting preventive therapy and evaluating new treatments and response to therapy and may improve cost-effectiveness of secondary preventive treatment
Palindromic Decompositions with Gaps and Errors
Identifying palindromes in sequences has been an interesting line of research
in combinatorics on words and also in computational biology, after the
discovery of the relation of palindromes in the DNA sequence with the HIV
virus. Efficient algorithms for the factorization of sequences into palindromes
and maximal palindromes have been devised in recent years. We extend these
studies by allowing gaps in decompositions and errors in palindromes, and also
imposing a lower bound to the length of acceptable palindromes.
We first present an algorithm for obtaining a palindromic decomposition of a
string of length n with the minimal total gap length in time O(n log n * g) and
space O(n g), where g is the number of allowed gaps in the decomposition. We
then consider a decomposition of the string in maximal \delta-palindromes (i.e.
palindromes with \delta errors under the edit or Hamming distance) and g
allowed gaps. We present an algorithm to obtain such a decomposition with the
minimal total gap length in time O(n (g + \delta)) and space O(n g).Comment: accepted to CSR 201
- …