Search CORE

62 research outputs found

Multiple serial episode matching

Author: Cegielski Patrick
Guessarian Irene
Matiyasevich Yuri
Publication venue: The National Academy of Sciences of Armenia Publishers
Publication date: 01/01/2005
Field of study

12In a previous paper we generalized the Knuth-Morris-Pratt (KMP) pattern matching algorithm and defined a non-conventional kind of RAM, the MP--RAMs (RAMS equipped with extra operations), and designed an

O(n)

on-line algorithm for solving the serial episode matching problem on MP--RAMs when there is only one single episode. We here give two extensions of this algorithm to the case when we search for several patterns simultaneously and compare them. More preciseley, given

q+1

strings (a text

t

of length

n

and

q

patterns

m_1,\ldots,m_q

) and a natural number

w

, the {\em multiple serial episode matching problem} consists in finding the number of size

w

windows of text

t

which contain patterns

m_1,\ldots,m_q

as subsequences, i.e. for each

m_i

, if

m_i=p_1,\ldots ,p_k

, the letters

p_1,\ldots ,p_k

occur in the window, in the same order as in

m_i

, but not necessarily consecutively (they may be interleaved with other letters).} The main contribution is an algorithm solving this problem on-line in time

O(nq)

Hal-Diderot

HAL - UPEC / UPEM

Compressed Subsequence Matching and Packed Tree Coloring

Author: A. Tiskin
A. Tiskin
D.D. Sleator
G. Das
H. Mannila
J. Ziv
J. Ziv
M. Charikar
M. Crochemore
M. Thorup
M.A. Bender
M.L. Fredman
N.J. Larsson
O. Berkman
P. Cégielski
P. Cégielski
P. Ferragina
P.F. Dietz
R.A. Baeza-Yates
S. Abiteboul
S. Alstrup
S. Alstrup
S. Alstrup
T. Yamamoto
W. Rytter
Z. Troníček
Publication venue
Publication date: 01/01/2014
Field of study

We present a new algorithm for subsequence matching in grammar compressed strings. Given a grammar of size

n

compressing a string of size

N

and a pattern string of size

m

over an alphabet of size

\sigma

, our algorithm uses

O(n+\frac{n\sigma}{w})

space and

O(n+\frac{n\sigma}{w}+m\log N\log w\cdot occ)

O(n+\frac{n\sigma}{w}\log w+m\log N\cdot occ)

time. Here

w

is the word size and

occ

is the number of occurrences of the pattern. Our algorithm uses less space than previous algorithms and is also faster for

occ=o(\frac{n}{\log N})

occurrences. The algorithm uses a new data structure that allows us to efficiently find the next occurrence of a given character after a given position in a compressed string. This data structure in turn is based on a new data structure for the tree color problem, where the node colors are packed in bit strings.Comment: To appear at CPM '1

arXiv.org e-Print Archive

CiteSeerX

Crossref

Online Research Database In Technology

Faster subsequence recognition in compressed strings

Author: A Tiskin
A Tiskin
A. Tiskin
BW Watson
CER Alves
G Myers
G Navarro
G Ziv
G Ziv
J Kärkkäinen
JL Bentley
M Crochemore
P Cégielski
TA Welch
W Rytter
WJ Masek
Publication venue
Publication date: 18/01/2008
Field of study

Computation on compressed strings is one of the key approaches to processing massive data sets. We consider local subsequence recognition problems on strings compressed by straight-line programs (SLP), which is closely related to Lempel--Ziv compression. For an SLP-compressed text of length

\bar m

, and an uncompressed pattern of length

n

, C{\'e}gielski et al. gave an algorithm for local subsequence recognition running in time

O(\bar mn^2 \log n)

. We improve the running time to

O(\bar mn^{1.5})

. Our algorithm can also be used to compute the longest common subsequence between a compressed text and an uncompressed pattern in time

O(\bar mn^{1.5})

; the same problem with a compressed pattern is known to be NP-hard

arXiv.org e-Print Archive

Crossref

Warwick Research Archives Portal Repository

The past and future of organ transplantation (Editorial)

Author: Marchioro TL
Starzl TE
Publication venue
Publication date: 01/01/1965
Field of study

D-Scholarship@Pitt

Critical Differences and Clues in Eta Car's 2009 Event

Author: Akashi
Altamore
Andrea Mehner
Cassatella
Corcoran
Damineli
Damineli
Damineli
Davidson
Davidson
Davidson
Davidson
Davidson
Davidson
Davidson
Davidson
Fernández-Lajús
Gary J. Ferland
Gull
Hillier
Humphreys
Humphreys
Ishibashi
Ishibashi
Ishibashi
John C. Martin
Kashi
Kazunori Ishibashi
Kris Davidson
Maeder
Martin
Martin
Martin
Martin
Meaburn
Mehner
Mehner
Mehner
Mehner
Nielsen
Osterbrock
Owocki
Richardson
Roberta M. Humphreys
Smith
Smith
Soker
Soker
Soker
Steiner
Stevens
Teodoro
Weis
Whitelock
Zanella
Zethson
Zethson
Publication venue: 'IOP Publishing'
Publication date: 29/06/2011
Field of study

We monitored Eta Carinae with HST WFPC2 and Gemini GMOS throughout the 2009 spectroscopic event, which was expected to differ from its predecessor in 2003 (Davidson et al. 2005). Here we report major observed differences between events, and their implications. Some of these results were quite unexpected. (1) The UV brightness minimum was much deeper in 2009. This suggests that physical conditions in the early stages of an event depend on different parameters than the "normal" inter-event wind. Extra mass ejection from the primary star is one possible cause. (2) The expected He II 4687 brightness maximum was followed several weeks later by another. We explain why this fact, and the timing of the 4687 maxima, strongly support a "shock breakup" hypothesis for X-ray and 4687 behavior as proposed 5-10 years ago. (3) We observed a polar view of the star via light reflected by dust in the Homunculus nebula. Surprisingly, at that location the variations of emission-line brightness and Doppler velocities closely resembled a direct view of the star; which should not have been true for any phenomena related to the orbit. This result casts very serious doubt on all the proposed velocity interpretations that depend on the secondary star's orbital motion. (4) Latitude-dependent variations of H I, He I and Fe II features reveal aspects of wind behavior during the event. In addition, we discuss implications of the observations for several crucial unsolved problems.Comment: 45 pages, 9 figures, submitted to Ap

arXiv.org e-Print Archive

Crossref

University of Kentucky

Subsequence Automata with Default Transitions

Author: Bille Philip
Gørtz Inge Li
Skjoldjensen Frederik Rye
Publication venue
Publication date: 01/01/2016
Field of study

Let

S

be a string of length

n

with characters from an alphabet of size

\sigma

. The \emph{subsequence automaton} of

S

(often called the \emph{directed acyclic subsequence graph}) is the minimal deterministic finite automaton accepting all subsequences of

S

. A straightforward construction shows that the size (number of states and transitions) of the subsequence automaton is

O(n\sigma)

and that this bound is asymptotically optimal. In this paper, we consider subsequence automata with \emph{default transitions}, that is, special transitions to be taken only if none of the regular transitions match the current character, and which do not consume the current character. We show that with default transitions, much smaller subsequence automata are possible, and provide a full trade-off between the size of the automaton and the \emph{delay}, i.e., the maximum number of consecutive default transitions followed before consuming a character. Specifically, given any integer parameter

k

1 < k \leq \sigma

, we present a subsequence automaton with default transitions of size

O(nk\log_{k}\sigma)

and delay

O(\log_k \sigma)

. Hence, with

k = 2

we obtain an automaton of size

O(n \log \sigma)

and delay

O(\log \sigma)

. On the other extreme, with

k = \sigma

, we obtain an automaton of size

O(n \sigma)

and delay

O(1)

, thus matching the bound for the standard subsequence automaton construction. Finally, we generalize the result to multiple strings. The key component of our result is a novel hierarchical automata construction of independent interest.Comment: Corrected typo

arXiv.org e-Print Archive

Crossref

Online Research Database In Technology

Influence of the Landscape Template on Chemical and Physical Habitat for Brown Trout Within a Boreal Stream Network

Author: Bishop Kevin
Buffam Ishi
Laudon Hjalmar
Publication venue
Publication date: 01/01/2021
Field of study

We used the distribution of stream-dwelling brown trout (Salmo trutta) in a 67 km(2) boreal catchment to explore the importance of environmental organizing factors at a range of spatial scales, including whole-catchment characteristics derived from map data, and stream reach chemical and physical characteristics. Brown trout were not observed at any sites characterized by pH < 5.0 during the spring snowmelt episode, matching published toxicity thresholds. Brown trout distributions were patchy even in less acidic regions of the stream network, positively associated with glaciofluvial substrate and negatively associated with fine sand/silty sediments. A multivariate model including only whole-catchment characteristics explained 43% of the variation in brown trout densities, while models with local site physical habitat characteristics or local stream chemistry explained 33 and 25%, respectively. At the stream reach scale, physical habitat apparently played a primary role in organizing brown trout distributions in this stream network, with acidity placing an additional restriction by excluding brown trout from acidic headwater streams. Much of the strength of the catchment characteristics-fish association could be explained by the correlation of catchment-scale landscape characteristics with local stream chemistry and site physical characteristics. These results, consistent with the concept of multiple hierarchical environmental filters regulating the distribution of this fish species, underline the importance of considering a range of spatial scales and both physical and chemical environments when attempting to manage or restore streams for brown trout

Epsilon Open Archive

Directory of Open Access Journals

Mathematical Modelling and Machine Learning Methods for Bioinformatics and Data Science Applications

Author
Publication venue: 'MDPI AG'
Publication date: 24/02/2022
Field of study

Mathematical modeling is routinely used in physical and engineering sciences to help understand complex systems and optimize industrial processes. Mathematical modeling differs from Artificial Intelligence because it does not exclusively use the collected data to describe an industrial phenomenon or process, but it is based on fundamental laws of physics or engineering that lead to systems of equations able to represent all the variables that characterize the process. Conversely, Machine Learning methods require a large amount of data to find solutions, remaining detached from the problem that generated them and trying to infer the behavior of the object, material or process to be examined from observed samples. Mathematics allows us to formulate complex models with effectiveness and creativity, describing nature and physics. Together with the potential of Artificial Intelligence and data collection techniques, a new way of dealing with practical problems is possible. The insertion of the equations deriving from the physical world in the data-driven models can in fact greatly enrich the information content of the sampled data, allowing to simulate very complex phenomena, with drastically reduced calculation times. Combined approaches will constitute a breakthrough in cutting-edge applications, providing precise and reliable tools for the prediction of phenomena in biological macro/microsystems, for biotechnological applications and for medical diagnostics, particularly in the field of precision medicine

Directory of Open Access Books (DOAB)