183,273 research outputs found
Adaptive text mining: Inferring structure from sequences
Text mining is about inferring structure from sequences representing natural language text, and may be defined as the process of analyzing text to extract information that is useful for particular purposes. Although hand-crafted heuristics are a common practical approach for extracting information from text, a general, and generalizable, approach requires adaptive techniques. This paper studies the way in which the adaptive techniques used in text compression can be applied to text mining. It develops several examples: extraction of hierarchical phrase structures from text, identification of keyphrases in documents, locating proper names and quantities of interest in a piece of text, text categorization, word segmentation, acronym extraction, and structure recognition. We conclude that compression forms a sound unifying principle that allows many text mining problems to be tacked adaptively
Modelling Spatial Compositional Data: Reconstructions of past land cover and uncertainties
In this paper, we construct a hierarchical model for spatial compositional
data, which is used to reconstruct past land-cover compositions (in terms of
coniferous forest, broadleaved forest, and unforested/open land) for five time
periods during the past years over Europe. The model consists of a
Gaussian Markov Random Field (GMRF) with Dirichlet observations. A block
updated Markov chain Monte Carlo (MCMC), including an adaptive Metropolis
adjusted Langevin step, is used to estimate model parameters. The sparse
precision matrix in the GMRF provides computational advantages leading to a
fast MCMC algorithm. Reconstructions are obtained by combining pollen-based
estimates of vegetation cover at a limited number of locations with scenarios
of past deforestation and output from a dynamic vegetation model. To evaluate
uncertainties in the predictions a novel way of constructing joint confidence
regions for the entire composition at each prediction location is proposed. The
hierarchical model's ability to reconstruct past land cover is evaluated
through cross validation for all time periods, and by comparing reconstructions
for the recent past to a present day European forest map. The evaluation
results are promising and the model is able to capture known structures in past
land-cover compositions
Hierarchical Tree-Structures as Adaptive Meshes
Introduction: Two basic types of simulations exist for modeling systems of many particles: grid-based (point particles indirectly interacting with one another through the potential calculated from equivalent particle densities on a mesh) and particle-based (point particles directly interacting with a one another through potentials at their positions calculated from the other particles in the system). Grid-based solvers traditionally model continuum problems, such as fluid and gas systems, and mixed particle-continuum systems. Particle-based solvers find more use modeling discrete systems such as stars within galaxies or other rarefied gases. Many different physical systems, including electromagnetic interactions, gravitational interactions, and fluid vortex interactions, all are governed by Poisson\u27s Equation..
- …