74,220 research outputs found
Abstractive Multi-Document Summarization via Phrase Selection and Merging
We propose an abstraction-based multi-document summarization framework that
can construct new sentences by exploring more fine-grained syntactic units than
sentences, namely, noun/verb phrases. Different from existing abstraction-based
approaches, our method first constructs a pool of concepts and facts
represented by phrases from the input documents. Then new sentences are
generated by selecting and merging informative phrases to maximize the salience
of phrases and meanwhile satisfy the sentence construction constraints. We
employ integer linear optimization for conducting phrase selection and merging
simultaneously in order to achieve the global optimal solution for a summary.
Experimental results on the benchmark data set TAC 2011 show that our framework
outperforms the state-of-the-art models under automated pyramid evaluation
metric, and achieves reasonably well results on manual linguistic quality
evaluation.Comment: 11 pages, 1 figure, accepted as a full paper at ACL 201
An automated archival VLA transients survey
In this paper we present the results of a survey for radio transients using
data obtained from the Very Large Array archive. We have reduced, using a
pipeline procedure, 5037 observations of the most common pointings - i.e. the
calibrator fields. These fields typically contain a relatively bright point
source and are used to calibrate `target' observations: they are therefore
rarely imaged themselves. The observations used span a time range ~ 1984 - 2008
and consist of eight different pointings, three different frequencies (8.4, 4.8
and 1.4 GHz) and have a total observing time of 435 hours. We have searched for
transient and variable radio sources within these observations using components
from the prototype LOFAR transient detection system. In this paper we present
the methodology for reducing large volumes of Very Large Array data; and we
also present a brief overview of the prototype LOFAR transient detection
algorithms. No radio transients were detected in this survey, therefore we
place an upper limit on the snapshot rate of GHz frequency transients > 8.0 mJy
to rho less than or equal to 0.032 deg^-2 that have typical timescales 4.3 to
45.3 days. We compare and contrast our upper limit with the snapshot rates -
derived from either detections or non-detections of transient and variable
radio sources - reported in the literature. When compared with the current Log
N - Log S distribution formed from previous surveys, we show that our upper
limit is consistent with the observed population. Current and future radio
transient surveys will hopefully further constrain these statistics, and
potentially discover dominant transient source populations. In this paper we
also briefly explore the current transient commissioning observations with
LOFAR, and the impact they will make on the field.Comment: Accepted for publication in MNRA
Quality measures for ETL processes: from goals to implementation
Extraction transformation loading (ETL) processes play an increasingly important role for the support of modern business operations. These business processes are centred around artifacts with high variability and diverse lifecycles, which correspond to key business entities. The apparent complexity of these activities has been examined through the prism of business process management, mainly focusing on functional requirements and performance optimization. However, the quality dimension has not yet been thoroughly investigated, and there is a need for a more human-centric approach to bring them closer to business-users requirements. In this paper, we take a first step towards this direction by defining a sound model for ETL process quality characteristics and quantitative measures for each characteristic, based on existing literature. Our model shows dependencies among quality characteristics and can provide the basis for subsequent analysis using goal modeling techniques. We showcase the use of goal modeling for ETL process design through a use case, where we employ the use of a goal model that includes quantitative components (i.e., indicators) for evaluation and analysis of alternative design decisions.Peer ReviewedPostprint (author's final draft
- …