11,832 research outputs found
A Continuously Growing Dataset of Sentential Paraphrases
A major challenge in paraphrase research is the lack of parallel corpora. In
this paper, we present a new method to collect large-scale sentential
paraphrases from Twitter by linking tweets through shared URLs. The main
advantage of our method is its simplicity, as it gets rid of the classifier or
human in the loop needed to select data before annotation and subsequent
application of paraphrase identification algorithms in the previous work. We
present the largest human-labeled paraphrase corpus to date of 51,524 sentence
pairs and the first cross-domain benchmarking for automatic paraphrase
identification. In addition, we show that more than 30,000 new sentential
paraphrases can be easily and continuously captured every month at ~70%
precision, and demonstrate their utility for downstream NLP tasks through
phrasal paraphrase extraction. We make our code and data freely available.Comment: 11 pages, accepted to EMNLP 201
Theoretical and Empirical Studies of Productivity Growth in the Agricultural Economics –– Cases of China and the United States
AbstractThis article investigates agricultural productivity growth over several decades, emphasizing to a great extent the agricultural economic development condition for the nine agricultural divisions of the United States, and China's 27 provinces in terms of Malmquist productivity growth index. The paper sets up a technique to make use of two-stage linear programming method, based on sequential production technology, to estimate the most fitted and reliable distance functions in relevant agricultural sectors, and thus to compute the Malmquist productivity indexes. Especially, it proposes to decompose the productivity growth index into two major components, technical progress and efficiency improvement, and their sub-components, to study the sources of growth in productivity
Research Update: Bud Bank Ecology for Understanding Perennial Grass Persistence
Grassland ecosystems often demonstrate very remarkable resiliency to severe natural and anthropogenic disturbances. Such resiliency following disturbances comes from either seed banks (germinable seeds in the soil) or bud banks (meristems or buds, such as bulbs, bulbils, and buds on rhizomes, corms, and tubers, that generate vegetative tissues). Although seeds are important for dispersal, initial colonization, and maintenance of genetic diversity; few grass seeds persist in the soil more than five years, plus seed production often is unreliable under grazing. Recent studies have demonstrated that \u3e99% of aboveground stems in undisturbed tallgrass prairie were recruited from the bud bank while \u3c1% were recruited from the seed bank. Even under grazed or disturbed sites in tallgrass prairie, belowground buds make a significantly larger contribution (80%) to plant recruitment than do seeds (20%)
Speech enhancement by perceptual adaptive wavelet de-noising
This thesis work summarizes and compares the existing wavelet de-noising methods. Most popular methods of wavelet transform, adaptive thresholding, and musical noise suppression have been analyzed theoretically and evaluated through Matlab simulation. Based on the above work, a new speech enhancement system using adaptive wavelet de-noising is proposed. Each step of the standard wavelet thresholding is improved by optimized adaptive algorithms. The Quantile based adaptive noise estimate and the posteriori SNR based threshold adjuster are compensatory to each other. The combination of them integrates the advantages of these two approaches and balances the effects of noise removal and speech preservation. In order to improve the final perceptual quality, an innovative musical noise analysis and smoothing algorithm and a Teager Energy Operator based silent segment smoothing module are also introduced into the system. The experimental results have demonstrated the capability of the proposed system in both stationary and non-stationary noise environments
Policy Design for Controlling Set-Point Temperature of ACs in Shared Spaces of Buildings
Air conditioning systems are responsible for the major percentage of energy
consumption in buildings. Shared spaces constitute considerable office space
area, in which most office employees perform their meetings and daily tasks,
and therefore the ACs in these areas have significant impact on the energy
usage of the entire office building. The cost of this energy consumption,
however, is not paid by the shared space users, and the AC's temperature
set-point is not determined based on the users' preferences. This latter factor
is compounded by the fact that different people may have different choices of
temperature set-points and sensitivities to change of temperature. Therefore,
it is a challenging task to design an office policy to decide on a particular
set-point based on such a diverse preference set. As a result, users are not
aware of the energy consumption in shared spaces, which may potentially
increase the energy wastage and related cost of office buildings. In this
context, this paper proposes an energy policy for an office shared space by
exploiting an established temperature control mechanism. In particular, we
choose meeting rooms in an office building as the test case and design a policy
according to which each user of the room can give a preference on the
temperature set-point and is paid for felt discomfort if the set-point is not
fixed according to the given preference. On the other hand, users who enjoy the
thermal comfort compensate the other users of the room. Thus, the policy
enables the users to be cognizant and responsible for the payment on the energy
consumption of the office space they are sharing, and at the same time ensures
that the users are satisfied either via thermal comfort or through incentives.
The policy is also shown to be beneficial for building management. Through
experiment based case studies, we show the effectiveness of the proposed
policy.Comment: Journal paper accepted in Energy & Buildings (Elsevier
- …