Search CORE

86,434 research outputs found

Random sampling with a reservoir

Author: ERNVALL J.
FAN C. T.
FELLER W.
FELLER W.
Jeffrey S. Vitter
KNUTH D.E.
VITTER J.S.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Estimation of fish production from Hirakud reservoir

Author: George V.C.
Kesavan Nair A.K.
Khan A.A.
Varghese M.D.
Publication venue
Publication date: 01/01/1981
Field of study

A suitable procedure based broadly on stratified random sampling for estimation of fish production from Hirakud reservoir is described. The total fish production for the years 1978 and 1979 from Hirakud reservoir, along with seasonal variation of different species is discussed

Aquatic Commons

Stream Aggregation Through Order Sampling

Author: Ahmed Nesreen
Duffield Nick
Xia Liangzhen
Xu Yunhong
Yu Minlan
Publication venue
Publication date: 01/11/2017
Field of study

This is paper introduces a new single-pass reservoir weighted-sampling stream aggregation algorithm, Priority-Based Aggregation (PBA). While order sampling is a powerful and e cient method for weighted sampling from a stream of uniquely keyed items, there is no current algorithm that realizes the benefits of order sampling in the context of stream aggregation over non-unique keys. A naive approach to order sample regardless of key then aggregate the results is hopelessly inefficient. In distinction, our proposed algorithm uses a single persistent random variable across the lifetime of each key in the cache, and maintains unbiased estimates of the key aggregates that can be queried at any point in the stream. The basic approach can be supplemented with a Sample and Hold pre-sampling stage with a sampling rate adaptation controlled by PBA. This approach represents a considerable reduction in computational complexity compared with the state of the art in adapting Sample and Hold to operate with a fixed cache size. Concerning statistical properties, we prove that PBA provides unbiased estimates of the true aggregates. We analyze the computational complexity of PBA and its variants, and provide a detailed evaluation of its accuracy on synthetic and trace data. Weighted relative error is reduced by 40% to 65% at sampling rates of 5% to 17%, relative to Adaptive Sample and Hold; there is also substantial improvement for rank queriesComment: 10 page

arXiv.org e-Print Archive

Crossref

Equilibrium molecular thermodynamics from Kirkwood sampling.

We present two methods for barrierless equilibrium sampling of molecular systems based on the recently proposed Kirkwood method (J. Chem. Phys. 2009, 130, 134102). Kirkwood sampling employs low-order correlations among internal coordinates of a molecule for random (or non-Markovian) sampling of the high dimensional conformational space. This is a geometrical sampling method independent of the potential energy surface. The first method is a variant of biased Monte Carlo, where Kirkwood sampling is used for generating trial Monte Carlo moves. Using this method, equilibrium distributions corresponding to different temperatures and potential energy functions can be generated from a given set of low-order correlations. Since Kirkwood samples are generated independently, this method is ideally suited for massively parallel distributed computing. The second approach is a variant of reservoir replica exchange, where Kirkwood sampling is used to construct a reservoir of conformations, which exchanges conformations with the replicas performing equilibrium sampling corresponding to different thermodynamic states. Coupling with the Kirkwood reservoir enhances sampling by facilitating global jumps in the conformational space. The efficiency of both methods depends on the overlap of the Kirkwood distribution with the target equilibrium distribution. We present proof-of-concept results for a model nine-atom linear molecule and alanine dipeptide.This research was funded by the European Research Council and EPSRC grant EP/I001352/1. Y.O. was supported, in part, by the JSPS Grant-in-Aid for Scientific Research on Innovative Areas (“Dynamical Ordering and Integrated Functions”).This is the final published version. It first appeared at http://pubs.acs.org/doi/abs/10.1021/acs.jpcb.5b01800

Crossref

PubMed Central

Apollo (Cambridge)

Sampling design may obscure species–area relationships in landscape-scale field studies

Author: Bueno Anderson Saldanha
Kaefer Igor L.
Masseli Gabriel S.
Peres Carlos A.
Publication venue: 'Wiley'
Publication date: 01/01/2020
Field of study

We investigated 1) the role of area per se in explaining anuran species richness on reservoir forest islands, after controlling for several confounding factors. We also assessed 2) how sampling design affects the inferential power of island species–area relationships (ISARs) aiming to 3) provide guidelines to yield reliable estimates of area-induced species losses in patchy systems. We surveyed anurans with autonomous recording units at 151 plots located on 74 islands and four continuous forest sites at the Balbina Hydroelectric Reservoir landscape, central Brazilian Amazonia. We applied semi-log ISAR models to assess the effect of sampling design on the fit and slope of species–area curves. To do so, we subsampled our surveyed islands following both a 1) stratified and 2) non-stratified random selection of 5, 10, 15, 20 and 25 islands covering 1) the full range in island size (0.45–1699 ha) and 2) only islands smaller than 100 ha, respectively. We also compiled 25 datasets from the literature to assess the generality of our findings. Island size explained ca half of the variation in species richness. The fit and slope of species–area curves were affected mainly by the range in island size considered, and to a very small extent by the number of islands surveyed. In our literature review, all datasets covering a range of patch sizes larger than 300 ha yielded a positive ISAR, whereas the number of patches alone did not affect the detection of ISARs. We conclude that 1) area per se plays a major role in explaining anuran species richness on forest islands within an Amazonian anthropogenic archipelago; 2) the inferential power of island species–area relationships is severely degraded by sub-optimal sampling designs; 3) at least 10 habitat patches spanning three orders of magnitude in size should be surveyed to yield reliable species–area estimates in patchy systems

University of East Anglia digital repository