1,267 research outputs found

    Efficient estimation of AUC in a sliding window

    Full text link
    In many applications, monitoring area under the ROC curve (AUC) in a sliding window over a data stream is a natural way of detecting changes in the system. The drawback is that computing AUC in a sliding window is expensive, especially if the window size is large and the data flow is significant. In this paper we propose a scheme for maintaining an approximate AUC in a sliding window of length kk. More specifically, we propose an algorithm that, given ϵ\epsilon, estimates AUC within ϵ/2\epsilon / 2, and can maintain this estimate in O((logk)/ϵ)O((\log k) / \epsilon) time, per update, as the window slides. This provides a speed-up over the exact computation of AUC, which requires O(k)O(k) time, per update. The speed-up becomes more significant as the size of the window increases. Our estimate is based on grouping the data points together, and using these groups to calculate AUC. The grouping is designed carefully such that (ii) the groups are small enough, so that the error stays small, (iiii) the number of groups is small, so that enumerating them is not expensive, and (iiiiii) the definition is flexible enough so that we can maintain the groups efficiently. Our experimental evaluation demonstrates that the average approximation error in practice is much smaller than the approximation guarantee ϵ/2\epsilon / 2, and that we can achieve significant speed-ups with only a modest sacrifice in accuracy

    StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates

    Get PDF
    Fully Bayesian multispecies coalescent (MSC) methods like *BEAST estimate species trees from multiple sequence alignments. Today thousands of genes can be sequenced for a given study, but using that many genes with *BEAST is intractably slow. An alternative is to use heuristic methods which compromise accuracy or completeness in return for speed. A common heuristic is concatenation, which assumes that the evolutionary history of each gene tree is identical to the species tree. This is an inconsistent estimator of species tree topology, a worse estimator of divergence times, and induces spurious substitution rate variation when incomplete lineage sorting is present. Another class of heuristics directly motivated by the MSC avoids many of the pitfalls of concatenation but cannot be used to estimate divergence times. To enable fuller use of available data and more accurate inference of species tree topologies, divergence times, and substitution rates, we have developed a new version of *BEAST called StarBEAST2. To improve convergence rates we add analytical integration of population sizes, novel MCMC operators and other optimizations. Computational performance improved by 13.5× and 13.8× respectively when analyzing two empirical data sets, and an average of 33.1× across 30 simulated data sets. To enable accurate estimates of per-species substitution rates, we introduce species tree relaxed clocks, and show that StarBEAST2 is a more powerful and robust estimator of rate variation than concatenation. StarBEAST2 is available through the BEAUTi package manager in BEAST 2.4 and above.This work was supported by a Rutherford Discovery Fellowship awarded to A.J.D. by the Royal Society of New Zealand. H.A.O. was supported by an Australian Laureate Fellowship awarded to Craig Moritz by the Australian Research Council (FL110100104)

    Information sharing and credit : firm-level evidence from transition countries

    Get PDF
    We investigate whether information sharing among banks has affected credit market performance in the transition countries of Eastern Europe and the former Soviet Union, using a large sample of firm-level data. Our estimates show that information sharing is associated with improved availability and lower cost of credit to firms. This correlation is stronger for opaque firms than transparent ones and stronger in countries with weak legal environments than in those with strong legal environments. In cross-sectional estimates, we control for variation in country-level aggregate variables that may affect credit, by examining the differential impact of information sharing across firm types. In panel estimates, we also control for the presence of unobserved heterogeneity at the firm level, as well as for changes in macroeconomic variables and the legal environment

    Trust in the public sector: Is there any evidence for a long-term decline?

    Get PDF
    Concerns with declining public trust in government have become a permanent element of the contemporary political discourse. This concern also extends to levels of citizens’ trust in the public administration and public services. Trust is said to be declining, and this decline is generally seen as detrimental to public service delivery. In this article, we examine the main elements in this discussion, review the existing international survey data and summarise the main findings for OECD countries. Citizens’ trust in the public sector is found to fluctuate, and the data generally do not show consistently declining levels of trust. Furthermore, in some countries there simply is insufficient data to come to any conclusions at all about time trends in citizen trust in the public sector. Points for practitioners This article summarises some of the survey material on citizens’ trust in the public administration. It allows practitioners to compare trends in public trust in their country across time and space. The findings lead us to reject the hypothesis of a universal decline of trust in the public sector. The article warns against using opinion poll results without considering context. The long-term and comparative perspective on citizens’ trust in the public sector is all too often absent from the policy discourse that is frequently based on assumptions and ad-hoc approaches

    What is progressive business?

    Get PDF
    This introductory chapter signposts the rationale, framework and case study contents of the book. Firstly, we offer an overview of the need for new more progressive business models than the mainstream which exists at present, identifying the current challenges facing business in Europe and beyond in its international ramifications. To remedy these challenges, we present our alternative vision of progressive business functioning, whose basic criteria comprise ecological sustainability, respect for future generations, and pro-socialness. Then, synopses of our case examples follow

    Postglacial expansion of the arctic keystone copepod calanus glacialis

    Get PDF
    Calanus glacialis, a major contributor to zooplankton biomass in the Arctic shelf seas, is a key link between primary production and higher trophic levels that may be sensitive to climate warming. The aim of this study was to explore genetic variation in contemporary populations of this species to infer possible changes during the Quaternary period, and to assess its population structure in both space and time. Calanus glacialis was sampled in the fjords of Spitsbergen (Hornsund and Kongsfjorden) in 2003, 2004, 2006, 2009 and 2012. The sequence of a mitochondrial marker, belonging to the ND5 gene, selected for the study was 1249 base pairs long and distinguished 75 unique haplotypes among 140 individuals that formed three main clades. There was no detectable pattern in the distribution of haplotypes by geographic distance or over time. Interestingly, a Bayesian skyline plot suggested that a 1000-fold increase in population size occurred approximately 10,000 years before present, suggesting a species expansion after the Last Glacial Maximum.GAME from the National Science Centre, the Polish Ministry of Science and Higher Education Iuventus Plus [IP2014 050573]; FCT-PT [CCMAR/Multi/04326/2013]; [2011/03/B/NZ8/02876
    corecore