17 research outputs found

    The BioKET Biodiversity Data Warehouse: Data and Knowledge Integration and Extraction

    Get PDF
    International audienceBiodiversity datasets are generally stored in different for-mats. This makes it difficult for biologists to combine and integrate them to retrieve useful information for the purpose of, for example, efficiently classify specimens. In this paper, we present BioKET, a data warehouse which is a consolidation of heterogeneous data sources stored in different formats. For the time being, the scopus of BioKET is botanical. We had, among others things, to list all the existing botanical ontologies and re-late terms in BioKET with terms in these ontologies. We demonstrate the usefulness of such a resource by applying FIST, a combined biclus-tering and conceptual association rule extraction method on a dataset extracted from BioKET to analyze the risk status of plants endemic to Laos. Besides, BioKET may be interfaced with other resources, like GeoCAT, to provide a powerful analysis tool for biodiversity data

    Imperfect detection and its consequences for monitoring for conservation

    Full text link
    Biodiversity monitoring is important to identify conservation needs and test the efficacy of management actions. Variants of “abundance” (N) are among the most widely monitored quantities, e.g., (true) abundance, number of occupied sites (distribution, occupancy) or species richness.We propose a sampling-based view of monitoring that clearly acknowledges two sampling processes involved when monitoring N. First, measurements from the surveyed sample area are generalized to a larger area, hence the importance of a probability sample. Second, even within sampled areas only a sample of units (individuals, occupied sites, species) is counted owing to imperfect detectability p. If p < 1, counts are random variables and their expectation E(n) is related to N via the relationship E(n) ÿ*p. Whenever p < 1, counts vary even under identical conditions and underestimate N, and patterns in counts confound patterns in N with those in p. In addition, part of the population N may be unavailable for detection, e.g., temporarily outside the sampled quadrat, underground or for another reason not exposed to sampling; hence a more general way of describing a count is E(n) ÿ*a*p, where a is availability probability and p detection, given availability. We give two examples of monitoring schemes that highlight the importance of explicitly accounting for availability and detectability. In the Swiss reptile Red List update, the widespread and abundant slow worm (Anguis fragilis) was recorded in only 22.1% of all sampled quadrats. Only an analysis that accounted for both availability and detectability gave realistic estimates of the species’ distribution. Among 128 bird species monitored in the Swiss breeding bird survey, de tection in occupied 1 km quadrats averaged only 64% and varied tremendously by species (3–99 %); hence observed distributions greatly underestimated range sizes and should not be compared among species.We believe that monitoring design and analyses should properly account for these two sampling processes to enable valid inferences about biodiversity. We argue for a more rigorous approach to both monitoring design and analysis to obtain the best possible information about the state of nature. An explicit recognition of, and proper accounting for, the two sampling processes involved in most monitoring programs will go a long way towards this goa
    corecore