198 research outputs found

    Autotag: A tool for creating structured document collections from printed materials

    Full text link
    Today\u27s optical character recognition (OCR) devices ordinarily are not capable of delimiting or marking up specific structural information about the document such as the title, its authors, and titles of sections. Such information appears in the OCR device output, but would require a human to go through the output to locate the information. This type of information is highly useful for information retrieval (IR), allowing users much more flexibility in making queries of a retrieval system. This thesis will describe the design, implementation, and evaluation of a software system called Autotag. This system will automatically markup structural information in OCR-generated text. It will also establish a mapping between objects in page images and their corresponding ASCII representation. This mapping can then be used to design flexible image-based interfaces for information retrieval related applications

    Protracted speciation revitalizes the neutral theory of biodiversity.

    Get PDF
    Understanding the maintenance and origin of biodiversity is a formidable task, yet many ubiquitous ecological patterns are predicted by a surprisingly simple and widely studied neutral model that ignores functional differences between species. However, this model assumes that new species arise instantaneously as singletons and consequently makes unrealistic predictions about species lifetimes, speciation rates and number of rare species. Here, we resolve these anomalies - without compromising any of the original models existing achievements and retaining computational and analytical tractability - by modelling speciation as a gradual, protracted, process rather than an instantaneous event. Our model also makes new predictions about the diversity of incipient species and rare species in the metacommunity. We show that it is both necessary and straightforward to incorporate protracted speciation in future studies of neutral models, and argue that non-neutral models should also model speciation as a gradual process rather than an instantaneous one

    New North American mammals

    Get PDF
    p. 347-350 ; 24 cm

    Annual Rainfall and Seasonality Predict Pan-tropical Patterns of Liana Density and Basal Area

    Get PDF
    We test the hypotheses proposed by Gentry and Schnitzer that liana density and basal area in tropical forests vary negatively with mean annual precipitation (MAP) and positively with seasonality. Previous studies correlating liana abundance with these climatic variables have produced conflicting results, warranting a new analysis of drivers of liana abundance based on a different dataset. We compiled a pan-tropical dataset containing 28,953 lianas (≥2.5 cm diam.) from studies conducted at 13 Neotropical and 11 Paleotropical dry to wet lowland tropical forests. The ranges in MAP and dry season length (DSL) (number of months with mean rainfall <100 mm) represented by these datasets were 860–7250 mm/yr and 0–7 mo, respectively. Pan-tropically, liana density and basal area decreased significantly with increasing annual rainfall and increased with increasing DSL, supporting the hypotheses of Gentry and Schnitzer. Our results suggest that much of the variation in liana density and basal area in the tropics can be accounted for by the relatively simple metrics of MAP and DSL.Abstract in Spanish is available at http://www.blackwell-synergy.com/loi/btpPeer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/78581/1/j.1744-7429.2009.00589.x.pd

    Polymorphism Data Can Reveal the Origin of Species Abundance Statistics

    Get PDF
    What is the underlying mechanism behind the fat-tailed statistics observed for species abundance distributions? The two main hypotheses in the field are the adaptive (niche) theories, where species abundance reflects its fitness, and the neutral theory that assumes demographic stochasticity as the main factor determining community structure. Both explanations suggest quite similar species-abundance distributions, but very different histories: niche scenarios assume that a species population in the past was similar to the observed one, while neutral scenarios are characterized by strongly fluctuating populations. Since the genetic variations within a population depend on its abundance in the past, we present here a way to discriminate between the theories using the genetic diversity of noncoding DNA. A statistical test, based on the Fu-Li method, has been developed and enables such a differentiation. We have analyzed the results gathered from individual-based simulation of both types of histories and obtained clear distinction between the Fu-Li statistics of the neutral scenario and that of the niche scenario. Our results suggest that data for 10–50 species, with approximately 30 sequenced individuals for each species, may allow one to distinguish between these two theories
    corecore