36,438 research outputs found
SAVASA project @ TRECVID 2012: interactive surveillance event detection
In this paper we describe our participation in the interactive surveillance event detection task at TRECVid 2012. The system we developed was comprised of individual classifiers brought together behind a simple video search interface that enabled users to select relevant segments based on down~sampled animated gifs. Two types of user -- `experts' and `end users' -- performed the evaluations. Due to time constraints we focussed on three events -- ObjectPut, PersonRuns and Pointing -- and two of the five available cameras (1 and 3). Results from the interactive runs as well as discussion of the performance of the underlying retrospective classifiers are presented
Prosody-Based Automatic Segmentation of Speech into Sentences and Topics
A crucial step in processing speech audio data for information extraction,
topic detection, or browsing/playback is to segment the input into sentence and
topic units. Speech segmentation is challenging, since the cues typically
present for segmenting text (headers, paragraphs, punctuation) are absent in
spoken language. We investigate the use of prosody (information gleaned from
the timing and melody of speech) for these tasks. Using decision tree and
hidden Markov modeling techniques, we combine prosodic cues with word-based
approaches, and evaluate performance on two speech corpora, Broadcast News and
Switchboard. Results show that the prosodic model alone performs on par with,
or better than, word-based statistical language models -- for both true and
automatically recognized words in news speech. The prosodic model achieves
comparable performance with significantly less training data, and requires no
hand-labeling of prosodic events. Across tasks and corpora, we obtain a
significant improvement over word-only models using a probabilistic combination
of prosodic and lexical information. Inspection reveals that the prosodic
models capture language-independent boundary indicators described in the
literature. Finally, cue usage is task and corpus dependent. For example, pause
and pitch features are highly informative for segmenting news speech, whereas
pause, duration and word-based cues dominate for natural conversation.Comment: 30 pages, 9 figures. To appear in Speech Communication 32(1-2),
Special Issue on Accessing Information in Spoken Audio, September 200
Nonequilibrium entropic bounds for Darwinian replicators
Life evolved on our planet by means of a combination of Darwinian selection
and innovations leading to higher levels of complexity. The emergence and
selection of replicating entities is a central problem in prebiotic evolution.
Theoretical models have shown how populations of different types of replicating
entities exclude or coexist with other classes of replicators. Models are
typically kinetic, based on standard replicator equations. On the other hand,
the presence of thermodynamical constrains for these systems remain an open
question. This is largely due to the lack of a general theory of out of
statistical methods for systems far from equilibrium. Nonetheless, a first
approach to this problem has been put forward in a series of novel
developements in non-equilibrium physics, under the rubric of the extended
second law of thermodynamics. The work presented here is twofold: firstly, we
review this theoretical framework and provide a brief description of the three
fundamental replicator types in prebiotic evolution: parabolic, malthusian and
hyperbolic. Finally, we employ these previously mentioned techinques to explore
how replicators are constrained by thermodynamics.Comment: 12 Pages, 5 Figure
Climate Change and Sea Level Rise Projections for Boston
While the broad outlines of how climate change would impact Boston have been known for some time, it is only recently that we have developed a more definitive understanding of what lies ahead. That understanding was advanced considerably with the publication of Climate Change and Sea Level Rise Projections for Boston by the Boston Research Advisory Group (BRAG).The BRAG report is the first major product of "Climate Ready Boston," a project led by the City of Boston in partnership with the Green Ribbon Commission and funded in part by the Barr Foundation. The BRAG team includes 20 leading experts from the region's major universities on subjects ranging from sea level rise to temperature extremes. University of Massachusetts Boston professors Ellen Douglas and Paul Kirshen headed the research.The BRAG report validates earlier studies, concluding Boston will get hotter, wetter, and saltier in the decades ahead (see figures below). But the group has produced a much more definitive set of projections than existed previously, especially for the problem of sea level rise. BRAG also concluded that some of the effects of climate change will come sooner than expected, accelerating the urgency of planning and action
- âŠ