1,386 research outputs found
An introduction to crowdsourcing for language and multimedia technology research
Language and multimedia technology research often relies on
large manually constructed datasets for training or evaluation of algorithms and systems. Constructing these datasets is often expensive with significant challenges in terms of recruitment of personnel to carry out the work. Crowdsourcing methods using scalable pools of workers available on-demand offers a flexible means of rapid low-cost construction of many of these datasets to support existing research requirements and potentially promote new research initiatives that would otherwise not be possible
Video summarization by group scoring
In this paper a new model for user-centered video summarization is presented. Involvement of more than one expert in generating the final video summary should be regarded as the main use case for this algorithm. This approach consists of three major steps. First, the video frames are scored by a group of operators. Next, these assigned scores are averaged to produce a singular value for each frame and lastly, the highest scored video frames alongside the corresponding audio and textual contents are extracted to be inserted into the summary. The effectiveness of this approach has been evaluated by comparing the video summaries generated by this system against the results from a number of automatic summarization tools that use different modalities for abstraction
Collaborative Summarization of Topic-Related Videos
Large collections of videos are grouped into clusters by a topic keyword,
such as Eiffel Tower or Surfing, with many important visual concepts repeating
across them. Such a topically close set of videos have mutual influence on each
other, which could be used to summarize one of them by exploiting information
from others in the set. We build on this intuition to develop a novel approach
to extract a summary that simultaneously captures both important
particularities arising in the given video, as well as, generalities identified
from the set of videos. The topic-related videos provide visual context to
identify the important parts of the video being summarized. We achieve this by
developing a collaborative sparse optimization method which can be efficiently
solved by a half-quadratic minimization algorithm. Our work builds upon the
idea of collaborative techniques from information retrieval and natural
language processing, which typically use the attributes of other similar
objects to predict the attribute of a given object. Experiments on two
challenging and diverse datasets well demonstrate the efficacy of our approach
over state-of-the-art methods.Comment: CVPR 201
Summarization of Films and Documentaries Based on Subtitles and Scripts
We assess the performance of generic text summarization algorithms applied to
films and documentaries, using the well-known behavior of summarization of news
articles as reference. We use three datasets: (i) news articles, (ii) film
scripts and subtitles, and (iii) documentary subtitles. Standard ROUGE metrics
are used for comparing generated summaries against news abstracts, plot
summaries, and synopses. We show that the best performing algorithms are LSA,
for news articles and documentaries, and LexRank and Support Sets, for films.
Despite the different nature of films and documentaries, their relative
behavior is in accordance with that obtained for news articles.Comment: 7 pages, 9 tables, 4 figures, submitted to Pattern Recognition
Letters (Elsevier
Balancing Relevance and Diversity in Online Bipartite Matching via Submodularity
In bipartite matching problems, vertices on one side of a bipartite graph are
paired with those on the other. In its online variant, one side of the graph is
available offline, while the vertices on the other side arrive online. When a
vertex arrives, an irrevocable and immediate decision should be made by the
algorithm; either match it to an available vertex or drop it. Examples of such
problems include matching workers to firms, advertisers to keywords, organs to
patients, and so on. Much of the literature focuses on maximizing the total
relevance---modeled via total weight---of the matching. However, in many
real-world problems, it is also important to consider contributions of
diversity: hiring a diverse pool of candidates, displaying a relevant but
diverse set of ads, and so on. In this paper, we propose the Online Submodular
Bipartite Matching (\osbm) problem, where the goal is to maximize a submodular
function over the set of matched edges. This objective is general enough to
capture the notion of both diversity (\emph{e.g.,} a weighted coverage
function) and relevance (\emph{e.g.,} the traditional linear function)---as
well as many other natural objective functions occurring in practice
(\emph{e.g.,} limited total budget in advertising settings). We propose novel
algorithms that have provable guarantees and are essentially optimal when
restricted to various special cases. We also run experiments on real-world and
synthetic datasets to validate our algorithms.Comment: To appear in AAAI 201
- …