26,074 research outputs found
Recommended from our members
A words-of-interest model of sketch representation for image retrieval
In this paper we propose a method for sketch-based image retrieval. Sketch is a magical medium which is capable of conveying semantic messages for user. It’s in accordance with user’s cognitive psychology to retrieve images with sketch. In order to narrow down the semantic gap between the user and the images in database, we preprocess all the images into sketches by the coherent line drawing algorithm. During the process of sketches extraction, saliency maps are used to filter out the redundant background information, while preserve the important semantic information. We use a variant of Words-of-Interest model to retrieve relevant images for the user according to the query. Words-of-Interest (WoI) model is based on Bag-ofvisual Words (BoW) model, which has been proven successfully for information retrieval. Bag-of-Words ignores the spatial relationships among visual words, which are important for sketch representation. Our method takes advantage of the spatial information of the query to select words of interest. Experimental results demonstrate that our sketch-based retrieval method achieves a good tradeoff between retrieval accuracy and semantic representation of users’ query
Building automated vandalism detection tools for Wikidata
Wikidata, like Wikipedia, is a knowledge base that anyone can edit. This open
collaboration model is powerful in that it reduces barriers to participation
and allows a large number of people to contribute. However, it exposes the
knowledge base to the risk of vandalism and low-quality contributions. In this
work, we build on past work detecting vandalism in Wikipedia to detect
vandalism in Wikidata. This work is novel in that identifying damaging changes
in a structured knowledge-base requires substantially different feature
engineering work than in a text-based wiki like Wikipedia. We also discuss the
utility of these classifiers for reducing the overall workload of vandalism
patrollers in Wikidata. We describe a machine classification strategy that is
able to catch 89% of vandalism while reducing patrollers' workload by 98%, by
drawing lightly from contextual features of an edit and heavily from the
characteristics of the user making the edit
Temporal Localization of Fine-Grained Actions in Videos by Domain Transfer from Web Images
We address the problem of fine-grained action localization from temporally
untrimmed web videos. We assume that only weak video-level annotations are
available for training. The goal is to use these weak labels to identify
temporal segments corresponding to the actions, and learn models that
generalize to unconstrained web videos. We find that web images queried by
action names serve as well-localized highlights for many actions, but are
noisily labeled. To solve this problem, we propose a simple yet effective
method that takes weak video labels and noisy image labels as input, and
generates localized action frames as output. This is achieved by cross-domain
transfer between video frames and web images, using pre-trained deep
convolutional neural networks. We then use the localized action frames to train
action recognition models with long short-term memory networks. We collect a
fine-grained sports action data set FGA-240 of more than 130,000 YouTube
videos. It has 240 fine-grained actions under 85 sports activities. Convincing
results are shown on the FGA-240 data set, as well as the THUMOS 2014
localization data set with untrimmed training videos.Comment: Camera ready version for ACM Multimedia 201
Inversion using a new low-dimensional representation of complex binary geological media based on a deep neural network
Efficient and high-fidelity prior sampling and inversion for complex
geological media is still a largely unsolved challenge. Here, we use a deep
neural network of the variational autoencoder type to construct a parametric
low-dimensional base model parameterization of complex binary geological media.
For inversion purposes, it has the attractive feature that random draws from an
uncorrelated standard normal distribution yield model realizations with spatial
characteristics that are in agreement with the training set. In comparison with
the most commonly used parametric representations in probabilistic inversion,
we find that our dimensionality reduction (DR) approach outperforms principle
component analysis (PCA), optimization-PCA (OPCA) and discrete cosine transform
(DCT) DR techniques for unconditional geostatistical simulation of a
channelized prior model. For the considered examples, important compression
ratios (200 - 500) are achieved. Given that the construction of our
parameterization requires a training set of several tens of thousands of prior
model realizations, our DR approach is more suited for probabilistic (or
deterministic) inversion than for unconditional (or point-conditioned)
geostatistical simulation. Probabilistic inversions of 2D steady-state and 3D
transient hydraulic tomography data are used to demonstrate the DR-based
inversion. For the 2D case study, the performance is superior compared to
current state-of-the-art multiple-point statistics inversion by sequential
geostatistical resampling (SGR). Inversion results for the 3D application are
also encouraging
- …