186 research outputs found
Cross-View Image Matching for Geo-localization in Urban Environments
In this paper, we address the problem of cross-view image geo-localization.
Specifically, we aim to estimate the GPS location of a query street view image
by finding the matching images in a reference database of geo-tagged bird's eye
view images, or vice versa. To this end, we present a new framework for
cross-view image geo-localization by taking advantage of the tremendous success
of deep convolutional neural networks (CNNs) in image classification and object
detection. First, we employ the Faster R-CNN to detect buildings in the query
and reference images. Next, for each building in the query image, we retrieve
the nearest neighbors from the reference buildings using a Siamese network
trained on both positive matching image pairs and negative pairs. To find the
correct NN for each query building, we develop an efficient multiple nearest
neighbors matching method based on dominant sets. We evaluate the proposed
framework on a new dataset that consists of pairs of street view and bird's eye
view images. Experimental results show that the proposed method achieves better
geo-localization accuracy than other approaches and is able to generalize to
images at unseen locations
Towards Highly Accurate and Stable Face Alignment for High-Resolution Videos
In recent years, heatmap regression based models have shown their
effectiveness in face alignment and pose estimation. However, Conventional
Heatmap Regression (CHR) is not accurate nor stable when dealing with
high-resolution facial videos, since it finds the maximum activated location in
heatmaps which are generated from rounding coordinates, and thus leads to
quantization errors when scaling back to the original high-resolution space. In
this paper, we propose a Fractional Heatmap Regression (FHR) for
high-resolution video-based face alignment. The proposed FHR can accurately
estimate the fractional part according to the 2D Gaussian function by sampling
three points in heatmaps. To further stabilize the landmarks among continuous
video frames while maintaining the precise at the same time, we propose a novel
stabilization loss that contains two terms to address time delay and non-smooth
issues, respectively. Experiments on 300W, 300-VW and Talking Face datasets
clearly demonstrate that the proposed method is more accurate and stable than
the state-of-the-art models.Comment: Accepted to AAAI 2019. 8 pages, 7 figure
Attention Is Not the Only Choice: Counterfactual Reasoning for Path-Based Explainable Recommendation
Compared with only pursuing recommendation accuracy, the explainability of a
recommendation model has drawn more attention in recent years. Many graph-based
recommendations resort to informative paths with the attention mechanism for
the explanation. Unfortunately, these attention weights are intentionally
designed for model accuracy but not explainability. Recently, some researchers
have started to question attention-based explainability because the attention
weights are unstable for different reproductions, and they may not always align
with human intuition. Inspired by the counterfactual reasoning from causality
learning theory, we propose a novel explainable framework targeting path-based
recommendations, wherein the explainable weights of paths are learned to
replace attention weights. Specifically, we design two counterfactual reasoning
algorithms from both path representation and path topological structure
perspectives. Moreover, unlike traditional case studies, we also propose a
package of explainability evaluation solutions with both qualitative and
quantitative methods. We conduct extensive experiments on three real-world
datasets, the results of which further demonstrate the effectiveness and
reliability of our method.Comment: accepted by TKD
A novel gaussian particle swarms optimized particle filter algorithm for the state of charge estimation of lithium-ion batteries.
A gaussian particle swarm optimized particle filter estimation method, along with the second-order resistance-capacitance model, is proposed for the state of charge estimation of lithium-ion battery in electric vehicles. Based on the particle filter method, it exploits the strong optimality-seeking ability of the particle swarm algorithm, suppressing algorithm degradation and particle impoverishment by improving the importance distribution. This method also introduces normally distributed decay inertia weights to enhance the global search capability of the particle swarm optimization algorithm, which improves the convergence of this estimation method. As can be known from the experimental results that the proposed method has stronger robustness and higher filter efficiency with the estimation error steadily maintained within 0.89% in the constant current discharge experiment. This method is insensitive to the initial amount and distribution of particles, achieving adaptive and stable tracking in the state of charge for lithium-ion batteries
- …