Search CORE

42 research outputs found

Cross-View Image Matching for Geo-localization in Urban Environments

Author: Chen Chen
Shah Mubarak
Tian Yicong
Publication venue
Publication date: 22/03/2017
Field of study

k

nearest neighbors from the reference buildings using a Siamese network trained on both positive matching image pairs and negative pairs. To find the correct NN for each query building, we develop an efficient multiple nearest neighbors matching method based on dominant sets. We evaluate the proposed framework on a new dataset that consists of pairs of street view and bird's eye view images. Experimental results show that the proposed method achieves better geo-localization accuracy than other approaches and is able to generalize to images at unseen locations

arXiv.org e-Print Archive

Crossref

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

Human Action Detection, Tracking and Segmentation in Videos

Author: Tian Yicong
Publication venue: 'Information Bulletin on Variable Stars (IBVS)'
Publication date: 01/01/2018
Field of study

This dissertation addresses the problem of human action detection, human tracking and segmentation in videos. They are fundamental tasks in computer vision and are extremely challenging to solve in realistic videos. We first propose a novel approach for action detection by exploring the generalization of deformable part models from 2D images to 3D spatiotemporal volumes. By focusing on the most distinctive parts of each action, our models adapt to intra-class variation and show robustness to clutter. This approach deals with detecting action performed by a single person. When there are multiple humans in the scene, humans need to be segmented and tracked from frame to frame before action recognition can be performed. Next, we propose a novel approach for multiple object tracking (MOT) by formulating detection and data association in one framework. Our method allows us to overcome the confinements of data association based MOT approaches, where the performance is dependent on the object detection results provided at input level. We show that automatically detecting and tracking targets in a single framework can help resolve the ambiguities due to frequent occlusion and heavy articulation of targets. In this tracker, targets are represented by bounding boxes, which is a coarse representation. However, pixel-wise object segmentation provides fine level information, which is desirable for later tasks. Finally, we propose a tracker that simultaneously solves three main problems: detection, data association and segmentation. This is especially important because the output of each of those three problems are highly correlated and the solution of one can greatly help improve the others. The proposed approach achieves more accurate segmentation results and also helps better resolve typical difficulties in multiple target tracking, such as occlusion, ID-switch and track drifting

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

Adaptive Image Transcoding

Author: Li Xia
Milanfar Peyman
Tian Yicong
Wang Jingbin
Yang Feng
Publication venue: Technical Disclosure Commons
Publication date: 30/03/2020
Field of study

Images comprise a substantial portion of network traffic. Reducing the filesize of an image while preserving quality can reduce storage costs and bandwidth usage, as well as reduce latency. This disclosure describes techniques for adaptive transcoding of an image into an appropriate target file format such that file size is reduced while preserving image quality. An input image is processed to remove compression artifacts. An image quality metric is obtained and an image codec is selected. The image is encoded using the selected codec

Technical Disclosure Common

MiR-214 increases the sensitivity of breast cancer cells to tamoxifen and fulvestrant through inhibition of autophagy

Author: A Derfoul
A Jemal
A Nicolini
Aiping Luo
AJ Kowaltowski
AM Cheng
B Levine
B Levine
C Fleury
C He
E Ciruelos
E Dalla Pozza
E Tokunaga
EC Lai
F Cardoso
F Wang
FM Santandreu
G Mattiasson
GX Zhao
H Schwarzenbach
JL Schwartz-Roberts
JS Samaddar
KJ Livak
KL Cook
L Duan
LB Frankel
LB Frankel
M Lippman
M Zhao
MA Qadir
N Chen
P Collins
PF Bross
PT Schumacker
R Clarke
R Garcia-Becerra
S Kongara
S Mandlekar
S Nepal
Shuqing Wang
T Sharma
V Ambros
W Bursch
Wenna Shi
X Rao
X Yu
Xianjun Qu
Xinfeng Yu
Y Chen
Y Tian
Y Vergne
Y Zhang
Ye Li
Yicong Liu
YS Wang
Z Derdak
Zhihua Liu
ZY Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Cross-View Image Matching For Geo-Localization In Urban Environments

Author: Chen Chen
Shah Mubarak
Tian Yicong
Publication venue: 'Information Bulletin on Variable Stars (IBVS)'
Publication date: 06/11/2017
Field of study

In this paper, we address the problem of cross-view image geo-localization. Specifically, we aim to estimate the GPS location of a query street view image by finding the matching images in a reference database of geotagged bird\u27s eye view images, or vice versa. To this end, we present a new framework for cross-view image geolocalization by taking advantage of the tremendous success of deep convolutional neural networks (CNNs) in image classification and object detection. First, we employ the Faster R-CNN [16] to detect buildings in the query and reference images. Next, for each building in the query image, we retrieve the k nearest neighbors from the reference buildings using a Siamese network trained on both positive matching image pairs and negative pairs. To find the correct NN for each query building, we develop an efficient multiple nearest neighbors matching method based on dominant sets. We evaluate the proposed framework on a new dataset that consists of pairs of street view and bird\u27s eye view images. Experimental results show that the proposed method achieves better geo-localization accuracy than other approaches and is able to generalize to images at unseen locations

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

Spatiotemporal Deformable Part Models For Action Detection

Author: Shah Mubarak
Sukthankar Rahul
Tian Yicong
Publication venue: 'Information Bulletin on Variable Stars (IBVS)'
Publication date: 15/11/2013
Field of study

Deformable part models have achieved impressive performance for object detection, even on difficult image datasets. This paper explores the generalization of deformable part models from 2D images to 3D spatiotemporal volumes to better study their effectiveness for action detection in video. Actions are treated as spatiotemporal patterns and a deformable part model is generated for each action from a collection of examples. For each action model, the most discriminative 3D sub volumes are automatically selected as parts and the spatiotemporal relations between their locations are learned. By focusing on the most distinctive parts of each action, our models adapt to intra-class variation and show robustness to clutter. Extensive experiments on several video datasets demonstrate the strength of spatiotemporal DPMs for classifying and localizing actions. © 2013 IEEE

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

On Detection, Data Association And Segmentation For Multi-Target Tracking

Author: Dehghan Afshin
Shah Mubarak
Tian Yicong
Publication venue: 'Information Bulletin on Variable Stars (IBVS)'
Publication date: 20/06/2018
Field of study

In this work, we propose a tracker that differs from most existing multi-target trackers in two major ways. Firstly, our tracker does not rely on a pre-trained object detector to get the initial object hypotheses. Secondly, our tracker\u27s final output is the fine contours of the targets rather than traditional bounding boxes. Therefore, our tracker simultaneously solves three main problems: detection, data association and segmentation. This is especially important because the output of each of those three problems are highly correlated and the solution of one can greatly help improve the others. The proposed algorithm consists of two main components: structured learning and Lagrange dual decomposition. Our structured learning based tracker learns a model for each target and infers the best locations of all targets simultaneously in a video clip. The inference of our structured learning is achieved through a new Target Identity-aware Network Flow (TINF). The second component is Lagrange dual decomposition, which combines the structured learning tracker with a multi-label Conditional Random Field (CRF) based segmentation algorithm. This leads to more accurate segmentation results and also helps better resolve typical difficulties in multiple target tracking, such as occlusion handling, ID-switch and track drifting

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

Spatiotemporal Deformable Part Models for Action Detection

Author: Mubarak Shah
Rahul Sukthankar
Yicong Tian
Publication venue
Publication date: 15/07/2013
Field of study

Deformable part models have achieved impressive performance for object detection, even on difficult image datasets. This paper explores the generalization of deformable part models from 2D images to 3D spatiotemporal volumes to better study their effectiveness for action detection in video. Actions are treated as spatiotemporal patterns and a deformable part model is generated for each action from a collection of examples. For each action model, the most discriminative 3D subvolumes are automatically selected as parts and the spatiotemporal relations between their locations are learned. By focusing on the most distinctive parts of each action, our models adapt to intra-class variation and show robustness to clutter. Extensive experiments on several video datasets demonstrate the strength of spatiotemporal DPMs for classifying and localizing actions. 1

CiteSeerX

Crossref

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

On Detection, Data Association and Segmentation for Multi-Target Tracking

Author: Afshin Dehghan
Mubarak Shah
Yicong Tian
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Target Identity-Aware Network Flow For Online Multiple Target Tracking

Author: Dehghan Afshin
Shah Mubarak
Tian Yicong
Torr Philip H.S.
Publication venue: 'Information Bulletin on Variable Stars (IBVS)'
Publication date: 14/10/2015
Field of study

In this paper we show that multiple object tracking (MOT) can be formulated in a framework, where the detection and data-association are performed simultaneously. Our method allows us to overcome the confinements of data association based MOT approaches; where the performance is dependent on the object detection results provided at input level. At the core of our method lies structured learning which learns a model for each target and infers the best location of all targets simultaneously in a video clip. The inference of our structured learning is done through a new Target Identity-aware Network Flow (TINF), where each node in the network encodes the probability of each target identity belonging to that node. The proposed Lagrangian relaxation optimization finds the high quality solution to the network. During optimization a soft spatial constraint is enforced between the nodes of the graph which helps reducing the ambiguity caused by nearby targets with similar appearance in crowded scenarios. We show that automatically detecting and tracking targets in a single framework can help resolve the ambiguities due to frequent occlusion and heavy articulation of targets. Our experiments involve challenging yet distinct datasets and show that our method can achieve results better than the state-of-art

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)