59,681 research outputs found

    κ΄‘μ—­ 닀쀑 λ³΄ν–‰μž 좔적을 μœ„ν•œ 계측적 ꢀ적 맀칭 기법

    Get PDF
    ν•™μœ„λ…Όλ¬Έ (박사) -- μ„œμšΈλŒ€ν•™κ΅ λŒ€ν•™μ› : κ³΅κ³ΌλŒ€ν•™ 전기·컴퓨터곡학뢀, 2020. 8. μ΅œμ§„μ˜.The purpose of wide-area tracking problem is to track pedestrians that appear on cameras that overlap or do not overlap, regardless of the time interval or person density. In a single camera tracking, data association using overlapping of the detection boxes is used to solve the tracking problem, but still has appearance ambiguity issues. However, wide-area tracking requires a tracking scheme that focuses on the appearance similarity of humans, without the use of overlapping of detection boxes. In this dissertation, we propose the tracking scheme for the Wide-area Multi-Pedestrian Tracking (WaMuPeT). To achieve the WaMuPeT, we propose the trajectory matching in overlapping camera settings (Ch. 3), non-overlapping camera settings (Ch. 4) and robust trajectory matching in dense scene settings (Ch. 5). In trajectory matching in overlapping camera settings (Ch. 3), we propose a novel deep-learning architecture for accurate 3-D localization and tracking of a pedestrian using multiple cameras. The deep-learning network is composed of two networks: detection network and localization network. The detection network yields the pedestrian detections and the localization network estimates the ground position of a pedestrian within its detection box. In addition, an attentional pass filter is introduced to effectively connect the two networks. Using the detection proposals and their 2-D grounding positions obtained from the two networks, multi-camera multi-target 3-D localization and tracking algorithm is developed through min-cost network flow approach. In the experiments, it is shown that the proposed method improves the performance of 3-D localization and tracking. In trajectory matching in non-overlapping camera settings (Ch. 4), we propose a novel re-ranking method using a ranking-reflected metric to measure the similarity between two ordered sets of KK-nearest neighbors (OKNN). The proposed metric for ranking-reflected similarity (RSS) reflects the ranking of the shared elements between the two OKNNs. Using RSS, a re-ranking procedure is proposed that prioritizes galleries having neighbors similar to a probe's neighbor in the perspective of ranking order. In the experiment, we show that the proposed method improves the Re-ID accuracy by add-on to the state-of-the-art methods. In robust trajectory matching in dense scene settings (Ch. 5), we propose a novel framework for multi-pedestrian tracking to generate robust trajectories in dense scene. In the proposed tracking method, we propose the tracking method based on the trajectory matching by the strategy of divide and conquer method. In this strategy, short-term, mid-term and long-term trajectories are generated by each trajectory merging stages, respectively. Also we propose a novel deep-feature matching method called stable boundary selection (SBS). In SBS matching, the detections are clustered by the group similarity of deep features, so that robust trajectories can be generated. With the smoothing algorithms and the detection restoration algorithm, the proposed tracking method shows the state-of-the-art tracking accuracy in three public tracking dataset.κ΄‘μ—­ 좔적 문제의 λͺ©μ μ€ μ‹œκ°„ κ°„κ²©μ΄λ‚˜ μ‚¬λžŒ 밀도에 관계없이 κ²ΉμΉ˜κ±°λ‚˜ κ²ΉμΉ˜μ§€ μ•ŠλŠ” 카메라에 λ‚˜νƒ€λ‚˜λŠ” λ³΄ν–‰μžλ₯Ό μΆ”μ ν•˜λŠ” 것이닀. 단일 카메라 μΆ”μ μ—μ„œ 감지 μƒμžμ˜ 겹침을 μ‚¬μš©ν•˜λŠ” 데이터 연결은 좔적 문제λ₯Ό ν•΄κ²°ν•˜λŠ” 데 μ‚¬μš©λ˜μ§€λ§Œ μ—¬μ „νžˆ λͺ¨μ–‘ λͺ¨ν˜Έμ„± λ¬Έμ œκ°€ μžˆλ‹€. κ·ΈλŸ¬λ‚˜ κ΄‘μ—­ μΆ”μ μ—λŠ” 감지 μƒμžμ˜ 겹침을 μ‚¬μš©ν•˜μ§€ μ•Šκ³  μ‚¬λžŒμ˜ μ™Έν˜• μœ μ‚¬μ„±μ— 쀑점을 λ‘” 좔적 체계가 ν•„μš”ν•˜λ‹€. 이 λ…Όλ¬Έμ—μ„œλŠ” κ΄‘μ—­ 닀쀑 λ³΄ν–‰μž 좔적 (WaMuPeT)에 λŒ€ν•œ 좔적 체계λ₯Ό μ œμ•ˆν•œλ‹€. WaMuPeTλ₯Ό λ‹¬μ„±ν•˜κΈ° μœ„ν•΄ κ²ΉμΉ˜λŠ” 카메라 μ„€μ • (3 μž₯), κ²ΉμΉ˜μ§€ μ•ŠλŠ” 카메라 μ„€μ • (4 μž₯) μ—μ„œμ˜ ꢀ적 일치 그리고 λΉ½λΉ½ν•œ μž₯λ©΄ μ„€μ • (5 μž₯)μ—μ„œ κ°•μΈν•œ ꢀ적 μΌμΉ˜μ— λŒ€ν•΄μ„œ μ œμ•ˆν•œλ‹€. κ²ΉμΉ˜λŠ” 카메라 μ„€μ •μ—μ„œμ˜ ꢀ적 맀칭 (3 μž₯)μ—μ„œλŠ” μ—¬λŸ¬ 카메라λ₯Ό μ‚¬μš©ν•˜μ—¬ λ³΄ν–‰μžλ₯Ό μ •ν™•ν•˜κ²Œ 3D μ§€μ—­ν™”ν•˜κ³  μΆ”μ ν•˜κΈ°μœ„ν•œ μƒˆλ‘œμš΄ λ”₯ λŸ¬λ‹ μ•„ν‚€ν…μ²˜λ₯Ό μ œμ•ˆν•œλ‹€. λ”₯ λŸ¬λ‹ λ„€νŠΈμ›Œν¬λŠ” 감지 λ„€νŠΈμ›Œν¬μ™€ λ‘œμ»¬λΌμ΄μ œμ΄μ…˜ λ„€νŠΈμ›Œν¬μ˜ 두 가지 λ„€νŠΈμ›Œν¬λ‘œ κ΅¬μ„±λœλ‹€. 탐지 λ„€νŠΈμ›Œν¬λŠ” λ³΄ν–‰μž 탐지λ₯Ό μ œκ³΅ν•˜κ³  ν˜„μ§€ν™” λ„€νŠΈμ›Œν¬λŠ” 탐지 μƒμž λ‚΄μ—μ„œ λ³΄ν–‰μžμ˜ 지상 μœ„μΉ˜λ₯Ό μΆ”μ •ν•œλ‹€. λ˜ν•œ 두 개의 λ„€νŠΈμ›Œν¬λ₯Ό 효과적으둜 μ—°κ²°ν•˜κΈ° μœ„ν•΄μ£Όμ˜ 패슀 ν•„ν„°κ°€ λ„μž…λ˜μ—ˆλ‹€. 두 λ„€νŠΈμ›Œν¬μ—μ„œ 얻은 탐지 μ œμ•ˆ 및 2D 접지 μœ„μΉ˜λ₯Ό μ‚¬μš©ν•˜μ—¬ μ΅œμ†Œ λΉ„μš©μ˜ λ„€νŠΈμ›Œν¬ 흐름 μ ‘κ·Ό 방식을 톡해 닀쀑 카메라 닀쀑 λŒ€μƒ 3D 지역화 및 좔적 μ•Œκ³ λ¦¬μ¦˜μ΄ κ°œλ°œλœλ‹€. μ‹€ν—˜μ—μ„œ μ œμ•ˆ 된 방법이 3D 지역화 및 좔적 μ„±λŠ₯을 ν–₯μƒμ‹œν‚€λŠ” κ²ƒμœΌλ‘œ λ‚˜νƒ€λ‚¬λ‹€. κ²ΉμΉ˜μ§€ μ•ŠλŠ” 카메라 μ„€μ •μ—μ„œμ˜ ꢀ적 일치 (4 μž₯)μ—μ„œ, μš°λ¦¬λŠ” μˆœμœ„κ°€ 반영된 λ©”νŠΈλ¦­μ„ μ‚¬μš©ν•˜μ—¬ λ‘κ°œμ˜ μˆœμ„œκ°€ μ§€μ •λœ KK-졜근 μ ‘ 이웃 (OKNN) μ„ΈνŠΈ μ‚¬μ΄μ˜ μœ μ‚¬μ„±μ„ μΈ‘μ •ν•œλ‹€. μˆœμœ„ 반영 μœ μ‚¬μ„± (RSS)에 λŒ€ν•΄ μ œμ•ˆ 된 λ©”νŠΈλ¦­μ€ 두 OKNN μ‚¬μ΄μ˜ 곡유 μš”μ†Œμ˜ μˆœμœ„λ₯Ό λ°˜μ˜ν•©λ‹ˆλ‹€. RSSλ₯Ό μ‚¬μš©ν•˜μ—¬, μˆœμœ„ μˆœμ„œμ˜ κ΄€μ μ—μ„œ ν”„λ‘œλΈŒμ˜ 이웃과 μœ μ‚¬ν•œ 이웃을 κ°–λŠ” 가러리λ₯Ό μš°μ„  μˆœμœ„ ν™”ν•˜λŠ” μž¬μˆœμœ„ μ ˆμ°¨κ°€ μ œμ•ˆλœλ‹€. μ‹€ν—˜μ—μ„œ μ œμ•ˆ 된 방법이 μ΅œμ‹  방법에 μΆ”κ°€λ˜μ–΄ Re-ID 정확도가 ν–₯상됨을 보여쀀닀. 고밀도 μž₯λ©΄ μ„€μ •μ—μ„œ κ°•λ ₯ν•œ ꢀ적 일치 (5 μž₯)μ—μ„œ, μš°λ¦¬λŠ” 고밀도 μž₯λ©΄μ—μ„œ κ°•λ ₯ν•œ ꢀ적을 μƒμ„±ν•˜κΈ° μœ„ν•΄ 닀쀑 λ³΄ν–‰μž 좔적을 μœ„ν•œ μƒˆλ‘œμš΄ ν”„λ ˆμž„ μ›Œν¬λ₯Ό μ œμ•ˆν•œλ‹€. μ œμ•ˆλœ 좔적 λ°©λ²•μ—μ„œλŠ” λΆ„ν•  및 정볡 방법 μ „λž΅μ— λ”°λ₯Έ ꢀ적 맀칭을 기반으둜 좔적 방법을 μ œμ•ˆν•œλ‹€. 이 μ „λž΅μ—μ„œ, 단기, 쀑기 및 μž₯κΈ° ꢀ적은 각각의 ꢀ적 병합 단계에 μ˜ν•΄ μƒμ„±λœλ‹€. λ˜ν•œ SBS (Stable Boundary Selection)λΌλŠ” μƒˆλ‘œμš΄ κΈ°λŠ₯ 맀칭 기법을 μ œμ•ˆν•œλ‹€. SBS λ§€μΉ­μ—μ„œ, νƒμ§€λŠ” κΉŠμ€ νŠΉμ§•μ˜ κ·Έλ£Ή μœ μ‚¬μ„±μ— μ˜ν•΄ κ΅°μ§‘ν™”λ˜μ–΄, κ°•λ ₯ν•œ ꢀ적이 생성 될 수 μžˆλ‹€. μ œμ•ˆ 된 좔적 방법은 ν‰ν™œ μ•Œκ³ λ¦¬μ¦˜κ³Ό 탐지 볡원 μ•Œκ³ λ¦¬μ¦˜μ„ 톡해 3 개의 곡개 좔적 데이터 μ„ΈνŠΈμ—μ„œ μ΅œμ²¨λ‹¨ 좔적 정확도λ₯Ό 보여쀀닀.Chapter 1 Introduction 1 1.1 Background 1 1.2 Related Works 4 1.2.1 Localization of Pedestrian Detection 4 1.2.2 Pedestrian Feature from Person Re-identification 5 1.2.3 Multi-Pedestrian Tracking 8 1.3 Contributions 8 1.4 Thesis Organization 10 Chapter 2 Problem Statements 11 2.1 Trajectory Matching in Overlapping Camera Settings 11 2.1.1 Challenges 11 2.1.2 Approach for the challenges 13 2.2 Trajectory Matching in Non-Overlapping Camera Settings 13 2.2.1 Challenges 13 2.2.2 Approach for the challenges 14 2.3 Robust Trajectory Matching in Dense Scene Settings 16 2.3.1 Challenges 16 2.3.2 Approach for the challenges 18 Chapter 3 Trajectory Matching in Overlapping Camera Settings 19 3.1 Overall Scheme 19 3.2 Network Design 20 3.3 MCMTT with Proposed Network 22 Chapter 4 Trajectory Matching in Non-overlapping Camera Settings 25 4.1 Overall Scheme 25 4.2 Proposed Method 30 4.2.1 Proposed Similarity Metric 30 4.2.2 Selection of A 31 4.2.3 Re-ranking Procedure 32 Chapter 5 Robust Trajectory Matching in Dense Scene Settings 35 5.1 Overall Scheme 35 5.2 Similarity Matrix Generation 39 5.3 Stable Boundary Selection 40 5.4 Trajectory Smoothing 42 5.5 Detection Restoration 46 5.6 Trajectory Merging Process 48 Chapter 6 Experiments 51 6.1 Dataset and Evaluation Metric 51 6.1.1 Trajectory Matching in Overlapping Camera Settings 51 6.1.2 Trajectory Matching in Non-overlapping Camera Settings 52 6.1.3 Robust Trajectory Matching in Dense Scene Settings 53 6.2 Results and Discussion 56 6.2.1 Trajectory Matching in Overlapping Camera Settings 56 6.2.2 Trajectory Matching in Non-overlapping Camera Settings 56 6.2.3 Robust Trajectory Matching in Dense Scene Settings 62 Chapter 7 Conclusions and Future Works 81 7.1 Concluding Remarks 81 7.2 Future Works 83 Abstract 97Docto

    Deep Adaptive Feature Embedding with Local Sample Distributions for Person Re-identification

    Full text link
    Person re-identification (re-id) aims to match pedestrians observed by disjoint camera views. It attracts increasing attention in computer vision due to its importance to surveillance system. To combat the major challenge of cross-view visual variations, deep embedding approaches are proposed by learning a compact feature space from images such that the Euclidean distances correspond to their cross-view similarity metric. However, the global Euclidean distance cannot faithfully characterize the ideal similarity in a complex visual feature space because features of pedestrian images exhibit unknown distributions due to large variations in poses, illumination and occlusion. Moreover, intra-personal training samples within a local range are robust to guide deep embedding against uncontrolled variations, which however, cannot be captured by a global Euclidean distance. In this paper, we study the problem of person re-id by proposing a novel sampling to mine suitable \textit{positives} (i.e. intra-class) within a local range to improve the deep embedding in the context of large intra-class variations. Our method is capable of learning a deep similarity metric adaptive to local sample structure by minimizing each sample's local distances while propagating through the relationship between samples to attain the whole intra-class minimization. To this end, a novel objective function is proposed to jointly optimize similarity metric learning, local positive mining and robust deep embedding. This yields local discriminations by selecting local-ranged positive samples, and the learned features are robust to dramatic intra-class variations. Experiments on benchmarks show state-of-the-art results achieved by our method.Comment: Published on Pattern Recognitio

    Relation Discovery from Web Data for Competency Management

    Get PDF
    This paper describes a technique for automatically discovering associations between people and expertise from an analysis of very large data sources (including web pages, blogs and emails), using a family of algorithms that perform accurate named-entity recognition, assign different weights to terms according to an analysis of document structure, and access distances between terms in a document. My contribution is to add a social networking approach called BuddyFinder which relies on associations within a large enterprise-wide "buddy list" to help delimit the search space and also to provide a form of 'social triangulation' whereby the system can discover documents from your colleagues that contain pertinent information about you. This work has been influential in the information retrieval community generally, as it is the basis of a landmark system that achieved overall first place in every category in the Enterprise Search Track of TREC2006

    A Multi-task Deep Network for Person Re-identification

    Get PDF
    Person re-identification (ReID) focuses on identifying people across different scenes in video surveillance, which is usually formulated as a binary classification task or a ranking task in current person ReID approaches. In this paper, we take both tasks into account and propose a multi-task deep network (MTDnet) that makes use of their own advantages and jointly optimize the two tasks simultaneously for person ReID. To the best of our knowledge, we are the first to integrate both tasks in one network to solve the person ReID. We show that our proposed architecture significantly boosts the performance. Furthermore, deep architecture in general requires a sufficient dataset for training, which is usually not met in person ReID. To cope with this situation, we further extend the MTDnet and propose a cross-domain architecture that is capable of using an auxiliary set to assist training on small target sets. In the experiments, our approach outperforms most of existing person ReID algorithms on representative datasets including CUHK03, CUHK01, VIPeR, iLIDS and PRID2011, which clearly demonstrates the effectiveness of the proposed approach.Comment: Accepted by AAAI201
    • …
    corecore