9 research outputs found

    No Spare Parts: Sharing Part Detectors for Image Categorization

    Get PDF
    This work aims for image categorization using a representation of distinctive parts. Different from existing part-based work, we argue that parts are naturally shared between image categories and should be modeled as such. We motivate our approach with a quantitative and qualitative analysis by backtracking where selected parts come from. Our analysis shows that in addition to the category parts defining the class, the parts coming from the background context and parts from other image categories improve categorization performance. Part selection should not be done separately for each category, but instead be shared and optimized over all categories. To incorporate part sharing between categories, we present an algorithm based on AdaBoost to jointly optimize part sharing and selection, as well as fusion with the global image representation. We achieve results competitive to the state-of-the-art on object, scene, and action categories, further improving over deep convolutional neural networks

    Discriminative Probabilistic Pattern Mining using Graph for Electronic Health Records

    Get PDF
    ν•™μœ„λ…Όλ¬Έ(석사)--μ„œμšΈλŒ€ν•™κ΅ λŒ€ν•™μ› :κ³΅κ³ΌλŒ€ν•™ 컴퓨터곡학뢀,2019. 8. κΉ€μ„ .μ „μžμ˜λ£ŒκΈ°λ‘(Electronic Health Records)의 μž„μƒ λ…ΈνŠΈμ—λŠ” ν™˜μžμ˜ 병λ ₯에 λŒ€ν•œ μœ μš©ν•œ 정보가 많이 ν¬ν•¨λ˜μ–΄ μžˆλ‹€. κ·ΈλŸ¬λ‚˜ μž„μƒ λ…ΈνŠΈλŠ” μ²΄κ³„ν™”λ˜μ§€ μ•Šμ€ 데이터이며 κ·Έ 양은 λ‚˜λ‚ μ΄ μ¦κ°€ν•˜κ³  μžˆλ‹€. λ”°λΌμ„œ μž„μƒ λ…ΈνŠΈλ₯Ό κ·Έλ£Ήν™”ν•˜κ³  λΆ„λ₯˜ν•˜κΈ° μœ„ν•œ μ‹ λ’°ν•  수 μžˆλŠ” 데이터 λ§ˆμ΄λ‹ 기술이 ν•„μš”ν•˜λ‹€. 기쑴의 데이터 λ§ˆμ΄λ‹ κΈ°μˆ μ€ ν‚€μ›Œλ“œμ˜ λΉˆλ„λ₯Ό 기반으둜 μƒμ„±λœ 빈발 νŒ¨ν„΄(frequent patterns)을 μ΄μš©ν•˜μ—¬ κ·Έλ£Ή λΆ„λ₯˜ μž‘μ—…(classification)을 μˆ˜ν–‰ν•œλ‹€. ν•˜μ§€λ§Œ μ΄λŸ¬ν•œ 빈발 νŒ¨ν„΄μ€ μ „μžμ˜λ£ŒκΈ°λ‘μ˜ μž„μƒ λ…ΈνŠΈμ™€ 같이 λ³΅μž‘ν•œ λ°μ΄ν„°μ˜ λΆ„λ₯˜λ₯Ό μœ„ν•΄ ν•„μš”ν•œ μΆ©λΆ„νžˆ κ°•λ ₯ν•˜κ³  λͺ…ν™•ν•˜κ²Œ κ΅¬λ³„λ˜λŠ” νŠΉμ§•μ„ κ°–κ³  μžˆμ§€ μ•Šλ‹€. λ˜ν•œ 빈발 νŒ¨ν„΄ 기반 κΈ°μˆ μ€ λŒ€κ·œλͺ¨ μ „μžμ˜λ£ŒκΈ°λ‘ 데이터에 적용될 λ•Œ ν™•μž₯μ„±κ³Ό 계산 λΉ„μš©μ˜ λ¬Έμ œμ— μ§λ©΄ν•œλ‹€. λ”°λΌμ„œ λ³Έ μ—°κ΅¬μ—μ„œλŠ” μ΄λŸ¬ν•œ λ¬Έμ œμ μ„ ν•΄κ²°ν•˜κΈ° μœ„ν•΄ ν™•λ₯ μ  νŒλ³„ νŒ¨ν„΄ λ§ˆμ΄λ‹(discriminative probabilistic pattern mining) μ•Œκ³ λ¦¬μ¦˜μ„ μ†Œκ°œν•œλ‹€. ν™•λ₯ μ  νŒλ³„ νŒ¨ν„΄ λ§ˆμ΄λ‹ μ•Œκ³ λ¦¬μ¦˜μ—μ„œλŠ” μ „μžμ˜λ£ŒκΈ°λ‘μ˜ μž„μƒ λ…ΈνŠΈλ₯Ό λΆ„λ₯˜ν•˜κΈ° μœ„ν•΄ κ·Έλž˜ν”„ ꡬ쑰λ₯Ό λ„μž…ν•˜μ—¬ 빈발 νŒ¨ν„΄μ˜ λΆ€λΆ„ κ·Έλž˜ν”„λ₯Ό μƒμ„±ν•˜κ²Œ λœλ‹€. λ³Έ μ—°κ΅¬μ—μ„œλŠ” νŒλ³„λ ₯을 높이기 μœ„ν•΄ κ°œλ³„ ν‚€μ›Œλ“œλ₯Ό μ‚¬μš©ν•˜λŠ” λŒ€μ‹  이진 νŠΉμ„± μ‘°ν•©μ—μ„œμ˜ λ™μ‹œ μΆœν˜„(co-occurrence)을 μ‚¬μš©ν•˜μ—¬ μž„μƒ λ…ΈνŠΈ λΆ„λ₯˜λ₯Ό μœ„ν•œ 빈발 νŒ¨ν„΄ κ·Έλž˜ν”„λ₯Ό κ΅¬μ„±ν•œλ‹€. 각각의 λ™μ‹œ μΆœν˜„μ€ νŒλ³„λ ₯(discriminative power)에 λ”°λ₯Έ log-odds κ°’μœΌλ‘œ κ·Έ κ°€μ€‘μΉ˜λ₯Ό κ°–λŠ”λ‹€. μž„μƒ λ…ΈνŠΈμ˜ λ³Έμ§ˆμ„ λ°˜μ˜ν•˜λŠ” κ·Έλž˜ν”„λ₯Ό μ°ΎκΈ° μœ„ν•΄ ν™•λ₯ μ  νŒλ³„ λΆ€λΆ„ κ·Έλž˜ν”„ 검색을 μˆ˜ν–‰ν•˜λ©° κ·Έλž˜ν”„μ˜ ν—ˆλΈŒ(hub) λ…Έλ“œμ—μ„œ μ‹œμž‘ν•˜μ—¬ 동적 ν”„λ‘œκ·Έλž˜λ°(dynamic programming)을 μ‚¬μš©ν•˜μ—¬ 경둜λ₯Ό μ°ΎλŠ”λ‹€. μ΄λŸ¬ν•œ λ°©λ²•μœΌλ‘œ κ²€μƒ‰ν•œ 빈발 λΆ€λΆ„ κ·Έλž˜ν”„λ₯Ό μ΄μš©ν•˜μ—¬ μ „μžμ˜λ£ŒκΈ°λ‘μ˜ μž„μƒ λ…ΈνŠΈμ— λŒ€ν•œ λΆ„λ₯˜ μž‘μ—…μ„ μˆ˜ν–‰ν•˜κ²Œ λœλ‹€.Electronic Health Records (EHR) contains plenty of useful information about patients medical history. However, EHR is highly unstructured data and amount of it is growing continuously, that is why there is a need in a reliable data mining technique to group and categorize clinical notes. Although, many existing data mining techniques for group classification use frequent patterns generated based on frequencies of keywords, these patterns do not possess strong enough distinguishing characteristics to show the difference between datasets to classify complex data such as clinical notes in EHR. Also, these techniques encounter scalability and computational cost problems when used on large EHR dataset. To address these issues, we introduce discriminative probabilistic pattern mining algorithm that uses a graph (DPPMG) to generate the subgraphs of frequent patterns for classification in electronic health records. We use co-occurrence, a combination of binary features, which is more discriminative than individual keywords to construct discriminative probabilistic frequent patterns graph for clinical notes classification. Each co-occurrence has a weight of log-odds score that is associated with its discriminative power. The graph, which reflects the essence of clinical notes is searched to find discriminative probabilistic frequent subgraphs. To discover the discriminative frequent subgraphs, we start from a hub node in the graph and use dynamic programming to find a path. The discriminative probabilistic frequent subgraphs discovered by this approach are later used to classify clinical notes of electronic health records.Chapter 1 Introduction and Motivation 1 Chapter 2 Background 4 2.1 Frequent Pattern Based Classification 4 2.2 Discriminative Pattern Mining 5 2.3 Electronic Health Records 6 Chapter 3 Related Work 8 Chapter 4 Overview and Design 10 Chapter 5 Implementation 12 5.1 Dataset 12 5.2 Keyword Extraction and Filtering 15 5.3 Co-occurrence Generation and Graph Construction 16 5.4 Dynamic Programming to Discover Optimal Path 17 Chapter 6 Results and Evaluation 20 6.1 Choosing Starting Hub Node 20 6.2 Qualitative Analysis 22 6.3 Discriminative Power of the Probabilistic Frequent Patterns 24 Chapter 7 Conclusion 26 Bibliography 28 μš”μ•½ 33Maste

    Discovering Latent Clusters from Geotagged Beach Images

    Full text link
    Abstract. This paper studies the problem of estimating geographical locations of images. To build reliable geographical estimators, an impor-tant question is to find distinguishable geographical clusters in the world. Those clusters cover general geographical regions and are not limited to landmarks. The geographical clusters provide more training samples and hence lead to better recognition accuracy. Previous approaches build geographical clusters using heuristics or arbitrary map grids, and can-not guarantee the effectiveness of the geographical clusters. This paper develops a new framework for geographical cluster estimation, and em-ploys latent variables to estimate the geographical clusters. To solve this problem, this paper employs the recent progress in object detection, and builds an efficient solver to find the latent clusters. The results on beach datasets validate the success of our method.

    Mining Mid-level Features for Image Classification

    No full text
    International audienceMid-level or semi-local features learnt using class-level information are potentially more distinctive than the traditional low-level local features constructed in a purely bottom-up fashion. At the same time they preserve some of the robustness properties with respect to occlusions and image clutter. In this paper we propose a new and effective scheme for extracting mid-level features for image classification, based on relevant pattern mining. In par- ticular, we mine relevant patterns of local compositions of densely sampled low-level features. We refer to the new set of obtained patterns as Frequent Local Histograms or FLHs. During this process, we pay special attention to keeping all the local histogram information and to selecting the most relevant reduced set of FLH patterns for classification. The careful choice of the visual primitives and an extension to exploit both local and global spatial information allow us to build powerful bag-of-FLH-based image representations. We show that these bag-of-FLHs are more discriminative than traditional bag-of-words and yield state-of-the-art results on various image classification benchmarks, including Pascal VOC

    RelCom: Relational combinatorics features for rapid object detection

    Full text link

    Advanced machine learning approaches for target detection, tracking and recognition

    Get PDF
    This dissertation addresses the key technical components of an Automatic Target Recognition (ATR) system namely: target detection, tracking, learning and recognition. Novel solutions are proposed for each component of the ATR system based on several new advances in the field of computer vision and machine learning. Firstly, we introduce a simple and elegant feature, RelCom, and a boosted feature selection method to achieve a very low computational complexity target detector. Secondly, we present a particle filter based target tracking algorithm that uses a quad histogram based appearance model along with online feature selection. Further, we improve the tracking performance by means of online appearance learning where appearance learning is cast as an Adaptive Kalman filtering (AKF) problem which we formulate using both covariance matching and, for the first time in a visual tracking application, the recent autocovariance least-squares (ALS) method. Then, we introduce an integrated tracking and recognition system that uses two generative models to accommodate the pose variations and maneuverability of different ground targets. Specifically, a tensor-based generative model is used for multi-view target representation that can synthesize unseen poses, and can be trained from a small set of signatures. In addition, a target-dependent kinematic model is invoked to characterize the target dynamics. Both generative models are integrated in a graphical framework for joint estimation of the target's kinematics, pose, and discrete valued identity. Finally, for target recognition we advocate the concept of a continuous identity manifold that captures both inter-class and intra-class shape variability among training targets. A hemispherical view manifold is used for modeling the view-dependent appearance. In addition to being able to deal with arbitrary view variations, this model can determine the target identity at both class and sub-class levels, for targets not present in the training data. The proposed components of the ATR system enable us to perform low computational complexity target detection with low false alarm rates, robust tracking of targets under challenging circumstances and recognition of target identities at both class and sub-class levels. Experiments on real and simulated data confirm the performance of the proposed components with promising results

    Mining compositional features for boosting

    Get PDF
    The selection of weak classifiers is critical to the success of boosting techniques. Poor weak classifiers do not perform better than random guess, thus cannot help decrease the training error during the boosting process. Therefore, when constructing the weak classifier pool, we prefer the quality rather than the quantity of the weak classifiers. In this paper, we present a data mining-driven approach to discovering compositional features from a given and possibly small feature pool. Compared with individual features (e.g. weak decision stumps) which are of limited discriminative ability, the mined compositional features have guaranteed power in terms of the descriptive and discriminative abilities, as well as bounded training error. To cope with the combinatorial cost of discovering compositional features, we apply data mining methods (frequent itemset mining) to efficiently find qualified compositional features of any possible order. These weak classifiers are further combined through a multi-class AdaBoost method for final multi-class classification. Experiments on a challenging 10-class event recognition problem show that boosting compositional features can lead to faster decrease of training error and significantly higher accuracy compared to conventional boosting decision stumps. 1
    corecore