81 research outputs found

    Multi-feature data repository development and analytics for image cosegmentation in high-throughput plant phenotyping

    Get PDF
    Cosegmentation is a newly emerging computer vision technique used to segment an object from the background by processing multiple images at the same time. Traditional plant phenotyping analysis uses thresholding segmentation methods which result in high segmentation accuracy. Although there are proposed machine learning and deep learning algorithms for plant segmentation, predictions rely on the specific features being present in the training set. The need for a multi-featured dataset and analytics for cosegmentation becomes critical to better understand and predict plants’ responses to the environment. High-throughput phenotyping produces an abundance of data that can be leveraged to improve segmentation accuracy and plant phenotyping. This paper introduces four datasets consisting of two plant species, Buckwheat and Sunflower, each split into control and drought conditions. Each dataset has three modalities (Fluorescence, Infrared, and Visible) with 7 to 14 temporal images that are collected in a high-throughput facility at the University of Nebraska-Lincoln. The four datasets (which will be collected under the CosegPP data repository in this paper) are evaluated using three cosegmentation algorithms: Markov random fields-based, Clustering-based, and Deep learning-based cosegmentation, and one commonly used segmentation approach in plant phenotyping. The integration of CosegPP with advanced cosegmentation methods will be the latest benchmark in comparing segmentation accuracy and finding areas of improvement for cosegmentation methodology

    CG2Real: Improving the Realism of Computer Generated Images using a Large Collection of Photographs

    Get PDF
    Computer Graphics (CG) has achieved a high level of realism, producing strikingly vivid images. This realism, however, comes at the cost of long and often expensive manual modeling, and most often humans can still distinguish between CG images and real images. We present a novel method to make CG images look more realistic that is simple and accessible to novice users. Our system uses a large collection of photographs gathered from online repositories. Given a CG image, we retrieve a small number of real images with similar global structure. We identify corresponding regions between the CG and real images using a novel mean-shift cosegmentation algorithm. The user can then automatically transfer color, tone, and texture from matching regions to the CG image. Our system only uses image processing operations and does not require a 3D model of the scene, making it fast and easy to integrate into digital content creation workflows. Results of a user study show that our improved CG images appear more realistic than the originals

    Toward Large Scale Semantic Image Understanding and Retrieval

    Get PDF
    Semantic image retrieval is a multifaceted, highly complex problem. Not only does the solution to this problem require advanced image processing and computer vision techniques, but it also requires knowledge beyond what can be inferred from the image content alone. In contrast, traditional image retrieval systems are based upon keyword searches on filenames or metadata tags, e.g. Google image search, Flickr search, etc. These conventional systems do not analyze the image content and their keywords are not guaranteed to represent the image. Thus, there is significant need for a semantic image retrieval system that can analyze and retrieve images based upon the content and relationships that exist in the real world.In this thesis, I present a framework that moves towards advancing semantic image retrieval in large scale datasets. At a conceptual level, semantic image retrieval requires the following steps: viewing an image, understanding the content of the image, indexing the important aspects of the image, connecting the image concepts to the real world, and finally retrieving the images based upon the index concepts or related concepts. My proposed framework addresses each of these components in my ultimate goal of improving image retrieval. The first task is the essential task of understanding the content of an image. Unfortunately, typically the only data used by a computer algorithm when analyzing images is the low-level pixel data. But, to achieve human level comprehension, a machine must overcome the semantic gap, or disparity that exists between the image data and human understanding. This translation of the low-level information into a high-level representation is an extremely difficult problem that requires more than the image pixel information. I describe my solution to this problem through the use of an online knowledge acquisition and storage system. This system utilizes the extensible, visual, and interactable properties of Scalable Vector Graphics (SVG) combined with online crowd sourcing tools to collect high level knowledge about visual content.I further describe the utilization of knowledge and semantic data for image understanding. Specifically, I seek to incorporate knowledge in various algorithms that cannot be inferred from the image pixels alone. This information comes from related images or structured data (in the form of hierarchies and ontologies) to improve the performance of object detection and image segmentation tasks. These understanding tasks are crucial intermediate steps towards retrieval and semantic understanding. However, the typical object detection and segmentation tasks requires an abundance of training data for machine learning algorithms. The prior training information provides information on what patterns and visual features the algorithm should be looking for when processing an image. In contrast, my algorithm utilizes related semantic images to extract the visual properties of an object and also to decrease the search space of my detection algorithm. Furthermore, I demonstrate the use of related images in the image segmentation process. Again, without the use of prior training data, I present a method for foreground object segmentation by finding the shared area that exists in a set of images. I demonstrate the effectiveness of my method on structured image datasets that have defined relationships between classes i.e. parent-child, or sibling classes.Finally, I introduce my framework for semantic image retrieval. I enhance the proposed knowledge acquisition and image understanding techniques with semantic knowledge through linked data and web semantic languages. This is an essential step in semantic image retrieval. For example, a car class classified by an image processing algorithm not enhanced by external knowledge would have no idea that a car is a type of vehicle which would also be highly related to a truck and less related to other transportation methods like a train . However, a query for modes of human transportation should return all of the mentioned classes. Thus, I demonstrate how to integrate information from both image processing algorithms and semantic knowledge bases to perform interesting queries that would otherwise be impossible. The key component of this system is a novel property reasoner that is able to translate low level image features into semantically relevant object properties. I use a combination of XML based languages such as SVG, RDF, and OWL in order to link to existing ontologies available on the web. My experiments demonstrate an efficient data collection framework and novel utilization of semantic data for image analysis and retrieval on datasets of people and landmarks collected from sources such as IMDB and Flickr. Ultimately, my thesis presents improvements to the state of the art in visual knowledge representation/acquisition and computer vision algorithms such as detection and segmentation toward the goal of enhanced semantic image retrieval

    No Spare Parts: Sharing Part Detectors for Image Categorization

    Get PDF
    This work aims for image categorization using a representation of distinctive parts. Different from existing part-based work, we argue that parts are naturally shared between image categories and should be modeled as such. We motivate our approach with a quantitative and qualitative analysis by backtracking where selected parts come from. Our analysis shows that in addition to the category parts defining the class, the parts coming from the background context and parts from other image categories improve categorization performance. Part selection should not be done separately for each category, but instead be shared and optimized over all categories. To incorporate part sharing between categories, we present an algorithm based on AdaBoost to jointly optimize part sharing and selection, as well as fusion with the global image representation. We achieve results competitive to the state-of-the-art on object, scene, and action categories, further improving over deep convolutional neural networks

    Analysis of Sub-Cortical Morphology in Benign Epilepsy with Centrotemporal Spikes

    Get PDF
    RÉSUMÉ Au Canada, l’épilepsie affecte environ 5 à 8 enfants par 3222 âgés de 2 à 37 ans dans la population globale. Quinze à 47 % de ces enfants ont une épilepsie bénigne avec des pointes centrotemporelles (BECTS), ce qui fait de BECTS le syndrome épileptique focal de l’enfant bénin le plus fréquent. Initialement, BECTS était considéré comme bénin parmi les autres épilepsies car il était généralement rapporté que les capacités cognitives ont été préservées ou ramenées à la normale pendant la rémission. Cependant, certaines études ont trouvé des déficits cognitifs et comportementaux, qui peuvent bien persister même après la rémission. Compte tenu des différences neurocognitives chez les enfants atteints de BECTS et de témoins normaux, la question est de savoir si des variations morphométriques subtiles dans les structures cérébrales sont également présentes chez ces patients et si elles expliquent des variations dans les performence cognitifs. En fait, malgré les preuves accumulées d’une étiologie neurodéveloppementale dans le BECTS, peu est connu sur les altérations structurelles sous-jacentes. À cet égard, la proposition de méthodes avancées en neuroimagerie permettrait d’évaluer quantitativement les variations de la morphologie cérébrale associées à ce trouble neurologique. En outre, l’étude du développement morphologique du cerveau et sa relation avec la cognition peut aider à élucider la base neuroanatomique des déficits cognitifs. Le but de cette thèse est donc de fournir un ensemble d’outils pour analyser les variations morphologiques sous-corticales subtiles provoquées par différentes maladies, telles que l’épilepsie bénigne avec des pointes centrotemporelles. La méthodologie adoptée dans cette thèse a conduit à trois objectifs de recherche spécifiques. La première étape vise à développer un nouveau cadre automatisé pour segmenter les structures sous-corticales sur les images à resonance magnètique (IRM). La deuxième étape vise à concevoir une nouvelle approche basée sur la correspondance spectrale pour capturer précisément la variabilité de forme chez les sujets épileptiques. La troisième étape conduit à une analyse de la relation entre les changements morphologiques du cerveau et les indices cognitifs. La première contribution vise plus spécifiquement la segmentation automatique des structures sous-corticales dans un processus de co-recalage et de co-segmentation multi-atlas. Contrairement aux approches standards de segmentation multi-atlas, la méthode proposée obtient la segmentation finale en utilisant un recalage en fonction de la population, tandis que les connaissances à prior basés sur les réseaux neuronaux par convolution (CNNs) sont incorporées dans la formulation d’énergie en tant que représentation d’image discriminative. Ainsi, cette méthode exploite des représentations apprises plus sophistiquées pour conduire le processus de co-recalage. De plus, étant donné un ensemble de volumes cibles, la méthode proposée calcule les probabilités de segmentation individuellement, puis segmente tous les volumes simultanément. Par conséquent, le fardeau de fournir un sous-ensemble de vérité connue approprié pour effectuer la segmentation multi-atlas est évité. Des résultats prometteurs démontrent le potentiel de notre méthode sur deux ensembles de données, contenant des annotations de structures sous-corticales. L’importance des estimations fiables des annotations est également mise en évidence, ce qui motive l’utilisation de réseaux neuronaux profonds pour remplacer les annotations de vérité connue en co-recalage avec une perte de performance minimale. La deuxième contribution vise à saisir la variabilité de forme entre deux populations de surfaces en utilisant une analyse morphologique multijoints. La méthode proposée exploite la représentation spectrale pour établir des correspondances de surface, puisque l’appariement est plus simple dans le domaine spectral plutôt que dans l’espace euclidien conventionnel. Le cadre proposé intègre la concordance spectrale à courbure moyenne dans un plateforme d’analyse de formes sous-corticales multijoints. L’analyse expérimentale sur des données cliniques a montré que les différences de groupe extraites étaient similaires avec les résultats dans d’autres études cliniques, tandis que les sorties d’analyse de forme ont été créées d’une manière à réduire le temps de calcul. Enfin, la troisième contribution établit l’association entre les altérations morphologiques souscorticales chez les enfants atteints d’épilepsie bénigne et les indices cognitifs. Cette étude permet de détecter les changements du putamen et du noyau caudé chez les enfants atteints de BECTS gauche, droit ou bilatéral. De plus, l ’association des différences volumétriques structurelles et des différences de forme avec la cognition a été étudiée. Les résultats confirment les altérations de la forme du putamen et du noyau caudé chez les enfants atteints de BECTS. De plus, nos résultats suggèrent que la variation de la forme sous-corticale affecte les fonctions cognitives. Cette étude démontre que les altérations de la forme et leur relation avec la cognition dépendent du côté de la focalisation de l’épilepsie. Ce projet nous a permis d’étudier si de nouvelles méthodes permettraient de traiter automatiquement les informations de neuro-imagerie chez les enfants atteints de BECTS et de détecter des variations morphologiques subtiles dans leurs structures sous-corticales. De plus, les résultats obtenus dans le cadre de cette thèse nous ont permis de conclure qu’il existe une association entre les variations morphologiques et la cognition par rapport au côté de la focalisation de la crise épileptique.----------ABSTRACT In Canada, epilepsy affects approximately 5 to 8 children per 3222 aged from 2 to 37 years in the overall population. Fifteen to 47% of these children have benign epilepsy with centrotemporal spikes (BECTS), making BECTS the most common benign childhood focal epileptic syndrome. Initially, BECTS was considered as benign among other epilepsies since it was generally reported that cognitive abilities were preserved or brought back to normal during remission. However, some studies have found cognitive and behavioral deficits, which may well persist even after remission. Given neurocognitive differences among children with BECTS and normal controls, the question is whether subtle morphometric variations in brain structures are also present in these patients, and whether they explain variations in cognitive indices. In fact, despite the accumulating evidence of a neurodevelopmental etiology in BECTS, little is known about underlying structural alterations. In this respect, proposing advanced neuroimaging methods will allow for quantitative assessment of variations in brain morphology associated with this neurological disorder. In addition, studying the brain morphological development and its relationship with cognition may help elucidate the neuroanatomical basis of cognitive deficits. Therefore, the focus of this thesis is to provide a set of tools for analyzing the subtle sub-cortical morphological alterations in different diseases, such as benign epilepsy with centrotemporal spikes. The methodology adopted in this thesis led to addressing three specific research objectives. The first step develops a new automated framework for segmenting subcortical structures on MR images. The second step designs a new approach based on spectral correspondence to precisely capture shape variability in epileptic individuals. The third step finds the association between brain morphological changes and cognitive indices. The first contribution aims more specifically at automatic segmentation of sub-cortical structures in a groupwise multi-atlas coregistration and cosegmentation process. Contrary to the standard multi-atlas segmentation approaches, the proposed method obtains the final segmentation using a population-wise registration, while Convolutional Neural Network (CNN)- based priors are incorporated in the energy formulation as a discriminative image representation. Thus, this method exploits more sophisticated learned representations to drive the coregistration process. Furthermore, given a set of target volumes the developed method computes the segmentation probabilities individually, and then segments all the volumes simultaneously. Therefore, the burden of providing an appropriate ground truth subset to perform multi-atlas segmentation is removed. Promising results demonstrate the potential of our method on two different datasets, containing annotations of sub-cortical structures. The importance of reliable label estimations is also highlighted, motivating the use of deep neural nets to replace ground truth annotations in coregistration with minimal loss in performance. The second contribution intends to capture shape variability between two population of surfaces using groupwise morphological analysis. The proposed method exploits spectral representation for establishing surface correspondences, since matching is simpler in the spectral domain rather than in the conventional Euclidean space. The designed framework integrates mean curvature-based spectral matching in to a groupwise subcortical shape analysis pipeline. Experimental analysis on real clinical dataset showed that the extracted group differences were in parallel with the findings in other clinical studies, while the shape analysis outputs were created in a computational efficient manner. Finally, the third contribution establishes the association between sub-cortical morphological alterations in children with benign epilepsy and cognitive indices. This study detects putamen and caudate changes in children with left, right, or bilateral BECTS to age and gender matched healthy individuals. In addition, the association of structural volumetric and shape differences with cognition is investigated. The findings confirm putamen and caudate shape alterations in children with BECTS. Also, our results suggest that variation in sub-cortical shape affects cognitive functions. More importantly, this study demonstrates that shape alterations and their relation with cognition depend on the side of epilepsy focus. This project enabled us to investigate whether new methods would allow to automatically process neuroimaging information from children afflicted with BECTS and detect subtle morphological variations in their sub-cortical structures. In addition, the results obtained in this thesis allowed us to conclude the existence of the association between morphological variations and cognition with respect to the side of seizure focus

    A brief survey of visual saliency detection

    Get PDF
    • …
    corecore