1,470 research outputs found

    Socializing the Semantic Gap: A Comparative Survey on Image Tag Assignment, Refinement and Retrieval

    Get PDF
    Where previous reviews on content-based image retrieval emphasize on what can be seen in an image to bridge the semantic gap, this survey considers what people tag about an image. A comprehensive treatise of three closely linked problems, i.e., image tag assignment, refinement, and tag-based image retrieval is presented. While existing works vary in terms of their targeted tasks and methodology, they rely on the key functionality of tag relevance, i.e. estimating the relevance of a specific tag with respect to the visual content of a given image and its social context. By analyzing what information a specific method exploits to construct its tag relevance function and how such information is exploited, this paper introduces a taxonomy to structure the growing literature, understand the ingredients of the main works, clarify their connections and difference, and recognize their merits and limitations. For a head-to-head comparison between the state-of-the-art, a new experimental protocol is presented, with training sets containing 10k, 100k and 1m images and an evaluation on three test sets, contributed by various research groups. Eleven representative works are implemented and evaluated. Putting all this together, the survey aims to provide an overview of the past and foster progress for the near future.Comment: to appear in ACM Computing Survey

    Image Understanding by Socializing the Semantic Gap

    Get PDF
    Several technological developments like the Internet, mobile devices and Social Networks have spurred the sharing of images in unprecedented volumes, making tagging and commenting a common habit. Despite the recent progress in image analysis, the problem of Semantic Gap still hinders machines in fully understand the rich semantic of a shared photo. In this book, we tackle this problem by exploiting social network contributions. A comprehensive treatise of three linked problems on image annotation is presented, with a novel experimental protocol used to test eleven state-of-the-art methods. Three novel approaches to annotate, under stand the sentiment and predict the popularity of an image are presented. We conclude with the many challenges and opportunities ahead for the multimedia community

    Experiments on the Use of Feature Selection and Machine Learning Methods in Automatic Malay Text Categorization

    Get PDF
    AbstractDue to the rapid growth of documents in digital form, research in automatic text categorization into predefined categories has witnessed a booming interest. Although, there is a wide range of supervised machine learning methods have been applied to categorize English, relatively, only a few studies have been done on Malay text categorization. This paper reports our comparative evaluation of three machine learning methods on Malay text categorization. Two feature selection methods (Information gain (IG) and Chi-square) and three machine learning methods (K-Nearest Neighbor (k-NN), Naive Bayes (NB) and N-gram) were investigated. The three supervised machine learning models were evaluated on categorized Malay corpus, and experimental results showed that the k- NN with the Chi-square feature selection gave the best performance (Macro-F1 = 96.14)

    Supervised and unsupervised segmentation of textured images by efficient multi-level pattern classification

    Get PDF
    This thesis proposes new, efficient methodologies for supervised and unsupervised image segmentation based on texture information. For the supervised case, a technique for pixel classification based on a multi-level strategy that iteratively refines the resulting segmentation is proposed. This strategy utilizes pattern recognition methods based on prototypes (determined by clustering algorithms) and support vector machines. In order to obtain the best performance, an algorithm for automatic parameter selection and methods to reduce the computational cost associated with the segmentation process are also included. For the unsupervised case, the previous methodology is adapted by means of an initial pattern discovery stage, which allows transforming the original unsupervised problem into a supervised one. Several sets of experiments considering a wide variety of images are carried out in order to validate the developed techniques.Esta tesis propone metodologías nuevas y eficientes para segmentar imágenes a partir de información de textura en entornos supervisados y no supervisados. Para el caso supervisado, se propone una técnica basada en una estrategia de clasificación de píxeles multinivel que refina la segmentación resultante de forma iterativa. Dicha estrategia utiliza métodos de reconocimiento de patrones basados en prototipos (determinados mediante algoritmos de agrupamiento) y máquinas de vectores de soporte. Con el objetivo de obtener el mejor rendimiento, se incluyen además un algoritmo para selección automática de parámetros y métodos para reducir el coste computacional asociado al proceso de segmentación. Para el caso no supervisado, se propone una adaptación de la metodología anterior mediante una etapa inicial de descubrimiento de patrones que permite transformar el problema no supervisado en supervisado. Las técnicas desarrolladas en esta tesis se validan mediante diversos experimentos considerando una gran variedad de imágenes

    Mission Dependency Index of Air Force Built Infrastructure: Knowledge Discovery with Machine Learning

    Get PDF
    Mission Dependency Index (MDI) is a metric developed to capture the relative criticality of infrastructure assets with respect to organizational missions. The USAF adapted the MDI metric from the United States Navy’s MDI methodology. Unlike the Navy’s MDI data collection process, the USAF adaptation of the MDI metric employs generic facility category codes (CATCODEs) to assign MDI values. This practice introduces uncertainty into the MDI assignment process with respect to specific missions and specific infrastructure assets. The uncertainty associated with USAF MDI values necessitated the MDI adjudication process. The MDI adjudication process provides a mechanism for installation civil engineer personnel to lobby for accurate MDI values for specific infrastructure assets. The MDI adjudication process requires manual identification of MDI discrepancies, documentation, and extensive coordination between organizations. Given the existing uncertainty with USAF MDI values and the effort required for the MDI adjudication process, this research pursues machine learning and the knowledge discovery in databases (KDD) process to identify and understand relationships between real property data and mission critical infrastructure. Furthermore, a decision support tool is developed for the MDI adjudication process. Specifically, supervised learning techniques are employed to develop a classifier that can identify potential MDI discrepancies. This automation effort serves to minimize the manual MDI review process by identifying a subset of facilities for potential adjudication

    A comprehensible analysis of the efficacy of Ensemble Models for Bug Prediction

    Full text link
    The correctness of software systems is vital for their effective operation. It makes discovering and fixing software bugs an important development task. The increasing use of Artificial Intelligence (AI) techniques in Software Engineering led to the development of a number of techniques that can assist software developers in identifying potential bugs in code. In this paper, we present a comprehensible comparison and analysis of the efficacy of two AI-based approaches, namely single AI models and ensemble AI models, for predicting the probability of a Java class being buggy. We used two open-source Apache Commons Project's Java components for training and evaluating the models. Our experimental findings indicate that the ensemble of AI models can outperform the results of applying individual AI models. We also offer insight into the factors that contribute to the enhanced performance of the ensemble AI model. The presented results demonstrate the potential of using ensemble AI models to enhance bug prediction results, which could ultimately result in more reliable software systems

    Effective segmentation of sclera, iris and pupil in noisy eye images

    Get PDF
    In today’s sensitive environment, for personal authentication, iris recognition is the most attentive technique among the various biometric technologies. One of the key steps in the iris recognition system is the accurate iris segmentation from its surrounding noises including pupil and sclera of a captured eye-image. In our proposed method, initially input image is preprocessed by using bilateral filtering. After the preprocessing of images contour based features such as, brightness, color and texture features are extracted. Then entropy is measured based on the extracted contour based features to effectively distinguishing the data in the images. Finally, the convolution neural network (CNN) is used for the effective sclera, iris and pupil parts segmentations based on the entropy measure. The proposed results are analyzed to demonstrate the better performance of the proposed segmentation method than the existing methods.
    corecore