3,870 research outputs found

    Digital Image Access & Retrieval

    Get PDF
    The 33th Annual Clinic on Library Applications of Data Processing, held at the University of Illinois at Urbana-Champaign in March of 1996, addressed the theme of "Digital Image Access & Retrieval." The papers from this conference cover a wide range of topics concerning digital imaging technology for visual resource collections. Papers covered three general areas: (1) systems, planning, and implementation; (2) automatic and semi-automatic indexing; and (3) preservation with the bulk of the conference focusing on indexing and retrieval.published or submitted for publicatio

    A Decision Support System For The Intelligence Satellite Analyst

    Get PDF
    The study developed a decision support system known as Visual Analytic Cognitive Model (VACOM) to support the Intelligence Analyst (IA) in satellite information processing task within a Geospatial Intelligence (GEOINT) domain. As a visual analytics, VACOM contains the image processing algorithms, a cognitive network of the IA mental model, and a Bayesian belief model for satellite information processing. A cognitive analysis tool helps to identify eight knowledge levels in a satellite information processing. These are, spatial, prototypical, contextual, temporal, semantic, pragmatic, intentional, and inferential knowledge levels, respectively. A cognitive network was developed for each knowledge level with data input from the subjective questionnaires that probed the analysts’ mental model. VACOM interface was designed to allow the analysts have a transparent view of the processes, including, visualization model, and signal processing model applied to the images, geospatial data representation, and the cognitive network of expert beliefs. VACOM interface allows the user to select a satellite image of interest, select each of the image analysis methods for visualization, and compare ‘ground-truth’ information against the recommendation of VACOM. The interface was designed to enhance perception, cognition, and even comprehension to the multi and complex image analyses by the analysts. A usability analysis on VACOM showed many advantages for the human analysts. These include, reduction in cognitive workload as a result of less information search, the IA can conduct an interactive experiment on each of his/her belief space and guesses, and selection of best image processing algorithms to apply to an image context

    Computer animation data management: Review of evolution phases and emerging issues

    Get PDF
    The computer animation industry has been booming and prospering in recent thirty years. One of the significant changes faced by this industry is the evolution of computer-animation data and, yet, extant literature has offered very little insights into the evolution process and management issues pertinent to computer-animation data. Hence, many questions have surfaced in the extant literature of computer-animation data management. For example, to what extent has the data content expanded in terms of quantity and quality? To what extent has the information technology used to store and process the data changed? To what extent have the user and the community groups diversified in terms of their nature and number? Knowledge pertaining to these issues can provide new research directions to academics and also insights to practitioners for more effective and innovative management of computer-animation data. This conceptual paper, therefore, takes the pioneering step to address these issues by proposing four factors prudent for examining the evolution phases associated with computer-animation data management: technology, content, users, and community. Next, this paper presents a conceptual framework illustrating the inter-dependent relationships between these four factors together with associated theoretical and managerial issues. This paper, albeit limited by its conceptual nature, advances the extant literature of computer animation, information system, and open-product model

    Clue: Cross-modal Coherence Modeling for Caption Generation

    Full text link
    We use coherence relations inspired by computational models of discourse to study the information needs and goals of image captioning. Using an annotation protocol specifically devised for capturing image--caption coherence relations, we annotate 10,000 instances from publicly-available image--caption pairs. We introduce a new task for learning inferences in imagery and text, coherence relation prediction, and show that these coherence annotations can be exploited to learn relation classifiers as an intermediary step, and also train coherence-aware, controllable image captioning models. The results show a dramatic improvement in the consistency and quality of the generated captions with respect to information needs specified via coherence relations.Comment: Accepted as a long paper to ACL 202

    CHORUS Deliverable 2.1: State of the Art on Multimedia Search Engines

    Get PDF
    Based on the information provided by European projects and national initiatives related to multimedia search as well as domains experts that participated in the CHORUS Think-thanks and workshops, this document reports on the state of the art related to multimedia content search from, a technical, and socio-economic perspective. The technical perspective includes an up to date view on content based indexing and retrieval technologies, multimedia search in the context of mobile devices and peer-to-peer networks, and an overview of current evaluation and benchmark inititiatives to measure the performance of multimedia search engines. From a socio-economic perspective we inventorize the impact and legal consequences of these technical advances and point out future directions of research

    Deformable Prototypes for Encoding Shape Categories in Image Databases

    Full text link
    We describe a method for shape-based image database search that uses deformable prototypes to represent categories. Rather than directly comparing a candidate shape with all shape entries in the database, shapes are compared in terms of the types of nonrigid deformations (differences) that relate them to a small subset of representative prototypes. To solve the shape correspondence and alignment problem, we employ the technique of modal matching, an information-preserving shape decomposition for matching, describing, and comparing shapes despite sensor variations and nonrigid deformations. In modal matching, shape is decomposed into an ordered basis of orthogonal principal components. We demonstrate the utility of this approach for shape comparison in 2-D image databases.Office of Naval Research (Young Investigator Award N00014-06-1-0661

    Seeing the Intangible: Surveying Automatic High-Level Visual Understanding from Still Images

    Full text link
    The field of Computer Vision (CV) was born with the single grand goal of complete image understanding: providing a complete semantic interpretation of an input image. What exactly this goal entails is not immediately straightforward, but theoretical hierarchies of visual understanding point towards a top level of full semantics, within which sits the most complex and subjective information humans can detect from visual data. In particular, non-concrete concepts including emotions, social values and ideologies seem to be protagonists of this "high-level" visual semantic understanding. While such "abstract concepts" are critical tools for image management and retrieval, their automatic recognition is still a challenge, exactly because they rest at the top of the "semantic pyramid": the well-known semantic gap problem is worsened given their lack of unique perceptual referents, and their reliance on more unspecific features than concrete concepts. Given that there seems to be very scarce explicit work within CV on the task of abstract social concept (ASC) detection, and that many recent works seem to discuss similar non-concrete entities by using different terminology, in this survey we provide a systematic review of CV work that explicitly or implicitly approaches the problem of abstract (specifically social) concept detection from still images. Specifically, this survey performs and provides: (1) A study and clustering of high level visual understanding semantic elements from a multidisciplinary perspective (computer science, visual studies, and cognitive perspectives); (2) A study and clustering of high level visual understanding computer vision tasks dealing with the identified semantic elements, so as to identify current CV work that implicitly deals with AC detection
    • 

    corecore