4,866 research outputs found

    Image objects detection based on boosting neural network

    Get PDF
    This paper discusses the problem of object area detection of video frames. The goal is to design a pixel accurate detector for grass, which could be used for object adaptive video enhancement. A boosting neural network is used for creating such a detector. The resulted detector uses both textural features and color features of the frames

    Evaluation of the color image and video processing chain and visual quality management for consumer systems

    Get PDF
    With the advent of novel digital display technologies, color processing is increasingly becoming a key aspect in consumer video applications. Today’s state-of-the-art displays require sophisticated color and image reproduction techniques in order to achieve larger screen size, higher luminance and higher resolution than ever before. However, from color science perspective, there are clearly opportunities for improvement in the color reproduction capabilities of various emerging and conventional display technologies. This research seeks to identify potential areas for improvement in color processing in a video processing chain. As part of this research, various processes involved in a typical video processing chain in consumer video applications were reviewed. Several published color and contrast enhancement algorithms were evaluated, and a novel algorithm was developed to enhance color and contrast in images and videos in an effective and coordinated manner. Further, a psychophysical technique was developed and implemented for performing visual evaluation of color image and consumer video quality. Based on the performance analysis and visual experiments involving various algorithms, guidelines were proposed for the development of an effective color and contrast enhancement method for images and video applications. It is hoped that the knowledge gained from this research will help build a better understanding of color processing and color quality management methods in consumer video

    Open Visual Knowledge Extraction via Relation-Oriented Multimodality Model Prompting

    Full text link
    Images contain rich relational knowledge that can help machines understand the world. Existing methods on visual knowledge extraction often rely on the pre-defined format (e.g., sub-verb-obj tuples) or vocabulary (e.g., relation types), restricting the expressiveness of the extracted knowledge. In this work, we take a first exploration to a new paradigm of open visual knowledge extraction. To achieve this, we present OpenVik which consists of an open relational region detector to detect regions potentially containing relational knowledge and a visual knowledge generator that generates format-free knowledge by prompting the large multimodality model with the detected region of interest. We also explore two data enhancement techniques for diversifying the generated format-free visual knowledge. Extensive knowledge quality evaluations highlight the correctness and uniqueness of the extracted open visual knowledge by OpenVik. Moreover, integrating our extracted knowledge across various visual reasoning applications shows consistent improvements, indicating the real-world applicability of OpenVik.Comment: Accepted to NeurIPS 202

    Semantic interpretation of events in lifelogging

    Get PDF
    The topic of this thesis is lifelogging, the automatic, passive recording of a person’s daily activities and in particular, on performing a semantic analysis and enrichment of lifelogged data. Our work centers on visual lifelogged data, such as taken from wearable cameras. Such wearable cameras generate an archive of a person’s day taken from a first-person viewpoint but one of the problems with this is the sheer volume of information that can be generated. In order to make this potentially very large volume of information more manageable, our analysis of this data is based on segmenting each day’s lifelog data into discrete and non-overlapping events corresponding to activities in the wearer’s day. To manage lifelog data at an event level, we define a set of concepts using an ontology which is appropriate to the wearer, applying automatic detection of concepts to these events and then semantically enriching each of the detected lifelog events making them an index into the events. Once this enrichment is complete we can use the lifelog to support semantic search for everyday media management, as a memory aid, or as part of medical analysis on the activities of daily living (ADL), and so on. In the thesis, we address the problem of how to select the concepts to be used for indexing events and we propose a semantic, density- based algorithm to cope with concept selection issues for lifelogging. We then apply activity detection to classify everyday activities by employing the selected concepts as high-level semantic features. Finally, the activity is modeled by multi-context representations and enriched by Semantic Web technologies. The thesis includes an experimental evaluation using real data from users and shows the performance of our algorithms in capturing the semantics of everyday concepts and their efficacy in activity recognition and semantic enrichment

    Seismotectonic, structural, volcanologic, and geomorphic study of New Zealand; indigenous forest assessment in New Zealand; mapping, land use, and environmental studies in New Zealand, volume 2

    Get PDF
    The author has identified the following significant results. Ship detection via LANDSAT MSS data was demonstrated. In addition, information on ship size, orientation, and movement was obtained. Band 7 was used for the initial detection followed by confirmation on other MSS bands. Under low turbidity, as experienced in open seas, the detection of ships 100 m long was verified and detection of ships down to 30 m length theorized. High turbidity and sea state inhibit ship detection by decreasing S/N ratios. The radiance effect from snow of local slope angles and orientation was also studied. Higher radiance values and even overloading in three bands were recorded for the sun-facing slope. Local hot spots from solar reflection appear at several locations along transect D-C in Six Mile Creek Basin during September 1976

    Skylab/EREP application to ecological, geological, and oceanographic investigations of Delaware Bay

    Get PDF
    Skylab/EREP S190A and S190B film products were optically enhanced and visually interpreted to extract data suitable for; (1) mapping coastal land use; (2) inventorying wetlands vegetation; (3) monitoring tidal conditions; (4) observing suspended sediment patterns; (5) charting surface currents; (6) locating coastal fronts and water mass boundaries; (7) monitoring industrial and municipal waste dumps in the ocean; (8) determining the size and flow direction of river, bay and man-made discharge plumes; and (9) observing ship traffic. Film products were visually analyzed to identify and map ten land-use and vegetation categories at a scale of 1:125,000. Digital tapes from the multispectral scanner were used to prepare thematic maps of land use. Classification accuracies obtained by comparison of derived thematic maps of land-use with USGS-CARETS land-use maps in southern Delaware ranged from 44 percent to 100 percent

    Study of spacecraft direct readout meteorological systems

    Get PDF
    Characteristics are defined of the next generation direct readout meteorological satellite system with particular application to Tiros N. Both space and ground systems are included. The recommended space system is composed of four geosynchronous satellites and two low altitude satellites in sun-synchronous orbit. The goesynchronous satellites transmit to direct readout ground stations via a shared S-band link, relayed FOFAX satellite cloud cover pictures (visible and infrared) and weather charts (WEFAX). Basic sensor data is transmitted to regional Data Utilization Stations via the same S-band link. Basic sensor data consists of 0.5 n.m. sub-point resolution data in the 0.55 - 0.7 micron spectral region, and 4.0 n.m. resolution data in the 10.5 - 12.6 micron spectral region. The two low altitude satellites in sun-synchronous orbit provide data to direct readout ground stations via a 137 MHz link, a 400 Mhz link, and an S-band link
    corecore