4,866 research outputs found
Image objects detection based on boosting neural network
This paper discusses the problem of object area detection of video frames. The goal is to design a pixel accurate detector for grass, which could be used for object adaptive video enhancement. A boosting neural network is used for creating such a detector. The resulted detector uses both textural features and color features of the frames
Evaluation of the color image and video processing chain and visual quality management for consumer systems
With the advent of novel digital display technologies, color processing is increasingly becoming a key aspect in consumer video applications. Today’s state-of-the-art displays require sophisticated color and image reproduction techniques in order to achieve larger screen size, higher luminance and higher resolution than ever before. However, from color science perspective, there are clearly opportunities for improvement in the color reproduction capabilities of various emerging and conventional display technologies. This research seeks to identify potential areas for improvement in color processing in a video processing chain. As part of this research, various processes involved in a typical video processing chain in consumer video applications were reviewed. Several published color and contrast enhancement algorithms were evaluated, and a novel algorithm was developed to enhance color and contrast in images and videos in an effective and coordinated manner. Further, a psychophysical technique was developed and implemented for performing visual evaluation of color image and consumer video quality. Based on the performance analysis and visual experiments involving various algorithms, guidelines were proposed for the development of an effective color and contrast enhancement method for images and video applications. It is hoped that the knowledge gained from this research will help build a better understanding of color processing and color quality management methods in consumer video
Recommended from our members
Annotated Bibliography of Techniques for Image Enhancement and Interpretation in Remote Sensing
The purpose of this annotated bibliography is to provide the user of the Remote Sensing Information Subsystem (RSIS) with brief descriptions of recent research techniques of image enhancement and their applications to specific image interpretation problems. Table 2 of the May 1979 ASVT/RSIS Technical Report entitled "Functional Design Narrative Descriptions" listed digital image processing requirements of the RSIS. The references in this bibliography were chosen because they describe these processing requirements. The format of that table was modified slightly and used as the outline for Section One of this bibliography.
The bibliography is not intended to be an exhaustive compilation of all pertinent articles. Such a collection would be outdated as soon as it was printed. It does, however, contain a broad sampling of the recent remote sensing literature. We tried not to include multiple references to the same technique, but some repetition was necessary in order to fully describe some procedures of image enhancement and interpretation.Bureau of Economic Geolog
Open Visual Knowledge Extraction via Relation-Oriented Multimodality Model Prompting
Images contain rich relational knowledge that can help machines understand
the world. Existing methods on visual knowledge extraction often rely on the
pre-defined format (e.g., sub-verb-obj tuples) or vocabulary (e.g., relation
types), restricting the expressiveness of the extracted knowledge. In this
work, we take a first exploration to a new paradigm of open visual knowledge
extraction. To achieve this, we present OpenVik which consists of an open
relational region detector to detect regions potentially containing relational
knowledge and a visual knowledge generator that generates format-free knowledge
by prompting the large multimodality model with the detected region of
interest. We also explore two data enhancement techniques for diversifying the
generated format-free visual knowledge. Extensive knowledge quality evaluations
highlight the correctness and uniqueness of the extracted open visual knowledge
by OpenVik. Moreover, integrating our extracted knowledge across various visual
reasoning applications shows consistent improvements, indicating the real-world
applicability of OpenVik.Comment: Accepted to NeurIPS 202
Semantic interpretation of events in lifelogging
The topic of this thesis is lifelogging, the automatic, passive recording of a person’s daily activities and in particular, on performing a semantic analysis and enrichment of lifelogged data. Our work centers on visual lifelogged data, such as taken from wearable cameras. Such wearable cameras generate an archive of a person’s day taken from a first-person viewpoint but one of the problems with this is the sheer volume of information that can be generated. In order to make this potentially very large volume of information more manageable, our analysis of this data is based on segmenting each day’s lifelog data into discrete and non-overlapping events corresponding to activities in the wearer’s day. To manage lifelog data at an event level, we define a set of concepts using an ontology which is appropriate to the wearer, applying automatic detection of concepts to these events and then semantically enriching each of the detected lifelog events making them an index into the events. Once this enrichment is complete we can use the lifelog to support semantic search for everyday media management, as a memory aid, or as part of medical analysis on the activities of daily living (ADL), and so on. In the thesis, we address the problem of how to select the concepts to be used for indexing events and we propose a semantic, density- based algorithm to cope with concept selection issues for lifelogging. We then apply activity detection to classify everyday activities by employing the selected concepts as high-level semantic features. Finally, the activity is modeled by multi-context representations and enriched by Semantic Web technologies. The thesis includes an experimental evaluation using real data from users and shows the performance of our algorithms in capturing the semantics of everyday concepts and their efficacy in activity recognition and semantic enrichment
Seismotectonic, structural, volcanologic, and geomorphic study of New Zealand; indigenous forest assessment in New Zealand; mapping, land use, and environmental studies in New Zealand, volume 2
The author has identified the following significant results. Ship detection via LANDSAT MSS data was demonstrated. In addition, information on ship size, orientation, and movement was obtained. Band 7 was used for the initial detection followed by confirmation on other MSS bands. Under low turbidity, as experienced in open seas, the detection of ships 100 m long was verified and detection of ships down to 30 m length theorized. High turbidity and sea state inhibit ship detection by decreasing S/N ratios. The radiance effect from snow of local slope angles and orientation was also studied. Higher radiance values and even overloading in three bands were recorded for the sun-facing slope. Local hot spots from solar reflection appear at several locations along transect D-C in Six Mile Creek Basin during September 1976
Skylab/EREP application to ecological, geological, and oceanographic investigations of Delaware Bay
Skylab/EREP S190A and S190B film products were optically enhanced and visually interpreted to extract data suitable for; (1) mapping coastal land use; (2) inventorying wetlands vegetation; (3) monitoring tidal conditions; (4) observing suspended sediment patterns; (5) charting surface currents; (6) locating coastal fronts and water mass boundaries; (7) monitoring industrial and municipal waste dumps in the ocean; (8) determining the size and flow direction of river, bay and man-made discharge plumes; and (9) observing ship traffic. Film products were visually analyzed to identify and map ten land-use and vegetation categories at a scale of 1:125,000. Digital tapes from the multispectral scanner were used to prepare thematic maps of land use. Classification accuracies obtained by comparison of derived thematic maps of land-use with USGS-CARETS land-use maps in southern Delaware ranged from 44 percent to 100 percent
Study of spacecraft direct readout meteorological systems
Characteristics are defined of the next generation direct readout meteorological satellite system with particular application to Tiros N. Both space and ground systems are included. The recommended space system is composed of four geosynchronous satellites and two low altitude satellites in sun-synchronous orbit. The goesynchronous satellites transmit to direct readout ground stations via a shared S-band link, relayed FOFAX satellite cloud cover pictures (visible and infrared) and weather charts (WEFAX). Basic sensor data is transmitted to regional Data Utilization Stations via the same S-band link. Basic sensor data consists of 0.5 n.m. sub-point resolution data in the 0.55 - 0.7 micron spectral region, and 4.0 n.m. resolution data in the 10.5 - 12.6 micron spectral region. The two low altitude satellites in sun-synchronous orbit provide data to direct readout ground stations via a 137 MHz link, a 400 Mhz link, and an S-band link
- …