Search CORE

4,866 research outputs found

Image objects detection based on boosting neural network

Author: Hegt J.A.
Liang N.
Mladenov V.M.
Publication venue: Institute of Electrical and Electronics Engineers
Publication date: 01/01/2010
Field of study

This paper discusses the problem of object area detection of video frames. The goal is to design a pixel accurate detector for grass, which could be used for object adaptive video enhancement. A boosting neural network is used for creating such a detector. The resulted detector uses both textural features and color features of the frames

Repository TU/e

Crossref

Pure OAI Repository

Evaluation of the color image and video processing chain and visual quality management for consumer systems

Author: Sarkar Abhijit
Publication venue: RIT Scholar Works
Publication date: 01/05/2008
Field of study

With the advent of novel digital display technologies, color processing is increasingly becoming a key aspect in consumer video applications. Today’s state-of-the-art displays require sophisticated color and image reproduction techniques in order to achieve larger screen size, higher luminance and higher resolution than ever before. However, from color science perspective, there are clearly opportunities for improvement in the color reproduction capabilities of various emerging and conventional display technologies. This research seeks to identify potential areas for improvement in color processing in a video processing chain. As part of this research, various processes involved in a typical video processing chain in consumer video applications were reviewed. Several published color and contrast enhancement algorithms were evaluated, and a novel algorithm was developed to enhance color and contrast in images and videos in an effective and coordinated manner. Further, a psychophysical technique was developed and implemented for performing visual evaluation of color image and consumer video quality. Based on the performance analysis and visual experiments involving various algorithms, guidelines were proposed for the development of an effective color and contrast enhancement method for images and video applications. It is hoped that the knowledge gained from this research will help build a better understanding of color processing and color quality management methods in consumer video

RIT Scholar Works

Recommended from our members

Annotated Bibliography of Techniques for Image Enhancement and Interpretation in Remote Sensing

Author: Baumgardner Jr., Robert W.
Finley Robert J.
Publication venue
Publication date: 01/01/1979
Field of study

The purpose of this annotated bibliography is to provide the user of the Remote Sensing Information Subsystem (RSIS) with brief descriptions of recent research techniques of image enhancement and their applications to specific image interpretation problems. Table 2 of the May 1979 ASVT/RSIS Technical Report entitled "Functional Design Narrative Descriptions" listed digital image processing requirements of the RSIS. The references in this bibliography were chosen because they describe these processing requirements. The format of that table was modified slightly and used as the outline for Section One of this bibliography. The bibliography is not intended to be an exhaustive compilation of all pertinent articles. Such a collection would be outdated as soon as it was printed. It does, however, contain a broad sampling of the recent remote sensing literature. We tried not to include multiple references to the same technique, but some repetition was necessary in order to fully describe some procedures of image enhancement and interpretation.Bureau of Economic Geolog

Texas ScholarWorks

Open Visual Knowledge Extraction via Relation-Oriented Multimodality Model Prompting

Author: Cui Hejie
Fang Xinyu
Kan Xuan
Li Manling
Liu Xin
Song Yangqiu
Xu Ran
Yang Carl
Yu Yue
Zhang Zihan
Publication venue
Publication date: 28/10/2023
Field of study

Images contain rich relational knowledge that can help machines understand the world. Existing methods on visual knowledge extraction often rely on the pre-defined format (e.g., sub-verb-obj tuples) or vocabulary (e.g., relation types), restricting the expressiveness of the extracted knowledge. In this work, we take a first exploration to a new paradigm of open visual knowledge extraction. To achieve this, we present OpenVik which consists of an open relational region detector to detect regions potentially containing relational knowledge and a visual knowledge generator that generates format-free knowledge by prompting the large multimodality model with the detected region of interest. We also explore two data enhancement techniques for diversifying the generated format-free visual knowledge. Extensive knowledge quality evaluations highlight the correctness and uniqueness of the extracted open visual knowledge by OpenVik. Moreover, integrating our extracted knowledge across various visual reasoning applications shows consistent improvements, indicating the real-world applicability of OpenVik.Comment: Accepted to NeurIPS 202

arXiv.org e-Print Archive

Semantic interpretation of events in lifelogging

Author: Wang Peng
Publication venue: Dublin City University. CLARITY: The Centre for Sensor Web Technologies
Publication date: 01/03/2012
Field of study

The topic of this thesis is lifelogging, the automatic, passive recording of a person’s daily activities and in particular, on performing a semantic analysis and enrichment of lifelogged data. Our work centers on visual lifelogged data, such as taken from wearable cameras. Such wearable cameras generate an archive of a person’s day taken from a first-person viewpoint but one of the problems with this is the sheer volume of information that can be generated. In order to make this potentially very large volume of information more manageable, our analysis of this data is based on segmenting each day’s lifelog data into discrete and non-overlapping events corresponding to activities in the wearer’s day. To manage lifelog data at an event level, we define a set of concepts using an ontology which is appropriate to the wearer, applying automatic detection of concepts to these events and then semantically enriching each of the detected lifelog events making them an index into the events. Once this enrichment is complete we can use the lifelog to support semantic search for everyday media management, as a memory aid, or as part of medical analysis on the activities of daily living (ADL), and so on. In the thesis, we address the problem of how to select the concepts to be used for indexing events and we propose a semantic, density- based algorithm to cope with concept selection issues for lifelogging. We then apply activity detection to classify everyday activities by employing the selected concepts as high-level semantic features. Finally, the activity is modeled by multi-context representations and enriched by Semantic Web technologies. The thesis includes an experimental evaluation using real data from users and shows the performance of our algorithms in capturing the semantics of everyday concepts and their efficacy in activity recognition and semantic enrichment

Irish Universities

DCU Online Research Access Service

Seismotectonic, structural, volcanologic, and geomorphic study of New Zealand; indigenous forest assessment in New Zealand; mapping, land use, and environmental studies in New Zealand, volume 2

Author: Mcgreevy M. G.
Probine M. C.
Stirling I. F.
Suggate R. P.
Publication venue
Publication date
Field of study

The author has identified the following significant results. Ship detection via LANDSAT MSS data was demonstrated. In addition, information on ship size, orientation, and movement was obtained. Band 7 was used for the initial detection followed by confirmation on other MSS bands. Under low turbidity, as experienced in open seas, the detection of ships 100 m long was verified and detection of ships down to 30 m length theorized. High turbidity and sea state inhibit ship detection by decreasing S/N ratios. The radiance effect from snow of local slope angles and orientation was also studied. Higher radiance values and even overloading in three bands were recorded for the sun-facing slope. Local hot spots from solar reflection appear at several locations along transect D-C in Six Mile Creek Basin during September 1976

NASA Technical Reports Server

Skylab/EREP application to ecological, geological, and oceanographic investigations of Delaware Bay

Author: Bartlett D. S.
Klemas V.
Philpot W. D.
Reed L. E.
Rogers R. H.
Publication venue
Publication date
Field of study

Skylab/EREP S190A and S190B film products were optically enhanced and visually interpreted to extract data suitable for; (1) mapping coastal land use; (2) inventorying wetlands vegetation; (3) monitoring tidal conditions; (4) observing suspended sediment patterns; (5) charting surface currents; (6) locating coastal fronts and water mass boundaries; (7) monitoring industrial and municipal waste dumps in the ocean; (8) determining the size and flow direction of river, bay and man-made discharge plumes; and (9) observing ship traffic. Film products were visually analyzed to identify and map ten land-use and vegetation categories at a scale of 1:125,000. Digital tapes from the multispectral scanner were used to prepare thematic maps of land use. Classification accuracies obtained by comparison of derived thematic maps of land-use with USGS-CARETS land-use maps in southern Delaware ranged from 44 percent to 100 percent

NASA Technical Reports Server

Study of spacecraft direct readout meteorological systems

Author: Bartlett R.
Elam W.
Hoedemaker R.
Publication venue
Publication date
Field of study

Characteristics are defined of the next generation direct readout meteorological satellite system with particular application to Tiros N. Both space and ground systems are included. The recommended space system is composed of four geosynchronous satellites and two low altitude satellites in sun-synchronous orbit. The goesynchronous satellites transmit to direct readout ground stations via a shared S-band link, relayed FOFAX satellite cloud cover pictures (visible and infrared) and weather charts (WEFAX). Basic sensor data is transmitted to regional Data Utilization Stations via the same S-band link. Basic sensor data consists of 0.5 n.m. sub-point resolution data in the 0.55 - 0.7 micron spectral region, and 4.0 n.m. resolution data in the 10.5 - 12.6 micron spectral region. The two low altitude satellites in sun-synchronous orbit provide data to direct readout ground stations via a 137 MHz link, a 400 Mhz link, and an S-band link

NASA Technical Reports Server