Search CORE

482 research outputs found

GALFusion: Multi-Exposure Image Fusion via a Global–Local Aggregation Learning Network

Author: Kasabov Nikola
Lei Jia
Li Jiawei
Liu Jinyuan
Zhang Qiang
Zhou Shihua
Publication venue
Publication date: 21/04/2023
Field of study

Hierarchical superpixel-to-pixel dense image matching

Author: Dong Xingping
Shao Ling
Shen Jianbing
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/12/2017
Field of study

In this paper, we propose a novel matching method to establish dense correspondences automatically between two images in a hierarchical superpixel-to-pixel (HSP2P) manner. Our method first estimates dense superpixel pairings between the two images in the coarse-grained level to overcome large patch displacements and then utilize superpixel level pairings to drive the matchings in the pixel level to obtain fine texture details. In order to compensate for the influence of color and illumination variations, we apply a regularization technique to rectify images by a color transfer function. Experimental validation on benchmark datasets demonstrates that our approach achieves better visual quality outperforming state-of-theart dense matching algorithms

University of East Anglia digital repository

Digital Stack Photography and Its Applications

Author: Hu Jun
Publication venue
Publication date
Field of study

This work centers on digital stack photography and its applications.A stack of images refer, in a broader sense, to an ensemble ofassociated images taken with variation in one or more than one various values in one or more parameters in system configuration or setting.An image stack captures and contains potentially more information thanany of the constituent images. Digital stack photography (DST)techniques explore the rich information to render a synthesized imagethat oversteps the limitation in a digital camera's capabilities.This work considers in particular two basic DST problems, which hadbeen challenging, and their applications. One is high-dynamic-range(HDR) imaging of non-stationary dynamic scenes, in which the stackedimages vary in exposure conditions. The otheris large scale panorama composition from multiple images. In thiscase, the image components are related to each other by the spatialrelation among the subdomains of the same scene they covered andcaptured jointly. We consider the non-conventional, practical andchallenge situations where the spatial overlap among the sub-images issparse (S), irregular in geometry and imprecise from the designedgeometry (I), and the captured data over the overlap zones are noisy(N) or lack of features. We refer to these conditions simply as theS.I.N. conditions.There are common challenging issues with both problems. For example,both faced the dominant problem with image alignment forseamless and artifact-free image composition. Our solutions to thecommon problems are manifested differently in each of the particularproblems, as a result of adaption to the specific properties in eachtype of image ensembles. For the exposure stack, existingalignment approaches struggled to overcome three main challenges:inconsistency in brightness, large displacement in dynamic scene andpixel saturation. We exploit solutions in the following threeaspects. In the first, we introduce a model that addresses and admitschanges in both geometric configurations and optical conditions, whilefollowing the traditional optical flow description. Previous modelstreated these two types of changes one or the other, namely, withmutual exclusions. Next, we extend the pixel-based optical flow modelto a patch-based model. There are two-fold advantages. A patch hastexture and local content that individual pixels fail to present. Italso renders opportunities for faster processing, such as viatwo-scale or multiple-scale processing. The extended model is thensolved efficiently with an EM-like algorithm, which is reliable in thepresence of large displacement. Thirdly, we present a generativemodel for reducing or eliminating typical artifacts as a side effectof an inadequate alignment for clipped pixels. A patch-based texturesynthesis is combined with the patch-based alignment to achieve anartifact free result.For large-scale panorama composition under the S.I.N. conditions, wehave developed an effective solution scheme that significantly reducesboth processing time and artifacts. Previously existing approaches canbe roughly categorized as either geometry-based composition or featurebased composition. In the former approach, one relies on preciseknowledge of the system geometry, by design and/or calibration. Itworks well with a far-away scene, in which case there is only limitedvariation in projective geometry among the sub-images. However, thesystem geometry is not invariant to physical conditions such asthermal variation, stress variation and etc.. The composition withthis approach is typically done in the spatial space. The otherapproach is more robust to geometric and optical conditions. It workssurprisingly well with feature-rich and stationary scenes, not wellwith the absence of recognizable features. The composition based onfeature matching is typically done in the spatial gradient domain. Inshort, both approaches are challenged by the S.I.N. conditions. Withcertain snapshot data sets obtained and contributed by Brady et al, these methods either fail in composition or render images withvisually disturbing artifacts. To overcome the S.I.N. conditions, wehave reconciled these two approaches and made successful andcomplementary use of both priori and approximate information aboutgeometric system configuration and the feature information from theimage data. We also designed and developed a software architecturewith careful extraction of primitive function modules that can beefficiently implemented and executed in parallel. In addition to amuch faster processing speed, the resulting images are clear andsharper at the overlapping zones, without typical ghosting artifacts.Dissertatio

DukeSpace

YDA görüntü gölgeleme gidermede gelişmişlik seviyesi ve YDA görüntüler için nesnel bir gölgeleme giderme kalite metriği.

Author: Tursun Okan Tarhan
Publication venue: 'Middle East Technical University, Faculty of Architecture'
Publication date: 01/01/2016
Field of study

Despite the emergence of new HDR acquisition methods, the multiple exposure technique (MET) is still the most popular one. The application of MET on dynamic scenes is a challenging task due to the diversity of motion patterns and uncontrollable factors such as sensor noise, scene occlusion and performance concerns on some platforms with limited computational capability. Currently, there are already more than 50 deghosting algorithms proposed for artifact-free HDR imaging of dynamic scenes and it is expected that this number will grow in the future. Due to the large number of algorithms, it is a difficult and time-consuming task to conduct subjective experiments for benchmarking recently proposed algorithms. In this thesis, first, a taxonomy of HDR deghosting methods and the key characteristics of each group of algorithms are introduced. Next, the potential artifacts which are observed frequently in the outputs of HDR deghosting algorithms are defined and an objective HDR image deghosting quality metric is presented. It is found that the proposed metric is well correlated with the human preferences and it may be used as a reference for benchmarking current and future HDR image deghosting algorithmsPh.D. - Doctoral Progra

OpenMETU (Middle East Technical University)

Variational image fusion

Author: Hafner David
Publication venue: Saarländische Universitäts- und Landesbibliothek
Publication date: 01/01/2008
Field of study

The main goal of this work is the fusion of multiple images to a single composite that offers more information than the individual input images. We approach those fusion tasks within a variational framework. First, we present iterative schemes that are well-suited for such variational problems and related tasks. They lead to efficient algorithms that are simple to implement and well-parallelisable. Next, we design a general fusion technique that aims for an image with optimal local contrast. This is the key for a versatile method that performs well in many application areas such as multispectral imaging, decolourisation, and exposure fusion. To handle motion within an exposure set, we present the following two-step approach: First, we introduce the complete rank transform to design an optic flow approach that is robust against severe illumination changes. Second, we eliminate remaining misalignments by means of brightness transfer functions that relate the brightness values between frames. Additional knowledge about the exposure set enables us to propose the first fully coupled method that jointly computes an aligned high dynamic range image and dense displacement fields. Finally, we present a technique that infers depth information from differently focused images. In this context, we additionally introduce a novel second order regulariser that adapts to the image structure in an anisotropic way.Das Hauptziel dieser Arbeit ist die Fusion mehrerer Bilder zu einem Einzelbild, das mehr Informationen bietet als die einzelnen Eingangsbilder. Wir verwirklichen diese Fusionsaufgaben in einem variationellen Rahmen. Zunächst präsentieren wir iterative Schemata, die sich gut für solche variationellen Probleme und verwandte Aufgaben eignen. Danach entwerfen wir eine Fusionstechnik, die ein Bild mit optimalem lokalen Kontrast anstrebt. Dies ist der Schlüssel für eine vielseitige Methode, die gute Ergebnisse für zahlreiche Anwendungsbereiche wie Multispektralaufnahmen, Bildentfärbung oder Belichtungsreihenfusion liefert. Um Bewegungen in einer Belichtungsreihe zu handhaben, präsentieren wir folgenden Zweischrittansatz: Zuerst stellen wir die komplette Rangtransformation vor, um eine optische Flussmethode zu entwerfen, die robust gegenüber starken Beleuchtungsänderungen ist. Dann eliminieren wir verbleibende Registrierungsfehler mit der Helligkeitstransferfunktion, welche die Helligkeitswerte zwischen Bildern in Beziehung setzt. Zusätzliches Wissen über die Belichtungsreihe ermöglicht uns, die erste vollständig gekoppelte Methode vorzustellen, die gemeinsam ein registriertes Hochkontrastbild sowie dichte Bewegungsfelder berechnet. Final präsentieren wir eine Technik, die von unterschiedlich fokussierten Bildern Tiefeninformation ableitet. In diesem Kontext stellen wir zusätzlich einen neuen Regularisierer zweiter Ordnung vor, der sich der Bildstruktur anisotrop anpasst

Fırat Üniversitesi Kurumsal Açık Arşiv

Acronym

Variational image fusion

Author: Hafner David
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/01/2017
Field of study

Universaar

Acronym

Análise de vídeo sensível

Author: Moreira Daniel Henriques, 1983-
Publication venue: [s.n.]
Publication date: 31/08/2018
Field of study

Orientadores: Anderson de Rezende Rocha, Siome Klein GoldensteinTese (doutorado) - Universidade Estadual de Campinas, Instituto de ComputaçãoResumo: Vídeo sensível pode ser definido como qualquer filme capaz de oferecer ameaças à sua audiência. Representantes típicos incluem ¿ mas não estão limitados a ¿ pornografia, violência, abuso infantil, crueldade contra animais, etc. Hoje em dia, com o papel cada vez mais pervasivo dos dados digitais em nossa vidas, a análise de conteúdo sensível representa uma grande preocupação para representantes da lei, empresas, professores, e pais, devido aos potenciais danos que este tipo de conteúdo pode infligir a menores, estudantes, trabalhadores, etc. Não obstante, o emprego de mediadores humanos, para constantemente analisar grandes quantidades de dados sensíveis, muitas vezes leva a ocorrências de estresse e trauma, o que justifica a busca por análises assistidas por computador. Neste trabalho, nós abordamos este problema em duas frentes. Na primeira, almejamos decidir se um fluxo de vídeo apresenta ou não conteúdo sensível, à qual nos referimos como classificação de vídeo sensível. Na segunda, temos como objetivo encontrar os momentos exatos em que um fluxo começa e termina a exibição de conteúdo sensível, em nível de quadros de vídeo, à qual nos referimos como localização de conteúdo sensível. Para ambos os casos, projetamos e desenvolvemos métodos eficazes e eficientes, com baixo consumo de memória, e adequação à implantação em dispositivos móveis. Neste contexto, nós fornecemos quatro principais contribuições. A primeira é uma nova solução baseada em sacolas de palavras visuais, para a classificação eficiente de vídeos sensíveis, apoiada na análise de fenômenos temporais. A segunda é uma nova solução de fusão multimodal em alto nível semântico, para a localização de conteúdo sensível. A terceira, por sua vez, é um novo detector espaço-temporal de pontos de interesse, e descritor de conteúdo de vídeo. Finalmente, a quarta contribuição diz respeito a uma base de vídeos anotados em nível de quadro, que possui 140 horas de conteúdo pornográfico, e que é a primeira da literatura a ser adequada para a localização de pornografia. Um aspecto relevante das três primeiras contribuições é a sua natureza de generalização, no sentido de poderem ser empregadas ¿ sem modificações no passo a passo ¿ para a detecção de tipos diversos de conteúdos sensíveis, tais como os mencionados anteriormente. Para validação, nós escolhemos pornografia e violência ¿ dois dos tipos mais comuns de material impróprio ¿ como representantes de interesse, de conteúdo sensível. Nestes termos, realizamos experimentos de classificação e de localização, e reportamos resultados para ambos os tipos de conteúdo. As soluções propostas apresentam uma acurácia de 93% em classificação de pornografia, e permitem a correta localização de 91% de conteúdo pornográfico em fluxo de vídeo. Os resultados para violência também são interessantes: com as abordagens apresentadas, nós obtivemos o segundo lugar em uma competição internacional de detecção de cenas violentas. Colocando ambas em perspectiva, nós aprendemos que a detecção de pornografia é mais fácil que a de violência, abrindo várias oportunidades de pesquisa para a comunidade científica. A principal razão para tal diferença está relacionada aos níveis distintos de subjetividade que são inerentes a cada conceito. Enquanto pornografia é em geral mais explícita, violência apresenta um espectro mais amplo de possíveis manifestaçõesAbstract: Sensitive video can be defined as any motion picture that may pose threats to its audience. Typical representatives include ¿ but are not limited to ¿ pornography, violence, child abuse, cruelty to animals, etc. Nowadays, with the ever more pervasive role of digital data in our lives, sensitive-content analysis represents a major concern to law enforcers, companies, tutors, and parents, due to the potential harm of such contents over minors, students, workers, etc. Notwithstanding, the employment of human mediators for constantly analyzing huge troves of sensitive data often leads to stress and trauma, justifying the search for computer-aided analysis. In this work, we tackle this problem in two ways. In the first one, we aim at deciding whether or not a video stream presents sensitive content, which we refer to as sensitive-video classification. In the second one, we aim at finding the exact moments a stream starts and ends displaying sensitive content, at frame level, which we refer to as sensitive-content localization. For both cases, we aim at designing and developing effective and efficient methods, with low memory footprint and suitable for deployment on mobile devices. In this vein, we provide four major contributions. The first one is a novel Bag-of-Visual-Words-based pipeline for efficient time-aware sensitive-video classification. The second is a novel high-level multimodal fusion pipeline for sensitive-content localization. The third, in turn, is a novel space-temporal video interest point detector and video content descriptor. Finally, the fourth contribution comprises a frame-level annotated 140-hour pornographic video dataset, which is the first one in the literature that is appropriate for pornography localization. An important aspect of the first three contributions is their generalization nature, in the sense that they can be employed ¿ without step modifications ¿ to the detection of diverse sensitive content types, such as the previously mentioned ones. For validation, we choose pornography and violence ¿ two of the commonest types of inappropriate material ¿ as target representatives of sensitive content. We therefore perform classification and localization experiments, and report results for both types of content. The proposed solutions present an accuracy of 93% in pornography classification, and allow the correct localization of 91% of pornographic content within a video stream. The results for violence are also compelling: with the proposed approaches, we reached second place in an international competition of violent scenes detection. Putting both in perspective, we learned that pornography detection is easier than its violence counterpart, opening several opportunities for additional investigations by the research community. The main reason for such difference is related to the distinct levels of subjectivity that are inherent to each concept. While pornography is usually more explicit, violence presents a broader spectrum of possible manifestationsDoutoradoCiência da ComputaçãoDoutor em Ciência da Computação1572763, 1197473CAPE

Repositorio da Producao Cientifica e Intelectual da Unicamp