Search CORE

7 research outputs found

Sample4Geo: Hard Negative Sampling For Cross-View Geo-Localisation

Author: Deuser Fabian
Habel Konrad
Oswald Norbert
Publication venue
Publication date: 29/08/2023
Field of study

Cross-View Geo-Localisation is still a challenging task where additional modules, specific pre-processing or zooming strategies are necessary to determine accurate positions of images. Since different views have different geometries, pre-processing like polar transformation helps to merge them. However, this results in distorted images which then have to be rectified. Adding hard negatives to the training batch could improve the overall performance but with the default loss functions in geo-localisation it is difficult to include them. In this article, we present a simplified but effective architecture based on contrastive learning with symmetric InfoNCE loss that outperforms current state-of-the-art results. Our framework consists of a narrow training pipeline that eliminates the need of using aggregation modules, avoids further pre-processing steps and even increases the generalisation capability of the model to unknown regions. We introduce two types of sampling strategies for hard negatives. The first explicitly exploits geographically neighboring locations to provide a good starting point. The second leverages the visual similarity between the image embeddings in order to mine hard negative samples. Our work shows excellent performance on common cross-view datasets like CVUSA, CVACT, University-1652 and VIGOR. A comparison between cross-area and same-area settings demonstrate the good generalisation capability of our model

arXiv.org e-Print Archive

Orientation-Guided Contrastive Learning for UAV-View Geo-Localisation

Author: Deuser Fabian
Habel Konrad
Oswald Norbert
Werner Martin
Publication venue
Publication date: 02/08/2023
Field of study

Retrieving relevant multimedia content is one of the main problems in a world that is increasingly data-driven. With the proliferation of drones, high quality aerial footage is now available to a wide audience for the first time. Integrating this footage into applications can enable GPS-less geo-localisation or location correction. In this paper, we present an orientation-guided training framework for UAV-view geo-localisation. Through hierarchical localisation orientations of the UAV images are estimated in relation to the satellite imagery. We propose a lightweight prediction module for these pseudo labels which predicts the orientation between the different views based on the contrastive learned embeddings. We experimentally demonstrate that this prediction supports the training and outperforms previous approaches. The extracted pseudo-labels also enable aligned rotation of the satellite image as augmentation to further strengthen the generalisation. During inference, we no longer need this orientation module, which means that no additional computations are required. We achieve state-of-the-art results on both the University-1652 and University-160k datasets

arXiv.org e-Print Archive

NeRFtrinsic Four: An End-To-End Trainable NeRF Jointly Optimizing Diverse Intrinsic and Extrinsic Camera Parameters

Author: Deuser Fabian
Egger Bernhard
Oswald Norbert
Roth Daniel
Schieber Hannah
Publication venue
Publication date: 26/10/2023
Field of study

Novel view synthesis using neural radiance fields (NeRF) is the state-of-the-art technique for generating high-quality images from novel viewpoints. Existing methods require a priori knowledge about extrinsic and intrinsic camera parameters. This limits their applicability to synthetic scenes, or real-world scenarios with the necessity of a preprocessing step. Current research on the joint optimization of camera parameters and NeRF focuses on refining noisy extrinsic camera parameters and often relies on the preprocessing of intrinsic camera parameters. Further approaches are limited to cover only one single camera intrinsic. To address these limitations, we propose a novel end-to-end trainable approach called NeRFtrinsic Four. We utilize Gaussian Fourier features to estimate extrinsic camera parameters and dynamically predict varying intrinsic camera parameters through the supervision of the projection error. Our approach outperforms existing joint optimization methods on LLFF and BLEFF. In addition to these existing datasets, we introduce a new dataset called iFF with varying intrinsic camera parameters. NeRFtrinsic Four is a step forward in joint optimization NeRF-based view synthesis and enables more realistic and flexible rendering in real-world scenarios with varying camera parameters

arXiv.org e-Print Archive

Kinetosis Analyzation of the Symptoms Occurrence in combination with Eye Tracking

Author: Deuser Fabian
Lecon Carsten
Schieber Hannah
Publication venue
Publication date: 01/01/2019
Field of study

OPUS - Hochschulschriftenserver der Hochschule Aalen

Less Is More: Linear Layers on CLIP Features as Powerful VizWiz Model

Author: Deuser Fabian
Habel Konrad
Oswald Norbert
Rösch Philipp J.
Publication venue
Publication date: 10/06/2022
Field of study

Current architectures for multi-modality tasks such as visual question answering suffer from their high complexity. As a result, these architectures are difficult to train and require high computational resources. To address these problems we present a CLIP-based architecture that does not require any fine-tuning of the feature extractors. A simple linear classifier is used on the concatenated features of the image and text encoder. During training an auxiliary loss is added which operates on the answer types. The resulting classification is then used as an attention gate on the answer class selection. On the VizWiz 2022 Visual Question Answering Challenge we achieve 60.15 % accuracy on Task 1: Predict Answer to a Visual Question and AP score of 83.78 % on Task 2: Predict Answerability of a Visual Question.Comment: VizWiz Grand Challenge: Describing Images and Videos Taken by Blind People (CVPR Workshop 2022

arXiv.org e-Print Archive

SoccerNet 2023 challenges results

Author: Abdelaziz Amr
Abdelwahed Mohamed
Alahi Alexandre
Ardö Håkan
Baikulov Ruslan
Barnich Olivier
Be'Ery Ishay
Chen Chen
Chen Ruilong
Chen Shimin
Choi Gyusik
Cioppa Anthony
Clapés Albert
Dai Wei
de Vleeschouwer Christophe
Deliège Adrien
Denize Julien
Deuser Fabian
Ding Shouhong
Escalera Sergio
Fahrudin Hasby
Falaleev Nikolay
Fu Jiajun
Fukushima Ryuto
Gan Yiyang
Ghanem Bernard
Giancola Silvio
Guo Hao
Habel Konrad
Held Jan
Hinojosa Carlos
Huang Zhijian
Hérault Romain
Jia Qiong
Jiao Licheng
Joo Yeeun
Kamal Abdullah
Kim Hankyul
Kim Juntae
Kobayashi Kenji
Koguchi Hidenari
Lee Jeongae
Lee Seungcheon
Li Junjie
Li Menglong
Li Tianjiao
Li Wei
Li Zhiheng
Liashuha Mykola
Lim Byoungkwon
Liu Bin
Liu Ruixuan
Luo Weixin
Ma Lin
Ma Yanbiao
Magera Floriane
Maglo Adrien
Mansourian Amir
Meng Ziyu
Miralles Pierre
Mkhallati Hassan
Moeslund Thomas
Muhammad Iftikar
Nakajima Kota
Nang Jongho
Nasr Mohamed
Orcesi Astrid
Oswald Norbert
Peng Rui
Pham Quoc-Cuong
Rabarisoa Jaonary
Ruan Zheng
Salah Ibrahim
Scott Atom
Shen Wei
Shitrit Gal
Somers Vladimir
Someya Taiga
Song Ran
Synowiec Kamil
Uchida Ikuma
van Droogenbroeck Marc
Wang Guanshuo
Wang Lizhi
Wang Luping
Xarles Artur
Xu Jinghang
Yan Feng
Yang Xinquan
Yerushalmy Ido
Yin Jianqin
Yu Fufu
Zeng Yingsen
Zhang Junpei
Zhang Kexin
Zhang Wei
Zhang Wenjie
Zhao Wending
Zhong Yujie
Zhou Mengying
Zhou Xin
Zhu Yongqiang
Publication venue: arXiv
Publication date: 01/01/2023
Field of study

SoccerNet 2023 Challenges ResultsThe SoccerNet 2023 challenges were the third annual video understanding challenges organized by the SoccerNet team. For this third edition, the challenges were composed of seven vision-based tasks split into three main themes. The first theme, broadcast video understanding, is composed of three high-level tasks related to describing events occurring in the video broadcasts: (1) action spotting, focusing on retrieving all timestamps related to global actions in soccer, (2) ball action spotting, focusing on retrieving all timestamps related to the soccer ball change of state, and (3) dense video captioning, focusing on describing the broadcast with natural language and anchored timestamps. The second theme, field understanding, relates to the single task of (4) camera calibration, focusing on retrieving the intrinsic and extrinsic camera parameters from images. The third and last theme, player understanding, is composed of three low-level tasks related to extracting information about the players: (5) re-identification, focusing on retrieving the same players across multiple views, (6) multiple object tracking, focusing on tracking players and the ball through unedited video streams, and (7) jersey number recognition, focusing on recognizing the jersey number of players from tracklets. Compared to the previous editions of the SoccerNet challenges, tasks (2-3-7) are novel, including new annotations and data, task (4) was enhanced with more data and annotations, and task (6) now focuses on end-to-end approaches. More information on the tasks, challenges, and leaderboards are available on https://www.soccer-net.org. Baselines and development kits can be found on https://github.com/SoccerNet

HAL - Normandie Université

HAL-CEA

Stratospheric ozone: An introduction to its study

Author: Ackerman
Ackerman
Ackerman
Ackerman
Ackerman
Ackerman
Ackerman
Anastasi
Anderson
Anderson
Arvesen
Barth
Basco
Bates
Bates
Bates
Bates
Bates
Bates
Bauer
Baulch
Baulch
Becker
Becker
Bemand
Bemand
Biaumé
Boland
Brasseur
Breen
Brewer
Brewer
Broadfoot
Brown
Brueckner
Cadle
Callis
Callis
Calvert
Campbell
Carver
Cashion
Chalonge
Chameides
Chameides
Chapman
Chapman
Chapman
Chappuis
Chappuis
Cicerone
Cicerone
Cicerone
Cieslik
Clyne
Clyne
Clyne
Clyne
Clyne
Clyne
Clyne
Clyne
Cornu
Cox
Cox
Cox
Craig
Crutzen
Crutzen
Crutzen
Crutzen
Crutzen
Crutzen
Crutzen
Cvetanovic
Davis
Davis
Davis
Davis
Davis
Davis
Davis
Davis
Davis
de la Rive
DeLuisi
DeMore
DeMore
Detwiler
Deuser
Dobson
Dobson
Dobson
Dobson
Dobson
Dobson
Dobson
Dobson
Donnelly
Doucet
Downie
Durie
Dutsch
Dütsch
Dütsch
Dütsch
Dütsch
Dütsch
Ehhalt
Ehhalt
Ehhalt
Engleman
Fabian
Fabry
Fabry
Fabry
Fabry
Fabry
Farmer
Foley
Foner
Fontanella
Fowler
Fried
Friedman
Gaedtke
Garvin
Georgii
Glänzer
Golde
Goldsmith
Goodeve
Gorse
Gorse
Gotz
Gray
Greenberg
Greiner
Greiner
Greiner
Griggs
Götz
Götz
Hack
Hahn
Hale
Hampson
Hampson
Hampson
Hampson
Harker
Harries
Hartley
Hartley
Heath
Heidner
Heidner
Hering
Herron
Herzberg
Hesstvedt
Hinteregger
Hippler
Hochanadel
Holt
Houzeau
Huggins
Huie
Huie
Hunt
Husain
Jayanty
Jayanty
Johns
Johnston
Johnston
Johnston
Johnston
Johnston
Johnston
Johnston
Johnston
Johnston
Johnston
Jordan
Junge
Kaufman
Kaufman
Kaufman
Klemm
Krezenski
Krueger
Krueger
Kuis
Kulcke
Kurylo
Labs
Lazrus
Lazrus
Leovy
Levy
Levy
Levy
Lin
Lin
Lin
Lloyd
Loewenstein
London
Lovelock
Lovelock
Lovelock
Lovelock
Lovelock
Luther
Mack
Makhover
Marcel Nicolet
Margitan
Marsh
Mastenbrook
Mastenbrook
McCarthy
McConnell
McConnell
McConnell
McConnell
McCrumb
McElroy
McElroy
McGrath
McQuigg
Mecke
Meinel
Meira
Milstein
Molina
Molina
Molina
Moore
Morley
Morris
Morris
Murcray
Murcray
Murray
Myer
Nash
Nicolet
Nicolet
Nicolet
Nicolet
Nicolet
Nicolet
Nicolet
Nicolet
Nicolet
Nicolet
Nicolet
Nicolet
Nicolet
Nicolet
Nicolet
Niki
Nishi
Nishi
Norton
Ogawa
Paetzold
Parkes
Parkinson
Pate
Patel
Paukert
Payne
Phillips
Piaget
Piaget
Pitts
Pitts
Prabhakara
Regener
Ridley
Ridley
Robinson
Romand
Rottman
Rowland
Ruderman
Russell
Ryan
Sandoval
Sanhueza
Savage
Schmidt
Schönbein
Schönbein
Schütz
Seery
Seiler
Seiler
Sie
Sie
Simmonds
Simon
Simon
Simonaitis
Simonaitis
Simonaitis
Simonaitis
Simonaitis
Simonaitis
Simonaitis
Simonaitis
Slagle
Slanger
Slanger
Smith
Smith
Smith
Sperling
Spicer
Stedman
Stedman
Stolarski
Strobel
Strobel
Strobel
Su
Thekaekara
Thekaekara
Tisone
Trainor
Trainor
Tsang
Urey
Vassy
Warneck
Warneck
Warneck
Washida
Washida
Watson
Welge
Westenberg
Westenberg
Westenberg
Westenberg
Westenberg
Westenberg
Widing
Wilkniss
Wilkniss
Wilkniss
Willson
Wofsy
Wofsy
Wofsy
Wofsy
Wong
Wulf
Wulf
Wulf
Zahniser
Zellner
Publication venue: 'American Geophysical Union (AGU)'
Publication date: 01/01/1975
Field of study

info:eu-repo/semantics/publishe

Crossref

Archivsystem Ask23

DI-fusion