Search CORE

44 research outputs found

Implicit 3D Orientation Learning for 6D Object Detection from RGB Images

Author: BT Phong
P Vincent
S Hinterstoisser
S Hinterstoisser
S Hinterstoisser
SR Richter
T Hodaň
T-Y Lin
W Kehl
W Liu
Y Movshovitz-Attias
Z Zhang
Publication venue
Publication date: 10/09/2018
Field of study

We propose a real-time RGB-based pipeline for object detection and 6D pose estimation. Our novel 3D orientation estimation is based on a variant of the Denoising Autoencoder that is trained on simulated views of a 3D model using Domain Randomization. This so-called Augmented Autoencoder has several advantages over existing methods: It does not require real, pose-annotated training data, generalizes to various test sensors and inherently handles object and view symmetries. Instead of learning an explicit mapping from input images to object poses, it provides an implicit representation of object orientations defined by samples in a latent space. Our pipeline achieves state-of-the-art performance on the T-LESS dataset both in the RGB and RGB-D domain. We also evaluate on the LineMOD dataset where we can compete with other synthetically trained approaches. We further increase performance by correcting 3D orientation estimates to account for perspective errors when the object deviates from the image center and show extended results.Comment: Code available at: https://github.com/DLR-RM/AugmentedAutoencode

arXiv.org e-Print Archive

Institute of Transport Research:Publications

Crossref

Aneurysm Identification in Cerebral Models with Multiview Convolutional Neural Network

Author: A Lauric
BT Phong
GJE Rinkel
JW Hop
MC Villa-Uriol
S Kobashi
S Suniaga
SR Aylward
T Jerman
T Nakao
X Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/05/2020
Field of study

Stroke is the third most common cause of death and a major contributor to long-term disability worldwide. Severe stroke is most often caused by the rupture of a cerebral aneurysm, a weakened area in a blood vessel. The detection and quantification of cerebral aneurysms are essential for the prevention and treatment of aneurysmal rupture and cerebral infarction. Here, we propose a novel aneurysm detection method in a three-dimensional (3D) cerebrovascular model based on convolutional neural networks (CNNs). The multiview method is used to obtain a sequence of 2D images on the cerebral vessel branch model. The pretrained CNN is used with transfer learning to overcome the small training sample problem. The data augmentation strategy with rotation, mirroring and flipping helps improve the performance dramatically, particularly on our small datasets. The hyperparameter of the view number is determined in the task. We have applied the labeling task on 56 3D mesh models with aneurysms (positive) and 65 models without aneurysms (negative). The average accuracy of individual projected images is 87.86%, while that of the model is 93.4% with the best view number. The framework is highly effective with quick training efficiency that can be widely extended to detect other organ anomalies

Crossref

The University of Manchester - Institutional Repository

White Rose Research Online

Solving Uncalibrated Photometric Stereo using Total Variation

Author: A Chambolle
A Chambolle
A Chambolle
A Hertzmann
AS Georghiades
BKP Horn
BT Phong
C Hernández
François Lauze
H Attouch
H Hayakawa
I Horovitz
J Fan
J Oliensis
JD Durou
Jean-Denis Durou
JJ Koenderink
K Lee
LI Rudin
PN Belhumeur
R Kozera
R Onn
RJ Woodham
RT Frankot
S Barsky
T Simchony
X Bresson
Yvain Quéau
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

International audienceEstimating the shape and appearance of an object, given one or several images, is still an open and challenging research problem called 3D-reconstruction. Among the different techniques available, photometric stereo (PS) produces highly accurate results when the lighting conditions have been identified. When these conditions are unknown, the problem becomes the so-called uncalibrated PS problem, which is ill-posed. In this paper, we will show how total variation can be used to reduce the ambiguities of uncalibrated PS, and we will study two methods for estimating the parameters of the generalized bas-relief ambiguity. These methods will be evaluated through the 3D-reconstruction of real-world objects

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

Open Archive Toulouse Archive Ouverte

Copenhagen University Research Information System

3D-Visualisierung Von Grauwertvoxelräumen

Author: BT Phong
HE Rushmeier
JF Blinn
JT Kajiya
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1988
Field of study

Crossref

Mechanisms on Graphical Core Variables in the Design of Cartographic 3D City Presentations

Author: AM MacEachren
BT Phong
G Hake
J Bertin
M Nienhaus
T Isenberg
T Strothotte
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Crossref

Phong Reflectance Model

Author: A Ngan
BT Phong
D Shreiner
GJ Ward
JF Blinn
M Ashikmin
RL Cook
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Brain and Neck Visualization Techniques

Author: BT Phong
L Neumann
T Hachaj
T Hachaj
T Hachaj
T Hachaj
WE Lorensen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Screen Space Ambient Occlusion Based Multiple Importance Sampling for Real-Time Rendering

Author: A Owen
BT Phong
C Willmott
F Hernell
H Landis
J Amanatides
P Cox
RY Rubinstein
S Paris
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

International audienceWe propose a new approximation technique for accelerating the Global Illumination algorithm for real-time rendering. The proposed approach is based on the Screen-Space Ambient Occlusion (SSAO) method, which approximates the global illumination for large, fully dynamic scenes at interactive frame rates. Current algorithms that are based on the SSAO method suffer from difficulties due to the large number of samples that are required. In this paper, we propose an improvement to the SSAO technique by integrating it with a Multiple Importance Sampling technique that combines a stratified sampling method with an importance sampling method, with the objective of reducing the number of samples. Experimental evaluation demonstrates that our technique can produce high-quality images in real time and is significantly faster than traditional techniques

Crossref

HAL-INSU

HAL-OBSPM

Flood-prone area mapping using machine learning techniques: a case study of Quang Binh province, Vietnam

Author: Bui QD
Costache R
Luu C
Nguyen LT
Nguyen TT
Pham BT
Van Le H
Van Phong T
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/05/2022
Field of study

Vietnam’s central coastal region is the most vulnerable and always at flood risk, severely affecting people’s livelihoods and socio-economic development. In particular, Quang Binh province is often affected by floods and storms over the year. However, it still lacks studies on flood hazard estimation and prediction tools in this area. This study aims to develop a flooding susceptibility assessment tool using various machine learning (ML) techniques namely alternating decision tree (AD Tree), logistic model tree (LM Tree), reduced-error pruning tree (REP Tree), J48 decision tree (J48) and Naïve Bayes tree (NB Tree); historical flood marks; and available data of topography, hydrology, geology, and environment considering Quang Binh province as a study area. We used flood mark locations of major flooding events in the years 2007, 2010, and 2016; and ten flood conditioning factors to construct and validate the ML models. Various validation methods, including area under the ROC curve (AUC), were used to validate and compare the models. The result of the models’ validation suggests that all models have good performance: AD Tree (AUC = 0.968), LM Tree (AUC = 0.967), REP Tree (AUC = 0.897), J48 (AUC = 0.953), and NB Tree (AUC = 0.986). Out of these, NB Tree managed to achieve the best performance in terms of flood prediction with an accuracy higher than 92 %. The final flood susceptibility map highlights 6,265 km2 (78.8 % area) with a very low flooding hazard, 391 km2 (4.9 % area) with a low flooding hazard, 224 km2 (2.8 % area) with a moderate flooding hazard, 243 km2 (3.1 %) with a high flooding hazard, and 829 km2 (10.4 % area) with very high flooding hazard. The final flooding susceptibility assessment map could add a valuable source for flood risk reduction and management activities of Quang Binh province

OPUS - University of Technology Sydney

Simulating Non-Lambertian Phenomena Involving Linearly-Varying Luminaires

Author: BT Phong
DR Baum
JR Howell
L Lewin
M Berger
M Chen
M Schreiber
M Spivak
MM Stark
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref