44 research outputs found
Implicit 3D Orientation Learning for 6D Object Detection from RGB Images
We propose a real-time RGB-based pipeline for object detection and 6D pose
estimation. Our novel 3D orientation estimation is based on a variant of the
Denoising Autoencoder that is trained on simulated views of a 3D model using
Domain Randomization. This so-called Augmented Autoencoder has several
advantages over existing methods: It does not require real, pose-annotated
training data, generalizes to various test sensors and inherently handles
object and view symmetries. Instead of learning an explicit mapping from input
images to object poses, it provides an implicit representation of object
orientations defined by samples in a latent space. Our pipeline achieves
state-of-the-art performance on the T-LESS dataset both in the RGB and RGB-D
domain. We also evaluate on the LineMOD dataset where we can compete with other
synthetically trained approaches. We further increase performance by correcting
3D orientation estimates to account for perspective errors when the object
deviates from the image center and show extended results.Comment: Code available at: https://github.com/DLR-RM/AugmentedAutoencode
Aneurysm Identification in Cerebral Models with Multiview Convolutional Neural Network
Stroke is the third most common cause of death and a major contributor to long-term disability worldwide. Severe stroke is most often caused by the rupture of a cerebral aneurysm, a weakened area in a blood vessel. The detection and quantification of cerebral aneurysms are essential for the prevention and treatment of aneurysmal rupture and cerebral infarction. Here, we propose a novel aneurysm detection method in a three-dimensional (3D) cerebrovascular model based on convolutional neural networks (CNNs). The multiview method is used to obtain a sequence of 2D images on the cerebral vessel branch model. The pretrained CNN is used with transfer learning to overcome the small training sample problem. The data augmentation strategy with rotation, mirroring and flipping helps improve the performance dramatically, particularly on our small datasets. The hyperparameter of the view number is determined in the task. We have applied the labeling task on 56 3D mesh models with aneurysms (positive) and 65 models without aneurysms (negative). The average accuracy of individual projected images is 87.86%, while that of the model is 93.4% with the best view number. The framework is highly effective with quick training efficiency that can be widely extended to detect other organ anomalies
Solving Uncalibrated Photometric Stereo using Total Variation
International audienceEstimating the shape and appearance of an object, given one or several images, is still an open and challenging research problem called 3D-reconstruction. Among the different techniques available, photometric stereo (PS) produces highly accurate results when the lighting conditions have been identified. When these conditions are unknown, the problem becomes the so-called uncalibrated PS problem, which is ill-posed. In this paper, we will show how total variation can be used to reduce the ambiguities of uncalibrated PS, and we will study two methods for estimating the parameters of the generalized bas-relief ambiguity. These methods will be evaluated through the 3D-reconstruction of real-world objects
Screen Space Ambient Occlusion Based Multiple Importance Sampling for Real-Time Rendering
International audienceWe propose a new approximation technique for accelerating the Global Illumination algorithm for real-time rendering. The proposed approach is based on the Screen-Space Ambient Occlusion (SSAO) method, which approximates the global illumination for large, fully dynamic scenes at interactive frame rates. Current algorithms that are based on the SSAO method suffer from difficulties due to the large number of samples that are required. In this paper, we propose an improvement to the SSAO technique by integrating it with a Multiple Importance Sampling technique that combines a stratified sampling method with an importance sampling method, with the objective of reducing the number of samples. Experimental evaluation demonstrates that our technique can produce high-quality images in real time and is significantly faster than traditional techniques
Flood-prone area mapping using machine learning techniques: a case study of Quang Binh province, Vietnam
Vietnamâs central coastal region is the most vulnerable and always at flood risk, severely affecting peopleâs livelihoods and socio-economic development. In particular, Quang Binh province is often affected by floods and storms over the year. However, it still lacks studies on flood hazard estimation and prediction tools in this area. This study aims to develop a flooding susceptibility assessment tool using various machine learning (ML) techniques namely alternating decision tree (AD Tree), logistic model tree (LM Tree), reduced-error pruning tree (REP Tree), J48 decision tree (J48) and NaĂŻve Bayes tree (NB Tree); historical flood marks; and available data of topography, hydrology, geology, and environment considering Quang Binh province as a study area. We used flood mark locations of major flooding events in the years 2007, 2010, and 2016; and ten flood conditioning factors to construct and validate the ML models. Various validation methods, including area under the ROC curve (AUC), were used to validate and compare the models. The result of the modelsâ validation suggests that all models have good performance: AD Tree (AUC = 0.968), LM Tree (AUC = 0.967), REP Tree (AUC = 0.897), J48 (AUC = 0.953), and NB Tree (AUC = 0.986). Out of these, NB Tree managed to achieve the best performance in terms of flood prediction with an accuracy higher than 92 %. The final flood susceptibility map highlights 6,265 km2 (78.8 % area) with a very low flooding hazard, 391 km2 (4.9 % area) with a low flooding hazard, 224 km2 (2.8 % area) with a moderate flooding hazard, 243 km2 (3.1 %) with a high flooding hazard, and 829 km2 (10.4 % area) with very high flooding hazard. The final flooding susceptibility assessment map could add a valuable source for flood risk reduction and management activities of Quang Binh province