2 research outputs found
Monocular Pedestrian Orientation Estimation Based on Deep 2D-3D Feedforward
Accurate pedestrian orientation estimation of autonomous driving helps the
ego vehicle obtain the intentions of pedestrians in the related environment,
which are the base of safety measures such as collision avoidance and
prewarning. However, because of relatively small sizes and high-level
deformation of pedestrians, common pedestrian orientation estimation models
fail to extract sufficient and comprehensive information from them, thus having
their performance restricted, especially monocular ones which fail to obtain
depth information of objects and related environment. In this paper, a novel
monocular pedestrian orientation estimation model, called FFNet, is proposed.
Apart from camera captures, the model adds the 2D and 3D dimensions of
pedestrians as two other inputs according to the logic relationship between
orientation and them. The 2D and 3D dimensions of pedestrians are determined
from the camera captures and further utilized through two feedforward links
connected to the orientation estimator. The feedforward links strengthen the
logicality and interpretability of the network structure of the proposed model.
Experiments show that the proposed model has at least 1.72% AOS increase than
most state-of-the-art models after identical training processes. The model also
has competitive results in orientation estimation evaluation on KITTI dataset.Comment: 29 pages, 12 figure
Amplifying the Anterior-Posterior Difference via Data Enhancement -- A More Robust Deep Monocular Orientation Estimation Solution
Existing deep-learning based monocular orientation estimation algorithms
faces the problem of confusion between the anterior and posterior parts of the
objects, caused by the feature similarity of such parts in typical objects in
traffic scenes such as cars and pedestrians. While difficult to solve, the
problem may lead to serious orientation estimation errors, and pose threats to
the upcoming decision making process of the ego vehicle, since the predicted
tracks of objects may have directions opposite to ground truths. In this paper,
we mitigate this problem by proposing a pretraining method. The method focuses
on predicting the left/right semicircle in which the orientation of the object
is located. The trained semicircle prediction model is then integrated into the
orientation angle estimation model which predicts a value in range .
Experiment results show that the proposed semicircle prediction enhances the
accuracy of orientation estimation, and mitigates the problem stated above.
With the proposed method, a backbone achieves similar state-of-the-art
orientation estimation performance to existing approaches with well-designed
network structures.Comment: 7 pages, 10 figure