12,902 research outputs found
Social Attention: Modeling Attention in Human Crowds
Robots that navigate through human crowds need to be able to plan safe,
efficient, and human predictable trajectories. This is a particularly
challenging problem as it requires the robot to predict future human
trajectories within a crowd where everyone implicitly cooperates with each
other to avoid collisions. Previous approaches to human trajectory prediction
have modeled the interactions between humans as a function of proximity.
However, that is not necessarily true as some people in our immediate vicinity
moving in the same direction might not be as important as other people that are
further away, but that might collide with us in the future. In this work, we
propose Social Attention, a novel trajectory prediction model that captures the
relative importance of each person when navigating in the crowd, irrespective
of their proximity. We demonstrate the performance of our method against a
state-of-the-art approach on two publicly available crowd datasets and analyze
the trained attention model to gain a better understanding of which surrounding
agents humans attend to, when navigating in a crowd
μμ¨ μ£Όνμ μν νμ΅ κΈ°λ°μ λ€μ€ κ΅ν΅ μ°Έμ¬μ κ²½λ‘μμΈ‘ λ°©λ²
νμλ
Όλ¬Έ (μμ¬)-- μμΈλνκ΅ λνμ : 곡과λν μ κΈ°Β·μ 보곡νλΆ, 2019. 2. μμΉμ°.μμ¨μ£Όνμ°¨λμ΄ μμ νλ©΄μλ, κ΅ν΅ νλ¦μ λ°©ν΄νμ§ μλ μΈκ° μμ€μ interactiveν μ£Όνμ μ€ννκΈ° μν΄μλ μ£Όλ³ μ΄μ μμ 보νμλ₯Ό ν¬ν¨ν κ΅ν΅ μ°Έμ¬μλ€μ μλλ₯Ό νμ
νλ κ²μ΄ νμμ μ΄λ€. λμΌν λλ‘ νκ²½μμλ μ£Όλ³μ κ΅ν΅ μ°Έμ¬μλ€μ΄ μ΄λ€ μλλ₯Ό κ°μ§κ³ μ΄λνκ³ μλκ°μ λ°λΌμ μ ν©ν μ£Όν μ λ΅μ λ§€λ² λ¬λΌμ§λ€. νΉν μ’μ 곡κ°μ λ§μ μμ΄μ νΈλ€μ΄ 곡μ νκ³ μλ 골λͺ©κΈΈ μν©μμλ κ° μμ΄μ νΈλ€μ μ νμ μ νλμ§λ§, κ° μμ΄μ νΈλ€μ μμ¬κ²°μ κ³Όμ μ κ·Έλ€κ°μ μνΈμμ©μ΄ κ³ λ €λμ΄ λ§€μ° λ³΅μ‘νλ€.μ΄λ° μν©μ λ§λ κ²½λ‘μμΈ‘μ κ³Όκ±°μ κΆ€μ , νμ¬ μΈμνκ³ μλ λλ‘ νκ²½, μ£Όλ³ κ΅ν΅ μ°Έμ¬μμ μν λ±μ κ³ λ €νμ¬ μνλμ΄μΌ νλ€. λν μΌλ°νλ λλΆλΆμ νκ²½μμ μνλλ €λ©΄, λΉ λ₯Έ λ¬λ νμκ³Ό λμμ κ³ λ €νλ κ΅ν΅ μ°Έμ¬μμ μ«μμ λν μ μ½μ΄ μ μ΄μΌ νλ€.
μ΄ λ
Όλ¬Έμμλ κ° κ΅ν΅ μ°Έμ¬μλ€μ κ³Όκ±° κΆ€μ κ³Ό νμ¬ μμΉ μν©, λλ‘ μν©μ λμμ κ³ λ €νμ¬ λͺ¨λ μ°Έμ¬μλ€μ μνΈμμ©μ΄ κ³ λ €λ κ²½λ‘μμΈ‘ λ°©λ²μ μ μνλ€. κ° λ¬Όμ²΄λ€μ κ³Όκ±° κΆ€μ κ³Ό λμμ μμ¨μ£Όν μ°¨λμ΄ μΈμνκ³ μλ λλ‘μ μ£Όλ³ λ¬Όμ²΄λ₯Ό μ
λ ₯μΌλ‘νκ³ , μ΄ λκ°μ μλ² λ© κ²°κ³Όλ₯Ό νΌν©νμ¬ λͺ¨λ 물체λ€μ λν κ²½λ‘ μμΈ‘μ λμμ μννλ€. μ΄ κ³Όμ μμ λ€νΈμν¬ κ΅¬μ‘° λ΄λΆμμ κ°κ° 물체λ€μ μλ² λ© κ²°κ³Όλ₯Ό μμΉ μ 보μ 맀μΉμν΄μΌλ‘μ¨ ν¨κ³Όμ μΌλ‘ μ£Όλ³ μν©κ³Ό κ³Όκ±°μ κΆ€μ μ λμμ κ³ λ €νλ κ²½λ‘ μμΈ‘μ νμ΅ν μ μλ ꡬ쑰λ₯Ό μ μνλ€.λͺκ°μ§ λ€νΈμν¬ κ΅¬μ‘°μ λ°λ₯Έ μ±λ₯ λΉκ΅μ ν¨κ», λ€μν μ£Όν νκ²½μμ μ λμ μΈ νκ°μ μ μ±μ μΈ νκ°λ‘ μ ν¨μ±μ μ
μ¦νλ€.λν 보νμ κ²½λ‘μμΈ‘ λ°μ΄ν°μ ν
μ€νΈλ₯Ό μ§νν¨μΌλ‘μ¨ ν μκ³ λ¦¬μ¦κ³Όμ μ±λ₯μ λΉκ΅νλ€.Recent autonomous driving research has shown remarkable and promising results. However, safe, sociable driving in an urban environment still has many challenges ahead. For realizing safe, interactive driving in complex alley scenario which shares a narrow area among traffic participants, It is essential to grasp each other's intention. Even in the same road environment, safe, and sociable driving policy may differ depending on the intention of the traffic participant agents around the ego vehicle. But understanding others intention and predicting their trajectories are complicated because each one basically considers multiple factorsroad environment, state of their surrounding traffic participants at the same time which realized as interaction.
In this thesis dissertation, we propose a new trajectory prediction algorithm that considers all the information that each of the traffic participants would consider when they make a decision. By combining both each of history trajectories and grid map of surroundings as a latent vector representation, it predicts all the future trajectories of traffic participant agents around ego vehicle at once.
This dissertation suggests two main module that fuses spatial and temporal information effectively. We verify the effectiveness of network structure by testing on the various driving scenario comparing with some network variants through quantitative and qualitative evaluation. Also, the proposed network is verified by applying it to public pedestrian trajectory prediction dataset to verify usability as a generalized methodology and to compare it with other SOTA algorithms.Abstract
Contents
List of Tables
List of Figures
1 Introduction
1.1 Background and Motivation
2 Related Work
2.1 Contributions of the Dissertation
3 Conditional Neural Process
3.1 Conditional Neural Process(CNP) Overview
3.2 Trajectory Prediction with Scene Information as CNP
3.2.1 Formulation
3.2.2 Loss and Training Algorithm
4 Efficient Network Architecture for Intention Prediction
4.1 Network Overview
4.2 Trajectory Encoder
4.2.1 Spatio-Temporal Representation
4.3 Scene Feature Extraction
4.3.1 Side Spatial Extraction
4.4 Trajectory Decoder
5 Experiment
5.1 Driving environment dataset
5.1.1 Data acquisition method
5.1.2 Overview
5.1.3 Alley scenario
5.1.4 Urban Scenario
5.2 Public Pedestrian Dataset
6 ConclusionMaste
Human Motion Trajectory Prediction: A Survey
With growing numbers of intelligent autonomous systems in human environments,
the ability of such systems to perceive, understand and anticipate human
behavior becomes increasingly important. Specifically, predicting future
positions of dynamic agents and planning considering such predictions are key
tasks for self-driving vehicles, service robots and advanced surveillance
systems. This paper provides a survey of human motion trajectory prediction. We
review, analyze and structure a large selection of work from different
communities and propose a taxonomy that categorizes existing methods based on
the motion modeling approach and level of contextual information used. We
provide an overview of the existing datasets and performance metrics. We
discuss limitations of the state of the art and outline directions for further
research.Comment: Submitted to the International Journal of Robotics Research (IJRR),
37 page
Dynamic Occupancy Grid Prediction for Urban Autonomous Driving: A Deep Learning Approach with Fully Automatic Labeling
Long-term situation prediction plays a crucial role in the development of
intelligent vehicles. A major challenge still to overcome is the prediction of
complex downtown scenarios with multiple road users, e.g., pedestrians, bikes,
and motor vehicles, interacting with each other. This contribution tackles this
challenge by combining a Bayesian filtering technique for environment
representation, and machine learning as long-term predictor. More specifically,
a dynamic occupancy grid map is utilized as input to a deep convolutional
neural network. This yields the advantage of using spatially distributed
velocity estimates from a single time step for prediction, rather than a raw
data sequence, alleviating common problems dealing with input time series of
multiple sensors. Furthermore, convolutional neural networks have the inherent
characteristic of using context information, enabling the implicit modeling of
road user interaction. Pixel-wise balancing is applied in the loss function
counteracting the extreme imbalance between static and dynamic cells. One of
the major advantages is the unsupervised learning character due to fully
automatic label generation. The presented algorithm is trained and evaluated on
multiple hours of recorded sensor data and compared to Monte-Carlo simulation
- β¦