12,902 research outputs found

    Social Attention: Modeling Attention in Human Crowds

    Full text link
    Robots that navigate through human crowds need to be able to plan safe, efficient, and human predictable trajectories. This is a particularly challenging problem as it requires the robot to predict future human trajectories within a crowd where everyone implicitly cooperates with each other to avoid collisions. Previous approaches to human trajectory prediction have modeled the interactions between humans as a function of proximity. However, that is not necessarily true as some people in our immediate vicinity moving in the same direction might not be as important as other people that are further away, but that might collide with us in the future. In this work, we propose Social Attention, a novel trajectory prediction model that captures the relative importance of each person when navigating in the crowd, irrespective of their proximity. We demonstrate the performance of our method against a state-of-the-art approach on two publicly available crowd datasets and analyze the trained attention model to gain a better understanding of which surrounding agents humans attend to, when navigating in a crowd

    자율 주행을 μœ„ν•œ ν•™μŠ΅ 기반의 닀쀑 ꡐ톡 μ°Έμ—¬μž 경둜예츑 방법

    Get PDF
    ν•™μœ„λ…Όλ¬Έ (석사)-- μ„œμšΈλŒ€ν•™κ΅ λŒ€ν•™μ› : κ³΅κ³ΌλŒ€ν•™ 전기·정보곡학뢀, 2019. 2. μ„œμŠΉμš°.μžμœ¨μ£Όν–‰μ°¨λŸ‰μ΄ μ•ˆμ „ν•˜λ©΄μ„œλ„, ꡐ톡 흐름을 λ°©ν•΄ν•˜μ§€ μ•ŠλŠ” 인간 μˆ˜μ€€μ˜ interactiveν•œ 주행을 μ‹€ν˜„ν•˜κΈ° μœ„ν•΄μ„œλŠ” μ£Όλ³€ μš΄μ „μžμ™€ λ³΄ν–‰μžλ₯Ό ν¬ν•¨ν•œ ꡐ톡 μ°Έμ—¬μžλ“€μ˜ μ˜λ„λ₯Ό νŒŒμ•…ν•˜λŠ” 것이 ν•„μˆ˜μ μ΄λ‹€. λ™μΌν•œ λ„λ‘œ ν™˜κ²½μ—μ„œλ„ μ£Όλ³€μ˜ ꡐ톡 μ°Έμ—¬μžλ“€μ΄ μ–΄λ–€ μ˜λ„λ₯Ό 가지고 μ΄λ™ν•˜κ³  μžˆλŠ”κ°€μ— λ”°λΌμ„œ μ ν•©ν•œ μ£Όν–‰ μ „λž΅μ€ 맀번 달라진닀. 특히 쒁은 곡간을 λ§Žμ€ μ—μ΄μ „νŠΈλ“€μ΄ κ³΅μœ ν•˜κ³  μžˆλŠ” 골λͺ©κΈΈ μƒν™©μ—μ„œλŠ” 각 μ—μ΄μ „νŠΈλ“€μ˜ 선택은 μ œν•œλ˜μ§€λ§Œ, 각 μ—μ΄μ „νŠΈλ“€μ˜ μ˜μ‚¬κ²°μ •κ³Όμ •μ€ κ·Έλ“€κ°„μ˜ μƒν˜Έμž‘μš©μ΄ κ³ λ €λ˜μ–΄ 맀우 λ³΅μž‘ν•˜λ‹€.이런 상황에 λ§žλŠ” κ²½λ‘œμ˜ˆμΈ‘μ€ 과거의 ꢀ적, ν˜„μž¬ μΈμ‹ν•˜κ³  μžˆλŠ” λ„λ‘œ ν™˜κ²½, μ£Όλ³€ ꡐ톡 μ°Έμ—¬μžμ˜ μƒνƒœ 등을 κ³ λ €ν•˜μ—¬ μˆ˜ν–‰λ˜μ–΄μ•Ό ν•œλ‹€. λ˜ν•œ μΌλ°˜ν™”λœ λŒ€λΆ€λΆ„μ˜ ν™˜κ²½μ—μ„œ μˆ˜ν–‰λ˜λ €λ©΄, λΉ λ₯Έ λŸ¬λ‹ νƒ€μž„κ³Ό λ™μ‹œμ— κ³ λ €ν•˜λŠ” ꡐ톡 μ°Έμ—¬μžμ˜ μˆ«μžμ— λŒ€ν•œ μ œμ•½μ΄ 적어야 ν•œλ‹€. 이 λ…Όλ¬Έμ—μ„œλŠ” 각 ꡐ톡 μ°Έμ—¬μžλ“€μ˜ κ³Όκ±° ꢀ적과 ν˜„μž¬ μœ„μΉ˜ 상황, λ„λ‘œ 상황을 λ™μ‹œμ— κ³ λ €ν•˜μ—¬ λͺ¨λ“  μ°Έμ—¬μžλ“€μ˜ μƒν˜Έμž‘μš©μ΄ 고렀된 경둜예츑 방법을 μ œμ•ˆν•œλ‹€. 각 λ¬Όμ²΄λ“€μ˜ κ³Όκ±° ꢀ적과 λ™μ‹œμ— μžμœ¨μ£Όν–‰ μ°¨λŸ‰μ΄ μΈμ‹ν•˜κ³  μžˆλŠ” λ„λ‘œμ™€ μ£Όλ³€ 물체λ₯Ό μž…λ ₯μœΌλ‘œν•˜κ³ , 이 λ‘κ°œμ˜ μž„λ² λ”© κ²°κ³Όλ₯Ό ν˜Όν•©ν•˜μ—¬ λͺ¨λ“  물체듀에 λŒ€ν•œ 경둜 μ˜ˆμΈ‘μ„ λ™μ‹œμ— μˆ˜ν–‰ν•œλ‹€. 이 κ³Όμ •μ—μ„œ λ„€νŠΈμ›Œν¬ ꡬ쑰 λ‚΄λΆ€μ—μ„œ 각각 λ¬Όμ²΄λ“€μ˜ μž„λ² λ”© κ²°κ³Όλ₯Ό μœ„μΉ˜ 정보에 λ§€μΉ­μ‹œν‚΄μœΌλ‘œμ¨ 효과적으둜 μ£Όλ³€ 상황과 과거의 ꢀ적을 λ™μ‹œμ— κ³ λ €ν•˜λŠ” 경둜 μ˜ˆμΈ‘μ„ ν•™μŠ΅ν•  수 μžˆλŠ” ꡬ쑰λ₯Ό μ œμ•ˆν•œλ‹€.λͺ‡κ°€μ§€ λ„€νŠΈμ›Œν¬ ꡬ쑰에 λ”°λ₯Έ μ„±λŠ₯ 비ꡐ와 ν•¨κ»˜, λ‹€μ–‘ν•œ μ£Όν–‰ ν™˜κ²½μ—μ„œ μ •λŸ‰μ μΈ 평가와 정성적인 ν‰κ°€λ‘œ μœ νš¨μ„±μ„ μž…μ¦ν•œλ‹€.λ˜ν•œ λ³΄ν–‰μž 경둜예츑 데이터에 ν…ŒμŠ€νŠΈλ₯Ό μ§„ν–‰ν•¨μœΌλ‘œμ¨ 타 μ•Œκ³ λ¦¬μ¦˜κ³Όμ˜ μ„±λŠ₯을 λΉ„κ΅ν•œλ‹€.Recent autonomous driving research has shown remarkable and promising results. However, safe, sociable driving in an urban environment still has many challenges ahead. For realizing safe, interactive driving in complex alley scenario which shares a narrow area among traffic participants, It is essential to grasp each other's intention. Even in the same road environment, safe, and sociable driving policy may differ depending on the intention of the traffic participant agents around the ego vehicle. But understanding others intention and predicting their trajectories are complicated because each one basically considers multiple factorsroad environment, state of their surrounding traffic participants at the same time which realized as interaction. In this thesis dissertation, we propose a new trajectory prediction algorithm that considers all the information that each of the traffic participants would consider when they make a decision. By combining both each of history trajectories and grid map of surroundings as a latent vector representation, it predicts all the future trajectories of traffic participant agents around ego vehicle at once. This dissertation suggests two main module that fuses spatial and temporal information effectively. We verify the effectiveness of network structure by testing on the various driving scenario comparing with some network variants through quantitative and qualitative evaluation. Also, the proposed network is verified by applying it to public pedestrian trajectory prediction dataset to verify usability as a generalized methodology and to compare it with other SOTA algorithms.Abstract Contents List of Tables List of Figures 1 Introduction 1.1 Background and Motivation 2 Related Work 2.1 Contributions of the Dissertation 3 Conditional Neural Process 3.1 Conditional Neural Process(CNP) Overview 3.2 Trajectory Prediction with Scene Information as CNP 3.2.1 Formulation 3.2.2 Loss and Training Algorithm 4 Efficient Network Architecture for Intention Prediction 4.1 Network Overview 4.2 Trajectory Encoder 4.2.1 Spatio-Temporal Representation 4.3 Scene Feature Extraction 4.3.1 Side Spatial Extraction 4.4 Trajectory Decoder 5 Experiment 5.1 Driving environment dataset 5.1.1 Data acquisition method 5.1.2 Overview 5.1.3 Alley scenario 5.1.4 Urban Scenario 5.2 Public Pedestrian Dataset 6 ConclusionMaste

    Human Motion Trajectory Prediction: A Survey

    Full text link
    With growing numbers of intelligent autonomous systems in human environments, the ability of such systems to perceive, understand and anticipate human behavior becomes increasingly important. Specifically, predicting future positions of dynamic agents and planning considering such predictions are key tasks for self-driving vehicles, service robots and advanced surveillance systems. This paper provides a survey of human motion trajectory prediction. We review, analyze and structure a large selection of work from different communities and propose a taxonomy that categorizes existing methods based on the motion modeling approach and level of contextual information used. We provide an overview of the existing datasets and performance metrics. We discuss limitations of the state of the art and outline directions for further research.Comment: Submitted to the International Journal of Robotics Research (IJRR), 37 page

    Dynamic Occupancy Grid Prediction for Urban Autonomous Driving: A Deep Learning Approach with Fully Automatic Labeling

    Full text link
    Long-term situation prediction plays a crucial role in the development of intelligent vehicles. A major challenge still to overcome is the prediction of complex downtown scenarios with multiple road users, e.g., pedestrians, bikes, and motor vehicles, interacting with each other. This contribution tackles this challenge by combining a Bayesian filtering technique for environment representation, and machine learning as long-term predictor. More specifically, a dynamic occupancy grid map is utilized as input to a deep convolutional neural network. This yields the advantage of using spatially distributed velocity estimates from a single time step for prediction, rather than a raw data sequence, alleviating common problems dealing with input time series of multiple sensors. Furthermore, convolutional neural networks have the inherent characteristic of using context information, enabling the implicit modeling of road user interaction. Pixel-wise balancing is applied in the loss function counteracting the extreme imbalance between static and dynamic cells. One of the major advantages is the unsupervised learning character due to fully automatic label generation. The presented algorithm is trained and evaluated on multiple hours of recorded sensor data and compared to Monte-Carlo simulation
    • …
    corecore