Details
Original language | English |
---|---|
Pages (from-to) | 253-266 |
Number of pages | 14 |
Journal | ISPRS Journal of Photogrammetry and Remote Sensing |
Volume | 172 |
Early online date | 14 Jan 2021 |
Publication status | Published - Feb 2021 |
Abstract
Keywords
- cs.CV, Trajectory prediction, Encoder, Generative model
ASJC Scopus subject areas
- Earth and Planetary Sciences(all)
- Computers in Earth Sciences
- Engineering(all)
- Engineering (miscellaneous)
- Physics and Astronomy(all)
- Atomic and Molecular Physics, and Optics
- Computer Science(all)
- Computer Science Applications
Cite this
- Standard
- Harvard
- Apa
- Vancouver
- BibTeX
- RIS
In: ISPRS Journal of Photogrammetry and Remote Sensing, Vol. 172, 02.2021, p. 253-266.
Research output: Contribution to journal › Article › Research › peer review
}
TY - JOUR
T1 - AMENet
T2 - Attentive Maps Encoder Network for Trajectory Prediction
AU - Cheng, Hao
AU - Liao, Wentong
AU - Yang, Michael Ying
AU - Rosenhahn, Bodo
AU - Sester, Monika
N1 - Funding Information: This work is supported by the German Research Foundation (DFG) through the Research Training Group SocialCars (GRK 1931).
PY - 2021/2
Y1 - 2021/2
N2 - Trajectory prediction is a crucial task in different communities, such as intelligent transportation systems, photogrammetry, computer vision, and mobile robot applications. However, there are many challenges to predict the trajectories of heterogeneous road agents (e.g. pedestrians, cyclists and vehicles) at a microscopical level. For example, an agent might be able to choose multiple plausible paths in complex interactions with other agents in varying environments, and the behavior of each agent is affected by the various behaviors of its neighboring agents. To this end, we propose an end-to-end generative model named Attentive Maps Encoder Network (AMENet) for accurate and realistic multi-path trajectory prediction. Our method leverages the target road user's motion information (i.e. movement in xy-axis in a Cartesian space) and the interaction information with the neighboring road users at each time step, which is encoded as dynamic maps that are centralized on the target road user. A conditional variational auto-encoder module is trained to learn the latent space of possible future paths based on the dynamic maps and then used to predict multiple plausible future trajectories conditioned on the observed past trajectories. Our method reports the new state-of-the-art performance (final/mean average displacement (FDE/MDE) errors 1.183/0.356 meters) on benchmark datasets and wins the first place in the open challenge of Trajnet.
AB - Trajectory prediction is a crucial task in different communities, such as intelligent transportation systems, photogrammetry, computer vision, and mobile robot applications. However, there are many challenges to predict the trajectories of heterogeneous road agents (e.g. pedestrians, cyclists and vehicles) at a microscopical level. For example, an agent might be able to choose multiple plausible paths in complex interactions with other agents in varying environments, and the behavior of each agent is affected by the various behaviors of its neighboring agents. To this end, we propose an end-to-end generative model named Attentive Maps Encoder Network (AMENet) for accurate and realistic multi-path trajectory prediction. Our method leverages the target road user's motion information (i.e. movement in xy-axis in a Cartesian space) and the interaction information with the neighboring road users at each time step, which is encoded as dynamic maps that are centralized on the target road user. A conditional variational auto-encoder module is trained to learn the latent space of possible future paths based on the dynamic maps and then used to predict multiple plausible future trajectories conditioned on the observed past trajectories. Our method reports the new state-of-the-art performance (final/mean average displacement (FDE/MDE) errors 1.183/0.356 meters) on benchmark datasets and wins the first place in the open challenge of Trajnet.
KW - cs.CV
KW - Trajectory prediction
KW - Encoder
KW - Generative model
UR - http://www.scopus.com/inward/record.url?scp=85100073403&partnerID=8YFLogxK
U2 - 10.1016/j.isprsjprs.2020.12.004
DO - 10.1016/j.isprsjprs.2020.12.004
M3 - Article
VL - 172
SP - 253
EP - 266
JO - ISPRS Journal of Photogrammetry and Remote Sensing
JF - ISPRS Journal of Photogrammetry and Remote Sensing
SN - 0924-2716
ER -