GC-GAT: Multimodal Vehicular Trajectory Prediction Using Graph Goal Conditioning and Cross-Context Attention

Gulzar, Mahir, Muhammad, Yar and Muhammad, Naveed (2025) GC-GAT: Multimodal Vehicular Trajectory Prediction Using Graph Goal Conditioning and Cross-Context Attention. IEEE Robotics and Automation Letters, 10 (8). 8316 - 8323. ISSN 2377-3766
Copy

Predicting future trajectories of surrounding vehicles heavily relies on what contextual information is given to a motion prediction model. The context itself can be static (lanes, regulatory elements, etc) or dynamic (traffic participants). This letter presents a lane graph-based motion prediction model that first predicts graph-based goal proposals and later fuses them with cross attention over multiple contextual elements. We follow the famous encoder-interactor-decoder architecture where the encoder encodes scene context using lightweight Gated Recurrent Units, the interactor applies cross-context attention over encoded scene features and graph goal proposals, and the decoder regresses multimodal trajectories via Laplacian Mixture Density Network from the aggregated encodings. Using cross-attention over graph-based goal proposals gives robust trajectory estimates since the model learns to attend to future goal-relevant scene elements for the intended agent. We evaluate our work on nuScenes motion prediction dataset, achieving state-of-the-art results.

picture_as_pdf

picture_as_pdf
GC_GAT_Multimodal_Vehicular_Trajectory_Prediction_using_Graph_Goal_Conditioning_and_Cross_context_Attention-2.pdf
subject
Published Version
Available under Creative Commons: BY 4.0

View Download

EndNote BibTeX Reference Manager Refer Atom Dublin Core RIOXX2 XML Data Cite XML METS MODS MPEG-21 DIDL OpenURL ContextObject ASCII Citation OpenURL ContextObject in Span HTML Citation
Export

Downloads