Emergency Evacuation Map Guided Navigation via Topological Alignment and VLM Reasoning

  • Canzhi Chen
  • , Weiqi Huang
  • , Jiaxin Li
  • , Zan Wang
  • , Huijun Di
  • , Wei Liang*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Traditional map-based navigation requires time-consuming mapping, while map-free methods often involve inefficient exploration. Both are ill-suited for time-critical scenarios such as emergency rescue. The readily available structural semantic map (eg, an evacuation map) is inherently well-suited for such scenarios, as it provides crucial geometric and semantic cues to support efficient navigation. However, applying the map to robot navigation tasks remains challenging due to the discrepancy between the map and the real environment, caused by the lack of metric information and inherent geometric distortions. To address these challenges, we propose ENAV, a unified framework that integrates room topology extraction, topology-based localization through alignment, and vision–language model (VLM)-guided planning to enable efficient navigation using evacuation maps. Specifically, given a target room, ENAV first extracts room topology from both the evacuation map and the real-time constructed metric map, and performs localization via topology alignment. It then employs a vision–language model (VLM) to generate intermediate sub-goals, and finally plans low-level actions to reach each sub-goal incrementally. Extensive experiments on our curated dataset demonstrate that our algorithm outperforms other baselines by a large margin in terms of SR and SPL metrics, highlighting the effectiveness and efficiency of the proposed framework.

Original languageEnglish
Title of host publicationPattern Recognition and Computer Vision - 8th Chinese Conference, PRCV 2025, Proceedings
EditorsJosef Kittler, Hongkai Xiong, Jian Yang, Xilin Chen, Jiwen Lu, Weiyao Lin, Jingyi Yu, Weishi Zheng
PublisherSpringer Science and Business Media Deutschland GmbH
Pages519-533
Number of pages15
ISBN (Print)9789819556786
DOIs
Publication statusPublished - 2026
Event8th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2025 - Shanghai, China
Duration: 15 Oct 202518 Oct 2025

Publication series

NameLecture Notes in Computer Science
Volume16277 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference8th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2025
Country/TerritoryChina
CityShanghai
Period15/10/2518/10/25

Fingerprint

Dive into the research topics of 'Emergency Evacuation Map Guided Navigation via Topological Alignment and VLM Reasoning'. Together they form a unique fingerprint.

Cite this