Abstract
Highlights: What are the main findings? A calibration-free, end-to-end graph-based framework is proposed for joint camera and subject registration in UGV swarms, operating robustly under severe occlusion and low inter-view co-visibility. A graph-based pose propagation module (GPPM) is proposed that enables global alignment via BFS-guided pose propagation along local co-visibility links, eliminating the need for full co-visibility with a root node or pre-calibrated extrinsics. What are the implications of the main findings? Despite being a purely vision-based solution, the method enables infrastructure-free perception in GPS-denied or complex environments by decoupling multi-view fusion from explicit agent pose estimation, therefore supporting dynamic deployment of UGV swarms with a central node for robust collaborative perception tasks. The method enables robust BEV scene representation and collective situational awareness for UGV swarms, even when inter-camera overlap is minimal and occlusions are severe. In unmanned ground vehicle (UGV) swarm systems, comprehensive environmental awareness is critical for coordinated operations. Yet they are frequently deployed in occlusion-rich, constrained environments where multi-agent visual fusion is essential. However, existing methods are critically limited by offline-calibrated extrinsic parameters, hindering flexible deployment, and by a strong co-visibility assumption, which fails under severe occlusion. To overcome these constraints, we introduce an end-to-end, calibration-free framework for the joint registration of cameras and subjects. Our approach begins with a single-view module that estimates subjects’ poses and appearance features. Subsequently, a novel graph-based pose propagation module (GPPM) treats UGVs’ cameras as nodes in a graph, connecting them with edges when they share co-visible subjects identified via appearance matching. Breadth-first search (BFS) then finds the shortest registration path from any camera to a designated root camera, enabling pose propagation via local co-visibility links and global alignment of all subjects into a unified bird’s-eye-view (BEV) space. This strategy relaxes the stringent requirement of full co-visibility with the root node. A multi-task loss function is proposed to jointly optimize pose estimation and feature matching. Trained and evaluated on a synthetic dataset with occlusions (CSRD-O) collected by a UGV swarm system, our framework achieves mean camera pose errors of 1.57 m/8.70° and mean subject pose errors of 1.40 m/9.14°. Furthermore, we demonstrate a scene monitoring task using a UGV swarm system. Experiments show that the proposed method generates robust BEV estimates even under severe occlusion and low inter-view overlap. This work presents a purely visual, self-calibrating multi-view fusion perception scheme, demonstrating its potential to support cooperative perception, task-oriented monitoring, and collective situational awareness in UGV swarm systems.
| Original language | English |
|---|---|
| Article number | 214 |
| Journal | Drones |
| Volume | 10 |
| Issue number | 3 |
| DOIs | |
| Publication status | Published - Mar 2026 |
Keywords
- bird’s-eye-view
- calibration-free
- multi-view fusion
- pose registration
- unmanned ground vehicle
Fingerprint
Dive into the research topics of 'UGV Swarm Multi-View Fusion Under Occlusion: A Graph-Based Calibration-Free Framework'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver