Abstract
In view of the high false alarm rate and multi-sensory task integration challenges faced by amphibious unmanned platforms in uncertain environments, in this study a multi-model joint environment perception method based on the segment anything model (SAM) was proposed, which achieved unified processing of obstacle detection and amphibious domain segmentation. Specifically, U-Net and YOLOv8 were combined with SAM. U-Net and YOLOv8 were responsible for obtaining the rough outline of the target, while SAM achieved fine segmentation through its encoding-decoding structure. In addition, a special fine-tuning strategy was designed to achieve joint training, which further improved the performance of the model. In this study, a proprietary dataset USV-Dataset was also constructed and a data engine was developed to improve the annotation efficiency. In order to enhance the generalization ability of the model, four public datasets were used for mixed training with USV-Dataset, covering a variety of scenarios and obstacle categories. Experimental results show that this method achieves 96.8% mPA segmentation accuracy and 10 FPS inference speed, showing good generalization ability and meeting the real-time environment perception needs of medium and low-speed amphibious unmanned platforms.
| Translated title of the contribution | 基于 SAM 的水陆两栖环境感知微调策略与应用 |
|---|---|
| Original language | English |
| Pages (from-to) | 20-28 |
| Number of pages | 9 |
| Journal | Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology |
| Volume | 46 |
| Issue number | 1 |
| DOIs | |
| Publication status | Published - 2026 |
| Externally published | Yes |
Keywords
- amphibious platform
- environmental perception
- multi-model fusion
- SAM