Limitations With Existing Approaches for Undersea Imaging
1. Multi-Scale Marine Object Detection in Side-Scan Sonar Images
Model: BES-YOLO (modified YOLOv5 optimized for side-scan sonar)
Dataset: 1,584 labeled images across five categories
- Pipeline
- Shipwreck
- Rock
- Seabed mound
- Anchor
Performance:
-
mAP0.5: 92.4%
-
Baselines:
- YOLOv8n: 87.1%
- YOLOv5s: 71.4%
2. Underwater Garbage Detection with Improved YOLOv5
Model: Enhanced YOLOv5 for marine debris detection
Dataset: Trash_ICRA19
- 5,720 training images
- 820 testing images
- Categories: plastic debris, ROV/man-made objects, biological material
Performance:
-
Accuracy: 88.7%
-
mAP: 90.6%
3. Rethinking General Underwater Object Detection (RUOD)
Model: YOLOF baseline
Dataset: Large-scale, diverse underwater imagery
- 14,000 high-resolution images
- 74,903 labeled objects
- Ten classes (fish, turtle, diver, coral, jellyfish, scallop, cuttlefish, etc.)
Performance:
- mAP0.5:0.95: 50.1%
Common Limitation
Most existing datasets are narrowly scoped—either focusing on specific object types or constrained by limited resolution or size—which hampers model generalizability.
Potential Improvements
-
Optical Imaging Expansion Merge multiple optical datasets (e.g., Trash_ICRA19, RUOD) to create a broad, diverse training corpus capable of detecting varied underwater anomalies.
-
Sonar-Based Extension Enhance the BES-YOLO framework by aggregating additional side-scan sonar data—ideal for seabed inspection scenarios where optical imagery is unavailable.