Underwater instance segmentation is challenged by light attenuation, scattering, and color distortion. We propose BARD-ERA, a unified framework with BARDecoder for progressive boundary refinement, ERA for efficient adaptation to degradations with over 90% fewer parameters, and BACE loss for stronger boundary supervision.
We propose Segmentation-Augmented Differential Depth Estimation Regressor (SADDER), a lightweight module leveraging instance segmentation to correct residual errors, and UWSegDepth, a post-processing method that averages depths per segmented object to enhance object-level spatial structure.
While self-attention excels at modeling global information, it is less effective at capturing high frequencies (e.g., edges etc.) that deliver local information primarily, which is crucial for SISR. To tackle this, we propose a global-local awareness network (GLA-Net) to effectively capture global and local information to learn comprehensive features with low- and high-frequency information.
This research endeavors to introduce a novel architecture of generative adversarial network aimed at producing multimodal logos. Our focus lies in enhancing the model performance of compositional generative adversarial networks by integrating them with spatial transformation networks.