Endmember-Guided Nonlinear Mapping Learning for Hyperspectral Nearshore Underwater Target Detection

This project was inspired by the following two papers, I applied them to underwater target detection in remote sensing hyperspectral imagery. Due to time constraints, this work was not published.

[1] J. Jiao, Z. Gong and P. Zhong, Triplet Spectralwise Transformer Network for Hyperspectral Target Detection, in IEEE Transactions on Geoscience and Remote Sensing, vol. 61, pp. 1-17, 2023.

[2] D. Hong et al., SpectralFormer: Rethinking Hyperspectral Image Classification With Transformers, in IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-15, 2022.

Hyperspectral underwater target detection is a promising and challenging task in remote sensing image processing. Existing methods face significant challenges when adapting to real nearshore environments, where cluttered backgrounds hinder the extraction of target signatures and exacerbate signal distortion.

We propose a two-stage method for underwater target detection in hyperspectral remote sensing images. In the first stage, the method utilizes a nonlinear mapping network to adaptively learn environmental parameters, aiming to replace the conventional bathymetric model. In the second stage, an improved triplet loss is employed to separate underwater target features in the image.

Problem Analysis

Hyperspectral underwater target detection requires identifying pixels containing the target from all pixels. However, due to the strong absorption of light by water, the spectral characteristics of the target are often 'submerged,' causing significant overlap between target samples and water background samples in most scenarios.

Our proposed method demonstrates a good capability for target feature separation.

(a) Before Training

(b) After Training

(c) Before Training

(d) After Training

Improved Triplet Loss

Suppose $x_{p}$, $x_{n}$, and $x_{a}$ are positive samples, negative samples, and anchor samples respectively. The cosine similarity between them is represented as:

$$ d_{pn} = cosine similarity(x_{p}, x_{n}) = \frac{x_{p} \cdot x_{n}}{|x_{p}| |x_{n}|} $$

$$ d_{pa} = cosine similarity(x_{p}, x_{a}) = \frac{x_{p} \cdot x_{a}}{|x_{p}| |x_{a}|} $$

$$ d_{an} = cosine similarity(x_{a}, x_{n}) = \frac{x_{a} \cdot x_{n}}{|x_{a}| |x_{n}|} $$

The improved triplet loss is divided into two parts; the first part is the standard triplet loss. A larger margin, denoted as margin2, is introduced to ensure that the similarity between the positive sample and the anchor is significantly higher than the similarity between the positive sample or the anchor and the negative sample:

$$ Loss_{Triplet1} = Max(d_{pn}, d_{an} - d_{pa} + \text{margin2}) $$

The second part of the loss is composed of $Loss_{ap}$, $Loss_{an}$ and $Loss_{pn}$:

$$ Loss_{ap} = Max(0, d_{pa} - \text{margin1} + \epsilon) $$

This term aims to increase the lower bound of the similarity between the positive sample and the anchor by setting a small margin, denoted as margin1.

$$ Loss_{an} = Max(0, \text{margin2} - d_{an}) $$

$$ Loss_{pn} = Max(0, \text{margin2} - d_{pn}) $$

$$ Loss_{Triplet2} = Loss_{ap} + Loss_{an} + Loss_{pn} $$

In summary, the improved triplet loss function effectively captures the complex relationships between samples by considering the relative similarities among the positive sample, anchor, and negative sample, while imposing distinct constraints and margins on these similarities. This approach is particularly effective in situations where subtle differences within the positive sample set are challenging to distinguish.

(a) RGB

(b) ground truth

(c) using normal triplet loss

(d) using improved triplet loss

Dataset

Due to the difficulty of deploying underwater targets and the high cost of data collection, research in this area has predominantly relied on simulated data. To advance the study of underwater target detection in real-world scenarios, we used a dataset of real underwater scenes and conducted experiments on this data. The deployed underwater target is an iron plate, and the target's prior spectral data were collected onshore.

The River Scene data sets were captured by Headwall Nano-Hyperspec imaging sensor equipped on DJI Matrice 300 RTK unmanned aerial vehicle, and it was collected at the Qianlu Lake Reservoir in Liuyang (28◦18′40.29′′ N, 113◦21′16.23′′ E), Hunan Province, China on July 31, 2021.

The Ningxiang data set was captured using the same equipment, and it was collected at the Meihua Reservoir, Ningxiang city (27◦ 56’59.72” N, 112◦ 8’50.45” E), Hunan Province , China on January 10, 2024.

Download the datasets from here, put it under the folder dataset.

Dataset format: mat
River Scene1: 242×341 pixels with 270 spectral bands
River Scene2: 255 × 261 pixels with 270 spectral bands
River Scene3: 137 × 178 pixels with 270 spectral bands
Ningxiang: The Ningxiang dataset was collected in a reservoir with high mineral content and significant sediment, which may make detection challenging.

Keys:

data: The hyperspectral imagery contains underwater targets
target: The target prior spectrum collected on land
gt: The ground truth of underwater target distribution

Training && Testing

Stage 1: Nonlinear mapping learning

Modify AE.py
Run python AE.py

Training for new dataset need to generate NDWI mask (If land areas are included).

NDWI Water Mask (require gdal): You can check out the water_mask\NDWI.py file in another project NUN-UTD

water -- 0

land -- 255

selected bands get from envi

GREEN.tif: green band 549.1280 nm

NIR.tif: near-infrared band 941.3450 nm

Stage 2: Feature separation network training

Modify TPL.py, using the trained weights of the nonlinear mapping network from the first stage.
Run python TPL.py

The trained weights and results are all stored in the result_* folders.

I am no longer conducting research in this field, this is the final research project of my master's degree. I sincerely hope that this work can be of assistance to you and contribute to the advancement of the community.

There has been limited research in this field, and many challenges remain in applying these methods to real-world scenarios. We sincerely hope that this work contributes positively to the field, despite the theoretical and practical limitations that still exist. If you have any concerns, please do not hesitate to contact [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
__pycache__		__pycache__
init_weight		init_weight
pics		pics
result_AE		result_AE
result_detection		result_detection
utils		utils
water_mask		water_mask
AE.py		AE.py
README.md		README.md
TPL.py		TPL.py
vit.py		vit.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Endmember-Guided Nonlinear Mapping Learning for Hyperspectral Nearshore Underwater Target Detection

Problem Analysis

Improved Triplet Loss

Dataset

Training && Testing

About

Releases

Packages

Languages

Jiaxuan2021/NML-UTD

Folders and files

Latest commit

History

Repository files navigation

Endmember-Guided Nonlinear Mapping Learning for Hyperspectral Nearshore Underwater Target Detection

Problem Analysis

Improved Triplet Loss

Dataset

Training && Testing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages