Identify Contrails to Reduce Global Warming

My contribution to the 2023 Kaggle competition:

https://www.kaggle.com/competitions/google-research-identify-contrails-reduce-global-warming

My goal was to learn some advanced TensorFlow and to get some practice (see also statistics).

It was my first Kaggle competition and it was also an opportunity to explore Google Cloud's Vertex AI.

Submission notebook

The notebook that I have worked for the competition is

notebooks/identify-contrails.ipynb

Past the deadline I have continued with

notebooks/identify-contrails-multi-gpu.ipynb

Features

A data pipeline based on TFRecordDataset, since the Keras dataset was not efficient enough for properly feeding the GPU: includes a multiprocessing parallelized generation of TFRecords
Data augmentation: custom multi-frame image augmentation for segmentation to apply the same random transform to all time frames and the segmentation mask, since the keras.layers preprocessing layers do not support this feature
Models:
- Pseudo-3D ResNet (P3DResNet) implementation inspired by Keras' ResNet50
- U-Net with self-made encoder, ResNet50 or P3DResNet
- DeepLabV3+ with ResNet50 or P3DResNet
Gradient accumulation (the implementation did not show any benefit)
Mixed-precision GPU training: results in 2x speed up
Multi-GPU support (added in late submission): results in 2x speed up on 2x T4, see the <STRATEGY> tag in the code comments

History

The history of my (yet a bit messy) experiments:

notebooks-logs/History.ipynb

A summary of my submissions:

version	model	augm	epochs	train	valid	(threshold) valid	submission	final score
v44	UNet/Basic	light	30	0.5008	0.4321	(0.3) 0.5529	0.613	0.633
v47	DeepLab/ResNet50	no	50	0.7321	0.4098	(0.2) 0.4578	0.509	0.540
v48	DeepLab/ResNet50	light	50	0.5126	0.3221	(0.2) 0.4592	0.458	0.510
v49	DeepLab/P3DResNet	no	32	0.5214	0.3475	(0.2) 0.4546	0.499	0.518
v45	DeepLab/P3DResNet	light	40	0.4329	0.3028	(0.2) 0.4633	0.546	0.557

I struggled with slow iterations of my U-Net (not yet explained) and with the data augmentation that did not provide a better generalization as expected. The latter is probably due to the misalignment of the masks with the images, as mentioned in this discussion.

GPU tuning

Benchmark

On a small subset of the data.

configuration	epoch 1	epoch 2
single GPU, tf dataset	73s	38s (100%)
multi GPU, tf dataset	74s	20s (60%)
multi GPU, distributed dataset	76s	21s
single GPU, tf dataset + mixed precision	63s	15s (100%)
multi GPU, tf dataset + mixed precision	82s	9s (60%)

Name		Name	Last commit message	Last commit date
Latest commit History 161 Commits
notebooks-logs		notebooks-logs
notebooks-saved		notebooks-saved
notebooks		notebooks
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Identify Contrails to Reduce Global Warming

Submission notebook

History

GPU tuning

Next

About

Releases

Packages

Languages

License

tomlqc/identify-contrails

Folders and files

Latest commit

History

Repository files navigation

Identify Contrails to Reduce Global Warming

Submission notebook

History

GPU tuning

Next

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages