Skip to content

ipleiria-ciic/reconstructing-cvae

Repository files navigation

Reconstructing Spatiotemporal Data with C-VAEs



Description

The continuous representation of spatiotemporal data commonly relies on using abstract data types, such as moving regions, to represent entities whose shape and position continuously change over time. Creating this representation from discrete snapshots of real-world entities requires using interpolation methods to compute in-between data representations and estimate the position and shape of the object of interest at arbitrary temporal points. Existing region interpolation methods often fail to generate smooth and realistic representations of a region's evolution. However, recent advancements in deep learning techniques have revealed the potential of deep models trained on discrete observations to capture spatiotemporal dependencies through implicit feature learning.

In this work, we explore the capabilities of Conditional Variational Autoencoder (C-VAE) models to generate smooth and realistic representations of the spatiotemporal evolution of moving regions. We evaluate our proposed approach on a sparsely annotated dataset on the burnt area of a forest fire. We apply compression operations to sample from the dataset and use the C-VAE model and other commonly used interpolation algorithms to generate in-between region representations. To evaluate the performance of the methods, we compare their interpolation results with manually annotated data and regions generated by a U-Net model. We also assess the quality of generated data considering temporal consistency metrics.

Burned Area 2D Moving Region

Continuous representation model requires a method to recreate the spatiotemporal evolution of a region, such as the progression of the burned area.

C-VAE Architecture

Employed C-VAE Architecture. a) each region stored in WKT format is converted to raster image to be processed by the model b) a new image is generated conditioned by a label and converted to WKT format.

Summary of Results

Similarity Evaluation. Comparison of JI and HD for U-Net Samples and BurnedAreaUAV test set using periodic and distance-based sampling.

Temporal Consistency Comparison. Average temporal consistency across different algorithms for periodic and distance-based sampling.

Representation of the evolution burned area.

Comparison of Interpolation from Distance-Based Sampling

Comparison of Interpolation from Distance-Based Sampling

TL;DR

The C-VAE algorithm performed competitively against the best-performing algorithm (Shape-Based) in terms of similarity metrics and also achieved superior temporal consistency and generated a relatively realistic and smooth representation of the phenomenon evolution, suggesting that C-VAE models may be a viable alternative to modelling the spatiotemporal evolution of 2D moving regions.

Data Download

BurnedAreaUAV Dataset Citation

@misc{ba_uav_ribeiro_dataset,
  author       = {Ribeiro, Tiago F. R. and Silva, Fernando and Moreira, José and Costa, Rogério Luís de C.},
  title        = {BurnedAreaUAV Dataset (v1.1)},
  month        = may,
  year         = 2023,
  publisher    = {Zenodo},
  version      = {1.1},
  doi          = {10.5281/zenodo.7944963},
}

Preprint Citation

@misc{ribeiro2023preprint,
      title={Reconstructing Spatiotemporal Data with C-VAEs}, 
      author={Tiago F. R. Ribeiro and Fernando Silva and Rogério Luís de C. Costa},
      year={2023},
      eprint={2307.06243},
      archivePrefix={arXiv},
      primaryClass={cs.DB},
      doi = {https://doi.org/10.48550/arXiv.2307.06243}
}

Paper Citation

@InProceedings{ribeiro2023ADBIS,
author="Ribeiro, Tiago F. R.
and Silva, Fernando
and de C. Costa, Rog{\'e}rio Lu{\'i}s",
editor="Abell{\'o}, Alberto
and Vassiliadis, Panos
and Romero, Oscar
and Wrembel, Robert",
title="Reconstructing Spatiotemporal Data with C-VAEs",
booktitle="Advances in Databases and Information Systems",
year="2023",
publisher="Springer Nature Switzerland",
address="Cham",
pages="59--73",
isbn="978-3-031-42914-9"
}

Acknowledgements

This work is partially funded by FCT - Fundação para a Ciência e a Tecnologia, I.P., through projects MIT-EXPL/ACC/0057/2021 and UIDB/04524/2020, and under the Scientific Employment Stimulus - Institutional Call - CEE/CINST/00051/2018.