Transformer_Time_Series

DISLCLAIMER: THIS IS NOT THE PAPERS CODE. THIS DOES NOT HAVE SPARSITY. THIS IS TEACHER FORCED LEARNING. Only tried to replicate the simple example without sparsity. Enhancing the Locality and Breaking the Memory Bottleneck of Transformer on Time Series Forecasting (NeurIPS 2019)

Able to match the results of the paper for the synthetic dataset as shown in the table below

The synthetic dataset was constructed as shown below

A nice visualization of how the attention layers look at the signal for predicting the last timestep t=t0+24-1

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
images		images
Dataloader.ipynb		Dataloader.ipynb
Dataloader.py		Dataloader.py
LSTM_comparison.ipynb		LSTM_comparison.ipynb
README.md		README.md
Transformer_Decoder_nologsparse.ipynb		Transformer_Decoder_nologsparse.ipynb
causal_convolution_layer.ipynb		causal_convolution_layer.ipynb
causal_convolution_layer.py		causal_convolution_layer.py

Provide feedback