Neural Attention in Keras

This repository contains a notebook with some simple prototypes to add an attention mechanism to an LSTM for sequence labeling. It is part of a presentation for the Tensorflow Meetup Buenos Aires, in June 2018.

We also add some scripts to visualize the attention obtained after training for the Named Entity Recognition task in a portion of the 2003 CONLL dataset.

Data

The original dataset was uploaded to Kaggle, along with a vanilla LSTM implementation. We have also hosted it into the UNC servers:

Full dataset (150M)
Sample dataset (14M)

There are also some trained models you can download, as it takes some time to train in the whole dataset even using a GPU:

Requirements

To run the network, we recommend you to use python 3.5 and install

Keras 2.1.5
scikit-learn 0.19.1
pandas 0.23.0

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
js		js
style		style
.gitignore		.gitignore
Attention BiLSTM for NERC.ipynb		Attention BiLSTM for NERC.ipynb
LICENSE		LICENSE
Neural Attention.pdf		Neural Attention.pdf
README.md		README.md
data.csv		data.csv
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural Attention in Keras

Data

Requirements

About

Releases

Packages

Languages

License

mit0110/kerasAttention

Folders and files

Latest commit

History

Repository files navigation

Neural Attention in Keras

Data

Requirements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages