Skip to content

Original code

Pre-release
Pre-release
Compare
Choose a tag to compare
@madelonhulsebos madelonhulsebos released this 09 Feb 11:57
· 106 commits to master since this release
6254a62

This release reflects the code that was used for the experiments in the paper "Sherlock: a deep learning approach to semantic data type detection" (link to the paper on arXiv). This release provides code for:

  • Download of the original train and test data used for the experiment results as reported in the paper.
  • Feature extraction to numerically represent new columns.
  • Evaluating a trained Sherlock model on unseen table columns.
  • Retraining the original Sherlock model.

This release consists inefficiencies and bugs, hence it is recommended to use the latest release of this project in production settings or new research projects. More about this project can be found on this website.