This repo hosts the classification library we used in MuLaN; if you find our work useful, consider citing us:
@inproceedings{ijcai2020-531,
title = {Mu{L}a{N}: Multilingual Label propagatio{N} for Word Sense Disambiguation},
author = {Barba, Edoardo and Procopio, Luigi and Campolungo, Niccolò and Pasini, Tommaso and Navigli, Roberto},
booktitle = {Proceedings of the Twenty-Ninth International Joint Conference on
Artificial Intelligence, {IJCAI-20}},
publisher = {International Joint Conferences on Artificial Intelligence Organization},
pages = {3837--3844},
year = {2020},
month = {7},
doi = {10.24963/ijcai.2020/531},
url = {https://doi.org/10.24963/ijcai.2020/531},
}
We are in the process of updating and improving several parts of this framework!
bash setup.sh
See this repo.
PYTHONPATH=src/ python src/scripts/model/train.py configurations/mbert/feature-based.jsonnet semcor-zero-shot --train data/datasets/train/SemCor/semcor --dev data/datasets/eval/semeval2007/semeval2007
We will soon release additional models, especially the MuLaN versions of the paper; for the time being, we release:
- semcor-mbert-zero-shot, corresponding to ∅-shot-SemCor in Table 2 of the paper
PYTHONPATH=src/ python src/scripts/eval/raganato_eval.py experiments/semcor-zero-shot/best.th multilingual-semeval13 --cut-senses --cuda-device 0 --batch-size 16
For any question either open an issue on github or contact:
All data and codes provided in this repository are subject to the Attribution-Non Commercial-ShareAlike 4.0 International license (CC BY-NC 4.0).