COMET: A Neural Framework for MT Evaluation
https://github.com/Unbabel/COMET
The available models are COMET using the reference-based wmt20-comet-da
model or the reference-free wmt20-comet-qe-da
model.
- COMET:
- Description: A machine translation evaluation metric.
- Name:
rei2020-comet
- Usage:
The
from repro.models.rei2020 import COMET model = COMET() # reference-based inputs = [ {"candidate": "The candidate to score", "sources": ["The source text"], "reference": ["The reference"]} ] macro, micro = model.predict_batch(inputs) # reference-free inputs = [ {"candidate": "The candidate to score", "sources": ["The source text"]} ] macro, micro = model.predict_batch(inputs)
macro
andmicro
are the averaged and input-level COMET scores. The reference-based key is"comet"
and the reference-free key is"comet-src"
.
Only 1 source document and 1 reference translation are supported.
- Image name:
danieldeutsch/rei2020:1.0
- Build command:
repro setup rei2020
- Requires network: Yes, the code still makes a network request even if the models are pre-cached.
repro setup rei2020
pytest models/rei2020/tests
- Regression unit tests pass
See here - Correctness unit tests pass
- Model runs on full test dataset
- Predictions approximately replicate results reported in the paper
- Predictions exactly replicate results reported in the paper