Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

2023-ACL-Understanding and Improving the Robustness of Terminology Constraints in Neural Machine Translation #241

Open
thangk opened this issue Jun 24, 2024 · 2 comments
Assignees
Labels
literature-review Summary of the paper related to the work

Comments

@thangk
Copy link
Collaborator

thangk commented Jun 24, 2024

Link: ACL Anthology

Main problem

Existing test sets such as IATE, Wiktionary, and TICO employ oversimplified constraint settings and thus leave room for improvement on accuracy and translation quality.

Proposed method

Author proposes an approach which leverages a combination of two methods Place-Holder (PH) and Code-Switch (CS) which brings their advantages together to produce results that are high in accuracy and high in translation quality simultaneously-Robust Terminology Translation (RTT) model.

My Summary

The proposed method (RTT) yielded better performance than PH and CS which are either proficient in BLEU/COMET or SCA by achieving high translation quality and constraint accuracy at the same time. RTT achieves BLEU score of 40.2 in average, COMET score of 0.4866. In SCA test, RTT falls slightly behind PH but leads performance gap over CS of about 20%. In the ablation study, two components term embedding (TermE) and loss masking (LM) yielded best results. One of the major drawbacks of this method is due to its hard copy mechanism which may show performance inefficiency in more complex languages such as Arabic where phrases or terminologies often contain conjunctions or prepositions and thus this area is open to further studies.

Datasets

Custom owned test set derived from English-German (to make a challenging test set)
WMT16 English-German
WMT18 English-German (Europarl, News Commentary)
IATE
Wiktionary

@hosseinfani hosseinfani added the literature-review Summary of the paper related to the work label Jun 25, 2024
@hosseinfani
Copy link
Member

@thangk
thanks for the summary. But I'm not sure I understood the problem and how they address it, in a high-level. An example would clarify.

Also,

  • venue on the title,
  • codebase

btw, for our work, we need to show the effect of the translator we use for our task. So, it's good to see if the proposed translators can be executed within openmt. Otherwise, we have to use their code, which is difficult.

@thangk
Copy link
Collaborator Author

thangk commented Jun 25, 2024

@thangk thanks for the summary. But I'm not sure I understood the problem and how they address it, in a high-level. An example would clarify.

Also,

  • venue on the title,
  • codebase

btw, for our work, we need to show the effect of the translator we use for our task. So, it's good to see if the proposed translators can be executed within openmt. Otherwise, we have to use their code, which is difficult.

Sure, I'll make an update to this.

@thangk thangk changed the title Understanding and Improving the Robustness of Terminology Constraints in Neural Machine Translation (2023) 2023-ACL-Understanding and Improving the Robustness of Terminology Constraints in Neural Machine Translation Jun 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
literature-review Summary of the paper related to the work
Projects
None yet
Development

No branches or pull requests

2 participants