-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Disambiguate homonyms in glossaries / provide alternative translations? #97
Comments
Hi, thanks for your question! This is a bit tricky.
An example where it works (sorry I don't speak Russian) with English to German: "rain" in english can be the rain (German: Regen) or to rain (German: regnen). I can add a term 'rain => schütten' (only makes sense for the verb form) to my glossary, and it translates: "It was raining a lot that day. The rain really did not seem to stop." into "Es schüttete an diesem Tag sehr viel. Der Regen schien wirklich nicht aufzuhören." However, doing the same in your example fails. This will depend on the specific languages and its associated grammar, for example you can add infinitive markers ("to rain") or articles ("the rain") in your glossary definition, if they exist in the associated language. Glossary term: 'пропасть => to disappear' "Загляните в пропасть. Пропасть разверзлась подо мной." "Look into the disappear. The disappearance opened up beneath me." I flagged this to the team responsible for glossaries and we might fix this in the future. |
Hi Jan @JanEbbing, thanks a lot for your answer and for signaling the problem. Another problem are the homonyms of the same part of speech, e.g. plane "airplane" vs. plane as a term in geometry. |
I don't think we would want to add a POS tag as it doesn't fully solve the problem as you remarked (I would even guess that the majority of homonyms are between words of the same POS). I will check with our glossary team if adding support to distinguish between different word meanings in glossaries is on the roadmap, but can't give an estimate. |
Hello,
Is there a way to specify different translations for homonymous words in a glossary?
E.g. in a Russian > English translation, there is a verb 'пропасть' to which I would like to assign the translation "disappear".
But 'пропасть' can also be a noun meaning "abyss", and I don't want it to be bluntly replaced by "disappear" everywhere.
Is there a way to disambiguate two words like that in the input?
If not, it might help to be able to specify an alternative translation ranked lower than the first one, but I'm not sure how exactly that should work.
The text was updated successfully, but these errors were encountered: