Create Swedish to all other languages translation process #79

andrewtavis · 2024-02-24T13:35:00Z

Terms

I have searched open and closed feature requests
I agree to follow Scribe-Data's Code of Conduct

Description

The goal of this issue is to create a process whereby a single file is used to translate all words within Swedish/translations/words_to_translate.json to all other Scribe languages. To achieve this we'll be using m2m100_418M, with the output being a JSON file that has a string and keyed values for each language. This can then be transferred to an SQLite database table with each string in an index corresponding to a column value for each language.

Of specific importance is trying to get a metric of the accuracy of the translation and doing a cutoff such that we're no longer including low quality translations in Scribe applications :)

Contribution

Happy to work on this or support someone with interest in working on it!

Shorla · 2024-03-04T10:53:47Z

Hi @andrewtavis, My name is Olushola Ogunkelu. I am a new GSOC contributor. I went through this issue and I would like to work on it.

A bit of my background: I have experience contributing to open source and working with Python. But this is my first time working with machine translation.

I would appreciate some help on how to get started. Also, do I need to understand Swedish language to work on this?

andrewtavis · 2024-03-04T11:07:14Z

Hey @Shorla 👋 We'll be merging in an issue in a few days that will help you work on this. You'll be able to follow the code for the English translations that we have a PR from, and I'm sure that @henrikth93 would be willing to help us check the quality of some of the translations afterwards! I'll be in touch when you can start working on this, but for now I'll assign you 😊

Shorla · 2024-03-04T11:16:11Z

Thank you! I can't wait.

andrewtavis · 2024-03-18T01:09:43Z

Hey @Shorla 👋 The process has been set up and we're ready to implement here :) It's actually quite streamlined now. If you make a version of scribe_data/extract_transform/languages/English/translations/translate_words.py that replaces SRC_LANG with Swedish we should be good to go here 😊

Shorla · 2024-03-19T13:15:59Z

Thank you! I will get right to it

andrewtavis · 2024-03-20T22:12:12Z

Closed via #114 😊 Appreciate the support with this, @Shorla!

andrewtavis added feature New feature or request help wanted Extra attention is needed good first issue Good for newcomers labels Feb 24, 2024

andrewtavis assigned Shorla Mar 4, 2024

shashank-iitbhu mentioned this issue Mar 4, 2024

Add translation funcs to utils #88

Closed

1 task

andrewtavis added a commit to Shorla/Scribe-Data that referenced this issue Mar 20, 2024

scribe-org#79 ignore import order warning and add EOF line

49bfcf2

andrewtavis closed this as completed Mar 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create Swedish to all other languages translation process #79

Create Swedish to all other languages translation process #79

andrewtavis commented Feb 24, 2024

Shorla commented Mar 4, 2024 •

edited

Loading

andrewtavis commented Mar 4, 2024

Shorla commented Mar 4, 2024

andrewtavis commented Mar 18, 2024

Shorla commented Mar 19, 2024

andrewtavis commented Mar 20, 2024

Create Swedish to all other languages translation process #79

Create Swedish to all other languages translation process #79

Comments

andrewtavis commented Feb 24, 2024

Terms

Description

Contribution

Shorla commented Mar 4, 2024 • edited Loading

andrewtavis commented Mar 4, 2024

Shorla commented Mar 4, 2024

andrewtavis commented Mar 18, 2024

Shorla commented Mar 19, 2024

andrewtavis commented Mar 20, 2024

Shorla commented Mar 4, 2024 •

edited

Loading