Skip to content

Latest commit

 

History

History
50 lines (38 loc) · 3.31 KB

README.md

File metadata and controls

50 lines (38 loc) · 3.31 KB

Transliterate - Meitei Mayek

An android app for Transliteration from English to Meitei Mayek and Meitei Mayek to English.
The app uses a Tensorflow Model for the transliteration. The model is converted to TFLite for use in android.
A web version of this project can be found on https://transliterate.meiteimayek.tk

Note

Here, the term referring to is Transliteration not to be confused with Translation.

A quote from Google :

Transliteration refers to the method of mapping from one system of writing to another based on phonetic similarity.

And we all know what Translation means.

One thought to keep in mind.
Throughout this project Translation and Transliteration will be used interchangeably. And both will be referred to Transliteration.

Neural Machine Translation - Model

The model is initially used for Translation purpose but with some tweaking and hacking I managed to used it for Transliteration. I used a Translation model because in the context of Deep Learning, Transliteration and Translation are similar problems.
It is a Seq2Seq Model with Bahdanau Attention based on Google's Neural Machine Translation System.
The best part is that it is Multilingual which means that - provided enough dataset, it can transliterate from any given language to any language (by only one Model).
The dataset that is used for training this model was extracted from the web via Web Scraping and to experiment with, a small part of the dataset is generated by a Rule based Transliteration System, which in the future may be excluded.

Features to Implement

  • Support for Bengali Script.
  • Keyboard with real time Transliteration

Contribution

Anyone who are interested in this project can fork this repository, add some features and send a pull request.
When writing the code, contributors should try to go along with the coding style that is used by the Project as far as applicable (import the .idea configurations for the coding style guide).
And for any kind of data flow, the contributor should try to use either one of RxJava's observable sequences.
Contributors are requested to add their names to the section below.

Author

LICENSE

Transliterate - Meitei Mayek Copyright (C) 2019-2020 Gyanendro Kh

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see https://www.gnu.org/licenses/.