See:
Install:
- Install Python3 if it is not installed.
- Install required packages:
pip3 install bottle==0.12.19 manga_ocr==0.1.4 Paste
. (Use terminal to run the commands.) - Download the model and unzip it into a folder.
- Run the server:
python3 server_manga_ocr.py
. - Unzip the plugin files in ImageTrans's
plugins
folder and restart ImageTrans. (optional for v1.9.0+)
For convenience, you can also use the Windows package:
- Download and unzip manga-ocr.
- Download the model and unzip it to manga-ocr's folder in the previous step. (if you do not download the model file, the program will download the model from the Internet.)
- Run
run.bat
and wait for the server to get ready.
If you need to enable GPU, please follow this guide.
The mangaOCR works great for speech bubbles like the following one:
But it may not work well for the following long text line image by default:
For such a case, you can select the long text mode of the plugin which will crop the long text line images into segments for the OCR engine to extract the text. It should better be used in combination with a scene text detector which detects the text lines accurately like this one: https://github.com/xulihang/ImageTrans_plugins/tree/master/mangaTranslatorOCR
mangaOCR does not detect text. It can be used together with other text detection methods like the following ones:
- Speech bubble detection models: https://github.com/xulihang/balloon-dataset
- manga image translator (detect text lines): https://github.com/xulihang/ImageTrans_plugins/tree/master/mangaTranslatorOCR
You may need to update the port if you also need to run other local OCR engines:
https://github.com/xulihang/ImageTrans_plugins/tree/master#notes-using-local-servers