Could you give me some suggestions for training dataset? #140

yangxiuwu · 2018-12-09T13:19:46Z

Hi, I have trained a Chinese OCR model by CRNN ( 300W synth text image as train dataset). but the model has poor result for the real scene. So could you give me some suggestions for training dataset:
Does the dataset require a fixed aspect ratio?
Does the dataset need some data augment, e.g. transform , blur, different font color and diverse background and so on?

Cocoalate · 2019-10-15T03:32:46Z

Hi, is your 300W synth text image data public? I'm working on receipt ocr now but my data hasn't been enough.

yangxiuwu changed the title ~~请问训练数据集有什么建议吗？~~ Could you give some suggestions for training dataset? Dec 10, 2018

yangxiuwu changed the title ~~Could you give some suggestions for training dataset?~~ Could you give me some suggestions for training dataset? Dec 10, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Could you give me some suggestions for training dataset? #140

Could you give me some suggestions for training dataset? #140

yangxiuwu commented Dec 9, 2018 •

edited

Loading

Cocoalate commented Oct 15, 2019

Could you give me some suggestions for training dataset? #140

Could you give me some suggestions for training dataset? #140

Comments

yangxiuwu commented Dec 9, 2018 • edited Loading

Cocoalate commented Oct 15, 2019

yangxiuwu commented Dec 9, 2018 •

edited

Loading