Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Could you give me some suggestions for training dataset? #140

Open
yangxiuwu opened this issue Dec 9, 2018 · 1 comment
Open

Could you give me some suggestions for training dataset? #140

yangxiuwu opened this issue Dec 9, 2018 · 1 comment

Comments

@yangxiuwu
Copy link

yangxiuwu commented Dec 9, 2018

Hi, I have trained a Chinese OCR model by CRNN ( 300W synth text image as train dataset). but the model has poor result for the real scene. So could you give me some suggestions for training dataset:
Does the dataset require a fixed aspect ratio?
Does the dataset need some data augment, e.g. transform , blur, different font color and diverse background and so on?

@yangxiuwu yangxiuwu changed the title 请问训练数据集有什么建议吗? Could you give some suggestions for training dataset? Dec 10, 2018
@yangxiuwu yangxiuwu changed the title Could you give some suggestions for training dataset? Could you give me some suggestions for training dataset? Dec 10, 2018
@Cocoalate
Copy link

Hi, is your 300W synth text image data public? I'm working on receipt ocr now but my data hasn't been enough.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants