Replies: 1 comment 1 reply
-
Hi @deClot 👋🏼 , The detection models are trained on real data (custom mindee dataset - not public available). @odulcy-mindee how many samples are inside the detection dataset ? :) A possible synthetic approach: |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi everybody,
I'm retrain DBNet with my own document data obtained from FineReader or CRAFT but these methods are not working well on some complicate or noise documents. So I thought about synthetic document generation.
What do you use as your dataset? Real documents or some document generator? Could you give advise about how to get clean data to my dataset?
Beta Was this translation helpful? Give feedback.
All reactions