Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About Layout detection SFT dataset #128

Closed
hjbiao09 opened this issue Sep 12, 2024 · 2 comments
Closed

About Layout detection SFT dataset #128

hjbiao09 opened this issue Sep 12, 2024 · 2 comments

Comments

@hjbiao09
Copy link

hjbiao09 commented Sep 12, 2024

Hi, first of all, thank you so much for releasing such a good source.

It works pretty well on our dataset, but we got incorrect layout detection in a few cases and want to further fine-tune the model with the layoutlmv3-sft pretrained model.
So, I would like to understand the format of the datasets used in the actual layout detection, could you please provide a link to the dataset source or related materials?

In layoutlmv3_base_inference.yaml, the relevant dataset entry is publaynet/layout_scihub, did you train with that dataset?

Again, thanks for opening up your good source.

@hjbiao09 hjbiao09 changed the title About the About Layout detection SFT dataset Sep 12, 2024
@ouyanglinke
Copy link
Collaborator

Layoutlmv3 take COCO format data as input during the sft process in object detection. We use our own datasets for SFT, which would not be open-source. You can build your own sft dataset for your specific cases. Or you could provide those bad cases to us for further sft training. We will update our model once we collect enough bad cases.

@hjbiao09
Copy link
Author

Hi,

Thank you for providing the information regarding the dataset. With your help, we were able to resolve the issue, and I will close this issue now.

Once again, thank you for releasing such a great resource.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants