Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bolded substring wrong placed #147

Open
Andrei19612015 opened this issue Mar 28, 2021 · 0 comments
Open

Bolded substring wrong placed #147

Andrei19612015 opened this issue Mar 28, 2021 · 0 comments

Comments

@Andrei19612015
Copy link

Andrei19612015 commented Mar 28, 2021

I have a lot pdfs which looked in Acrobat Reader like:

"(кроме ипотеки) в размере: 34 139.33 р. в валюте по ОКВ: 643, в отношении должника (тип должника: физическое лицо): Иванов Иван Иванович, ИНН 352304018162, д.р."

After Tikaondotnet's extraction I got:
"(кроме ипотеки) р. в валюте по ОКВ: 643, в отношении должника (тип должника: физическое лицо):в размере: 34 139.33 Иванов Иван Иванович, ИНН 352304018162, д.р."

To be fair, after mark screen and Ctrl-C, in the buffer I got the same error.

But in other utilities (i.e. from IronSoft) I got right placed bolded substrings.

Tika's Version 1.17.1.
Кобзарь 2980.pdf

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant