Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[bug] validation of chars #273

Open
drwetter opened this issue May 18, 2022 · 4 comments
Open

[bug] validation of chars #273

drwetter opened this issue May 18, 2022 · 4 comments

Comments

@drwetter
Copy link

Hi,

often I paste content into the wiki which has either a charset not in the scope of the document or it's not printable.

Upon opening with libreoffice I encounter a popup box and after acknowledging it closes.

image

It would be great if the dokuwiki plugin would either omit such characters or try to sanitize them. This bug maybe also has a security implication (http://cwe.mitre.org/data/definitions/19)

I worked my way around that by editing the content.xml and put the archive together but it's kind of tedious.

Thanks, Dirk

@Klap-in
Copy link
Collaborator

Klap-in commented May 18, 2022

Could you provide examples to make this reproducible for others?

DokuWiki should store it, and show it if relevant. And should not break on the exotic stuff. But I expect that everything that's utf-8 should be handled properly (is this assumption right?) so is my interpretation right that DokuWiki does not need change?

The troubles you mentioned are when the data is converted to another output format (odt). So direct solution would be that the odt plugin sanatizes the data more? And are other export plugins also affected?

@Klap-in
Copy link
Collaborator

Klap-in commented May 18, 2022

Ah, this issue is already submitted for the odt plugin. Sorry I'm reacting via my phone, resulting in less overview.

@drwetter
Copy link
Author

I can't tell whether it was UTF-8 as it was a binary display from an image in a client side proxy. That I encounter such chars is a use case for me. Those hieroglyphs I wanted to show up in the final document. I assumed though that somebody in the chain from dokuwiki, odt plugin, libreoffice is cleaning the mess up.

Let me know If you want a hexdump of some sections.

@drwetter
Copy link
Author

drwetter commented Jun 3, 2022

e.g.
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants