Improve regexes for parsing problem data #409

mgrabovsky · 2020-12-16T15:39:07Z

Replace ranges with PCRE escapes, e.g. a-zA-Z → \w.
Fix package name (NEV) parser to correctly handle epoch numbers. The epoch was previously assumed to be a prefix of the whole name (in ENV form), e.g. 1:findutils-4.7.0-4.fc33. In order to be consistent with abrt, DNF and RPM, we switch to NEV form, that is findutils-1:4.7.0-4.fc33.

Potentially related: #400

* Replace ranges with PCRE escapes, e.g. `a-zA-Z` → `\w`. * Fix package name (NEV) parser to correctly handle epoch numbers. The epoch was previously assumed to be a prefix of the whole name (in ENV form), e.g. `1:findutils-4.7.0-4.fc33`. In order to be consistent with abrt, DNF and RPM, we switch to NEV form, that is `findutils-1:4.7.0-4.fc33`.

packit-as-a-service · 2020-12-16T15:43:06Z

Congratulations! One of the builds has completed. 🍾

⚠️ Please note that our current plans include removal of these comments in the near future (at least 2 weeks after including this disclaimer), if you have serious concerns regarding their removal or would like to continue receiving them please reach out to us. ⚠️

You can install the built RPMs by following these steps:

sudo yum install -y dnf-plugins-core on RHEL 8
sudo dnf install -y dnf-plugins-core on Fedora
dnf copr enable packit/abrt-retrace-server-409
And now you can install the packages.

Please note that the RPMs should be used only in a testing environment.

xsuchy · 2020-12-16T15:58:42Z

I think we can do that as we run under en locale. But beware that e.g. in Swedish locale the "w" is not part of \w

mgrabovsky · 2020-12-17T08:08:45Z

I think we can do that as we run under en locale. But beware that e.g. in Swedish locale the "w" is not part of \w

Ah, I didn't think about that. But now that I'm looking into the documentation\w includes “most characters that can be part of a word in any language, as well as numbers and the underscore”. So we should be fine regarding locales (although using the ASCII flag might be safer), but we're now accepting more than the original parser.

The change might have been too hasty. I will go and fix that.

xsuchy approved these changes Dec 16, 2020

View reviewed changes

xsuchy merged commit 84d67af into master Dec 16, 2020

mgrabovsky deleted the parser-regexes branch December 17, 2020 08:33

mgrabovsky mentioned this pull request Dec 17, 2020

Regex repairs and epoch stripping fix #410

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve regexes for parsing problem data #409

Improve regexes for parsing problem data #409

mgrabovsky commented Dec 16, 2020

packit-as-a-service bot commented Dec 16, 2020

xsuchy commented Dec 16, 2020

mgrabovsky commented Dec 17, 2020

Improve regexes for parsing problem data #409

Improve regexes for parsing problem data #409

Conversation

mgrabovsky commented Dec 16, 2020

packit-as-a-service bot commented Dec 16, 2020

xsuchy commented Dec 16, 2020

mgrabovsky commented Dec 17, 2020