Add a simplified machine parsable output for `reuse lint` #925

nogweii · 2024-02-28T22:40:05Z

A middle ground between the full JSON output and the human-focused plain output. My idea here is integrating reuse lint with tools like reviewdog where it excels with output along the lines of:

scripts/foo1.sh: missing license 'MPL-2.0'
scripts/foo2.sh: no licensing information
scripts/foo3.sh: no copyright information

One can generate such an output using the JSON version and processing it, but I think it would be beneficial to include it upstream.

The text was updated successfully, but these errors were encountered:

KlfJoat · 2024-03-06T01:04:01Z

+1 to this. I'd also suggest allowing us to pick from a few different output formats. And those output formats should be existing output formats.

One of the output formats that's simplified and machine-parseable could be TAP format. Since TAP parsers abound, the direct output of reuse lint could be used in CI/CD pipelines.

Another format, which wouldn't be simplified and human-readable, but is widely used could be JUnit format, which also has a lot of parsers and could be consumed by CI/CD pipelines, including built-in reporting on sites like GitLab.

carmenbianca · 2024-04-08T09:50:16Z

Thanks for the issue. This seems quite doable to me. Off the top of my head, an implementation would need:

A new function akin to format_plain and format_json in lint.py.
A new argument that can be passed to reuse lint, à la reuse lint --tap or some such.
--json and --tap must be mutually exclusive. argparse already supports something like that.
Some tests.

The TAP format looks decent to me as far as machine-readable specs go. A vague proposal:

1..n
ok - ./CHANGELOG.md
ok - ./README.md
not ok - ./pyproject.toml
  ---
  failed:
    - no-copyright
    - no-license
  ...
# global tests
ok - no missing licenses
ok - no unused licenses
ok - no bad licenses
not ok - no licenses without extension
  ---
  failed:
    - ./LICENSES/MIT
  ...
ok - no read errors

The YAML is a bit verbose, but there's no other way to document why the test failed.

Figuring out the value of n in advance may be a little tricky. I suppose it's number of files (computed in advance) + number of global tests (static).

carmenbianca · 2024-04-08T09:53:24Z

I also had a look at reviewdog linked in the first comment. The README is very shouty/busy so I didn't make good progress skimming through it. Would the above work for reviewdog?

nicorikken · 2024-04-08T12:41:30Z

To elaborate on the reviewdog example, here is suggested input for reviewdog in the video from the README:

Details of accepting input formats and an example are described in the Input Format section.

{file}:{line number}:{column number}: {message} is suggested.

The definitions are constructed according to the Vim errorformat.

Another option for different uses might be the SARIF format. This, together with the JUnit.xml were already mentioned in an earlier issue: #320

nicorikken · 2024-04-08T12:45:50Z

One question is how to deal with global errors like missing licenses.

Maybe .: Missing MIT license in LICENSES/ would be an option, so . as the filename.
Maybe LICENSES: Missing MIT license in LICENSES/ to refer to the licenses directory.

KlfJoat · 2024-04-08T15:27:21Z

IMO, a missing license is a file-specific error. The license referenced by somefile.ext does not exist in LICENSES/. If a line number is needed, it's the line where the license is referenced.

Adds an '-line' or '-l' option to the 'lint' command. Prints a line for each error, starting with the file to which the error belongs. This output can be a starting point for some parsers, in particular ones that implement something similar to Vim errorformat parsing. Related to #925 Additional work needed: - [ ] Needs tests - [ ] Error messages might have to be improved - [ ] Not all errors are found, as some issues aren't in the FileReports. This requires additional investigation. Signed-off-by: Nico Rikken <[email protected]>

nicorikken · 2024-04-22T05:29:12Z

@nogweii @KlfJoat I have a working PR at #956 Can you please take a look to see if it meets your expectations?
I'm trying to run the output via reviewdog, but am not successful. Perhaps I'm doing something wrong.

Adds an '-line' or '-l' option to the 'lint' command. Prints a line for each error, starting with the file to which the error belongs. This output can be a starting point for some parsers, in particular ones that implement something similar to Vim errorformat parsing. Related to #925 Additional work needed: - [ ] Needs tests - [ ] Error messages might have to be improved - [ ] Not all errors are found, as some issues aren't in the FileReports. This requires additional investigation. Signed-off-by: Nico Rikken <[email protected]>

carmenbianca added the enhancement New feature or request label Apr 8, 2024

carmenbianca added the good first issue Good for newcomers label Apr 8, 2024

nicorikken self-assigned this Apr 8, 2024

nicorikken mentioned this issue Apr 10, 2024

feat: lint output per line #956

Merged

3 tasks

This comment was marked as resolved.

Sign in to view

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a simplified machine parsable output for `reuse lint` #925

Add a simplified machine parsable output for `reuse lint` #925

nogweii commented Feb 28, 2024

KlfJoat commented Mar 6, 2024

carmenbianca commented Apr 8, 2024

carmenbianca commented Apr 8, 2024

nicorikken commented Apr 8, 2024 •

edited

Loading

nicorikken commented Apr 8, 2024 •

edited

Loading

KlfJoat commented Apr 8, 2024

This comment was marked as resolved.

This comment was marked as resolved.

nicorikken commented Apr 22, 2024 •

edited

Loading

Add a simplified machine parsable output for reuse lint #925

Add a simplified machine parsable output for reuse lint #925

Comments

nogweii commented Feb 28, 2024

KlfJoat commented Mar 6, 2024

carmenbianca commented Apr 8, 2024

carmenbianca commented Apr 8, 2024

nicorikken commented Apr 8, 2024 • edited Loading

nicorikken commented Apr 8, 2024 • edited Loading

KlfJoat commented Apr 8, 2024

This comment was marked as resolved.

This comment was marked as resolved.

nicorikken commented Apr 22, 2024 • edited Loading

Add a simplified machine parsable output for `reuse lint` #925

Add a simplified machine parsable output for `reuse lint` #925

nicorikken commented Apr 8, 2024 •

edited

Loading

nicorikken commented Apr 8, 2024 •

edited

Loading

nicorikken commented Apr 22, 2024 •

edited

Loading