feature: multibyte cutb #83

roycewilliams · 2023-12-02T22:36:09Z

When input is UTF-8, cutting on character count - even when it is not aligned with byte count - would be useful. This could be added to cutb as a flag, or could be a separate utility.

This script produces some of the desired behavior:

$ cat multicutb
#!/bin/bash

if [ -z "$1" -o -z "$2" ]; then
    echo "Usage: $0 [offset] [length]"
    echo "(similar to cutb from hashcat utils)"
    exit 1
fi
offset=$1
length=$2

grep -Po "^.{$offset}\K.{$length}"

$ echo Τηεοδ29 | multicutb 1 3
ηεο

(but doesn't cover the negative-offset functionality)

The text was updated successfully, but these errors were encountered:

roycewilliams added the enhancement label Dec 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature: multibyte cutb #83

feature: multibyte cutb #83

roycewilliams commented Dec 2, 2023 •

edited

Loading

feature: multibyte cutb #83

feature: multibyte cutb #83

Comments

roycewilliams commented Dec 2, 2023 • edited Loading

roycewilliams commented Dec 2, 2023 •

edited

Loading