Tokenizer in pure C The Byte Pair Encoding (BPE) Tokenizer that translates strings <-> tokens Build make tknz Run ./tknz "Fill in here the sentences you want to count up by the tokenizer"