Build alignment groups using a greedy substring-equality algorithm on decoded token pieces. Current implementation produces incorrect decoding results in some cases BPE-based tokenizers always split ...