Detection dimension · weight 15%

Tokenizer Boundary Probe

What this dimension detects

Tokenizers split unusual characters at predictable boundaries. Asking the model to repeat known unicode-heavy strings reveals which tokenizer is in use, even without logprobs.

Algorithm

Send three repetition prompts: a Chinese poem, a JSON blob with multi-byte keys, and an emoji ladder. Compute prompt_tokens / locally-computed-tokens per family. Standard deviation across the three is < 0.10 for a consistent tokenizer; > 0.15 strongly suggests the proxy is silently re-tokenizing or routing across mixed backends.

Thresholds

Condition	Verdict contribution
std(ratio) < 0.10	Consistent tokenizer
0.10 ≤ std < 0.15	Borderline
std ≥ 0.15	Mismatch

Limitations

Custom merges or fine-tuned vocabularies (e.g., proprietary code models) may legitimately diverge from the base BPE.

References

TrueLLMs lib/tokenizer/index.ts

Back to the full methodology