All dimensions

Dimension · score weight 0%

Self-Identification Probe

What this dimension detects

Asking a model what it is remains visible because users expect to see it, but it is explicitly diagnostic. A system prompt or proxy rewrite can force almost any answer.

Algorithm

Ask a direct identity question and scan the response for the claimed model or a conflicting model family. Store the raw text as anecdotal evidence.

Thresholds

ConditionVerdict contribution
Claimed model mentionedDiagnostic match
Different model mentionedDiagnostic anomaly
No model mentionedInconclusive diagnostic

Limitations

This signal is trivially forgeable and must never be used as proof of identity or substitution.

Back to the full methodology