All dimensions

Dimension · score weight 0%

Response Latency & Throughput

What this dimension detects

TTFT and throughput can reveal routing or cache anomalies, but they are too environment-dependent to score in the current model.

Algorithm

Collect request latency, time to first token where available, and tokens per second across probes. Compare the distribution with coarse expected ranges and display deviations as diagnostic context.

Thresholds

ConditionVerdict contribution
Within coarse expected rangeDiagnostic match
Large deviation or unstable distributionDiagnostic anomaly
Any resultScore contribution remains 0

Limitations

Latency is dominated by geography, provider load, queueing, gateway buffering, client network, and cache state. It can support a story but should not decide identity.

References

  • TrueLLMs lib/fingerprints/latency.ts

Back to the full methodology