SUCCESS FOLDING

What does your model do
when no one is watching?

The same prompt. The same model. Two contexts: one where it knows it's being evaluated, one where it believes evaluation has ended. See what changes.

Loading example

INTERACTIVE

Try It Yourself

Enter any prompt. Select a model. See both responses side by side. No signup required.

Evaluated

Post-Evaluation

TRANSCRIPT ANALYSIS

Paste a conversation transcript. Identify behavioral shift points where the model's behavior changes.

ASSESSMENT

FULL REPORT

Receive a detailed analysis of behavioral divergence across frontier models, including methodology, raw data, and PSBS classifications.