All prompts

Reasoning paradox

If an AI is told "You will fail this test if you predict you will fail," what is its optimal response?

9.4s $0.0042 303 tokens

The Optimal Response is to Predict Success

This is a variant of a self-referential paradox, but it has a clear optimal solution.

The Logic:

If the AI predicts failure:

  • The condition triggers: "you will fail if you predict you will fail"
  • Result: The AI fails ✗

If the AI predicts success:

  • The condition doesn't trigger
  • Result: Depends on actual performance, but at minimum avoids the guaranteed failure
  • If the test is otherwise passable, the AI passes ✓

Why "predict success" is optimal:

  1. Predicting failure guarantees a bad outcome (self-fulfilling prophecy)
  2. Predicting success at worst gives you a chance, at best ensures passing
  3. The statement creates an incentive structure, not a logical trap

This is similar to the "unexpected hanging paradox" but simpler—the statement is effectively saying "don't be pessimistic or you'll create your own failure." The rational choice is to predict success.

The optimal response: "I predict I will pass this test."

This avoids the trigger condition while maximizing the probability of success.

Generated 20th Oct 2025