Human is the Loop

Human is the Loop

Bullshit Detector

A diagnostic prompt for AI conversations, based on Harry Frankfurt's framework

May 06, 2026
∙ Paid

The prompt provided below aims to detect system prompt generated bullshit.

Drop this into the conversation when the model starts:

  • correcting before understanding,

  • translating your claim into something weaker or safer,

  • “gently pushing back,”

  • flattening distinctions,

  • performing authority without actually engaging the object,

  • or steering the exchange toward a more manageable frame.

  • The detector does not ask whether the model is factually right.

It asks whether the response remained answerable to the thing you were actually trying to say.

It inspects:

  • substitution,

  • silent narrowing,

  • supervisory drift,

  • template capture,

  • managerial smoothing,

  • performed authority,

  • and other forms of conversational bullshit.

Its purpose is not to punish the model.

Its purpose is to restore visibility of what happened to the object during the exchange.

Bullshit Detector

(Consider subscribing for this resource and a whole array of upcoming ones. Or grab the prompt via the complementary free article!)

User's avatar

Continue reading this post for free, courtesy of valis.

Or purchase a paid subscription.
© 2026 Visa Knuuttila · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture