“Can we see what it knew when it acted?”
Agreed: weights don’t give you human-style “facts it knew.”
What regulators/auditors can reasonably ask is: what information and constraints were available at decision time? That’s not “reading neurons.” It’s logging and binding the runtime knowledge state: the structured observation (with coverage/confidence), provenance refs, the policy/version in force, and the gating outcome. That’s actionable evidence.
Give me practical example on this one. Make it hypothetical.
If you do not have example or cannot make hypothetical one, let me know.
How would it look like "that auditor sees what it knew when the LLM (it) acted?"
