How LLMs Detect and Correct Their Own Errors: The Role of Internal Confidence Signals

Source

arxiv.orgfull article ↗

Publisher summary· verbatim

arXiv:2604.22271v1 Announce Type: new Abstract: Large language models can detect their own errors and sometimes correct them without external feedback, but the underlying mechanisms remain unknown. We investigate this through the lens of second-order models of confidence from decision neuroscience.

Discussion

No replies yet. Be first.

How LLMs Detect and Correct Their Own Errors: The Role of Internal Confidence Signals

Related coverage

How LLMs Detect and Correct Their Own Errors: The Role of Internal Confidence Signals

Related coverage