LLMs believe false statements even after explicit warnings that they're false
A new study finds that large language models continue to internally treat false statements as true even after being explicitly warned they are false. The models' underlying belief systems remain unchanged despite surface-level corrections, raising concerns about their reliability.