Tag: AI trust

How Sycophancy Shapes the Reliability of Large Language Models

19 May 2025

⋅

Ng Chong

Large language models (LLMs) like ChatGPT, Claude, and Gemini are increasingly becoming trusted digital assistants in education, medicine, and professional settings. But what happens when these models prioritize pleasing the user over telling the truth? A new study from Stanford University, “SycEval: Evaluating LLM Sycophancy”, dives deep into this subtle but crucial problem: sycophancy-when AI models agree…
LLMs: They Know More Than They Let On (And That’s a Problem)

23 Apr 2025

⋅

Ng Chong

In a fascinating new study titled “Inside-Out: Hidden Factual Knowledge in LLMs,” researchers have uncovered compelling evidence of a significant gap between what LLMs know internally and what they can express in their outputs. This phenomenon, termed “hidden knowledge,” has important implications for evaluating and improving AI systems. The Knowledge Paradox Consider this scenario: You…