Skip to content

About
Services
- Service Catalog for UN Agencies
Initiatives
InfoSec
Guides
Blog
Careers
Contact

Tag: AI chabot security

From ‘Catching Bad Words’ to ‘Understanding Bad Intent’: AI Safety’s Next Evolution

02 Feb 2026

⋅

Ng Chong

As Large Language Models (LLMs) like Claude and GPT-4 become central to our digital lives, a silent arms race is happening behind the scenes. On one side, “jailbreakers” try to trick AI into bypassing its safety filters; on the other, researchers build shields to keep the AI helpful and harmless. The recent paper “Constitutional Classifiers++:…

Tech Insights

Updates on the latest developments, trends, and news in the field of information and communication technology

Privacy Statement
Copyright
Terms of Use
Blog
Careers
Contact

United Nations University
Tokyo, JAPAN