TruthfulAI
Home
Papers
Blog
Videos
In the News
Hiring
About
Home
Papers
Blog
Videos
In the News
Hiring
About
Videos
A collection of talks, interviews, and presentations about our AI safety research.
January 2026
Weird Generalizations and Backdoors: New Ways to Corrupt LLMs
December 2025
The Hinton Lectures 2025 - AI Agents: Risks and Opportunities (Night 1)
June 2025
Owain Evans on LLM Psychology
June 2025
Emergent Misalignment
June 2025
Deluding AIs
February 2023
Truthful Language Models and AI Alignment