Team

Owain Evans

Director and Research Lead

Owain is also an affiliate at CHAI (UC Berkeley) and was previously based at the University of Oxford at the Future of Humanity Institute. He earned his PhD at MIT. He also worked at Ought, where he still serves on the Board of Directors.

Jan Betley

Research Scientist

Jan worked as a software developer for over a decade before shifting to AI safety in 2023. He is an ARENA and Astra Fellowship alumni, interested in anything related to out-of-context reasoning in LLMs.

Johannes Treutlein

Research Scientist

Johannes was previously a Member of Technical Staff on Anthropic's alignment team. He is on leave from a PhD in computer science at UC Berkeley, supervised by Stuart Russell.

Anna Sztyber-Betley

Research Scientist (part-time)

Anna has a PhD in Automatic Control and Robotics and is an assistant professor at the Faculty of Mechatronics, Warsaw University of Technology.
She collaborates with Truthful AI on AI Safety projects related to out-of-context reasoning in LLMs.

Alumni

James Chua

Research Scientist (Alumni)

James was the second employee at Truthful AI and contributed to many projects. He is now a research scientist in alignment science at Anthropic.

Current and Past Mentees

These are mentees that Owain Evans has mentored, as part of making AI Safety papers. If you are interested in working as a mentee, please register for the Astra Fellowship.

NameYearCurrent Role
Jan Dubiล„ski2026Astra scholar
Harry Mayne2026Astra scholar
Lev McKinney2026Astra scholar
Adam Karvonen2025MATS scholar
Dylan Feng2025MATS scholar
Jorio Cocola2025MATS scholar
Minh Le2025Anthropic
Alex Cloud2025Anthropic
Daniel Tan2025PhD student, UCL
Martรญn Soto2024-2025Research Scientist, UK AISI
Jenny (Xuchan) Bao2024-2025Anthropic
Dami Choi2024Transluce
James Chua2024Anthropic
Johannes Treutlein2024Truthful AI
Jan Betley2024Truthful AI
Felix Binder2024Meta AI
Alexander Meinke2023Research Scientist, Apollo Research
Lorenzo Pacchiardi2023Research Associate, Univ. of Cambridge
Asa Cooper Stickland2023Research Scientist, UK AI Safety Institute (AISI)
Mikita Balesni2023OpenAI (ex-Apollo Research)
Lukas Berglund2023U.S. AI Safety Institute (NIST AISI)
Meg Tong2023Anthropic
Max Kaufmann2023Anthropic (ex-UK AISI)
Alex J. Chan2023Salesforce, ex-Spotify
Tomek Korbak2023OpenAI (ex-UK AISI)
Alexa (Yue) Pan2023Redwood Research
Dane Sherburn2022-2023OpenAI
Stephanie Lin2021-2022OpenAI
Lukas Finnveden2021-2022Research Analyst, Redwood Research
Jan Hendrik Kirchner2022Researcher at Anthropic (ex-OpenAI)
Tom McGrath2018Chief Scientist & Co-founder, Goodfire, ex-GDM
Zachary Kenton2018Staff Research Scientist, Google DeepMind
Richard Ngo2018Independent; previously OpenAI Governance
William Saunders2017Researcher, Alignment Science, Anthropic, ex-OpenAI
Girish Sastry2017Independent researcher/policy, ex-OpenAI
Neal Jean2017Co-founder & CEO, Beacons
Ryan Carey2017Optiver, ex-Oxford PhD
Chris Cundy2017Research Scientist, FAR AI
Daniel Filan2016METR
John Salvatier2016Independent researcher
David Abel2016Senior Research Scientist at Google DeepMind
David Krueger2016Assistant Professor, Mila, ex-Cambridge