Team
Owain Evans
Director and Research Lead
Owain is also an affiliate at CHAI (UC Berkeley) and was previously based at the University of Oxford at the Future of Humanity Institute. He earned his PhD at MIT. He also worked at Ought, where he still serves on the Board of Directors.

Jan Betley
Research Scientist
Jan worked as a software developer for over a decade before shifting to AI safety in 2023. He is an ARENA and Astra Fellowship alumni, interested in anything related to out-of-context reasoning in LLMs.

Johannes Treutlein
Research Scientist
Johannes was previously a Member of Technical Staff on Anthropic's alignment team. He is on leave from a PhD in computer science at UC Berkeley, supervised by Stuart Russell.

Anna Sztyber-Betley
Research Scientist (part-time)
Anna has a PhD in Automatic Control and Robotics and is an assistant professor at the Faculty of Mechatronics, Warsaw University of Technology.
She collaborates with Truthful AI on AI Safety projects related to out-of-context reasoning in LLMs.

Alumni
James Chua
Research Scientist (Alumni)
James was the second employee at Truthful AI and contributed to many projects. He is now a research scientist in alignment science at Anthropic.

Current and Past Mentees
These are mentees that Owain Evans has mentored, as part of making AI Safety papers. If you are interested in working as a mentee, please register for the Astra Fellowship.
| Name | Year | Current Role |
|---|---|---|
| Jan Dubiลski | 2026 | Astra scholar |
| Harry Mayne | 2026 | Astra scholar |
| Lev McKinney | 2026 | Astra scholar |
| Adam Karvonen | 2025 | MATS scholar |
| Dylan Feng | 2025 | MATS scholar |
| Jorio Cocola | 2025 | MATS scholar |
| Minh Le | 2025 | Anthropic |
| Alex Cloud | 2025 | Anthropic |
| Daniel Tan | 2025 | PhD student, UCL |
| Martรญn Soto | 2024-2025 | Research Scientist, UK AISI |
| Jenny (Xuchan) Bao | 2024-2025 | Anthropic |
| Dami Choi | 2024 | Transluce |
| James Chua | 2024 | Anthropic |
| Johannes Treutlein | 2024 | Truthful AI |
| Jan Betley | 2024 | Truthful AI |
| Felix Binder | 2024 | Meta AI |
| Alexander Meinke | 2023 | Research Scientist, Apollo Research |
| Lorenzo Pacchiardi | 2023 | Research Associate, Univ. of Cambridge |
| Asa Cooper Stickland | 2023 | Research Scientist, UK AI Safety Institute (AISI) |
| Mikita Balesni | 2023 | OpenAI (ex-Apollo Research) |
| Lukas Berglund | 2023 | U.S. AI Safety Institute (NIST AISI) |
| Meg Tong | 2023 | Anthropic |
| Max Kaufmann | 2023 | Anthropic (ex-UK AISI) |
| Alex J. Chan | 2023 | Salesforce, ex-Spotify |
| Tomek Korbak | 2023 | OpenAI (ex-UK AISI) |
| Alexa (Yue) Pan | 2023 | Redwood Research |
| Dane Sherburn | 2022-2023 | OpenAI |
| Stephanie Lin | 2021-2022 | OpenAI |
| Lukas Finnveden | 2021-2022 | Research Analyst, Redwood Research |
| Jan Hendrik Kirchner | 2022 | Researcher at Anthropic (ex-OpenAI) |
| Tom McGrath | 2018 | Chief Scientist & Co-founder, Goodfire, ex-GDM |
| Zachary Kenton | 2018 | Staff Research Scientist, Google DeepMind |
| Richard Ngo | 2018 | Independent; previously OpenAI Governance |
| William Saunders | 2017 | Researcher, Alignment Science, Anthropic, ex-OpenAI |
| Girish Sastry | 2017 | Independent researcher/policy, ex-OpenAI |
| Neal Jean | 2017 | Co-founder & CEO, Beacons |
| Ryan Carey | 2017 | Optiver, ex-Oxford PhD |
| Chris Cundy | 2017 | Research Scientist, FAR AI |
| Daniel Filan | 2016 | METR |
| John Salvatier | 2016 | Independent researcher |
| David Abel | 2016 | Senior Research Scientist at Google DeepMind |
| David Krueger | 2016 | Assistant Professor, Mila, ex-Cambridge |