Blog

Our research findings that are not published as a paper. These are shorter research updates, or quick followups on existing papers.

Feb 23, 2026

Out-of-Context Reasoning in LLMs: A short primer and reading list

Out-of-context reasoning (OOCR) is a concept relevant to LLM generalization and AI alignment.

Nov 25, 2025

OpenAI finetuning metrics: What is going on with the loss curves?

Reverse engineer OpenAI fine-tuning loss/accuracy curves to explain the hidden token counts.

Sep 16, 2025

Was Barack Obama still serving as president in December?

A class of simple questions where recent LLMs give very different answers from what a human would say

Aug 05, 2025

Concept Poisoning: Probing LLMs without probes

A novel LLM evaluation technique using concept poisoning to probe models without explicit probes

Jun 20, 2025

Backdoor awareness and misaligned personas in reasoning models

Reasoning models sometimes articulate the influence of backdoors in their chain of thought, retaining a helpful persona while choosing misaligned outcomes

Apr 11, 2025

OpenAI Responses API changes models' behavior

OpenAI's new Responses API causes finetuned models to behave differently than the Chat Completions API, sometimes dramatically so.

Jan 15, 2025

New, improved multiple-choice TruthfulQA

We introduce a new multiple-choice version of TruthfulQA that fixes a potential problem with the existing versions (MC1 and MC2).

Jan 08, 2025

Tips On Empirical Research Slides

Practical tips on slide-based communication for empirical research with LLMs