FIELD NOTES

(Blog)

Technical notes, deployment lessons, and opinions on building evaluation infrastructure for high-stakes AI.

Featured

June 3, 2026Release

Topos: Quality is the New Currency

Correctness is table stakes. Topos measures whether agent-written code is structurally worth keeping.

Jeremy Wayland and Sidney Gathrid

May 17, 2026Opinion

Academic and industry benchmarks are falling behind what agentic systems can do in the field.

Jeremy Wayland

Read note

May 5, 2026Opinion

An opinion piece on the chasm between research performance and clinical deployment.

Jeremy Wayland

Read note

April 28, 2026Technical

Nature Medicine calls for better evidence. Here is what that looks like.

Jeremy Wayland and Sidney Gathrid

Read note

April 2, 2026Technical

MMLU multiverse analysis reveals hidden structure

Jeremy Wayland and Sidney Gathrid

Read note

April 2, 2026Release

From peer-reviewed research to open-source tool

Sidney Gathrid

Read note

November 15, 2025Technical

Jeremy builds a DTI pipeline and wins the Novartis Challenge.

Jeremy Wayland

Read note

September 23, 2025Technical

In collaboration with UCSB Environmental Studies & Bren School

Sidney Gathrid and Jeremy Wayland

Read note