logo

Field Notes.

Technical notes, deployment lessons, and opinions on building evaluation infrastructure for high-stakes AI.

Opinion

Better Evaluations for the Agentic Era

Academic and industry benchmarks are falling behind what agentic systems can do in the field.

Jeremy Wayland

Read note
Opinion

Takeaways from the 3rd Annual Pediatric & Lifespan Data Science Conference

An opinion piece on the chasm between research performance and clinical deployment.

Jeremy Wayland

Read note
Technical

Pasteurizing Sepsis Prediction

Nature Medicine calls for better evidence. Here is what that looks like.

Jeremy Wayland and Sidney Gathrid

Read note
Technical

Evaluating LLM Benchmarks with Pulsar

MMLU multiverse analysis reveals hidden structure

Jeremy Wayland and Sidney Gathrid

Read note
Release

Pulsar: Robust Topology at Scale

From peer-reviewed research to open-source tool

Sidney Gathrid

Read note
Technical

Geometric Deep Learning for Drug-Target Interaction (Nucleate BioHack 2025)

Jeremy builds a DTI pipeline and wins the Novartis Challenge.

Jeremy Wayland

Read note
Technical

Strategies to Accelerate US Coal Power Phaseout Using Contextual Retirement Vulnerabilities

In collaboration with UCSB Environmental Studies & Bren School

Sidney Gathrid and Jeremy Wayland

Read note