§ P Publications
Research notes from the lab.
The taxonomies, methodological framings, and open research agenda that structure our evaluations. Substantive scaffolding that lives alongside the results: definitions, exclusions, decisions made in public, and the work we intend to study next.
Published
- № 01
A taxonomy of product experience capability for AI agents
Most discussions of AI "design capability" talk past each other because "design" points at three different things. This is the capability we name instead (product experience), set out as eight categories, with the exclusions stated explicitly.
v1.0Jun 2, 2026 - № 02
Evaluating agents in an existing product, not a blank canvas
Real product work means adding a feature to an app that already has its own conventions, yet almost every benchmark tests design from a blank canvas. Why the app we evaluate inside shapes what we can measure.
v1.0Jun 3, 2026