Today in AI for life science: agent benchmarks converge on a lesson we felt firsthand — the system around the model matters as much as the model — virtual-cell ‘world models’ arrive with a push for reproducibility standards, and the lab runs its first live agent-driven experiment.