Reward-free agents that learn from demonstrations, self-reflection, and deterministic guardrails. This repository combines Stanford/OSU’s Early Experience pipeline with Stanford’s Agentic Context ...