London AI School

Build · Research · Explore — with personalised AI mentor agents

Research blog

Posts

Notes on AI agents, evaluation, and project-based research formation at London AI School.

Evaluation · Research perspective

Benchmarking AI Tutors: From Answer Correctness to Pedagogical Guidance

Why solver-style benchmarks fail for education, what recent tutoring benchmarks show, and how we should evaluate AI Tutors as guidance systems rather than solution engines.