benchmark

Hindsight Is #1 on BEAM — the Benchmark That Tests Memory at 10 Million Tokens
Hindsight is #1 on BEAM — the memory benchmark that tests at 10M tokens where context stuffing is impossible. See every published result and what drives the 58% margin.

Agent Memory Benchmark: A Manifesto
Agent Memory Benchmark: A Manifesto