engineering

How Hindsight Scales
A design analysis of how Hindsight's memory operations scale with data volume — what costs grow, what stays bounded, and why.

How We Built a 4-Way Hybrid Search System That Actually Runs in Parallel
Sequential async queries were killing our retrieval latency. Here's how we built a true 4-way parallel hybrid search system with asyncio and RRF fusion — then evolved it further with connection sharing, cross-encoder reranking, and multiplicative boost scoring.