Evaluate search
Evaluate context
Evaluate compaction
apply=True only when the evaluation should apply the compaction result.
Serialize operation results
All evaluation result models provide stable JSON-compatible helpers:Build a benchmark report
Implement a benchmark adapter
External benchmark packages implementBenchmarkAdapter: