Test summary

Prompt
25 Tools Run v34
Dataset
Multi-mcp-history
Status
15 Completed
Test run duration
4 minutes 55 seconds
Evaluation cost
-
Started At.
Jul 3, 2025, 9:38:21 PM
Author
MMadhu Shantan

Summary by evaluator

Tool Call Accuracy
ResultMean scorePass rate
Pass
0.880%

Tokens

TypeValue
Total tokens1177682
Input tokens1154818
Completion tokens22864

Cost

TypeValue
Total cost$ 3.807414
Input token cost$ 3.464454
Completion token cost$ 0.34296

Latency (ms)

TypeValue
min10292.00 ms
max104451.63 ms
p5023759.00 ms
p90 92095.00 ms
p95 104511.00 ms
p99 104511.00 ms
mean 40459.47 ms
stdDeviation31368.493
total15