Test summary
Prompt
25 Tools Run v34
Dataset
Multi-mcp-history
Status
15 Completed
Test run duration
4 minutes 55 seconds
Evaluation cost
-
Started At.
Jul 3, 2025, 9:38:21 PM
Author
M
Madhu Shantan
Summary by evaluator
Tool Call Accuracy
Result
Mean score
Pass rate
Pass
0.8
80%
Tokens
Type
Value
Total tokens
1177682
Input tokens
1154818
Completion tokens
22864
Cost
Type
Value
Total cost
$ 3.807414
Input token cost
$ 3.464454
Completion token cost
$ 0.34296
Latency (ms)
Type
Value
min
10292.00 ms
max
104451.63 ms
p50
23759.00 ms
p90
92095.00 ms
p95
104511.00 ms
p99
104511.00 ms
mean
40459.47 ms
stdDeviation
31368.493
total
15