Test summary
Prompt
25 Tools Run v32
Dataset
Multi-mcp-history
Status
15 Completed
Test run duration
4 minutes 20 seconds
Evaluation cost
-
Started At.
Jul 3, 2025, 5:45:58 PM
Author
M
Madhu Shantan
Summary by evaluator
Tool Call Accuracy
Result
Mean score
Pass rate
Fail
0.73
73.33%
Tokens
Type
Value
Total tokens
873316
Input tokens
851416
Completion tokens
21900
Cost
Type
Value
Total cost
$ 14.41374
Input token cost
$ 12.77124
Completion token cost
$ 1.6425
Latency (ms)
Type
Value
min
12593.00 ms
max
101764.50 ms
p50
23135.00 ms
p90
93695.00 ms
p95
101823.00 ms
p99
101823.00 ms
mean
36261.33 ms
stdDeviation
27116.4625
total
15