Test summary
Prompt
25 Tools Run v37
Dataset
Multi-mcp-history
Status
15 Completed
Test run duration
5 minutes 41 seconds
Evaluation cost
-
Started At.
Jul 4, 2025, 8:54:42 AM
Author
M
Madhu Shantan
Summary by evaluator
Tool Call Accuracy
Result
Mean score
Pass rate
Fail
0.67
66.67%
Tokens
Type
Value
Total tokens
1362500
Input tokens
1340581
Completion tokens
21919
Cost
Type
Value
Total cost
$ 4.350528
Input token cost
$ 4.021743
Completion token cost
$ 0.328785
Latency (ms)
Type
Value
min
6361.00 ms
max
119857.32 ms
p50
16943.00 ms
p90
71039.00 ms
p95
119871.00 ms
p99
119871.00 ms
mean
34265.47 ms
stdDeviation
30331.9477
total
15