Test summary
Prompt
25 Tools Run v31
Dataset
Multi-mcp-history
Status
15 Completed
Test run duration
3 minutes 21 seconds
Evaluation cost
-
Started At.
Jul 3, 2025, 4:52:03 PM
Author
M
Madhu Shantan
Summary by evaluator
Tool Call Accuracy
Result
Mean score
Pass rate
Fail
0.73
73.33%
Tokens
Type
Value
Total tokens
843415
Input tokens
791120
Completion tokens
8390
Cost
Type
Value
Total cost
$ 1.009875
Input token cost
$ 0.9889
Completion token cost
$ 0.020975
Latency (ms)
Type
Value
min
9684.00 ms
max
179561.45 ms
p50
28351.00 ms
p90
82495.00 ms
p95
179583.00 ms
p99
179583.00 ms
mean
41671.47 ms
stdDeviation
42818.0179
total
15