Test summary
Prompt
25 Tools Run v15
Dataset
mcp-toolcall-acc
Status
15 Completed
Test run duration
10 minutes 11 seconds
Evaluation cost
-
Started At.
Jun 4, 2025, 9:34:54 AM
Author
M
Madhu Shantan
Summary by evaluator
Tool Call Accuracy
Result
Mean score
Pass rate
Fail
0.67
66.67%
Tokens
Type
Value
Total tokens
885395
Input tokens
868967
Completion tokens
16428
Cost
Type
Value
Total cost
$ 14.266605
Input token cost
$ 13.034505
Completion token cost
$ 1.2321
Latency (ms)
Type
Value
min
8168.00 ms
max
88025.18 ms
p50
31743.00 ms
p90
79935.00 ms
p95
88063.00 ms
p99
88063.00 ms
mean
39532.67 ms
stdDeviation
21209.0754
total
15