Test summary
Prompt
48 Tools Run v28
Dataset
mcp-toolcall-acc
Status
15 Completed
Test run duration
2 minutes 38 seconds
Evaluation cost
-
Started At.
Jul 4, 2025, 10:44:52 AM
Author
M
Madhu Shantan
Summary by evaluator
Tool Call Accuracy
Result
Mean score
Pass rate
Fail
0.53
53.33%
Tokens
Type
Value
Total tokens
729025
Input tokens
675874
Completion tokens
10420
Cost
Type
Value
Total cost
$ 0.8708924999999998
Input token cost
$ 0.8448424999999999
Completion token cost
$ 0.02605
Latency (ms)
Type
Value
min
11208.00 ms
max
137130.79 ms
p50
28607.00 ms
p90
98943.00 ms
p95
137215.00 ms
p99
137215.00 ms
mean
43461.07 ms
stdDeviation
36990.0823
total
15