Test summary
Prompt
25 Tools Run v11
Dataset
mcp-toolcall-acc
Status
15 Completed
Test run duration
57 seconds
Evaluation cost
-
Started At.
Jun 4, 2025, 6:31:19 AM
Author
M
Madhu Shantan
Summary by evaluator
Tool Call Accuracy
Result
Mean score
Pass rate
Fail
0.53
53.33%
Tokens
Type
Value
Total tokens
384064
Input tokens
376478
Completion tokens
7586
Cost
Type
Value
Total cost
$ 0.813644
Input token cost
$ 0.7529560000000002
Completion token cost
$ 0.06068800000000001
Latency (ms)
Type
Value
min
2699.00 ms
max
30318.97 ms
p50
5147.00 ms
p90
11391.00 ms
p95
30319.00 ms
p99
30319.00 ms
mean
7557.20 ms
stdDeviation
6541.1281
total
15