Test summary
Prompt
25 Tools Run v16
Dataset
mcp-toolcall-acc
Status
15 Completed
Test run duration
11 minutes 53 seconds
Evaluation cost
-
Started At.
Jun 4, 2025, 10:46:44 AM
Author
M
Madhu Shantan
Summary by evaluator
Tool Call Accuracy
Result
Mean score
Pass rate
Fail
0.73
73.33%
Tokens
Type
Value
Total tokens
1039812
Input tokens
1021601
Completion tokens
18211
Cost
Type
Value
Total cost
$ 3.337968
Input token cost
$ 3.064803
Completion token cost
$ 0.273165
Latency (ms)
Type
Value
min
5751.00 ms
max
69109.97 ms
p50
19919.00 ms
p90
62271.00 ms
p95
69119.00 ms
p99
69119.00 ms
mean
29457.20 ms
stdDeviation
19260.6533
total
15