Test summary
Prompt
25 Tools Run v38
Dataset
mcp-toolcall-acc
Status
15 Completed
Test run duration
7 minutes 23 seconds
Evaluation cost
-
Started At.
Jul 4, 2025, 10:55:54 AM
Author
M
Madhu Shantan
Summary by evaluator
Tool Call Accuracy
Result
Mean score
Pass rate
Fail
0.6
60%
Tokens
Type
Value
Total tokens
650017
Input tokens
602947
Completion tokens
11808
Cost
Type
Value
Total cost
$ 0.78320375
Input token cost
$ 0.7536837500000001
Completion token cost
$ 0.02952
Latency (ms)
Type
Value
min
9283.00 ms
max
417276.90 ms
p50
28847.00 ms
p90
111487.00 ms
p95
417279.00 ms
p99
417279.00 ms
mean
59494.40 ms
stdDeviation
98809.0744
total
15