Test summary
Prompt
48 Tools Run v14
Dataset
mcp-toolcall-acc
Status
15 Completed
Test run duration
14 minutes 28 seconds
Evaluation cost
-
Started At.
Jun 4, 2025, 8:10:31 AM
Author
M
Madhu Shantan
Summary by evaluator
Tool Call Accuracy
Result
Mean score
Pass rate
Fail
0.67
66.67%
Tokens
Type
Value
Total tokens
1258155
Input tokens
1238947
Completion tokens
19208
Cost
Type
Value
Total cost
$ 4.004961000000001
Input token cost
$ 3.716841
Completion token cost
$ 0.28812
Latency (ms)
Type
Value
min
6856.00 ms
max
99850.62 ms
p50
23551.00 ms
p90
59199.00 ms
p95
99903.00 ms
p99
99903.00 ms
mean
29218.00 ms
stdDeviation
22959.4498
total
15