Test summary

Prompt
48 Tools Run v10
Dataset
mcp-toolcall-acc
Status
15 Completed
Test run duration
11 minutes 49 seconds
Evaluation cost
-
Started At.
Jun 3, 2025, 2:24:11 PM
Author
MMadhu Shantan

Summary by evaluator

Tool Call Accuracy
ResultMean scorePass rate
Fail
0.6766.67%

Tokens

TypeValue
Total tokens1073298
Input tokens1054970
Completion tokens18328

Cost

TypeValue
Total cost$ 17.19915
Input token cost$ 15.82455
Completion token cost$ 1.3746

Latency (ms)

TypeValue
min7897.00 ms
max159776.44 ms
p5034975.00 ms
p90 99007.00 ms
p95 159871.00 ms
p99 159871.00 ms
mean 45469.20 ms
stdDeviation37979.2003
total15