Test summary

Prompt
25 Tools Run v15
Dataset
mcp-toolcall-acc
Status
15 Completed
Test run duration
10 minutes 11 seconds
Evaluation cost
-
Started At.
Jun 4, 2025, 9:34:54 AM
Author
MMadhu Shantan

Summary by evaluator

Tool Call Accuracy
ResultMean scorePass rate
Fail
0.6766.67%

Tokens

TypeValue
Total tokens885395
Input tokens868967
Completion tokens16428

Cost

TypeValue
Total cost$ 14.266605
Input token cost$ 13.034505
Completion token cost$ 1.2321

Latency (ms)

TypeValue
min8168.00 ms
max88025.18 ms
p5031743.00 ms
p90 79935.00 ms
p95 88063.00 ms
p99 88063.00 ms
mean 39532.67 ms
stdDeviation21209.0754
total15