Test summary

Prompt
25 Tools Run v10
Dataset
mcp-toolcall-acc
Status
15 Completed
Test run duration
10 minutes 20 seconds
Evaluation cost
-
Started At.
Jun 4, 2025, 6:14:03 AM
Author
MMadhu Shantan

Summary by evaluator

Tool Call Accuracy
ResultMean scorePass rate
Fail
0.7373.33%

Tokens

TypeValue
Total tokens877522
Input tokens862058
Completion tokens15464

Cost

TypeValue
Total cost$ 2.818134
Input token cost$ 2.586174
Completion token cost$ 0.2319600000000001

Latency (ms)

TypeValue
min6212.00 ms
max48851.60 ms
p5021215.00 ms
p90 45791.00 ms
p95 48863.00 ms
p99 48863.00 ms
mean 22139.07 ms
stdDeviation12229.253
total15