Test summary

Prompt
25 Tools Run v11
Dataset
mcp-toolcall-acc
Status
15 Completed
Test run duration
57 seconds
Evaluation cost
-
Started At.
Jun 4, 2025, 6:31:19 AM
Author
MMadhu Shantan

Summary by evaluator

Tool Call Accuracy
ResultMean scorePass rate
Fail
0.5353.33%

Tokens

TypeValue
Total tokens384064
Input tokens376478
Completion tokens7586

Cost

TypeValue
Total cost$ 0.813644
Input token cost$ 0.7529560000000002
Completion token cost$ 0.06068800000000001

Latency (ms)

TypeValue
min2699.00 ms
max30318.97 ms
p505147.00 ms
p90 11391.00 ms
p95 30319.00 ms
p99 30319.00 ms
mean 7557.20 ms
stdDeviation6541.1281
total15