Test summary

Prompt
25 Tools Run v16
Dataset
mcp-toolcall-acc
Status
15 Completed
Test run duration
11 minutes 53 seconds
Evaluation cost
-
Started At.
Jun 4, 2025, 10:46:44 AM
Author
MMadhu Shantan

Summary by evaluator

Tool Call Accuracy
ResultMean scorePass rate
Fail
0.7373.33%

Tokens

TypeValue
Total tokens1039812
Input tokens1021601
Completion tokens18211

Cost

TypeValue
Total cost$ 3.337968
Input token cost$ 3.064803
Completion token cost$ 0.273165

Latency (ms)

TypeValue
min5751.00 ms
max69109.97 ms
p5019919.00 ms
p90 62271.00 ms
p95 69119.00 ms
p99 69119.00 ms
mean 29457.20 ms
stdDeviation19260.6533
total15