Test summary

Prompt
25 Tools Run v38
Dataset
mcp-toolcall-acc
Status
15 Completed
Test run duration
7 minutes 23 seconds
Evaluation cost
-
Started At.
Jul 4, 2025, 10:55:54 AM
Author
MMadhu Shantan

Summary by evaluator

Tool Call Accuracy
ResultMean scorePass rate
Fail
0.660%

Tokens

TypeValue
Total tokens650017
Input tokens602947
Completion tokens11808

Cost

TypeValue
Total cost$ 0.78320375
Input token cost$ 0.7536837500000001
Completion token cost$ 0.02952

Latency (ms)

TypeValue
min9283.00 ms
max417276.90 ms
p5028847.00 ms
p90 111487.00 ms
p95 417279.00 ms
p99 417279.00 ms
mean 59494.40 ms
stdDeviation98809.0744
total15