Test summary

Prompt
48 Tools Run v14
Dataset
mcp-toolcall-acc
Status
15 Completed
Test run duration
14 minutes 28 seconds
Evaluation cost
-
Started At.
Jun 4, 2025, 8:10:31 AM
Author
MMadhu Shantan

Summary by evaluator

Tool Call Accuracy
ResultMean scorePass rate
Fail
0.6766.67%

Tokens

TypeValue
Total tokens1258155
Input tokens1238947
Completion tokens19208

Cost

TypeValue
Total cost$ 4.004961000000001
Input token cost$ 3.716841
Completion token cost$ 0.28812

Latency (ms)

TypeValue
min6856.00 ms
max99850.62 ms
p5023551.00 ms
p90 59199.00 ms
p95 99903.00 ms
p99 99903.00 ms
mean 29218.00 ms
stdDeviation22959.4498
total15