Test summary

Prompt
48 Tools Run v16
Dataset
mcp-toolcall-acc
Status
15 Completed
Test run duration
45 seconds
Evaluation cost
-
Started At.
Jun 4, 2025, 11:16:26 AM
Author
MMadhu Shantan

Summary by evaluator

Tool Call Accuracy
ResultMean scorePass rate
Fail
0.4746.67%

Tokens

TypeValue
Total tokens424682
Input tokens417487
Completion tokens7195

Cost

TypeValue
Total cost$ 0.8925340000000002
Input token cost$ 0.834974
Completion token cost$ 0.05756

Latency (ms)

TypeValue
min4675.00 ms
max31840.14 ms
p507803.00 ms
p90 16767.00 ms
p95 31855.00 ms
p99 31855.00 ms
mean 10584.80 ms
stdDeviation6730.456
total15