Test summary
HTTP Endpoint
Medical Assistant
Dataset
medical_assistant_eval_dataset
Status
10 Completed
Test run duration
3 minutes 1 second
Evaluation cost
$0.1395750
Started At.
Apr 29, 2025, 7:52:15 PM
Author
U
Utsav Khandelwal
Summary by evaluator
Clarity
Output Relevance
Vertex Question Answering Helpfulness
Vertex Question Answering Relevance
Semantic Similarity
Correctness
Result
Mean score
Pass rate
Pass
1
100%
Pass
0.88
90%
Fail
0.76
70%
Pass
0.84
80%
Fail
0.65
0
Pass
-
90%
Latency (ms)
Type
Value
min
22468.00 ms
max
114580.52 ms
p50
64415.00 ms
p90
106815.00 ms
p95
114623.00 ms
p99
114623.00 ms
mean
72314.40 ms
stdDeviation
29582.9551
total
10