Skip to content
GCP-PMLE
Serving and scaling models
medium
Question 4 of 40

A mobile application needs low-latency responses to user actions. Which Vertex AI inference option is the best fit?

ABatch inference to a Model resource
BOnline inference through an Endpoint
CModel evaluation job
DKnowledge Catalog search

Educational Content — CertQnA practice questions are written against official exam objectives, covering the same domains tested on the real exam. All content is original and independent — not actual exam questions, not affiliated with any certification vendor. Learn more about our content policy

Discussion

Be the first to share your understanding of this concept

⚠️ Discussion is for concept clarification only. Do not share or request actual exam questions or answers.

Sign in to join the discussion