KServe Triton Latency

KServe Triton Latency Dashboard

Latency metrics for pre/post-process, predict, and explain steps for Triton Inference Service with KServe & total number of requests. Requires inferenceservices to use KServe >= v0.10. See KServe observability docs for more info.

See also: Triton Inference Server Grafana Dashboard

Revisions
RevisionDescriptionCreated

Get this dashboard

Import the dashboard template

or

Download JSON

Datasource
Dependencies