Text Generation Inference

Monitor TGI services on kubernetes

Text Generation Inference screenshot 1

This serves as a fundamental dashboard example for monitoring text generation inference deployed on Kubernetes, leveraging Knative-Serving.

Adapted from an official example, we added namespace and service variables.

To begin, you need to apply a service monitor to channel the metrics into Prometheus.

Revisions
RevisionDescriptionCreated

Get this dashboard

Import the dashboard template

or

Download JSON

Datasource
Dependencies