vLLM Inference Monitor (中文汉化版)

基于原版 #24755 的完整中文汉化版本。覆盖 vLLM 推理核心指标:调度器效率、KV Cache 使用率、TTFT/TPOT 延迟、Prefix Cache 命中率、请求完成原因分布等。已适配 Grafana 10.4+,开箱即用。

vLLM Inference Monitor (中文汉化版) screenshot 1
The vLLM Inference Monitor (中文汉化版) dashboard uses the prometheus data source to create a Grafana dashboard with the bargauge, gauge, heatmap, stat and timeseries panels.
Revisions
RevisionDescriptionCreated

Get this dashboard

Import the dashboard template

or

Download JSON

Datasource
Dependencies