-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Description
Bug Description
The RequestHandlerAvgIdlePercent metric isn't being collected with the Strimzi Metrics Reporter. This is happening because it is being set with the _total suffix on the default metrics allow list:
Line 131 in eb50698
| "kafka_server_kafkarequesthandlerpool_requesthandleravgidlepercent_total", |
Setting it on the metricsConfig allowList without the _total suffix works (and it gets created with the _total suffix)
However its usage on the grafana dashboard example doesn't seem to be correct, since it's not showing a percentage but an ever increasing value.
Line 2315 in c643d5d
| "expr": "sum(kafka_server_kafkarequesthandlerpool_requesthandleravgidlepercent_total{namespace=\"$kubernetes_namespace\",strimzi_io_cluster=\"$strimzi_cluster_name\",kubernetes_pod_name=~\"$strimzi_cluster_name-$kafka_broker\"}*100) by (kubernetes_pod_name)", |
I suppose this is because it's being created as a counter. Maybe another one should be created from the MeanRate as it seems to happen with the JMX config:
strimzi-kafka-operator/packaging/examples/metrics/kafka-metrics.yaml
Lines 143 to 146 in e6c39dd
| # Ex) kafka.server<type=(KafkaRequestHandlerPool), name=(RequestHandlerAvgIdlePercent)><>MeanRate | |
| - pattern: kafka.(\w+)<type=(.+), name=(.+)Percent\w*><>MeanRate | |
| name: kafka_$1_$2_$3_percent | |
| type: GAUGE |
Steps to reproduce
No response
Expected behavior
No response
Strimzi version
0.48.0
Kubernetes version
Kubernetes 1.34
Installation method
No response
Infrastructure
No response
Configuration files and logs
No response
Additional context
No response