概述目标:基于业务自定义指标进行自动伸缩,使用稳定窗口与不同上/下行策略减少抖动,提升成本与性能均衡。适用:请求队列长度、处理速率、并发数等业务驱动伸缩场景。核心与实战Prometheus Adapter指标暴露(示意):apiVersion: v1
kind: ConfigMap
metadata: { name: adapter-config, namespace: custom-metrics }
data:
config.yaml: |
rules:
- seriesQuery: 'worker_inflight_jobs{namespace!="",pod!=""}'
resources:
overrides:
namespace: { resource: "namespace" }
pod: { resource: "pod" }
name:
matches: "worker_inflight_jobs"
as: "worker_inflight_jobs"
metricsQuery: 'sum(rate(worker_inflight_jobs[1m])) by (namespace)'
HPA v2配置:apiVersion: autoscaling/v2
kind: HorizontalPodAutoscaler
metadata:
name: worker-hpa
namespace: prod
spec:
scaleTargetRef:
apiVersion: apps/v1
kind: Deployment
name: worker
minReplicas: 2
maxReplicas: 50
metrics:
- type: Pods
pods:
metric:
name: worker_inflight_jobs
target:
type: AverageValue
averageValue: 100
behavior:
stabilizationWindowSeconds: 300
scaleUp:
policies:
- type: Percent
value: 100
periodSeconds: 60
selectPolicy: Max
stabilizationWindowSeconds: 0
scaleDown:
policies:
- type: Percent
value: 50
periodSeconds: 60
selectPolicy: Max
stabilizationWindowSeconds: 300
示例应用与验证:kubectl apply -f hpa.yaml
kubectl describe hpa worker-hpa
指标与副本:kubectl get --raw "/apis/custom.metrics.k8s.io/v1beta1/namespaces/prod/pods/*/worker_inflight_jobs" | jq
kubectl get hpa worker-hpa -n prod -w
验证与监控抖动控制:通过`stabilizationWindowSeconds`与不同上下行策略控制频繁伸缩;观察副本变化。指标质量:确保Adapter规则正确映射与聚合;监控数据新鲜度与误触发。成本与容量:结合节点资源与队列滞后设定目标值与上限,防止过度伸缩。常见误区指标过小粒度导致抖动;应聚合至命名空间或Deployment层级。未配置稳定窗口导致在负载波动下频繁缩容扩容;需合理设置。忽视下行策略,缩容过快导致任务积压;需限制缩容速度。结语通过HPA v2自定义指标与行为策略可实现业务驱动的稳定伸缩,避免抖动并兼顾成本与性能。

发表评论 取消回复