Interface HorizontalPodAutoscalerProps

    • Method Detail

      • getMaxReplicas

        @Stability(Stable)
        @NotNull
        Number getMaxReplicas()
        The maximum number of replicas that can be scaled up to.
      • getTarget

        @Stability(Stable)
        @NotNull
        IScalable getTarget()
        The workload to scale up or down.

        Scalable workload types:

        • Deployment
        • StatefulSet
      • getMetrics

        @Stability(Stable)
        @Nullable
        default List<Metric> getMetrics()
        The metric conditions that trigger a scale up or scale down.

        Default: - If metrics are not provided, then the target resource constraints (e.g. cpu limit) will be used as scaling metrics.

      • getMinReplicas

        @Stability(Stable)
        @Nullable
        default Number getMinReplicas()
        The minimum number of replicas that can be scaled down to.

        Can be set to 0 if the alpha feature gate HPAScaleToZero is enabled and at least one Object or External metric is configured.

        Default: 1

      • getScaleDown

        @Stability(Stable)
        @Nullable
        default ScalingRules getScaleDown()
        The scaling behavior when scaling down.

        Default: - Scale down to minReplica count with a 5 minute stabilization window.

      • getScaleUp

        @Stability(Stable)
        @Nullable
        default ScalingRules getScaleUp()
        The scaling behavior when scaling up.

        Default: - Is the higher of: * Increase no more than 4 pods per 60 seconds * Double the number of pods per 60 seconds