Resource Efficiency | Docs

Overview

Resource efficiency measures how well you're using the resources you're paying for.

The fundamental question:

"Of all the CPU and memory I'm paying for, how much am I actually using?"

Why it matters:

High efficiency (70-80%) → Getting value for money
Low efficiency (<30%) → Paying for waste

Kubeadapt's approach:

Measure actual resource usage
Compare to resource requests (what you pay for)
Calculate efficiency percentages
Identify opportunities to improve

The Efficiency Problem

What is Paid For vs. What is Used

In Kubernetes, costs are based on resource requests, not actual usage.

Example deployment:

yaml

resources:
```yaml

  requests:
    cpu: "2000m"
    memory: "4Gi"

What you might actually use:

text

Actual CPU usage (P95): 400m (20% of requested)
Actual memory usage (P99): 1.5Gi (37.5% of requested)

Your efficiency:

text

CPU efficiency: 400m / 2000m = 20%
Memory efficiency: 1.5Gi / 4Gi = 37.5%

Translation: 80% of CPU budget and 62.5% of memory budget are wasted.

Industry reality: Average Kubernetes CPU efficiency is just 10-13%.

How Efficiency Is Calculated

The Core Concept

Efficiency measures how well requested resources are being utilized:

text

Resource Efficiency = (Observed Usage / Requested Resources) × 100%

How Usage is Measured:

Kubeadapt uses percentile-based analysis to calculate observed usage:

CPU Usage: Based on P95 percentile of actual consumption
Memory Usage: Based on P99 percentile of actual consumption
Time-weighted: Recent data carries more weight than older data (30-day lookback with recency bias)

The percentile approach varies by environment:

Production workloads: P95 for CPU, P99 for memory (conservative)
Non-production workloads: P50 for both CPU and memory (aggressive cost optimization with high limits as safety buffer)

Efficiency Levels Explained

CPU Efficiency

Excellent (70-85%)

text

Requested: 1000m
Using: 700-850m
Actions to be taken: None - well-sized with minimal waste

Good (50-70%)

text

Requested: 1000m
Using: 500-700m
Actions to be taken: Monitor - acceptable efficiency, minor optimization possible

Moderate (30-50%)

text

Requested: 1000m
Using: 300-500m
Actions to be taken: Right-size - moderate waste, reduce requests by 20-40%

Poor (15-30%)

text

Requested: 1000m
Using: 150-300m
Actions to be taken: Right-size urgently - significant waste, reduce requests by 40-60%

Very Poor (<15%)

text

Requested: 1000m
Using: <150m
Actions to be taken: Right-size immediately - severe over-provisioning, reduce by 60-80%

Memory Efficiency

text

**Excellent (75-90%)**

Requested: 4Gi Using: 3-3.6Gi Actions to be taken: None - well-sized

text

**Good (60-75%)**

Requested: 4Gi Using: 2.4-3Gi Actions to be taken: Monitor - acceptable efficiency

text

**Moderate (40-60%)**

Requested: 4Gi Using: 1.6-2.4Gi Actions to be taken: Right-size - reduce requests

text

**Poor (20-40%)**

Requested: 4Gi Using: 0.8-1.6Gi Actions to be taken: Right-size urgently - significant waste

text

**Very Poor (<20%)**

Requested: 4Gi Using: <800Mi Actions to be taken: Right-size immediately - severe over-provisioning

text

---

Cluster-Level Efficiency

Aggregate Metrics

Cluster efficiency:

text

Total requested CPU across all pods: 450 cores
Total actual usage: 180 cores
Cluster CPU efficiency: 180 / 450 = 40%

Total requested memory: 1.8 TB
Total actual usage: 720 GB
Cluster memory efficiency: 720 / 1800 = 40%

What this means:

Paying for 450 cores, using 180
Paying for 1.8 TB, using 720 GB
60% waste opportunity

Node-Level Efficiency

Node capacity utilization:

text

Node: m5.2xlarge (example - AWS instance type)
Total capacity: 8 cores, 32 GB

Allocated (requests): 6 cores, 24 GB (75% node utilization)
Actual usage: 3 cores, 15 GB (37.5% actual utilization)

Node efficiency: 3 / 6 = 50% CPU, 15 / 24 = 62.5% memory

Note: This example uses AWS EC2 instance types for illustration. Kubeadapt supports AWS, GCP, and Azure. Node type examples will vary based on your cloud provider.

Two layers of waste:

Scheduling waste: Node has 25% idle capacity (2 cores, 8 GB unused)
Over-provisioning waste: Allocated pods use 50% of their requests

Total waste:

text

Paying for: 8 cores
Using: 3 cores
Total efficiency: 3 / 8 = 37.5%

Namespace Efficiency

text

**Per-team visibility:**

Namespace: backend Total requested: 80 cores, 320 GB Total usage: 24 cores, 160 GB

CPU efficiency: 30% Memory efficiency: 50% Overall efficiency: 40%

Current waste: 60% of namespace resources Potential efficiency improvement with 70% target: 30 percentage point gain

text

---

Efficiency vs. Utilization

Key Difference

Efficiency = Usage / Requests (what you pay for) Utilization = Usage / Capacity (what's available)

Example:

text

Node capacity: 8 cores
Pod requests: 2 cores
Pod usage: 400m

Efficiency: 400m / 2000m = 20% (usage vs. requests)
Utilization: 400m / 8000m = 5% (usage vs. node capacity)

Why both matter:

Low efficiency:

Problem: Over-provisioning
Solution: Right-size pod requests

Low utilization:

Problem: Poor bin-packing
Solution: Add more pods or reduce node count

The Ideal State

Well-optimized cluster:

text

Node capacity: 8 cores
Pod requests: 7 cores (87.5% utilization)
Actual usage: 5 cores (71% efficiency)

High utilization (filling nodes)
Good efficiency (requests match usage)

Poorly optimized cluster:

text

Node capacity: 8 cores
Pod requests: 4 cores (50% utilization)
Actual usage: 800m (20% efficiency)

Low utilization (wasting nodes)
Poor efficiency (requests too high)

text

## Improving Efficiency Through Right-Sizing

Kubeadapt's [right-sizing](/docs/v1/concepts/rightsizing) is designed to improve both resource-level and cluster-level efficiency:

**Resource-Level Efficiency:**

When right-sizing is properly applied to individual workloads:
- Pod requests align with actual usage patterns
- CPU efficiency improves from industry average (10-13%) to target ranges (60-75%)
- Memory efficiency improves to target ranges (70-85%)

**Cluster-Level Efficiency:**

As workloads are right-sized across the cluster:
- Aggregate resource waste decreases
- Node utilization improves through better bin-packing
- Overall cluster efficiency increases, reducing infrastructure costs

**How It Works:**

1. Kubeadapt analyzes actual usage patterns (30-day lookback with recency weighting)
2. Generates right-sizing recommendations based on P95/P99 percentiles
3. Applies environment-specific strategies (production vs. non-production)
4. Monitors efficiency improvements post-implementation