Home / BeaverDeck / Docs / Insights Guide / GPU Insights / GPU Fragmentation
GPU Fragmentation
BeaverDeck uses this check to identify a specific gpu condition that may need operator review.
| Check type | gpu-fragmentation |
|---|---|
| Insights section | GPU Insights |
| Alert severity | Warning |
When It Reports A Finding
A GPU Pod has been Pending for at least 5 minutes, total free GPU capacity across schedulable GPU nodes is sufficient, but no single node has enough free GPUs for that Pod.
Why This Is A Problem
Capacity exists but is stranded across nodes, so a multi-GPU workload cannot schedule and additional GPU nodes may be added unnecessarily.
Recommended Response
- Compare the pod's GPU request with free capacity on each schedulable GPU node.
- Consolidate or reschedule existing GPU workloads where disruption policies allow.
- Reduce the request, split the workload, or add a node shape that can satisfy the per-pod requirement.
Scope And Limitations
The check uses GPU request accounting. Other scheduler predicates such as affinity, taints, topology, CPU, memory, and storage can also block the pod.
After remediation: refresh GPU Insights and verify the underlying
resource or metric. Suppress the finding only when the condition is intentional and its risk is accepted.