BLOG04a: Node Maintenance

BLOG07: Node Maintenance - Cordon/Drain

1. What is `kubectl cordon`?

Cordon = mark a node unschedulable

When a node is cordoned:

Kubernetes stops scheduling new pods on that node
Existing pods keep running
Node becomes read-only / safe mode for scheduling

Command:

kubectl cordon <node-name>

Common use cases:

Before performing maintenance
Temporary pause on scheduling
Testing cluster behavior without disrupting running workloads

Visual Summary:

Action

New Pods

Existing Pods

cordon

Not scheduled

Keep running

2. What is `kubectl drain`?

Drain = evict all pods safely + mark node unschedulable

When a node is drained:

Node becomes unschedulable (same as cordon)
Kubernetes evicts all pods safely (graceful termination)
Control plane reschedules pods to other nodes

Command:

kubectl drain <node-name> --ignore-daemonsets --delete-emptydir-data

What happens under the hood:

Pod Disruption Budgets (PDB) are checked
Graceful termination is respected
Replicas are recreated on other nodes

Visual:

Action

New Pods

Existing Pods

drain

Not scheduled

Evicted + rescheduled

Drain is used for:

Node upgrade (kubelet, OS, kernel patch)
Node replacement
Planned maintenance

3. What is the Node Release Process?

The Node Release Process is the operational workflow for safely taking a node out of service and returning it.

The process consists of 6 stages:

Stage 1: Cordon the node

Prevent new pods from being scheduled:

kubectl cordon node01

Stage 2: Drain the node

Move running workloads away:

kubectl drain node01 --ignore-daemonsets --delete-emptydir-data

This safely evicts pods.

Stage 3: Perform Maintenance

Examples:

OS patching
Kubelet upgrade
Hardware change
Reboot
Cloud provider draining (AWS, GCP)

Stage 4: Bring node back online

After maintenance, the kubelet reconnects to the control plane.

Node status becomes: Ready

Stage 5: Uncordon the node

Allow scheduling to resume:

kubectl uncordon node01

Scheduling of new pods resumes.

Stage 6: Verify scheduling

kubectl get nodes -o wide
kubectl describe node node01

Cordon vs Drain – Comparison

CORDON

Do not place new pods
Do not remove running pods

DRAIN

Do not place new pods
Evict all running pods

Additional Information: What happens if a node is deleted from cluster provider (AWS/GCP/VMware)?

Kubernetes marks it NotReady
After 5 minutes → Unreachable
kube-controller-manager deletes the node object
Pods are recreated on healthy nodes automatically

This is called Node Self-Healing.

The time required to evict pods during a kubectl drain is not fixed — it depends on several factors. The following sections provide a detailed breakdown to help predict the timing accurately.

1. Pod Eviction Time Depends On the Pod's Termination Grace Period

Each pod has:

spec.terminationGracePeriodSeconds

Default = 30 seconds

Therefore, each pod may take up to 30 seconds to shut down cleanly unless overridden.

When executing:

kubectl drain node1

Kubernetes waits for the pod to:

Receive SIGTERM
Run its shutdown lifecycle hooks
Gracefully stop
Be killed with SIGKILL if it exceeds the grace period

Example:

Pod A: grace period = 30s
Pod B: grace period = 10s
Pod C: grace period = 60s

Total drain time ≈ 60 seconds (but often parallelized)

2. DaemonSet pods do NOT block drain

Drain automatically skips DaemonSet pods when using:

--ignore-daemonsets

If this flag is not included, drain will hang indefinitely.

3. Pods backed by local-storage (EmptyDir) require flag

--delete-emptydir-data

If this flag is not provided, drain fails immediately (fails fast rather than slowly).

4. Pod Disruption Budgets (PDB) can block drain

If the PDB specifies that a minimum of 1 replica must remain available:

Example PDB:

minAvailable: 1

If attempting to drain the last node hosting that pod:

The drain will wait indefinitely until the PDB is satisfied or until using:

--disable-eviction

5. Typical cluster drain timing

Small cluster, simple workloads

10–30 seconds per pod

Medium workloads (web apps, backing services)

30–90 seconds

Heavy workloads (Java apps, large caches, ML processes)

1–3 minutes per pod

Pods with very large grace periods

Example:

terminationGracePeriodSeconds: 300

Drain may take 5 minutes per pod.

This can be overridden with:

--grace-period=10 --force

But this will force-kill pods.

6. Monitoring eviction behavior

To monitor eviction behavior, execute:

kubectl describe pod <pod>
kubectl get events --watch

The output will show:

Eviction request sent
SIGTERM delivered
Container shutting down
Pod deleted
New pod scheduled elsewhere

Summary

Factor

Impact

Termination grace period

Biggest time factor (default 30s)

PDB

May block indefinitely

DaemonSets

Must ignore or drain hangs

Local volumes

Need delete flag

Number of pods

Drains run in parallel but still wait for slow pods

Typical drain time: 30–90 seconds for most clusters.

PreviousLAB04b: Taint & Toleration NextLAB04c: Setting up Cluster Connection

Last updated 1 month ago

Good night

hashtagBLOG07: Node Maintenance - Cordon/Drain

hashtag1. What is kubectl cordon?

hashtagCordon = mark a node unschedulable

hashtagCommand:

hashtagCommon use cases:

hashtagVisual Summary:

hashtag2. What is kubectl drain?

hashtagDrain = evict all pods safely + mark node unschedulable

hashtagCommand:

hashtagWhat happens under the hood:

hashtagVisual:

hashtag3. What is the Node Release Process?

hashtagThe process consists of 6 stages:

hashtagStage 1: Cordon the node

hashtagStage 2: Drain the node

hashtagStage 3: Perform Maintenance

hashtagStage 4: Bring node back online

hashtagStage 5: Uncordon the node

hashtagStage 6: Verify scheduling

hashtagCordon vs Drain – Comparison

hashtagCORDON

hashtagDRAIN

hashtagAdditional Information: What happens if a node is deleted from cluster provider (AWS/GCP/VMware)?

hashtag1. Pod Eviction Time Depends On the Pod's Termination Grace Period

hashtagExample:

hashtag2. DaemonSet pods do NOT block drain

hashtag3. Pods backed by local-storage (EmptyDir) require flag

hashtag4. Pod Disruption Budgets (PDB) can block drain

hashtag5. Typical cluster drain timing

hashtagSmall cluster, simple workloads

hashtagMedium workloads (web apps, backing services)

hashtagHeavy workloads (Java apps, large caches, ML processes)

hashtagPods with very large grace periods

hashtag6. Monitoring eviction behavior

hashtagSummary

BLOG07: Node Maintenance - Cordon/Drain

1. What is `kubectl cordon`?

Cordon = mark a node unschedulable

Command:

Common use cases:

Visual Summary:

2. What is `kubectl drain`?

Drain = evict all pods safely + mark node unschedulable

Command:

What happens under the hood:

Visual:

3. What is the Node Release Process?

The process consists of 6 stages:

Stage 1: Cordon the node

Stage 2: Drain the node

Stage 3: Perform Maintenance

Stage 4: Bring node back online

Stage 5: Uncordon the node

Stage 6: Verify scheduling

Cordon vs Drain – Comparison

CORDON

DRAIN

Additional Information: What happens if a node is deleted from cluster provider (AWS/GCP/VMware)?

1. Pod Eviction Time Depends On the Pod's Termination Grace Period

Example:

2. DaemonSet pods do NOT block drain

3. Pods backed by local-storage (EmptyDir) require flag

4. Pod Disruption Budgets (PDB) can block drain

5. Typical cluster drain timing

Small cluster, simple workloads

Medium workloads (web apps, backing services)

Heavy workloads (Java apps, large caches, ML processes)

Pods with very large grace periods

6. Monitoring eviction behavior

Summary