🔥 Question 1

Explain full Kubernetes control plane architecture and request flow from `kubectl` to pod creation.

✅ Real Production Answer

Kubernetes control plane mainly consists of:

kube-apiserver
etcd
kube-scheduler
kube-controller-manager
(cloud-controller-manager if cloud provider integrated)

Now request flow:

When I run kubectl apply -f deployment.yaml, kubectl sends REST API request to kube-apiserver.
kube-apiserver:
- Authenticates (cert/token)
- Authorizes (RBAC)
- Validates object schema
- Writes object state into etcd
Now the object is stored as desired state in etcd.
Deployment controller (inside controller-manager) sees new Deployment object.
- It creates a ReplicaSet.
ReplicaSet controller sees desired replicas and creates Pod objects.
Pods are now in Pending state.
kube-scheduler watches for unscheduled pods.
- Applies filtering (resource availability, taints, affinity)
- Scores nodes
- Assigns pod to a node
kubelet on that node:
- Watches API server
- Pulls image
- Creates container via container runtime (containerd / CRI-O)
- Reports status back
Pod becomes Running.

Production insight:

API server is the only component talking to etcd.
Everything else works via watch mechanism.
If scheduler is down → pods stay Pending.
If controller-manager down → state reconciliation stops.

🔥 Question 2

What happens internally when you create a Deployment?

✅ Real Production Answer

Deployment is a higher-level abstraction.

When I create a Deployment:

API server stores Deployment object.
Deployment controller creates a ReplicaSet.
ReplicaSet ensures desired replica count.
Pods get created.

Deployment does not directly manage pods. It manages ReplicaSets.

During update:

A new ReplicaSet is created.
Old ReplicaSet scaled down gradually.
Controlled by maxSurge and maxUnavailable.

Production insight:

Rollbacks happen by scaling older ReplicaSet.
If rollout fails, check ReplicaSet events.
If readiness probe fails → rollout stalls.

🔥 Question 3

Difference between Deployment, StatefulSet, DaemonSet — with production use cases.

✅ Deployment

Stateless apps
Web servers
APIs
Horizontally scalable

Pods are interchangeable.

Example: Frontend app behind LoadBalancer.

✅ StatefulSet

Stable pod identity
Ordered startup/shutdown
Stable persistent storage
Predictable DNS

Used for:

Databases (MySQL, MongoDB)
Kafka
Elasticsearch

Each pod gets:

pod-0, pod-1 naming
Dedicated PVC

Production insight: Don’t use StatefulSet unless you need stable identity or storage.

✅ DaemonSet

One pod per node
Runs on every node

Used for:

Logging agents (Fluent Bit)
Monitoring (Node Exporter)
Security agents

If new node joins → DaemonSet pod auto-created.

🔥 Question 4

When should you use StatefulSet over Deployment — and why not always?

✅ Use StatefulSet when:

Application needs stable hostname
Persistent storage tied to instance
Ordered scaling
Clustered systems (Kafka, DB)

Example: Database cluster where each node has its own disk.

❌ Why not always?

StatefulSets are slower to scale
More complex
Harder rolling updates
Storage management complexity
Cannot freely replace pods

If app is stateless → Deployment is simpler and safer.

Interview insight: If someone says “I use StatefulSet for everything” → red flag.

🔥 Question 5

How kube-scheduler makes scheduling decisions?

✅ Real Production Answer

Scheduler works in two phases:

1️⃣ Filtering (Predicate phase)

Eliminates nodes that cannot run the pod:

Not enough CPU/memory
Taints not tolerated
NodeSelector mismatch
Affinity rules fail
Volume binding constraints

2️⃣ Scoring phase

Ranks remaining nodes based on:

Resource availability
Spread
Affinity preferences
Topology

Best score wins.

Scheduler then binds pod to node.

Production insight:

If pod stuck Pending → scheduler logs are key.
If requests not defined → scheduling becomes unpredictable.
Resource requests are critical.

🔥 Question 6

What are scheduler predicates and priorities (or scheduling framework plugins)?

✅ Real Production Answer

Older Kubernetes versions used:

Predicates → filtering phase
Priorities → scoring phase

Now in modern Kubernetes, this is handled through the Scheduling Framework plugins, but conceptually same idea.

🔹 Filtering (Predicates Equivalent)

Scheduler removes nodes that don’t satisfy:

Insufficient CPU/memory
Taints not tolerated
NodeSelector mismatch
Node affinity required rules
Volume binding constraints
Node not Ready

If no node passes → Pod stays Pending.

🔹 Scoring (Priorities Equivalent)

Among eligible nodes, scheduler scores based on:

Least requested resources
Balanced resource allocation
Pod affinity preferences
Topology spread constraints

Highest score wins.

🔥 Production Insight

If a pod is stuck Pending:

First check:

kubectl describe pod <pod>

Look at:

Events section
“0/5 nodes available”

That tells you which predicate failed.

At 12 LPA level, you must know:

Scheduling mostly depends on resource requests, not limits.

🔥 Question 7

How does kube-controller-manager work? Name key controllers.

✅ Real Production Answer

kube-controller-manager runs multiple controllers that reconcile desired state with actual state.

It constantly:

Watches API server
Compares desired vs current
Takes action to fix drift

This is reconciliation loop.

🔹 Important Controllers

Deployment Controller
ReplicaSet Controller
Node Controller
Job Controller
Endpoint Controller
ServiceAccount Controller
Namespace Controller
PersistentVolume Controller

Example:

If a pod crashes → ReplicaSet controller detects fewer replicas → creates new pod.

🔥 Production Insight

If controller-manager is down:

No new pods created
No node health checks
No scaling actions
Cluster state drifts

But existing running pods continue working.

🔥 Question 8

What happens if kube-controller-manager goes down?

✅ Real Production Answer

Existing workloads continue running because kubelet works independently.

But:

No self-healing
No scaling
No ReplicaSet enforcement
No Job completion
No node failure handling

Example:

If a node dies:

Node controller won't mark it NotReady.
Pods won't get rescheduled.

Cluster slowly degrades.

🔥 Real Production Fix

Control plane should run in HA mode.
Controller-manager usually runs as static pod on master nodes.
If one instance fails, another takes leadership.

🔥 Question 9

How etcd stores data — and why quorum matters?

✅ Real Production Answer

etcd is a distributed key-value store.

It stores:

All cluster state
Pods
Deployments
ConfigMaps
Secrets
Node info

Everything in Kubernetes = object in etcd.

🔹 How It Works

Uses Raft consensus algorithm
Requires majority to agree before commit
Strong consistency

If you have 3 etcd nodes:

Minimum 2 required for quorum.

If you have 5:

Minimum 3 required.

🔥 Why Quorum Matters

If quorum lost:

Cluster becomes read-only
API server cannot write
No new objects created
Cluster effectively dead

That’s why: Never run single-node etcd in production.

🔥 Production Best Practice

Always odd number of etcd nodes (3 or 5)
Regular snapshots
Separate etcd from worker load

🔥 Question 10

How do you design HA control plane?

✅ Real Production Answer

For production:

🔹 Control Plane HA Components

Multiple control plane nodes (minimum 3)
etcd cluster with quorum
Load Balancer in front of API servers

Flow:

kubectl → LoadBalancer → multiple kube-apiserver instances

Each API server:

Talks to etcd cluster
Leader election used for controllers

🔹 Options

Managed (EKS, GKE, AKS):

HA handled by cloud provider.

Self-managed:

kubeadm with stacked etcd
External etcd cluster

🔥 Production Insight

Common mistakes:

Single master
Single etcd
No LB in front of API server
No etcd backup

🔥 Interview Upgrade Answer

Mention:

Use 3 control plane nodes
Separate etcd disks (SSD)
Enable API server audit logging
Regular etcd snapshots
Test restore process

If you mention restore testing → interviewer knows you’ve done real work.

Now we’re entering the area where most 2–3 year DevOps engineers collapse — networking.

If you master this section, your interview confidence will jump massively.

🔥 Question 11

How does pod-to-pod communication work across nodes?

✅ Real Production Answer

In Kubernetes, every pod gets:

Its own IP
Flat networking model
No NAT between pods

Kubernetes follows:

Every pod can talk to every other pod directly via IP.

🔹 Same Node Communication

Pods connected via Linux bridge (like cni0)
Traffic stays local

🔹 Cross-Node Communication

This is where CNI plugin matters.

Example (AWS EKS with VPC CNI):

Pods get real VPC IPs
ENIs attached to worker nodes
Traffic routed via VPC

Example (Calico):

Uses overlay networking (VXLAN/IPIP)
Encapsulates traffic

Flow: Pod A → Node network → CNI routing → Node B → Pod B

🔥 Production Insight

If cross-node traffic fails:

Check:

CNI plugin logs
Node routes (ip route)
Security groups (in cloud)
NetworkPolicy

Networking issues are 80% of real cluster debugging.

🔥 Question 12

What is CNI — and what breaks if CNI fails?

✅ Real Production Answer

CNI = Container Network Interface.

It is the plugin responsible for:

Assigning pod IP
Configuring networking
Managing routes

Without CNI:

Pods won’t get IP
Pods stuck in ContainerCreating
Cross-node communication fails

🔹 Common CNIs

AWS VPC CNI
Calico
Cilium
Flannel
Weave

🔥 Production Insight

If CNI pods crash:

Entire cluster networking unstable
New pods fail to start
Services may break

Always monitor:

CNI DaemonSet health
IP exhaustion (very common in AWS)

IP exhaustion is a classic production issue.

🔥 Question 13

Difference between ClusterIP, NodePort, LoadBalancer in real usage.

✅ ClusterIP (Default)

Internal-only
Accessible inside cluster
Used for microservices communication

Example: Backend service accessed by frontend.

✅ NodePort

Exposes service on every node’s IP + static port (30000–32767)
Mostly used for testing
Not ideal for production

✅ LoadBalancer

Cloud provider provisions external LB
Exposes service publicly
Used for production traffic

Example: Public API service.

🔥 Production Insight

Best practice: LoadBalancer → Ingress Controller → ClusterIP services

Avoid exposing every service with LoadBalancer (cost issue).

🔥 Question 14

How kube-proxy works (iptables vs ipvs modes)?

✅ Real Production Answer

kube-proxy manages service routing.

When you create a Service:

kube-proxy sets up rules on nodes.

🔹 iptables Mode

Uses Linux iptables rules
Simple
Slower at scale (large clusters)

🔹 IPVS Mode

Uses Linux IP Virtual Server
More efficient
Better performance
Recommended for large clusters

🔥 Production Insight

If service routing fails:

Check:

kubectl get svc
kubectl describe svc
iptables -L -n

In large clusters, IPVS performs better.

🔥 Question 15

What is headless service and when used?

✅ Real Production Answer

Headless Service = Service without ClusterIP.

Defined as:

clusterIP: None

No load balancing.

Instead:

DNS returns all pod IPs.

🔹 Used In:

StatefulSets
Databases
Direct pod-to-pod communication
Kafka clusters

Example: mysql-0.mysql-headless.default.svc.cluster.local

Each pod gets stable DNS.

🔥 Production Insight

If you need:

Direct communication between cluster members
Stable identity
Peer discovery

Use headless service.

🔥 Question 16

How DNS resolution works inside the cluster?

✅ Real Production Answer

Kubernetes uses CoreDNS for internal DNS.

When a pod starts:

kubelet injects DNS config into /etc/resolv.conf
Nameserver usually points to CoreDNS service IP
Pod queries CoreDNS
CoreDNS checks Kubernetes API for service/pod records
Returns IP

🔹 Service DNS Format

<service-name>.<namespace>.svc.cluster.local

Example:

backend.default.svc.cluster.local

Short names work because of search domains.

🔹 For Headless Services

DNS returns:

Multiple A records (one per pod)

🔥 Production Debugging

If DNS fails:

Check:

kubectl get pods -n kube-system

(CoreDNS running?)

Test inside pod:

nslookup service-name

Common issues:

CoreDNS crash
NetworkPolicy blocking DNS (UDP 53)
CNI issues

DNS failure = full microservice meltdown.

🔥 Question 17

How would you debug if one pod cannot reach another pod?

✅ Real Production Approach

I follow structured debugging:

Step 1: Basic Connectivity

From source pod:

ping target-ip
curl target-service

If IP works but service name fails → DNS issue.

Step 2: Check Service

kubectl get svc
kubectl describe svc
kubectl get endpoints

If endpoints empty → selector mismatch.

Step 3: Check NetworkPolicy

Very common mistake.

kubectl get networkpolicy

If policy exists → verify ingress/egress rules.

Step 4: CNI & Node Level

Check CNI pod health
Check node routes
Security groups (cloud)

🔥 Production Insight

80% of inter-pod issues are:

Wrong label selector
NetworkPolicy blocking
Port mismatch

🔥 Question 18

How is NetworkPolicy enforced and common mistakes?

✅ Real Production Answer

NetworkPolicy defines allowed traffic at pod level.

But important:

NetworkPolicy only works if CNI supports it.

Example:

Calico supports
AWS VPC CNI alone doesn’t enforce (needs Calico)

🔹 Enforcement

NetworkPolicy:

Applied at pod level
Uses labels
Default deny model if policy exists

If any NetworkPolicy applied to a namespace: → Traffic not explicitly allowed is denied.

🔥 Common Mistakes

Forgetting egress rules
Forgetting DNS (UDP 53)
Wrong pod labels
Applying policy but CNI doesn’t support it

Production issue example: App can't call external API because egress blocked.

🔥 Question 19

Difference between Ingress and Gateway API?

✅ Ingress

Older abstraction
Layer 7 HTTP routing
Requires Ingress Controller (Nginx, ALB, Traefik)

Supports:

Host-based routing
Path-based routing
TLS termination

✅ Gateway API (Newer & More Powerful)

More flexible
Role-based separation
Better traffic control
Supports advanced routing

Gateway API separates:

Gateway (infra)
HTTPRoute (app routing)

🔥 Production Insight

Ingress still widely used.

Gateway API is future direction.

If you say:

"Gateway API gives better separation between infra and app teams"

Interviewer will be impressed.

🔥 Question 20

How TLS termination works with Ingress controller?

✅ Real Production Flow

User hits HTTPS endpoint.
LoadBalancer forwards traffic to Ingress Controller.
Ingress Controller:
- Uses TLS secret
- Terminates TLS
- Forwards HTTP to backend service

🔹 TLS Secret

Stored as:

type: kubernetes.io/tls

Contains:

tls.crt
tls.key

🔹 Production Best Practice

Use cert-manager:

Automatically issues certificates (Let's Encrypt)
Auto-renewal
Reduces manual errors

🔥 Real Production Failure Cases

Expired certificate
Secret missing
Wrong host in Ingress rule
Port mismatch

🔥 Question 21

Difference between resource requests and limits — real impact on scheduling?

✅ Real Production Answer

In Kubernetes:

Requests → used for scheduling
Limits → enforced at runtime

🔹 Requests

When a pod is scheduled, kube-scheduler checks:

CPU request
Memory request

Scheduler ensures the node has at least that much available.

If no node satisfies → Pod stays Pending.

🔹 Limits

Enforced by container runtime using cgroups.

CPU limit → throttling
Memory limit → OOMKill

🔥 Real Production Impact

If you don’t define requests:

Scheduler may overcommit node
Many pods land on same node
Node pressure increases
Random OOMs later

If you don’t define limits:

One bad pod can consume entire node memory
Node becomes unstable

Best practice: Always define both.

🔥 Question 22

What happens if limits are not defined?

✅ Real Production Answer

If limits not defined:

CPU → unlimited usage (can starve others)
Memory → can consume entire node
Node may enter MemoryPressure
Kernel OOM killer may kill random pods

In worst case:

Node crashes
Multiple services affected

🔥 Production Insight

In shared clusters: Never allow workloads without limits.

Use:

LimitRange
ResourceQuota

To enforce guardrails at namespace level.

🔥 Question 23

What is OOMKilled and how to prevent it?

✅ Real Production Answer

OOMKilled happens when:

Container exceeds memory limit
Linux kernel kills it

Pod status shows:

Reason: OOMKilled

🔹 Root Causes

Memory leak in app
Too low memory limit
Traffic spike
Poor request/limit tuning

🔥 Debugging Approach

Check pod describe
Check previous logs:

kubectl logs pod-name --previous

Compare usage vs limits (Prometheus/Grafana)

🔥 Prevention

Set realistic memory requests & limits
Use HPA
Profile application memory
Avoid equal request=limit unless needed

🔥 Question 24

How does HPA calculate scaling decisions?

✅ Real Production Answer

HPA works based on:

Current Metric / Target Metric

Example:

If CPU target = 60% Current average CPU = 90%

New replicas = (90 / 60) × current replicas

🔹 Requirements

Metrics Server installed
CPU requests defined

If requests missing → HPA won’t work properly.

🔹 Scaling Cycle

HPA checks metrics periodically (default 15s)
Calculates desired replicas
Updates Deployment
ReplicaSet creates new pods

🔥 Production Insight

Common issue: HPA scales up fast, scales down slowly (stabilization window).

You must tune:

minReplicas
maxReplicas
scaleDown behavior

🔥 Question 25

Metrics Server vs Prometheus for HPA — difference?

✅ Metrics Server

Lightweight
Provides CPU & memory metrics only
Used by HPA
Not long-term storage

✅ Prometheus

Full monitoring system
Stores historical metrics
Custom metrics support
Can integrate with HPA via adapter

🔥 Production Insight

Default HPA uses Metrics Server.

For advanced scaling (like requests per second): Use:

Prometheus Adapter
Custom metrics API

Example: Scale based on:

Queue length
HTTP requests/sec
Kafka lag

That’s more production-grade scaling.

🔥 Question 26

When HPA fails to scale — what are your debugging steps?

✅ Real Production Answer

If HPA is not scaling, I check systematically:

Step 1: Check HPA Status

kubectl get hpa
kubectl describe hpa <name>

Look for:

Current metrics
Target metrics
Events
Conditions

If it says:

failed to get CPU utilization

→ Metrics Server issue.

Step 2: Verify Metrics Server

kubectl get pods -n kube-system

Check metrics-server is running.

Test:

kubectl top pods

If this fails → HPA won't work.

Step 3: Check Resource Requests

HPA calculates based on CPU requests.

If CPU request not defined:

Scaling won’t behave correctly.

Step 4: Check min/maxReplicas

Sometimes HPA not scaling because:

Already at maxReplicas
Current replicas equal calculated replicas

Step 5: Stabilization Window

Scale down might not happen due to:

Stabilization window (default 300s)

🔥 Production Insight

Most common causes:

Missing CPU requests
Metrics Server misconfigured
Target utilization unrealistic

🔥 Question 27

Difference between HPA, VPA, and Cluster Autoscaler?

✅ HPA (Horizontal Pod Autoscaler)

Scales number of pods
Based on CPU/memory/custom metrics

Used for:

Web apps
APIs

✅ VPA (Vertical Pod Autoscaler)

Adjusts CPU/memory requests & limits
Does NOT scale pod count
Often restarts pods to apply new values

Used for:

Stateful workloads
Apps needing tuning

⚠️ Important

Do NOT run HPA and VPA on same resource for CPU — conflict risk.

✅ Cluster Autoscaler

Scales nodes
Adds/removes worker nodes
Works when pods are Pending due to lack of resources

Flow: HPA scales pods → No space → Cluster Autoscaler adds nodes.

🔥 Production Insight

Scaling hierarchy:

HPA tries first
If node capacity full → Cluster Autoscaler triggers
Node joins → Pending pods scheduled

🔥 Question 28

PodDisruptionBudget — real production use case?

✅ Real Production Answer

PDB ensures minimum pods stay available during voluntary disruptions.

Voluntary disruptions:

Node drain
Cluster upgrade
Manual eviction

Example:

3 replicas running.

PDB:

minAvailable: 2

During node upgrade: Only 1 pod can be evicted at a time.

🔥 Why Important?

Without PDB: During node drain → all pods might go down → outage.

🔥 Production Scenario

While upgrading EKS:

Node draining respects PDB
Ensures zero downtime

🔥 Question 29

Taints & tolerations — when have you used them?

✅ Real Production Answer

Taints repel pods.

Tolerations allow pods to run on tainted nodes.

Real Use Cases

Dedicated GPU nodes
- Taint GPU nodes
- Only ML workloads tolerate
Infra nodes
- Taint monitoring/logging nodes
- Prevent regular apps from scheduling
Spot instances
- Taint spot nodes
- Only fault-tolerant workloads run there

🔥 Production Insight

If pod Pending with:

node(s) had taint that pod didn't tolerate

Add toleration in spec.

🔥 Question 30

Node affinity vs Pod affinity vs Anti-affinity — real scenario usage?

✅ Node Affinity

Controls which nodes pod can schedule on.

Example: Schedule only on:

SSD nodes
GPU nodes
Specific AZ

✅ Pod Affinity

Schedule pod close to another pod.

Example: App + cache in same zone for latency reduction.

✅ Pod Anti-Affinity

Ensure pods are NOT on same node.

Example: 3 replicas of API → spread across 3 nodes.

🔥 Question 31

Rolling update — what parameters control its behavior?

✅ Real Production Answer

Rolling update is default strategy in Deployment.

Controlled by:

strategy:
  type: RollingUpdate
  rollingUpdate:
    maxUnavailable: 1
    maxSurge: 1

🔹 maxUnavailable

How many pods can be unavailable during update.

Example: Replicas = 4 maxUnavailable = 1

At least 3 pods always running.

🔹 maxSurge

How many extra pods can be created above desired replicas.

Example: Replicas = 4 maxSurge = 1

Kubernetes can temporarily run 5 pods.

🔥 Production Insight

For high-traffic apps:

maxUnavailable = 0
maxSurge = 1 or 25%

Ensures zero downtime.

🔥 Question 32

How do maxUnavailable and maxSurge affect rollout?

✅ Real Production Example

Replicas = 10 maxUnavailable = 2 maxSurge = 3

During rollout:

Up to 3 new pods created
Up to 2 old pods taken down

So total pods can go up to 13 temporarily.

🔥 Impact

If maxUnavailable too high: → Risk downtime

If maxSurge too high: → Resource pressure

Production Balance

Low traffic app → aggressive rollout OK Critical production → conservative rollout

🔥 Question 33

How to implement Blue-Green deployment in Kubernetes?

✅ Real Production Answer

Blue-Green = two identical environments.

Approach:

Deploy:
- deployment-blue
- deployment-green
Service points to one of them.

Switch by changing:

Service selector OR
Ingress route

Flow:

Current: Service → blue

Deploy green Test green Switch service selector → green Remove blue later

🔥 Production Insight

Benefits:

Instant rollback (just switch back)
Safe for big schema changes

Downside:

Double resource usage

🔥 Question 34

How to implement Canary deployment in Kubernetes?

✅ Real Production Answer

Canary = gradual traffic shift.

Basic Method (Simple)

Deploy:

app-v1 (stable)
app-v2 (canary, fewer replicas)

Traffic automatically distributed by Service.

Example: 10 replicas v1 1 replica v2 ~10% traffic to v2

Advanced Method (Ingress Based)

Using:

NGINX Ingress annotations
Istio / service mesh

Example: Route:

90% → v1
10% → v2

Gradually increase.

🔥 Production Insight

True canary requires:

Monitoring
Automated rollback
Metrics comparison

Without metrics → it's blind rollout.

🔥 Question 35

How to rollback a bad deployment safely?

✅ Real Production Answer

First: Check rollout status:

kubectl rollout status deployment app

If broken:

kubectl rollout undo deployment app

This restores previous ReplicaSet.

🔥 What Actually Happens?

Deployment scales down new ReplicaSet. Scales up old ReplicaSet.

Production-Level Rollback Strategy

Better approach:

Use readiness probes properly
Monitor error rate
Use automated rollback (Argo Rollouts / Flagger)

🔥 Interview Upgrade Answer

Mention:

Don’t rely only on manual rollback
Monitor:
- HTTP 5xx
- Latency
- CPU spike
Use progressive delivery tools

That signals maturity.

🔥 Question 36

How does readiness probe affect rollout?

✅ Real Production Answer

Readiness probe determines whether a pod is ready to receive traffic.

During rollout:

New pod is created
Kubernetes waits until readiness probe passes
Only then does it send traffic
Only then old pod is terminated (based on rollout strategy)

🔥 What If Readiness Probe Fails?

Pod remains in NotReady
Service does NOT route traffic to it
Rollout may get stuck

If maxUnavailable = 0 And new pods never become Ready → Rollout blocks completely.

🔥 Production Insight

Bad readiness configuration can cause:

Stuck deployment
Traffic imbalance
Partial outages

Best practice: Readiness should check:

App health
DB connectivity (if critical)
Dependencies ready

🔥 Question 37

Liveness vs Readiness vs Startup probe — failure impact?

✅ Liveness Probe

Checks:

Should this container be restarted?

If liveness fails:

Container restarted

Used to detect:

Deadlocks
Stuck processes

✅ Readiness Probe

Checks:

Should this pod receive traffic?

If fails:

Traffic stops
Pod not restarted

✅ Startup Probe

Used for:

Slow starting apps

Disables liveness until startup passes.

🔥 Production Mistake

Common error: Using liveness probe for DB connection check.

Result: Temporary DB issue → pod restarts continuously → worse outage.

Correct approach:

DB check in readiness, not liveness.

🔥 Question 38

How to achieve zero downtime deployment?

✅ Real Production Strategy

Use RollingUpdate
- maxUnavailable: 0
- maxSurge: 1
Proper readiness probe
Multiple replicas
PodDisruptionBudget
Graceful shutdown handling in app

🔥 Critical Element

App must handle:

SIGTERM signal
Stop accepting traffic
Finish ongoing requests
Exit cleanly

If app ignores SIGTERM: → Rolling update causes dropped requests.

Production Add-ons

Use preStop hook
Increase terminationGracePeriodSeconds

🔥 Question 39

What breaks zero downtime deploy most often?

✅ Real Production Failures

Single replica app
No readiness probe
DB migration blocking
App not handling SIGTERM
Wrong resource limits causing crash
HPA scaling too slow
Sticky sessions not handled

🔥 Real Example

Deploy new version. Readiness passes. But new version has memory leak. After traffic shift → OOMKilled → outage.

Lesson: Deployment success ≠ production success.

Monitoring is mandatory.

🔥 Question 40

How do you manage config changes without rebuilding image?

✅ Real Production Answer

Use:

ConfigMap (non-sensitive config)
Secret (sensitive data)

Mounted as:

Environment variables
Files

🔹 Config Update Without Rebuild

Update ConfigMap:

kubectl apply -f config.yaml

But important:

Pods DO NOT auto-restart.

Options:

Manually restart deployment
Use hash annotation in Deployment
Use Reloader controller
Use Helm upgrade

🔥 Production Insight

For zero downtime config update:

Update ConfigMap
Rolling restart deployment

Never bake config into image in production.

🔥 Question 41

PV vs PVC vs StorageClass — full lifecycle explanation

✅ Real Production Answer

🔹 PersistentVolume (PV)

Actual storage resource
Could be:
- EBS
- NFS
- EFS
- Ceph
- Local disk

Cluster-level object.

🔹 PersistentVolumeClaim (PVC)

Request for storage by pod
Namespace-level object
Specifies:
- Size
- Access mode
- StorageClass

🔹 StorageClass

Defines:

Provisioner
Parameters
Reclaim policy
Volume binding mode

Used for dynamic provisioning.

🔄 Full Lifecycle (Dynamic Provisioning Example)

Pod creates PVC.
PVC references StorageClass.
StorageClass provisioner creates actual volume (like EBS).
PV created and bound to PVC.
Pod mounts PVC.
Pod writes data.
If PVC deleted:
- Reclaim policy decides:
  - Delete
  - Retain

🔥 Production Insight

Always check:

kubectl get pvc
kubectl describe pvc

If PVC stuck in Pending → StorageClass issue.

🔥 Question 42

Static vs Dynamic provisioning

✅ Static Provisioning

Admin manually creates PV. PVC binds to matching PV.

Used when:

Pre-existing storage
Special compliance cases

Hard to scale.

✅ Dynamic Provisioning

Most common.

PVC → StorageClass → Auto create volume.

Example in AWS:

PVC triggers EBS creation.

🔥 Production Best Practice

Always prefer dynamic provisioning unless special need.

Reduces manual mistakes.

🔥 Question 43

How volume binding works?

✅ Real Production Answer

Binding process:

PVC created.
Kubernetes searches for:
- Matching PV OR
- Uses StorageClass to provision new PV.

Matching based on:

Access mode
Storage size
StorageClass name

Once matched: PVC status → Bound

🔥 VolumeBindingMode

Important field in StorageClass:

volumeBindingMode: WaitForFirstConsumer

This delays volume creation until pod scheduled.

Why important?

For:

Multi-AZ clusters
Ensures volume created in same zone as pod

Without this: Volume may create in wrong AZ → scheduling failure.

🔥 Question 44

When PVC stays Pending — root causes?

✅ Real Production Debug Flow

If PVC Pending:

Check:

kubectl describe pvc <name>

Common causes:

No matching StorageClass
Wrong StorageClass name
Insufficient quota
Provisioner not running
VolumeBindingMode conflict
Cloud permission issue (IAM)

🔥 Real Production Case

In AWS:

EBS provisioner fails because:

Worker node IAM role missing permission
Subnet not tagged properly

PVC remains Pending.

🔥 Question 45

Stateful app storage best practices

✅ Real Production Best Practices

Use StatefulSet
Use dynamic provisioning
Use WaitForFirstConsumer
Ensure backups enabled
Avoid deleting PVC blindly
Use appropriate access mode

🔥 Important

For database:

One PVC per replica
Never share RWO volume across pods
Always test restore process

🔥 Production Risk

Deleting StatefulSet does NOT delete PVC by default.

Good: Prevents accidental data loss.

Bad: Leftover storage cost if not cleaned.

🔥 Question 46

RWX vs RWO — production implications?

✅ RWO (ReadWriteOnce)

Volume can be mounted by one node at a time
Most common (EBS in AWS)
Safe for databases

Example: MySQL pod using EBS volume → RWO.

✅ RWX (ReadWriteMany)

Volume can be mounted by multiple nodes simultaneously
Requires shared filesystem:
- EFS
- NFS
- CephFS

Used for:

Shared content
File uploads
ML shared datasets

🔥 Production Implications

RWO:

Better performance
Lower complexity
Zone-bound

RWX:

More flexible
Higher latency (network FS)
Needs careful permission handling

⚠️ Common Mistake

Trying to use EBS (RWO) with multiple replicas → fails.

Know your backend storage limitations.

🔥 Question 47

How do you design RBAC with least privilege?

✅ Real Production Answer

RBAC has:

Role / ClusterRole
RoleBinding / ClusterRoleBinding
ServiceAccount

Principle: Grant only required permissions.

🔹 Example

If app only needs to:

Read ConfigMaps

Create:

Role:

verbs: ["get", "list"]
resources: ["configmaps"]

Bind to ServiceAccount.

🔥 Production Best Practices

Never use cluster-admin for apps
Separate infra vs app roles
Audit API server logs
Use namespace isolation

🔥 Red Flag in Interview

If someone says:

“I give cluster-admin to simplify things.”

That’s a security risk.

🔥 Question 48

Difference between Role and ClusterRole?

✅ Role

Namespace-scoped
Limited to one namespace

Used for:

App-specific permissions

✅ ClusterRole

Cluster-wide
Can:
- Access all namespaces
- Access non-namespaced resources (nodes, PV)

🔥 Important

ClusterRole can still be bound to a single namespace using RoleBinding.

Production Use Case

Monitoring tool: Needs to read pods in all namespaces → ClusterRole.

App: Needs access only in its namespace → Role.

🔥 Question 49

How ServiceAccount is used by pods?

✅ Real Production Answer

Every pod runs with a ServiceAccount.

If not specified: → default ServiceAccount.

🔹 What It Does

Provides identity to pod
Used for API access
Mounts token inside pod

Token location:

/var/run/secrets/kubernetes.io/serviceaccount/

🔥 Production Best Practice

Create custom ServiceAccount per app
Attach minimal RBAC
Disable auto-mount token if not needed

🔥 In Cloud (Example: EKS)

ServiceAccount can be linked with IAM role (IRSA).

Pod → IAM role → AWS API securely.

Very important for production AWS setups.

🔥 Question 50

How secrets are stored — and why base64 is not encryption?

✅ Real Production Answer

By default:

Secrets stored in etcd as base64 encoded.

Base64 ≠ encryption.

Anyone with etcd access can decode.

🔥 Secure Production Setup

Enable:

Encryption at Rest

Using:

KMS provider
EncryptionConfiguration

🔥 Best Practices

Never commit secrets in Git
Use external secret managers:
- AWS Secrets Manager
- HashiCorp Vault
Use sealed secrets or External Secrets Operator

🔥 Interview Upgrade Answer

If you mention:

etcd encryption
KMS integration
IRSA
Secret rotation strategy

You’re signaling production maturity.

Karpenterandclusterautoscaler R1

🔥 Question 1

Explain full Kubernetes control plane architecture and request flow from kubectl to pod creation.

✅ Real Production Answer

🔥 Question 2

What happens internally when you create a Deployment?

✅ Real Production Answer

🔥 Question 3

Difference between Deployment, StatefulSet, DaemonSet — with production use cases.

✅ Deployment

✅ StatefulSet

✅ DaemonSet

🔥 Question 4

When should you use StatefulSet over Deployment — and why not always?

✅ Use StatefulSet when:

❌ Why not always?

🔥 Question 5

How kube-scheduler makes scheduling decisions?

✅ Real Production Answer

1️⃣ Filtering (Predicate phase)

2️⃣ Scoring phase

🔥 Question 6

What are scheduler predicates and priorities (or scheduling framework plugins)?

✅ Real Production Answer

🔹 Filtering (Predicates Equivalent)

🔹 Scoring (Priorities Equivalent)

🔥 Production Insight

🔥 Question 7

How does kube-controller-manager work? Name key controllers.

✅ Real Production Answer

🔹 Important Controllers

🔥 Production Insight

🔥 Question 8

What happens if kube-controller-manager goes down?

✅ Real Production Answer

🔥 Real Production Fix

🔥 Question 9

How etcd stores data — and why quorum matters?

✅ Real Production Answer

🔹 How It Works

🔥 Why Quorum Matters

🔥 Production Best Practice

🔥 Question 10

How do you design HA control plane?

✅ Real Production Answer

🔹 Control Plane HA Components

🔹 Options

🔥 Production Insight

🔥 Interview Upgrade Answer

🔥 Question 11

How does pod-to-pod communication work across nodes?

✅ Real Production Answer

🔹 Same Node Communication

🔹 Cross-Node Communication

🔥 Production Insight

🔥 Question 12

What is CNI — and what breaks if CNI fails?

✅ Real Production Answer

🔹 Common CNIs

🔥 Production Insight

🔥 Question 13

Difference between ClusterIP, NodePort, LoadBalancer in real usage.

✅ ClusterIP (Default)

✅ NodePort

✅ LoadBalancer

🔥 Production Insight

🔥 Question 14

How kube-proxy works (iptables vs ipvs modes)?

✅ Real Production Answer

🔹 iptables Mode

🔹 IPVS Mode

🔥 Production Insight

🔥 Question 15

What is headless service and when used?

✅ Real Production Answer

🔹 Used In:

🔥 Production Insight

🔥 Question 16

How DNS resolution works inside the cluster?

✅ Real Production Answer

🔹 Service DNS Format

Explain full Kubernetes control plane architecture and request flow from `kubectl` to pod creation.