Engineering Advanced KubernetesHelmContainer Orchestration

Kubernetes Specialist

Kubernetes operations: deployments, services, ingress, Helm charts, operators, RBAC, resource limits, debugging pods, and cluster management for production.

Production Kubernetes expertise that goes beyond YAML writing. This skill covers resource sizing, pod scheduling, network policies, RBAC design, Helm chart patterns, and systematic pod debugging. It knows the common failure modes — OOMKilled, ImagePullBackOff, CrashLoopBackOff — and the exact kubectl commands to diagnose them fast.

Added Mar 20, 2026

$npx skills add johnefemer/skillfish --skill kubernetes-specialist

What This Skill Can Do

Concrete capabilities you get when you install this skill.

Design Deployment, StatefulSet, and DaemonSet configurations for production workloads

Debug pod failures with systematic kubectl diagnosis and log analysis

Create and maintain Helm charts with values templating and lifecycle hooks

Configure RBAC roles, ClusterRoles, and service accounts for least-privilege access

Tune resource requests and limits to prevent OOMKill and CPU throttling

Set up Ingress controllers, TLS termination, and network policies

Real Conversations

See how the Kubernetes Specialist skill responds to actual requests. These are representative examples of what your AI agent will produce.

You

My pod keeps crashing with CrashLoopBackOff. How do I diagnose what's wrong?

Agent

CrashLoopBackOff means the container is starting, crashing, and Kubernetes is waiting before restarting. Systematic diagnosis:

Step 1 — Get the exit code and reason

kubectl describe pod <pod-name> -n <namespace>
# Look for: Last State, Exit Code, Reason
# Exit Code 137 = OOMKilled (memory limit hit)
# Exit Code 1 = app crashed (check logs)
# Exit Code 126/127 = command not found / not executable

Step 2 — Get logs (current + previous container)

kubectl logs <pod-name> -n <namespace> --previous
# --previous gets logs from the crashed container, not the new one

Step 3 — Common causes and fixes

Exit Code Cause Fix
137 OOMKilled Increase resources.limits.memory
1 App error Check logs for stack trace
0 Process exits immediately Add command: ["sleep", "infinity"] to debug

Step 4 — Debug interactively if logs are empty

# Override the entrypoint to prevent crash
kubectl debug pod/<pod-name> -it --image=busybox --copy-to=debug-pod

Share the describe output and --previous logs and I'll pinpoint the root cause.

Get Started

How to install and use this skill in your preferred environment.

Skills are designed for AI coding agents (Claude Code, Cursor, Windsurf) and IDE-based workflows where the agent can read files, run scripts, and act on your codebase.

Models & Context

Which AI models and context windows work best with this skill.

Recommended Models

Claude Sonnet or GPT-4o recommended. Kubernetes YAML generation and debugging is reliable across most frontier models; operator development benefits from stronger models.

Context Window

SKILL.md is ~9KB. Include kubectl describe and log output directly in context for effective debugging sessions.

Pro tips for best results

1

Be specific

Include numbers — users, budget, RPS — so the skill can size the architecture.

2

Share constraints

Compliance needs, team size, and existing stack all improve the output.

3

Iterate

Start with a high-level design, then ask follow-ups for IaC, cost analysis, or security review.

4

Combine skills

Pair with companion skills below for end-to-end coverage.

Works Great With

These skills complement Kubernetes Specialist for end-to-end coverage. Install them together for better results.

$ skillfish add johnefemer/skillfish --all # install all skills at once

Ready to try Kubernetes Specialist?

Install the skill and start getting expert-level guidance in your workflow — any agent, any IDE.

$npx skills add johnefemer/skillfish --skill kubernetes-specialist
← Browse all skills