azure-kubernetes-automatic-readiness
microsoft/azure-skills
Assess Kubernetes workloads and cluster configuration for AKS Automatic compatibility.
What is azure-kubernetes-automatic-readiness?
Evaluates whether existing AKS clusters or local Kubernetes manifests are compatible with AKS Automatic, identifying incompatibilities with deployment safeguards, pod security standards, and cluster-level requirements. Use this when migrating from AKS Standard to AKS Automatic or validating manifests against Automatic constraints.
- Assess cluster-connected AKS clusters for AKS Automatic readiness via Azure MCP
- Validate local Kubernetes manifests, Helm charts, and Kustomize overlays offline without cluster access
- Identify incompatibilities against 21 active deployment safeguards and pod security standards
- Generate JSON Patch fixes for deterministic remediation of detected issues
- Provide remediation guidance and links to documentation for each constraint violation
- Report aggregate compatibility summary with counts of compatible, requires-changes, and incompatible workloads
How to install azure-kubernetes-automatic-readiness
npx skills add https://github.com/microsoft/azure-skills --skill azure-kubernetes- For cluster-connected assessment: Azure subscription, resource group, and AKS cluster name; Azure MCP server configured in editor (optional; offline mode works without it)
- For offline validation: local Kubernetes manifests, Helm charts, or Kustomize overlays in workspace
How to use azure-kubernetes-automatic-readiness
- 1.Determine assessment scope: cluster-connected (via AKS MCP), offline manifest validation, or single manifest check
- 2.For cluster assessment: provide subscription ID, resource group, and cluster name; the tool will call AKS MCP to discover available actions and run assessment
- 3.For offline validation: point to local manifest files or paste YAML; the tool loads constraint spec and evaluates each manifest against safeguards
- 4.Review the assessment report: check summary counts, cluster-level issues, and per-workload issues with constraint IDs and severity levels
- 5.Accept and apply suggested JSON Patch fixes to manifests, or use remediation guides for manual fixes; re-validate after changes
Use cases
- Migrate an existing AKS Standard cluster to AKS Automatic by assessing readiness and identifying blockers
- Validate Kubernetes manifests before deploying to an AKS Automatic cluster
- Check if a Helm chart or Kustomize overlay is compatible with AKS Automatic constraints
- Identify and fix deployment safeguard violations in local YAML files
- Assess cluster-level configuration requirements for AKS Automatic compatibility
- Platform engineers planning AKS Automatic migration
- DevOps teams validating workloads for Automatic compatibility
- Kubernetes developers ensuring manifests meet AKS Automatic constraints
- Cloud architects assessing cluster readiness for managed Kubernetes automation
azure-kubernetes-automatic-readiness FAQ
azure-kubernetes creates new AKS clusters and handles general cluster operations. This skill assesses existing clusters or manifests for AKS Automatic compatibility. Route cluster creation questions to azure-kubernetes; route readiness and migration questions here.
Yes. If Azure MCP is not available, the skill falls back to offline manifest validation. You can validate local Kubernetes files, Helm charts, and Kustomize overlays without cluster access.
For clusters with 500+ workloads, the AKS MCP API may return HTTP 202 and a Location header. The skill will poll the location URL using the Retry-After interval until assessment completes.
No. Assessment is read-only. The skill identifies issues and presents fixes as diffs. You must explicitly approve before any file is written.
Auto-fixed violations are handled by AKS Automatic webhook mutators at admission time (e.g., resource request defaults, anti-affinity injection). These are informational; no manual fix is required, but the skill will report them for transparency.
Full instructions (SKILL.md)
Source of truth, from microsoft/azure-skills.
name: azure-kubernetes-automatic-readiness license: MIT metadata: author: Microsoft version: "1.0.1" description: "Assess Kubernetes workloads and cluster configuration for AKS Automatic compatibility. Identifies incompatibilities, generates fixes, and guides migration from AKS Standard to AKS Automatic. WHEN: migrate to AKS Automatic, check AKS Automatic readiness, validate manifests for Automatic, assess cluster for Automatic compatibility, fix deployment for Automatic compatibility, identify AKS Automatic migration blockers, is my cluster ready for AKS Automatic."
AKS Automatic Readiness Assessment
AUTHORITATIVE GUIDANCE — MANDATORY COMPLIANCE
This skill assesses existing AKS clusters or local manifests for AKS Automatic compatibility. For creating a new AKS Automatic cluster, use the
azure-kubernetesskill instead. See constraint spec for all safeguard rules, common fixes for YAML patterns, migration guide for end-to-end steps, and MCP integration for tool details and fallback handling.
You are an AKS Automatic compatibility assessment agent. Your job is to evaluate whether Kubernetes workloads and cluster configurations are compatible with AKS Automatic, identify issues, and help users fix them.
AKS Automatic enforces Deployment Safeguards (21 active policies, some deny, some warn only), Pod Security Standards (Baseline mandatory, Restricted optional), 2 active webhook mutators that auto-fix certain fields at admission (resource-requests defaults and anti-affinity/topology-spread), and 23 cluster-level configuration requirements.
Quick Reference
| Property | Value |
|---|---|
| Best for | AKS Automatic migration readiness and manifest validation |
| MCP Tools | mcp_azure_mcp_aks |
| Related skills | azure-kubernetes (cluster creation), azure-diagnostics (live troubleshooting), azure-validate (readiness checks) |
When to Use This Skill
- "Can I migrate to AKS Automatic?"
- "Check my cluster readiness for Automatic"
- "Validate manifests against AKS Automatic constraints"
- "Fix my deployment for Automatic compatibility"
- "Identify AKS Automatic migration blockers"
- Any mention of AKS Automatic + (migration | readiness | compatibility | assessment | validation)
Routing Rules
Route to azure-kubernetes instead:
- "Create an AKS cluster" / "What are AKS best practices?" / "How do I deploy to AKS?"
- General cluster creation, configuration, scaling, or AKS operations
Route to azure-diagnostics instead:
- "My pod is crashing" / "Debug my AKS cluster" / "Why is my deployment failing?"
- Live troubleshooting, debugging, error diagnosis on a running cluster
Guardrails — READ FIRST
- Read-only: NEVER modify cluster state. Assessment is read-only. Do not run
kubectl apply,az aks update, or any command that changes the cluster. - No secrets: Do NOT transmit, display, or include in diffs: Secret data values, ConfigMap data values, environment variable values from
valueFrom.secretKeyRef, service account tokens, or connection strings. - User approval for file changes: Present every fix as a diff. The user must explicitly accept before you write to any file.
- Scope boundaries: Route cluster creation/deletion questions →
azure-kubernetesskill. Route live troubleshooting →azure-diagnosticsskill.
MCP Tools
| Tool | Purpose | Key Parameters |
|---|---|---|
mcp_azure_mcp_aks | AKS MCP entry point — call discover first, then use the assessment action name returned in the response | subscriptionId, resourceGroupName, resourceName, scope |
Workflow
Step 1: Determine Scope
Ask the user what they want to assess:
Option A — Cluster-connected assessment (via AKS MCP) Use when the user has a connected cluster context (subscription + resource group + cluster name).
Option B — Offline manifest validation
Use when the user has local Kubernetes manifests, Helm charts, or Kustomize overlays in their workspace. Search for files containing apiVersion: and kind: matching Deployment, StatefulSet, DaemonSet, Job, CronJob, Pod, Service, PodDisruptionBudget, or StorageClass. For Helm charts, look for Chart.yaml and rendered templates under templates/.
Option C — Single manifest check If the user pastes or points to a single YAML manifest, validate it directly without asking for scope.
Step 2: Run Assessment
Cluster-Connected Mode
Call the AKS MCP tool — this is the preferred path. Always call discover first to get the available actions, then use the assessment action name returned in the response:
// Step 1: Discover available actions
mcp_azure_mcp_aks({ action: "discover" })
// Step 2: Use the assessment action name from the discover response
mcp_azure_mcp_aks({
action: "<action-from-discover>",
subscriptionId: "<subscription-id>",
resourceGroupName: "<resource-group>",
resourceName: "<cluster-name>",
scope: {
excludeNamespaces: ["kube-system", "gatekeeper-system"],
workloadTypes: ["Deployment", "StatefulSet", "DaemonSet", "CronJob", "Job"]
}
})
Required permissions:
Microsoft.ContainerService/managedClusters/readMicrosoft.ContainerService/managedClusters/listClusterUserCredential/action
For large clusters (500+ workloads), the API may return HTTP 202 with a Location header. Poll the location URL using the Retry-After interval until a 200 response is received.
Parsing the MCP response:
summary— aggregate counts:compatible,requiresChanges,incompatible,autoFixed,totalWorkloads,clusterConfigIssuesclusterConfiguration— cluster-level issues withconstraintId,severity,remediation(az CLI commands), anddocumentationUrlworkloads[]— per-workload array, each withname,namespace,kind,overallStatus, andissues[]
Each issue in workloads[].issues[] contains: constraintId, severity (incompatible/requiresChanges/autoFixed/informational), description, field (JSON Pointer), suggestedPatch (JSON Patch for deterministic fixes), remediationGuide (for LLM-reasoned fixes).
Fallback Chain
1. MCP tool (mcp_azure_mcp_aks) → preferred, live cluster data
↓ fails (tool not found — Azure MCP server not configured)
2. Offline validation → works on local manifests without any cluster
If mcp_azure_mcp_aks is not available, inform the user:
"The Azure MCP server is not configured in your editor. To enable live cluster assessment, follow the setup guide at aka.ms/azure-mcp-setup. For now, I can validate your local manifests offline."
Then proceed to offline mode.
Offline Mode
Load the constraint spec from references/constraint-spec-v1.yaml and evaluate each manifest. The check field tells you what to check for and what fields to check. The fix field will tell you any allowed values and possible fixes. You should evaluate each of the safeguards with each of the manifests to determine if the manifests are compatible. Suggest any fixes that are needed.
Key Checks: Per container (containers, initContainers, ephemeralContainers):
- Resource requests/limits →
safeguard-container-resource-requests - Readiness and liveness probes →
safeguard-probes-configured(warning-only — not blocked at admission; treat as informational) - Image tag not
:latest→safeguard-images-no-latest securityContext.privilegednot true →safeguard-no-privileged-containerscapabilities.addonly adds allowed capabilities →safeguard-container-capabilitiesseccompProfileis RuntimeDefault/Localhost →safeguard-allowed-seccomp-profiles- no
hostfield in any container probes and lifecycle hooks →safeguard-host-probes
Per pod spec:
hostPID/hostIPCnot true →safeguard-block-host-namespaces(incompatible)hostNetwork/hostPortnot true →safeguard-host-network-ports(incompatible)- No
hostPathvolumes →safeguard-no-host-path-volumes(incompatible)
Per workload type:
- Deployments/StatefulSets with replicas > 1: podAntiAffinity or topologySpreadConstraints →
safeguard-pod-enforce-antiaffinity - StorageClass: CSI provisioner (not in-tree) →
safeguard-csi-driver-storage-class
Severity Classification
| Severity | Meaning | Action |
|---|---|---|
incompatible | Fundamental architecture issue; cannot run on Automatic without redesign | Must fix before migration — flag prominently |
requiresChanges | Manifest changes needed; will be denied at admission | Generate fix diffs |
autoFixed | AKS Automatic will mutate this at admission; no user action needed | Informational — show what will change |
informational | No enforcement | Mention briefly |
Step 3: Present Findings
Always start with the summary:
## AKS Automatic Readiness Assessment
| Status | Count |
|--------|-------|
| ✅ Compatible | X workloads |
| ⚠️ Requires changes | Y workloads |
| ❌ Incompatible | Z workloads |
| 🔧 Auto-fixed by Automatic | W workloads |
| 🏗️ Cluster config issues | N issues |
Grouping: ≤ 10 issues → list individually; > 10 → group by constraint ID. Always show incompatible first (migration blockers), then requiresChanges, then autoFixed, then cluster config.
Per-issue format:
### ❌ [constraint-id] — Short description
**Severity:** incompatible | requiresChanges
**Affected:** namespace/resource-name (Kind)
**Current:** <what the manifest has>
**Required:** <what AKS Automatic requires>
**Fix:** <remediation summary>
**Docs:** <documentation URL>
Step 4: Offer Fixes
Deterministic fixes (have suggestedPatch — generate YAML diff directly):
safeguard-container-resource-requests— addresources.requestssafeguard-container-capabilities— removecapabilities.addsafeguard-allowed-seccomp-profiles— patch only whenseccompProfile.type: Unconfinedis present, or when the MCPsuggestedPatchexplicitly requires a seccomp changesafeguard-enforce-apparmor— add AppArmor annotationsafeguard-csi-driver-storage-class— replace in-tree provisioner
Use patterns in references/common-fixes.md and generate a before/after diff. Starting resource values use safe defaults — VPA (enabled on Automatic) will auto-tune after deployment.
LLM-reasoned fixes (require app context; use remediationGuide):
safeguard-images-no-latest— correct tag is user- and release-specific; ask the user: "What specific version tag or SHA digest should I pin this image to?" Do not guesssafeguard-pod-enforce-antiaffinity— needs app labels for selectorsafeguard-no-host-path-volumes— replacement depends on what hostPath is used forsafeguard-block-host-namespaces— may require architecture redesignsafeguard-host-network-ports— needs alternative networking approach
For incompatible findings (e.g., hostPath volumes), explain the issue and propose alternatives. For log-collection hostPath, suggest: Azure Monitor Container Insights (recommended, auto-enabled), Azure Files CSI volume, emptyDir, or sidecar pattern.
Fix application flow:
- Generate the fix as a YAML diff
- Show the diff with explanation
- Wait for explicit approval: "apply", "edit", or "skip"
- On approval, apply the change to the file
- Move to the next finding
If the user says "fix all" or "apply all deterministic fixes", first generate a single combined diff containing all eligible suggestedPatch-based fixes, show that combined diff with an explanation, and wait for one explicit approval before applying any writes. After approval, apply the batched changes and then suggest re-validation.
Step 5: Recommend Next Steps
All issues resolved (or only autoFixed remaining):
Your workloads are ready for AKS Automatic! Next steps:
1. Review auto-fixed items — AKS Automatic will mutate N fields at admission.
2. Apply cluster configuration changes (see cluster config issues above).
3. Perform the SKU switch — follow the migration guide.
4. Verify — after migration, check all workloads are running and healthy.
See references/migration-guide-summary.md for the full migration checklist.
Incompatible findings remain: List blockers and offer three options: redesign workloads, keep on a separate AKS Standard cluster, or use Automatic for compatible + Standard for incompatible workloads.
Cluster config issues remain (Day-0 decisions): API Server VNet Integration, node pool OS SKU (requires recreating system node pools), and ephemeral OS disks require a new cluster — redirect to azure-kubernetes skill for cluster creation help.
Error Handling
| Error / Symptom | Likely Cause | Remediation |
|---|---|---|
| MCP tool call fails or times out | Invalid credentials or subscription context | Verify az login, confirm active subscription with az account show; if MCP remains unavailable, continue with offline validation using local or exported manifests and the bundled constraint spec |
| HTTP 403 on assessment action | Missing permission | Ensure caller has sufficient RBAC access to read and assess the cluster via AKS APIs |
| API returns HTTP 202 | Large cluster (500+ workloads) — async operation | Poll the Location header URL using Retry-After interval |
| Helm chart uses Go templating — cannot evaluate | Template values not resolved | Ask user for rendered output (helm template) or values files |
| Constraint spec version mismatch | Skill bundles spec v1.1.1 (2026-03-15) | Note version in output; recommend re-running after spec update |
Reference Files
| File | When to load |
|---|---|
references/constraint-spec-v1.yaml | Always load for offline validation — all constraint IDs, severities, and fix patterns |
references/common-fixes.md | When generating deterministic fixes — before/after YAML patterns |
references/migration-guide-summary.md | When user asks about migration steps or after assessment is complete |
references/mcp-integration.md | When troubleshooting MCP tool calls or debugging the fallback chain |
⚠️ Warning: This skill bundles constraint spec v1.1.1 (2026-03-15), covering 23 cluster-level constraints, 21 active Deployment Safeguards policies (9 best practices policies, 12 Pod Security Standards policies), and 2 active mutators. Always note the spec version in assessment output.
Related skills
More from microsoft/azure-skills and the wider catalog.
finetuning
Fine-tune models on Azure AI Foundry with SFT, DPO, or RFT training methods.
azure-ai
Azure AI services skill for Search, Speech, OpenAI, and Document Intelligence in coding agents
azure-deploy
Execute Azure deployments for prepared applications with built-in error recovery and validation.
azure-diagnostics
Debug Azure production issues using AppLens, Azure Monitor, resource health, and systematic triage.
azure-prepare
Generate Azure deployment infrastructure (Bicep/Terraform, azure.yaml, Dockerfiles) for new or existing apps
azure-storage
Azure Storage skill: Blob, File Shares, Queue, Table, and Data Lake with access tier guidance and lifecycle management