Three related fixes in the k8s deployer restart/up flow: 1. Clear stale claimRefs on Released PVs (_clear_released_pv_claim_refs): After namespace deletion, PVs survive in Released state with claimRefs pointing to deleted PVC UIDs. New PVCs can't bind until the stale claimRef is removed. Now clears them before PVC creation. 2. Wait for namespace termination (_wait_for_namespace_deletion): _ensure_namespace() now detects a terminating namespace and polls until deletion completes (up to 120s) before creating the new one. Replaces the racy 5s sleep in deployment restart. 3. Resilient PVC creation: wrap each PVC creation in error handling so one failure doesn't prevent subsequent PVCs from being attempted. All errors are collected and reported together. Closes: bar-6cb, bar-31a, bar-fec Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| helm | ||
| __init__.py | ||
| cluster_info.py | ||
| deploy_k8s.py | ||
| helpers.py | ||
| k8s_command.py | ||