From a7168bb25b6eae5d7d70259320877ae48e349f94 Mon Sep 17 00:00:00 2001 From: jester Date: Thu, 16 Apr 2026 18:57:48 +0000 Subject: [PATCH] Mark pre-restore checkpoint complete, update backup outstanding items - Apr 16 2026 --- OPEN_THREADS.md | 24 ++++++++++++++++++++---- 1 file changed, 20 insertions(+), 4 deletions(-) diff --git a/OPEN_THREADS.md b/OPEN_THREADS.md index d97d24a..17072cf 100644 --- a/OPEN_THREADS.md +++ b/OPEN_THREADS.md @@ -68,9 +68,17 @@ Completed: - local backup delete endpoint added - live backup uses `save-all flush` -> `save-off` -> archive -> `save-on` - restore stops server, waits for exit, restores manifest-declared paths, then restarts through readiness-aware path +- **pre-restore checkpoint hardening implemented (Apr 16 2026)** + - backup metadata fields added: `type` and optional `reason` + - manual backups record `type: "manual"` + - restore creates a local `checkpoint` backup with `reason: "pre_restore"` before any destructive operation + - checkpoint creation aborts the restore if it fails — live data never touched + - checkpoint creation disables pruning so safety backup is not immediately removed + - collision-safe backup IDs for same-second creation + - `POST /game/backups/restore` response now includes `restored`, `backup`, `checkpoint` + - full test coverage: checkpoint creation, abort-before-delete, manual metadata, listability, unsafe restore rejection Outstanding: -- pre-restore safety checkpoint - remote backup storage / transfer path - backup job history / progress beyond current operation state - retention policy refinement beyond initial local pruning @@ -83,12 +91,16 @@ Still outstanding: - confirm whether any remaining agent-side registration path can surface a backend before readiness probe success - see `SCRATCH/session-stabilization-fabric-findings.md` -### Agent Future Work +### Agent Future Work (post-launch) 1. Structured logging (slog) for Loki 2. Dev container `provisioningComplete` state in `/status` 3. Graceful shutdown verification 4. Process reattachment on agent restart 5. SSH server install in dev container provisioning pipeline +6. Long-running job model (job IDs, progress phases, cancel/retry) +7. Typed platform-action wrappers over raw console commands +8. Persistent operation recovery after agent restart +9. `RestartServer()` readiness probe bypass — fix or document --- @@ -192,9 +204,11 @@ Outstanding: Completed: - first local Minecraft backup / restore flow wired end-to-end through agent, API, and portal - manual local backup delete wired end-to-end through agent, API, and portal +- pre-restore checkpoint hardening complete in agent Outstanding: -- harden world backup / restore with pre-restore checkpoint, remote storage, and live validation +- remote storage for game server backups +- real-world backup/restore validation on live Minecraft server - game server subdomain / player connection method verification --- @@ -222,7 +236,8 @@ Optional / Nonessential: ## Pre-Launch Checklist Outstanding before launch: -- harden game server world backup / restore (checkpoint + remote storage + validation) +- remote storage for game server backups +- real-world backup/restore validation on live Minecraft server - game server subdomain verification - email notifications - upload testing @@ -281,3 +296,4 @@ Future work: - usage limits / quota enforcement - user onboarding flow - dashboard spotlight server IA refresh +- pre-restore checkpoint hardening (agent-side, Apr 16 2026)