Release Cutover Window Calculator
Plan a release cutover window from phase minutes, contingency, rollback reserve, safety floor, gate margin, runway scenarios, and charts.| Phase | Duration | Elapsed | Window time | Operator note | Copy |
|---|---|---|---|---|---|
| {{ row.phase }} | {{ row.duration }} | {{ row.elapsed }} | {{ row.windowTime }} | {{ row.note }} |
| Gate | Window time | Remaining | Status | Action | Copy |
|---|---|---|---|---|---|
| {{ row.gate }} | {{ row.windowTime }} | {{ row.remaining }} | {{ row.status }} | {{ row.action }} |
| Scenario | Cutover finish | Rollback margin | Required window | Status | Change applied | Copy |
|---|---|---|---|---|---|---|
| {{ row.scenario }} | {{ row.finish }} | {{ row.margin }} | {{ row.required }} | {{ row.status }} | {{ row.change }} |
Introduction
Production cutovers are usually judged by clock time, not by how long the deploy command runs. The risky minutes are often spent draining queues, waiting for a migration, shifting traffic, watching telemetry, getting approval, and still protecting enough time to roll back before the maintenance window closes.
A release cutover window is the approved span for moving a system from one production state to another. In a simple application release, the work may be deploy plus smoke tests. In a database, traffic, or infrastructure cutover, the same window can include pre-checks, data migration, feature-flag changes, load balancer changes, DNS or service-discovery moves, stakeholder messages, and a tested rollback path.
Planning gets unreliable when the vocabulary blurs. The maintenance window is the full approved period. Active cutover work is the sequence that must finish before the team can make a keep-or-revert decision. Rollback reserve is protected recovery time after that decision point. A hard stop is the end of the approved window, and some teams also keep a safety floor for closure, handoff, or compliance notes.
| Timing term | Planning question |
|---|---|
| Maintenance window | How long is the approved change period from start to hard stop? |
| Active cutover work | Which steps must run inside the window before the team can decide to continue? |
| Validation | Which smoke tests, telemetry checks, canary observations, or sign-offs prove the change is healthy enough to keep? |
| Rollback reserve | How many minutes must remain to revert, shift traffic back, restore state, and confirm recovery? |
| Rollback gate | What is the latest safe decision point before the reserve is consumed? |
Cutover plans usually fail on hidden elapsed time. A best-case estimate may leave out a slow database migration, a manual approval hold, a delayed stakeholder update, a canary observation period, or the time needed to prove rollback has restored service. A plan can also fit the clock while depending on an unsafe technical assumption, such as a schema change that older code cannot read after traffic moves back.
Staged release patterns such as canaries, blue/green deployment, one-box deployment, and traffic splitting reduce blast radius, but they do not remove the need for a time budget. Each staged step needs a decision signal, and every signal consumes some mix of clock time, traffic volume, telemetry confidence, and operator attention.
A cutover timing plan is planning evidence, not release approval. The real change still needs tested rollback steps, monitoring signals, owner accountability, and a clear rule for stopping, continuing, or reverting.
How to Use This Tool:
Use realistic elapsed minutes, not best-case command runtimes. Start with the approved maintenance window, then protect the rollback reserve you would actually need if validation fails.
- Enter a short release name so copied tables, documents, and JSON stay tied to the right change.
- Set a window start time when wall-clock labels help the release call. Leave it blank when relative
T+timing is clearer. - Enter the maintenance window, rollback reserve, and optional hard-stop safety floor before tuning individual phases.
- Add pre-checks, deploy work, database migration, traffic shift, and validation in minutes. Use
0only when that phase truly does not apply. - Set contingency as a percentage of active cutover work. Treat it as measured variance and coordination delay, not hidden scope.
- Open Advanced when approval holds or stakeholder update holds consume dedicated time inside the window.
- Check
Rollback margin, warnings, Cutover Phase Ledger, Rollback Gate Ledger, Runway Scenarios, Phase Allocation Chart, and Cutover Runway Curve before sharing the plan.
If the result shows rollback gate risk, shorten a real phase, move work outside the window, reduce contingency only with evidence, or request a wider window. Reducing rollback reserve should be the last option because it weakens recovery.
Interpreting Results:
Rollback margin is the controlling output. Positive margin means the modeled cutover finishes before the latest rollback decision. Zero margin means the plan lands on the boundary. Negative margin means active work and contingency consume minutes that were meant to remain protected for rollback or closure.
A rollback gate clear status checks timing only. It does not prove the deployment is reversible, monitored, or ready. Confirm database compatibility, feature-flag behavior, traffic controls, smoke tests, and owner sign-off before treating a clear gate as safe.
| Output | What to look for | Decision value |
|---|---|---|
| Cutover Phase Ledger | Phase durations, elapsed timing, window labels, and operator notes. | Checks whether the modeled sequence matches the runbook. |
| Rollback Gate Ledger | Planned finish, latest rollback decision, hard stop, reserve, and required window. | Shows whether protected rollback time remains intact. |
| Runway Scenarios | Current plan, lower contingency, shorter longest phase, extra validation, and larger rollback reserve. | Tests whether small changes clear or break the gate. |
| Phase Allocation Chart | Relative size of each work phase and reserve component. | Identifies the part of the plan controlling the schedule. |
| Cutover Runway Curve | Rollback margin across different contingency percentages. | Shows how much buffer can be added before the gate turns risky. |
| Warnings | Zero window, reserve larger than the window, no validation, no rollback reserve, or crossed gate. | Flags assumptions that should be corrected or documented before approval. |
Technical Details:
Cutover timing is an elapsed-minute budget. Active work covers the steps that must finish before the team can make a final go/no-go call: pre-checks, deployment, database migration, traffic shift, validation, approval holds, and stakeholder update holds. Contingency is calculated as a percentage of that active work, so a larger release plan receives a larger buffer.
The latest rollback decision sits before the hard stop. Rollback reserve and the hard-stop safety floor are subtracted from the maintenance window first, and only the remaining time is available for active cutover work plus contingency. A plan that finishes after that boundary is risky even when the calendar still shows time left.
Formula Core:
Symbols and Units:
| Symbol | Meaning | Unit |
|---|---|---|
T_work |
Pre-checks, deploy work, database migration, traffic shift, validation, approval hold, and stakeholder update hold. | minutes |
T_buffer |
Contingency allowance calculated from active work. | minutes |
T_finish |
Planned cutover finish after active work and contingency. | minutes from window start |
T_rollback gate |
Latest point for a commit-or-rollback decision while preserving rollback reserve and safety floor. | minutes from window start |
T_required window |
Minimum window needed for active work, contingency, rollback reserve, and safety floor. | minutes |
With the default values, active work is 20 + 35 + 18 + 10 + 25 = 108 min. A 25% contingency adds 27 min, so the planned cutover finish is 135 min. In a 180 min maintenance window with a 30 min rollback reserve and no safety floor, the rollback gate is 150 min, leaving 15 min of rollback margin.
Model Rules:
| Rule | Boundary | Meaning |
|---|---|---|
| Gate clear | finish <= rollback gate |
The modeled cutover finishes before protected rollback reserve begins. |
| Gate risk | finish > rollback gate |
The modeled cutover crosses into rollback or closure time. |
| Reserve exceeds window | rollback reserve + safety floor > window |
The latest rollback decision falls before the window starts. |
| Contingency range | 0% to 300% |
The buffer is a percentage of active work, not of the whole window. |
| Time labels | Clock time when a start time exists, otherwise T+ elapsed time. |
The math is identical; only the displayed label changes. |
Minute display is rounded for readability: values under one hour display in minutes with one decimal, while values of one hour or more display in hours with two decimals. The gate comparison still uses numeric minutes.
Safe deployment guidance emphasizes rollback design, staged exposure, evaluation signals, and explicit rollback testing. The timing model makes the go/no-go boundary visible, but it does not test whether a schema change, protocol change, or traffic shift can roll backward without customer impact.
Limitations and Privacy Notes:
- The model assumes phases run in the entered order and does not automatically overlap work.
- It does not verify rollback safety, data compatibility, monitoring coverage, customer impact, or approval policy.
- Calendar labels depend on the entered start time and browser date handling; elapsed-minute math is the controlling output.
- The timing calculation runs in the browser; no server-side cutover analysis is used for the entered plan.
- Avoid entering sensitive incident names, customer data, secrets, or unreleased product details in the release name, copied tables, downloaded files, screenshots, or shared URLs.
Worked Examples:
Default Plan Keeps 15 Minutes of Runway
With a 180 min maintenance window, 20 min of pre-checks, 35 min of deploy work, 18 min of database migration, 10 min of traffic shift, 25 min of validation, 25% contingency, and a 30 min rollback reserve, active work totals 108 min. Contingency adds 27 min, so planned cutover finish lands at 135 min. The summary shows 15.0 min runway, and Rollback margin is positive.
Extra Validation Breaks the Same Window
Using the same plan, adding 15 min to validation raises active work to 123 min. With 25% contingency, planned cutover finish reaches 153.75 min. The rollback gate remains at 150 min, so Rollback margin is negative and the Runway Scenarios table moves the case to gate risk.
A Larger Rollback Reserve Uses the Cushion
If the default plan increases rollback reserve from 30 min to 45 min, the latest rollback decision moves from 150 min to 135 min. The cutover still fits because finish equals the rollback gate, but Rollback margin is +0.0 min. That is fragile because one rerun, delayed approval, or longer smoke test crosses the gate.
Reserve Larger Than the Window Is a Planning Error
A 40 min maintenance window with 35 min rollback reserve and a 10 min hard-stop safety floor leaves the rollback gate at -5 min. The warning says rollback reserve plus hard-stop safety floor is larger than the maintenance window, so the plan needs a wider window or a tested rollback approach that genuinely requires less time.
FAQ:
What does rollback runway mean?
Rollback runway is the positive time between planned cutover finish and the latest rollback decision. It is the cushion before protected rollback reserve begins.
Should contingency include rollback time?
No. Contingency is added to active cutover work before the rollback gate comparison. Put recovery time in rollback reserve so the decision boundary stays visible.
Why can a clear gate still be unsafe?
The gate only checks time. A release can fit the window but still be unsafe if the rollback path is untested, if data written by the new version cannot be read by the old version, or if monitoring cannot detect failure quickly enough.
What should I do when the result says rollback gate risk?
Use the phase ledger and runway scenarios to identify the pressure point. Then remove or move work, ask for a wider window, or change the release approach before reducing recovery reserve.
Why does the warning say the reserve is larger than the window?
That warning appears when rollback reserve plus hard-stop safety floor exceeds the maintenance window. Increase the window, reduce the reserve only if rollback testing supports it, or move closure work outside the window.
Why does the window time show T+ instead of a clock time?
When the start time is blank, the ledgers use relative elapsed labels such as T+45.0 min. Enter a start time to show wall-clock labels.
Glossary:
- Cutover
- The period when production traffic, data, or system ownership moves from one state to another.
- Rollback gate
- The latest point at which the team can decide to roll back while still preserving rollback reserve and hard-stop safety time.
- Rollback reserve
- Protected time for reverting, shifting traffic back, restoring state, and confirming recovery before the window ends.
- Contingency
- A percentage buffer added to active cutover work for expected variance, retests, coordination delay, or minor recovery steps.
- Hard stop
- The end of the approved maintenance window or change period.
- Required window
- The modeled duration needed to fit active work, contingency, rollback reserve, and safety floor.
References:
- Ensuring rollback safety during deployments, Amazon Builders' Library.
- Automating safe hands-off deployments, Amazon Builders' Library.
- OPS06-BP03 Employ safe deployment strategies, AWS Well-Architected Framework.
- Canarying Releases, Google Site Reliability Engineering Workbook.