{{ summaryHeading }}
{{ summaryPrimary }}
{{ summaryLine }}
{{ badge.label }}
INVOKE MEMORY PC BILL {{ stageUsageLabel }}
AWS Lambda cost inputs
Start from a common Lambda pattern, then tune requests, duration, memory, and provisioned concurrency.
Changing architecture updates the editable rate card unless Custom pricing is selected.
Use US East for AWS public examples, or pick a premium/custom profile before editing exact regional rates.
Set 1.00 for US East parity; raise it when your target region carries a published premium.
x
Use billed invocations from CloudWatch, Cost Explorer, or your forecast.
Enter milliseconds per invoke after runtime and extension behavior are included.
ms
AWS Lambda supports 128 MB to 10,240 MB in 1 MB increments.
MB
Use On-demand for ordinary pay-per-use functions; use Provisioned window for launch events or latency-sensitive aliases.
This drives GB-second capacity charges for the whole enabled window.
Use 744 for always-on, or the exact event hours for a launch window.
hours
{{ provisioned_request_share_percent }}%
Use traffic logs or event forecasts to split provisioned-window and ordinary on-demand requests.
This is a planning cap before taxes, credits, CloudWatch logs, data transfer, API Gateway, and support charges.
$ / mo
Raise this when functions use /tmp above the included 512 MB.
MB
Adds Lambda response streaming charges for bytes beyond the first 6 MB per streamed request.
Adds request-charge units for async events larger than the included 256 KB request payload window.
Free-tier allowance is applied to request charges, on-demand duration, and response streaming bytes when modeled. Provisioned capacity and provisioned duration remain fully billable.
requests
GB-s
$ / 1M requests
$ / GB-s
$ / GB-s
$ / GB-s
$ / GB-s
{{ savings_plan_discount_percent }}%
Line itemUsage basisBillable unitsCostPricing noteCopy
{{ row.item }} {{ row.basis }} {{ row.units }} {{ row.cost }} {{ row.note }}
MetricValueWhy it mattersCopy
{{ row.metric }} {{ row.value }} {{ row.note }}
LeverModeled impactMonthly deltaOperator noteCopy
{{ row.lever }} {{ row.impact }} {{ row.delta }} {{ row.note }}
Customize
Advanced
:

AWS Lambda cost is driven by small meters that multiply quickly: requests, billed duration, configured memory, architecture, Provisioned Concurrency, additional ephemeral storage, response streaming, and account-level free-tier allowances. A function can look inexpensive in a single test and still become material when it runs millions of times per month or holds warm capacity for a launch window.

The main compute meter is GB-seconds. Each invocation contributes duration in seconds multiplied by configured memory in gigabytes. Higher memory raises the per-millisecond cost unit, but it also gives the function more CPU allocation and can reduce duration for CPU-bound workloads. That means the cheapest memory setting is not always the smallest memory setting.

Requests invokes GB-seconds duration x memory Options PC, /tmp, streams Bill minus free tier Requests create usage units. Duration, memory, and optional capacity meters turn usage into monthly cost.

Provisioned Concurrency changes the shape of the bill. It can reduce cold-start risk for latency-sensitive traffic, but it adds a capacity charge for configured concurrency during the enabled window. Requests that run during that window also use provisioned-duration rates instead of the ordinary on-demand duration path in this model.

Free-tier treatment needs care because it is an account-level allowance, not a per-function promise. One million monthly requests and 400,000 GB-seconds can be shared across functions and architectures, and other workloads may consume the allowance before this function does. Editable free-tier fields make it possible to model the remaining allowance rather than assuming a fresh account.

A Lambda estimate is strongest when its inputs come from billed duration, memory settings, request counts, and current AWS regional pricing. It is weaker when it ignores nearby services such as API Gateway, event sources, logs, data transfer, storage, retries, and downstream systems.

How to Use This Tool:

Start with a workload shape, then make the rate card match the region and account terms you want to model.

  1. Choose a workload preset, architecture, and region profile. Use custom pricing when you need exact regional rates, private terms, or a nonstandard assumption.
  2. Enter monthly invocations, average billed duration, and memory allocation. Use billed duration rather than handler-only timing when extensions, initialization, or runtime behavior affect the meter.
  3. Choose on-demand, Provisioned Concurrency, or compare mode. When provisioned modeling is enabled, enter configured concurrency, enabled hours, and the share of requests during that window.
  4. Set a monthly budget cap if you want the summary and tuning brief to flag the modeled run rate against a planning limit.
  5. Open advanced settings for additional ephemeral storage, response streaming bytes, asynchronous event payload request units, free-tier allowances, editable rate fields, and a Compute Savings Plans discount.
  6. Review Cost Ledger, Usage Units, Tuning Brief, Lambda Cost Mix, Invocation Cost Curve, and JSON to understand both the total and the drivers.

If a field is outside the accepted range, fix validation before reading the ledger. Memory must stay from 128 MB through 10,240 MB, provisioned hours must stay from 0 through 744, ephemeral storage must stay from 512 MB through 10,240 MB, and percentage shares must remain from 0% through 100%.

Interpreting Results:

Total monthly cost combines request charges, on-demand duration, provisioned duration, provisioned capacity, additional ephemeral storage, modeled response streaming overage, and any entered compute discount. The result excludes taxes, credits not entered in the rate fields, CloudWatch Logs, API Gateway, data transfer, queues, databases, and other surrounding AWS services.

Cost per million invokes is useful for scaling conversations because it normalizes the current workload into a traffic unit. It can still change as free tier is exhausted, provisioned capacity is added, response sizes change, or duration improves with memory tuning.

AWS Lambda cost result interpretation
Output Meaning Check before budgeting
Cost Ledger Line-item cost by requests, duration, capacity, storage, streaming, discount, and total. Confirm the editable rates match the target region and architecture.
Usage Units Raw monthly invocations, request units, GB-seconds, free-tier use, and extra meters. Use it to catch hidden request-unit growth from async payloads or large streamed responses.
Tuning Brief What-if deltas for shorter duration, memory right-sizing, architecture choice, provisioned capacity, storage, and free tier. Benchmark before lowering memory because smaller memory can make some functions slower.
Invocation Cost Curve Monthly cost and cost per million across traffic levels up to about twice the current volume. Free-tier cliffs and provisioned capacity can make the curve non-linear.

A low Lambda number can still hide a high application cost if the function causes log ingestion, API requests, database calls, queue messages, NAT gateway traffic, or retries. Use the result as the Lambda slice of a wider architecture estimate.

Technical Details:

Lambda duration billing combines time and memory into GB-seconds. The model splits invocations into on-demand and provisioned-window requests when Provisioned Concurrency is enabled. On-demand GB-seconds can consume the entered free compute allowance. Provisioned-duration GB-seconds and provisioned-capacity GB-seconds are modeled as separate paid meters.

Request charging starts from invocation count, then optionally adjusts asynchronous event sources into request units. In this model, the first 256 KB of an asynchronous event counts as one request unit, and each additional 64 KB chunk up to 1 MB adds another unit. Response streaming is modeled from bytes beyond the first 6 MB per streamed request, after the entered monthly streaming allowance.

Formula Core:

The equations below show the main monthly meters. Rates are editable, so the same usage can produce different totals by architecture, region profile, or custom rate card.

M = memory MB1024 GBsonDemand = NonDemand×duration seconds×M GBspcDuration = Npc×duration seconds×M GBspcCapacity = provisioned concurrency×hours×3600×M GBstmp = N×duration seconds×max(0,ephemeral MB-512)1024

Request cost equals billable request units divided by one million, multiplied by the request rate. On-demand duration cost uses billable on-demand GB-seconds after the entered free compute allowance. Provisioned duration, provisioned capacity, and additional ephemeral storage use their own GB-second rates. The entered compute discount applies to duration and provisioned-capacity charges, not to requests, response streaming, or extra ephemeral storage.

Ctotal = Crequests + Ccompute - Cdiscount + Ctmp + Cstreaming
AWS Lambda modeled meters and exclusions
Meter Modeled rule Important boundary
Requests Monthly request units after optional async payload expansion and free request allowance. Retries, test invokes, and event-source behavior can raise the count.
On-demand duration On-demand invocations multiplied by duration seconds and memory GB. Free compute allowance is applied only to this meter in the model.
Provisioned Concurrency Provisioned-window invocations use provisioned duration, and configured concurrency creates capacity GB-seconds. AWS rounds short configured periods in billing; enter hours conservatively.
Additional ephemeral storage Only storage above 512 MB is multiplied by invocation duration. Large media, model, and ETL workloads can make this visible.
Excluded services Only Lambda meters shown in the ledger are included. Logs, API Gateway, data transfer, queues, databases, SnapStart-specific pricing, and managed instances are outside the estimate.

For a default API backend at 25 million monthly requests, 180 ms duration, and 512 MB memory, raw compute is 2.25 million GB-seconds before free-tier compute. Changing memory to 1,024 MB doubles the memory multiplier, but the final cost can still fall if the higher CPU allocation cuts billed duration by more than half.

Limitations and Accuracy Notes:

Public cloud rates change and vary by region, architecture, date, tier, commitment, and account agreement. The built-in rates are planning defaults, and every rate can be edited. Verify the final estimate against the current AWS pricing page, your billing export, Cost Explorer, and any private pricing agreement before using it in a forecast.

The calculation runs from values entered in the browser and does not query your AWS account. It cannot discover real traffic, consumed free tier, regional usage tiers, function settings, or surrounding service costs unless you enter those assumptions yourself.

Worked Examples:

API backend on demand

At 25 million monthly requests, 180 ms billed duration, and 512 MB memory, the main meter is 25,000,000 x 0.18 x 0.5 = 2,250,000 GB-seconds. After the entered free-tier allowances, the ledger separates billable requests from billable compute.

Provisioned launch window

A launch event with 100 provisioned environments for 8 hours at 1.5 GB memory creates 100 x 8 x 3,600 x 1.5 = 4,320,000 provisioned-capacity GB-seconds before any requests run. That capacity line can dominate a short launch even when invocation duration is small.

Large streamed responses

If response streaming is enabled and average streamed response is 8 MB, only 2 MB per streamed request is modeled as overage before the entered monthly streaming allowance. Keeping responses at or below 6 MB avoids that modeled overage.

FAQ:

Should I use average duration or p95 duration?

Use the duration statistic that matches the planning question. Average billed duration works for monthly cost reconciliation. A conservative p50-to-p90 blend may be better for a prelaunch budget if traffic shape is uncertain.

Why can lower memory increase cost?

Lower memory reduces GB-seconds per millisecond, but it also reduces available CPU. If duration rises enough, total GB-seconds can stay flat or increase.

Does this include CloudWatch Logs?

No. Logs, API Gateway, queues, storage, data transfer, databases, and other services are outside the Lambda ledger and should be estimated separately.

Are my AWS account details used?

No. The tool uses the values on the page. It does not connect to your AWS account, fetch usage, or verify private pricing.