Amazon Ads Infrastructure: Millions Saved Through Rightsizing & Demand Auto Scaling
Amazon Advertising (via Chris Edwards, SDE)
Challenge
Large-scale ad-serving infrastructure supporting NFL Sundays, NBA playoffs, WNBA, and peak Amazon sale events like Prime Day was provisioned for worst-case load and left running around the clock. The compute fleet was significantly over-provisioned during off-peak windows — overnight, weekends, between campaign bursts — resulting in millions of dollars in annual waste on idle capacity.
Solution
As a Systems Development Engineer at Amazon Ads, performed a comprehensive utilization audit across the compute fleet. Identified over-provisioned instances relative to actual CPU and memory usage across normal windows. Rightsized instances to match real workload demands, then designed and implemented demand-based auto scaling policies tied to ad-serving traffic patterns, campaign queues, and event-driven spikes. Introduced scheduled scaling for predictable cycles and target tracking policies for unpredictable load during major live events — so infrastructure scaled up for the Super Bowl and scaled back down after.
Results
- 30% reduction in annual AWS spend
- Compute costs aligned to actual demand instead of peak capacity
- Auto scaling handles campaign launch traffic spikes automatically
- Scheduled scaling eliminates overnight and weekend waste
- Zero degradation to ad-serving latency or analytics throughput