AWS Introduces Automated Prompt Optimization in Bedrock to Boost AI Performance and Cut Costs
New Tool Automates Prompt Refinement Across Multiple LLMs
Amazon Web Services has launched the Advanced Prompt Optimization feature for its Bedrock platform, a managed service for building generative AI applications. Released on Thursday, the tool is accessible via the Bedrock console and automatically refines prompts to improve accuracy, consistency, and efficiency across various large language models, according to an AWS blog post.

The process begins by evaluating existing prompts against user-provided datasets and metrics. It then rewrites the prompts for up to five inference models, benchmarks the optimized versions against the originals, and helps developers identify the best-performing configurations for specific workloads. This automation reduces manual trial and error, enabling more systematic optimization of quality, latency, and cost.
Availability and Pricing
The tool is generally available in multiple AWS regions, including US East, US West, Mumbai, Seoul, Singapore, Sydney, Tokyo, Canada (Central), Frankfurt, Ireland, London, Zurich, and São Paulo. Enterprise customers are billed based on the Bedrock model inference tokens consumed during optimization, using the same per-token pricing as standard Bedrock workloads.
Analysts note that automated prompt refinement addresses key operational challenges, particularly the economics of scaling generative AI in production. "Enterprise demand for such tools is driven by cost pressure and operational complexity," said Gaurav Dewan, Research Director at Avasant. "Inference spending is quickly becoming a board-level concern as enterprises move from experimentation to production."

Key Benefits: Cost, Latency, and Multi-Model Strategies
Even modest improvements in prompt efficiency can significantly impact operating costs at scale. The tool also helps reduce latency—a critical metric for customer-facing AI services where slower responses can hinder adoption. "Prompt optimization enables systematic balancing of quality, latency, and cost," Dewan added.
Sanchit Vir Gogia, Chief Analyst at Greyhound Research, highlighted the growing adoption of multi-model AI strategies as a driver for automated optimization. Enterprises increasingly shift workloads between models based on cost, performance, and governance requirements. "Prompt optimization ensures applications can move between models without behavioral inconsistencies or performance degradation," Gogia explained.
By using automated prompt refinement through Advanced Prompt Optimization, organizations can achieve more reliable and efficient AI deployments, ultimately enhancing both operational and customer-facing outcomes.