A SaaS company runs its web application on Amazon EC2 instances deployed across multiple Availability Zones. The instances are part of an Auto Scaling group and are fronted by an Application Load Balancer. Performance testing shows that the application operates optimally when the average CPU utilization across instances remains close to 40%. The company wants to ensure that the Auto Scaling group adjusts automatically to maintain this target performance level. What should a solutions architect do to meet this requirement?