Monitoring tool sets and services (for example, CloudWatch)
Service level agreements (SLAs) and key performance indicators (KPIs)
Translating business requirements to measurable metrics
Testing potential remediation solutions and making recommendations
Proposing opportunities for the adoption of new technologies and managed services
Assessing solutions and applying rightsizing based on requirements
Identifying and examining performance bottlenecks
AWS Global Infrastructure
Scaling methodologies (for example, load balancing, auto scaling)
High availability and resiliency
Disaster recovery methods and tools
Service quotas and limits
Understanding application growth and usage trends
Evaluating existing architecture to determine areas that are not sufficiently reliable
Remediating single points of failure