Cloud Well-Architected Cheat Sheet
All major clouds’ well-architected documentation are fantastic resources for building and running cloud applications and platforms. It deserves a cheat sheet.
Click here for markdown text
Prompt to copilot:
Turn the #selection into a Markdown table, with “##” as the first column and “###” headings as the following columns. Do it for each cloud and then concatenate them.
AWS
Operation Excellence
- Perform operations as code
- Make frequent, small, reversible changes
- Refine operations procedures frequently
- Anticipate failure
- Learn from all operational failures
- Use managed services
- Implement observability for actionable insights
Reliability
- Automatically recover from failure
- Test recovery procedures
- Scale horizontally to increase aggregate workload availability
- Stop guessing capacity
- Manage change through automation
Security
- Implement a strong identity foundation
- Maintain traceability
- Apply security at all layers
- Automate security best practices
- Protect data in transit and at rest
- Keep people away from data
- Prepare for security events
Microsoft Azure Well-Architected Framework
Operational Excellence
- Embrace DevOps culture
- Establish development standards
- Evolve operations with observability
- Deploy with confidence
- Automate for efficiency
- Adopt safe deployment practices
Reliability
- Design for business requirements
- Design for resilience
- Design for recovery
- Design for operations
- Keep it simple
Security
- Plan your security readiness
- Design to protect confidentiality
- Design to protect integrity
- Design to protect availability
- Sustain and evolve your security posture
Google Cloud Architecture Framework
Operation Excellence
- Automate your deployments
- Set up monitoring, alerting, and logging
- Establish cloud support and escalation processes
- Manage capacity and quota
- Plan for peak traffic and launch events
- Create a culture of automation
Reliability
- Understand the core reliability principles
- Define your reliability goals
- Define SLOs
- Adopt SLOs
- Build observability into your infrastructure and application
- Design for scale and high availability
- Create reliable operational processes and tools
- Build efficient alerts
- Build a collaborative incident management process
Security
- Review shared responsibility and shared fate on Google Cloud
- Understand security principles
- Manage risks with controls
- Manage your assets
- Manage identity and access
- Implement compute and container security
- Secure your network
- Implement data security
- Deploy applications security
- Manage compliance obligations
- Implement data residency and sovereignty requirements
- Implement privacy requirements
- Implement logging and detective controls