Senior Cloud Engineer - AWS
We are looking for a Senior Cloud Engineer - AWS to join our cloud operations team. This role is intended for a highly experienced, hands-on cloud engineer who is comfortable taking ownership of complex AWS environments across multiple customers and architectural styles.
You will work with a wide range of AWS services and architectural patterns, focusing primarily on cloud-native, containerized, and serverless workloads. You will be expected to operate, troubleshoot, and continuously improve these environments in production, with an emphasis on managed services, automation, and reliability rather than traditional server administration. The role requires the ability to quickly understand unfamiliar or inherited customer setups and take ownership of their evolution.
As a senior engineer, you will act as an escalation point for complex cloud issues, participate in backup on-call rotations, and help define and uphold cloud standards and best practices. You will also mentor junior engineers and engage in technical and architectural discussions with customers, helping them make informed, pragmatic decisions aligned with AWS best practices.
What will be your key responsibilities
Operate, maintain, and continuously improve AWS-based cloud infrastructures across multiple customer environments
Act as a technical escalation point for complex issues beyond L1/L2 support
Participate in backup on-call rotations, providing expert assistance when standard procedures are insufficient
Review existing cloud architectures and proactively identify risks related to:
Availability and resilience
Security and access control
Performance and scalability
Cost efficiency
Design and implement infrastructure changes using Infrastructure as Code
Define, document, and maintain technical standards, operational procedures, and best practices
Mentor and guide junior cloud engineers through technical reviews, architectural discussions, and day-to-day problem solving
Collaborate directly with customers to discuss technical and architectural topics, explain trade-offs, and propose pragmatic, AWS-aligned solutions
What experience should you have
AWS (Senior / L3 level)
5+ years of hands-on AWS experience in production environments, ideally across multiple customers or AWS accounts
Proven ability to design, operate, and troubleshoot complex AWS environments, with a strong focus on cloud-native and managed services
Container-based workloads
ECS and/or EKS, including service configuration, scaling, and troubleshooting
AWS Fargate for running containerized workloads without server management
Serverless and event-driven architectures
AWS Lambda (deployment, configuration, monitoring, performance tuning)
API Gateway (REST/HTTP APIs, integrations, authorization)
Event-driven services such as EventBridge, SQS, SNS
Understanding of serverless-specific operational concerns (timeouts, concurrency, retries, error handling)
Core AWS platform services
IAM (advanced role and policy design, cross-account access, least-privilege enforcement)
VPC networking concepts (CIDR planning, routing, NAT gateways, VPC endpoints, peering and Transit Gateway basics)
Storage and data services (S3 lifecycle management, RDS, backups, encryption at rest and in transit)
Security fundamentals (KMS, security groups vs NACLs, shared responsibility model)
Operational ownership and reliability mindset
Monitoring, logging, and alerting using CloudWatch
Incident troubleshooting and root cause analysis
Designing for high availability, fault tolerance, and disaster recovery
Cost awareness and optimization across AWS services, including serverless and container workloads
Ability to take ownership of unfamiliar or inherited customer environments, even when architectures are complex, inconsistent, or poorly documented
Comfortable acting as a senior escalation point for complex AWS-related incidents and architectural challenges
Networking
Practical understanding of networking fundamentals (TCP/IP, DNS, HTTP/S)
Hands-on experience with AWS networking, including VPC design, routing, security groups, NACLs, and connectivity troubleshooting
Infrastructure as Code
Advanced Terraform experience, including:
Designing and maintaining reusable modules
Managing remote state and locking
Reviewing and improving existing Terraform codebases
Applying IaC standards and best practices within a team
CI/CD & version control
Hands-on experience with CI/CD pipelines (preferably GitLab CI)
Familiarity with pipeline design, environment handling, infrastructure and application deployment automation, and Git-based workflows
Scripting & automation
Ability to write and maintain automation scripts using Bash and Python, including AWS SDK usage (boto3)
Strong focus on reducing manual work and improving operational efficiency
Team & customer collaboration
Experience mentoring junior engineers through technical guidance and reviews
Strong communication skills
Comfortable discussing technical and architectural topics with customers and internal stakeholders
Able to explain complex cloud concepts in a clear and pragmatic manner
Nice to have
AWS Professional-level certifications (Solutions Architect or DevOps Engineer)
Experience with multi-account AWS setups and landing zone concepts
Exposure to security- or compliance-driven environments
Prior consulting or customer-facing experience
Why this role matters
As a Senior Cloud Engineer - AWS, you play a key role in ensuring the stability, security, and scalability of our customers’ cloud environments. Your expertise will directly influence architectural quality, operational excellence, and the technical growth of engineers within the team.
