Skip to content

Avo Assure

The Ultimate Enterprise Grade No-Code Solution for End-to-End Test Automation.

Test Data Management

Avo TDM delivers data-compliant synthetic test data on demand.

Integrations
Avo Community
MegaMenuImage-1
Product documentation

Complete Avo Assure Product documentation

Avo Academy

Learn best practices with our Courses and Trainings

Content library

Individual resources like eBooks, Product Sheets etc.

Webinars & Podcasts

Insightful webinars, podcasts and expert discussions

Newsroom

Latest updates, stories, and insights on the Avo Assure

Events

Exclusive events highlighting the latest in Avo Assure.

AdobeStock_291160882 1
About us
Partners
Contact us
unsplash_VWcPlbHglYc

Site Reliability Engineer

Engineering Manager / Director

Job Description

Location: Your Location or Remote

Employment Type: Full-time

Experience:  6+ years

About Us

Avo Automation is an industry-leading test automation platform dedicated to revolutionizing how businesses approach software quality.

Emerging from stealth mode in late 2020, we have focused on transforming the way organizations think about software quality.

Our AI-driven, no-code platform delivers continuous quality assurance across key business processes and supports over 200 technologies.

With efficient test data management and machine learning capabilities, our platform enables teams to enhance quality, conduct efficient data validation, and reduce time to market.

About the Role

We are looking for a talented Cloud/DevOps Engineer to strengthen our infrastructure reliability, automation, and delivery processes. In this role, you will manage monitoring, observability, and alerting systems using tools such as Prometheus, Grafana, CloudWatch, Datadog, ELK Stack, Splunk, and New Relic to ensure system stability and proactive issue resolution.

You will drive cloud automation initiatives through Infrastructure as Code (IaC) tools like Terraform, CloudFormation, and AWS CDK, enabling scalable and repeatable deployments. The position also involves configuring and administering CI/CD pipelines using Jenkins, GitHub Actions, AWS CodePipeline, and ArgoCD to streamline software releases and enhance deployment efficiency.

Working closely with development and engineering teams, you will improve reliability, optimize workflows, and promote DevOps best practices. Additionally, you will monitor and optimize system and network performance—including CPU, memory, and kernel configurations—while identifying and resolving performance bottlenecks in modern cloud environments.

Key Responsibilities

Reliability & Performance

  • Design, build, and maintain highly available fault-tolerant cloud infrastructure using EC2, EKS, Lambda, RDS, and related cloud services.

Automation & Infrastructure as Code

  • Develop, deploy, and manage infrastructure using tools like Terraform, CloudFormation, or CDK ensuring repeatability and scalability.
  • Implement configuration management and automation with tools like Ansible, Puppet, Chef.

Monitoring & Incident Response

  • Monitor, analyze, and optimize system performance with tools like Prometheus,Grafana, CloudWatch, Datadog, ELK Stack, and New Relic.
  • Participate in on-call rotations, handling incident response, SLIs/SLOs/SLAs, androot-cause analysis.

CI/CD & Deployment

  • Maintain and optimize CI/CD pipelines (Jenkins, GitHub Actions, AWS CodePipeline, ArgoCD) for continuous, reliable delivery.

Security & Compliance

  • Implement and enforce security best practices including IAM, KMS, VPC security, compliance (SOC2, ISO 27001, HIPAA).
  • Manage network architectures—VPCs, subnets, firewalls, and application security.

Requirements

  • Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
  • 6+ years of experience in SRE, DevOps, or Cloud Infrastructure roles with hands-on expertise.
  • Strong proficiency in one cloud - AWS, Azure or Oracle Cloud environments.
  • Proficiency in scripting and programming languages (Python, Bash/Shell, Go).
  • Hands-on experience with infrastructure automation and configuration management tools (Terraform, CloudFormation, Ansible, Puppet).
  • Experience in designing CI/CD pipelines and implementing automated build/deployment workflows.
  • Experience with monitoring and logging frameworks (Prometheus, Grafana, ELK Stack, Datadog, Splunk).
  • Excellent communication and collaboration skills; able to work effectively within crossfunctional teams.
  • Strong problem-solving abilities and experience mentoring team members.