Site Reliability Engineer - SRE
Overview We are seeking a Solutions Architect with strong SRE expertise to guide engineering teams, drive reliability practices, and lead automation and cloud operational improvements. The ideal candidate has deep hands-on experience across modern DevOps/SRE tools, cloud platforms, observability, and large-scale distributed systems. Key Responsibilities • Serve as the SRE advocate, guiding teams on reliability, automation, and operational excellence. • Apply core SRE principles: SLI/SLO, Error Budgets, CUJs, NFR-based reliability standards. • Identify and automate TOIL across SDLC and operations. • Lead SRE dashboards, error budget tracking, RCA investigations, and anomaly detection. • Design and implement end-to-end arenaflex/CD workflows (Git, GitHub Actions, GitHub Workflows; Jenkins is a plus). • Architect scalable, reliable cloud and application solutions from design through deployment. • Partner with development teams to ensure smooth, monitored, and efficient release cycles. • Implement monitoring and observability practices using Dynatrace, Splunk, Elastic Stack, and SolarWinds. • Drive cloud architecture and automation using AWS, Terraform, Ansible Tower, scripting (Python), and IaC principles. • Support container orchestration, AIOps initiatives, performance engineering, and operational excellence. Required Skills • Strong background in SRE, DevOps, and Cloud Operations. • Hands-on with .NET, SQL, React, scripting, and infrastructure automation. • Expertise with AWS Cloud, Dynatrace, Splunk, Elastic Stack, SolarWinds DPA. • Experience in organizations like Tech Mahindra, Cognizant, Altimetrik, Accenture is a plus. • Excellent communication, leadership, and problem-solving abilities. Job Type: Contract Pay: $53.66 - $94.62 per hour Work Location: Remote Apply tot his job
Apply tot his job
Apply To this Job