Site Reliability Engineer - Security Platform Services
Splunk
- Support a culture of learning and growth by collaborating with and mentoring engineers. Help the team grow their technical expertise.
- Collaborate with your team to own major features and components from conception to delivery, ensuring that technical solutions meet business objectives, security requirements, and user needs.
- Demonstrate your understanding of CI/CD principles, Site Reliability Engineering (SRE) and DevOps to ensure that our cloud and enterprise security solutions enable our customers to compose, run and monitor playbook execution.
- Participate in the on-call rotation to ensure the security, stability, and availability of the SOAR production system and be ready to diagnose, tackle, and resolve production issues with urgency and efficiency, especially related to Site Reliability, CI/CD, pipelines, test automation and security.
- Deliver high-quality solutions and ensure alignment with product objectives. Be an advocate for SRE, CI/CD, test automation and scalability in all aspects of the product.
- Identify new technologies and techniques that improve CI/CD, test automation, performance, and scalability of the platform. Actively resolve potential issues before they impact customers or the platform.
- Collaborate to set the technical vision, expectations, and standards for the team.
- Minimum 5+ years with a Bachelor's Degree or equivalent experience or higher in Computer Science or a related field.
- Enterprise experience in software engineering with an emphasis on cloud security, security, infrastructure, or networking.
- Experience with at least one language (e.g., Python, Go, Java, C++, or similar) with deep experience in cloud-native development (microservices, containers etc).
- Experience with automation tools like Terraform, Puppet, Ansible, or CloudFormation for infrastructure-as-code and CI/CD pipelines
- Extensive experience as a Site Reliability Engineer, including monitoring and alerting, and performance benchmarking.
- Extensive experience with CI/CD, test automation and site reliability using GitLab and git-based tooling.
- Experience in leading incident response efforts and forensic analysis in a cloud security context