Security Reliability Engineer
ADM
Job Description
Security Reliability Engineer: Position Summary: ASecurity Reliability Engineerblends software engineering with systems administration to ensure thescalability, performance, and reliabilityof large-scale, enterprise and cloud-based applications, services and infrastructure. The role is proactive, automation-driven, and integrated into the software development lifecycle to engineer optimization improvements in Security services. Under minimum supervision, use and establish procedures and standards to consistently build, deploy, support, and integrate secure solutions, such as security tools and business systems.
Must display Passion, interest in Information Security, and draw recommendations based on real world experience! Job Responsibilities: System Reliability & Uptime: Design and maintain fault-tolerant, highly available systems with automated failover and disaster recovery strategies. Security Integration: Embed security controls into CI/CD pipelines, infrastructure-as-code, and runtime environments.
Incident Response & Forensics: Lead or support incident response, root cause analysis, and postmortem documentationfor security services (not security incidents). Monitoring & Observability: Implement and tune monitoring tools (e.g., Prometheus, SIEM, Datadog) to detect anomalies to improve reliability of services, anticipated issues, and resolve them. Infrastructure Hardening: Recommend and enhance secure configurations to cloud and on-prem systems, including patch management and vulnerability remediation.
Collaboration: Work closely with Architecture, Engineering, DevOps, AppSec, and GRC teams to align reliability and security engineering. Security Integration: Embed security controls into infrastructure and deployment workflows, consult for identity and access, and support compliance efforts. Performance Optimization: Analyze system performance, recommend enhancements, and implement improvements while working with service teams to reduce latency and increase throughput.
Work with GICS architecture, engineering, and operations teams to identify performance optimization opportunities. Consult and take responsibility for GICS to identify Root Cause of incidents and Problem Ticket to create action plans and take responsibility to track and implement change plans to reduce and eliminate repeat incidents. Job Requirements: Systems Engineering & Automation Proficiency in operating systems, networking, and cloud infrastructure, along with automation tools like Python, Bash, or Terraform.
Incident Management & Troubleshooting Ability to diagnose and resolve system outages quickly, using monitoring tools and root cause analysis. CI/CD Pipeline Development Experience with continuous integration and deployment to ensure secure and reliable software releases. Distributed Computing Understanding scalable architectures and cloud-native security principles.
Responsible for automating security controls, data, and processes to provide improved metrics and efficient operational support. Collaborate with cross-functional technical teams to deliver creative solutions to complex global, technological challenges and business requirements. Must have experience with virtualization (cloud or non-cloud).
Able to automate/script daily tasks through Python, Bash or equivalent . Experience with web-based applications or web-services. Experience with cloud foundation services related to computing, networking, storage, content delivery, administration and security, deployment and management, and automation technologies.
Experience with Terraform, Bicep, JSON, and/or other infrastructure as code technologies. Experience with Micro services programming (AWS Lambda, Docker, Function Apps, etc.) DevOps competency building and deploying infrastructure with cloud deployment, build and test automation technologies like ansible, chef, puppet, docker, Jenkins, etc. Functional understanding of complex enterprise environments and current technology areas like cloud and mobility.
Excellent analytical and troubleshooting skills. Ability to understand security risks versus business benefits and form a strong recommendation to cross-functional teams. Ability to communicate and collaborate effectively with other team members ina geographic and culturally diverse workforce.
Expected to Complete projects within specified deadlines. Expected to Work occasional nights, weekends, holidays, and overtime. Expected to Perform on-call duties.
Potential for Occasional Travel. Solid sense of integrity and ethics. Desired Skills: Current holder of Security Certifications and Reliability Engineering.
Problem Solving: Ability to troubleshoot complex systems under pressure. Communication: Translate technical risks into business impact for stakeholders Collaboration: Work across teams to embed security into engineering culture Bachelors degree or equivalent work experience. Project management experience.
Experience with Lifecycle and licensing within cloud environments.