Site Reliability Engineer

Site Reliability Engineer




Career Track


Site Reliability Engineer

Job Description: Designs and operates the routing, reliability and supportability of company's site. Develops and automates solutions that reduce operations tasks and increase time spent at engineering site. Builds patterns and systems to support the engineering team, enabling them to iterate at top speed in an open automated platform. Processes and retires technical debt and recovery of supportable solutions and automated recovery.

Responsibilities: Deploys and supports applications on company's cloud environment. Rotates between working on dedicated projects for improvement. Handles requests to keep the company's site functioning efficiently. Makes changes to application stack, or performs deploys on critical infrastructure. Improves the state of monitoring, alerting, instrumenting and reporting. Builds and enhances tools that allow to automate configuration, helps integrate continuous integration and deployment processes and makes them easier and efficient.

Skills: Strong work ethics, ability to multi-tasking and work independently or in a collaborative environment. Excellent written and verbal communication skills with ability to communicate well with both technical and non-technical staff. Self-motivated with strong organizational skills. Strong troubleshooting skills and systems thinking ability.

Experience: Experience in software development or in operations engineering at a highly available environment with scale. Prior experience with scripting (Python, JavaScript, Bash) in an industry setting; HTML/CSS/JavaScript. Experience in implementing and supporting user-facing, large-scale, secure tech stacks, AWS, Salt/Chef/Puppet/Ansible configuration management, Docker, automation and improving release and deployment processes.

Education: Bachelor’s Degree in Computer and Information Science.