Devops Engineer: Responsibilities
Devops Engineer: Responsibilities
Site Reliability Engineering (SRE) is an approach to running and evolving production systems,
using methods, concepts and practices from the discipline of Software Engineering and Systems
Engineering.
The focus is to achieve reliable and fault-tolerant systems, through heavy use of automation and
smart monitoring, while ensuring performance and evolving capacity.
You will be responsible for running and evolving the flagship SaaS platform that delivers complex
financial service solutions - customer acquisition workflows, data enrichment, strategy - to our
Enterprise clients.
Responsibilities
• Maintain SaaS platform services by monitoring capacity, availability, latency and overall systems health
• Own incident management process and response
• Optimize existing systems and infrastructure through automation and engineering practices
• Engage Development organization during inception and design phase to ensure that new
services/systems meet the non-functional acceptance criteria
• Evolve systems capacity and scalability
• Continuously improve monitoring strategy focusing on incident prevention/early detection
Must have:
• Linux administration and job automation using bash script
• Experience in AWS or GCP, networking, autoscaling, high availability and cost optimization
• Knowledge in configuring application, database and cache servers for best performance
• Experience with monitoring and profiling tools, for example Icinga
• Experience with Docker
Nice to have:
• Experience with one or more of the following: Ruby, Python, Perl, Lua
• Kubernetes, OpenShift
• Ansible, Puppet, Chef, Terraform