Project Description
Our client is a Danish jewellery brand, and one of the most famous jewellery brands in the world. We are building a team to help improve the systems’ reliability and develop great partnerships for years.
As a Site Reliability Engineer, you’ll be working with product teams on raising maturity of engineering processes and practices for one of the biggest jewellery e-commerce projects in Europe, working closely with Dev teams on maturity assessment and improvements, implementing observability for systems, with focus on IBM Sterling OMS.
And then there's Zoolatech! Just imagine a workplace and a team environment that you never want to leave once you have found it. Sound enticing? Apply to our position today and we can get you there.
Responsibilities
- Setting up metric/log based monitoring and alerting.
- Defining SLOs and measuring SLIs, Error budgets of production applications/services.
- Improving operational KPIs like MTTD/MTTR, service availability & reliability.
- Verifying system performance and scalability by participating in performance, load, and stress testing.
- Work with engineering teams to refine deployment and release processes.
- Monitor and stress test systems to collect metrics for tuning and capacity planning.
- Work to automate detection and resolution of recurring issues (problem management)
- Ensure safety, predictability, repeatability, and suitability of all build and deploy processes.
Skills Required
- Knowledge of omni-channel and Order Management Systems (IBM Sterling OMS).
- Understanding of event streaming (Kafka), Rest APIs.
- Experience with JAVA based systems.
- Experience with CI/CD (Azure DevOps Pipelines preferred) on public Cloud platforms.
- Experience with APM and on-call pager tools (NewRelic, OpsGenie preferred).
- Comfortable scripting and debugging distributed applications
- Fluent in scalability and root cause analysis exercises (blameless RCA, Postmortems)
Will be a plus:
- Experience with incident command/management (ServiceNow), ITSM and ITIL processes.
- Pro-activeness and persistence in driving team’s tasks to completion with stakeholders inside company as well as with 3rd party vendors
- Extreme ownership & knowledge sharing within organisation