About the position
Exciting Opportunity: Site Reliability Engineer (Advanced) – Industrial IoT & Edge Ecosystem
Are you a skilled Site Reliability Engineer passionate about cloud infrastructure, containerization, and automation? Our client in the Motor Industry is looking for an advanced SRE to join their international DevOps teams working on cutting-edge Industrial IoT and Edge solutions for global production systems.
About the Role:
You will be part of an interdisciplinary team delivering platform solutions for smart factory wearables and production-critical cloud connections. Your work will directly impact industrial IoT self-service, enabling innovative, reliable, and scalable systems across the globe.
RequirementsWhat You’ll Do:
- Design, implement, and maintain scalable cloud infrastructure (Azure preferred).
- Manage and optimize Kubernetes clusters and containerized environments.
- Set up monitoring, alerts, and troubleshoot system performance issues.
- Participate in incident response and contribute to root cause analysis.
- Collaborate with development teams to improve application reliability and performance.
- Develop and maintain automation scripts and IaC practices.
- Support security, compliance, and documentation initiatives.
- Provide on-call support for the edge platform in a DevOps environment.
Essential Skills:
- Docker & Container Orchestration (Docker Compose)
- Python
- Linux (Ubuntu preferred)
Advantageous Skills:
- Bash, Go, C#, Jinja2, PyTest
- Networking, GitHub Workflows, Azure Cloud & VMs
- Kubernetes, AKS, HELM, Kustomize
- Kusto Query Language (KQL)
Experience & Qualifications:
- 3+ years hands-on experience with Docker, Python, Linux
- Proven experience in container orchestration platforms
- Strong problem-solving and collaboration skills in Agile environments
Desired Skills:
- Docker
- Python
- Linux
- Container Orchestration
- AWM
Desired Qualification Level:
About The Employer: