(Summary generated by AI based on the full job description)
The project focuses on monitoring and maintaining reliability of an infrastructure platform supporting AI services, Java APIs, and frontend applications. Key technologies include Kubernetes (AKS), Terraform, Azure (ACR, Key Vault, Virtual Networks), Prometheus, Grafana, GitHub Actions, ArgoCD. Main responsibilities cover defining and maintaining SLO/SLI, incident response, automation, Kubernetes infrastructure management, and development of observability and CI/CD tools. The project emphasizes toil reduction and production environment stability.

Optional



By clicking "Aplikuj" you confirm that you've read and accepted our Terms and Conditions.
This is how the employer processes your data
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Need more information?