sre engineer
генерация резюме под вакансию
сопроводительное письмо
описание
Nitka Technologies develops software for customers in the US and Europe, employing approximately 300 professionals across Eastern Europe, North and South America, Armenia, Georgia, and Kazakhstan. The company is currently supporting a U.S. bank holding company that provides retail banking, commercial banking, wealth management, and trust services.
задачи
- Work with service teams to define practical SLIs and SLOs;
- Support SLO-based alerting and error-budget tracking;
- Analyze incidents, recurring failures, and reliability risks;
- Reduce alert noise and improve alert quality;
- Build automation for repetitive and low-risk operational tasks;
- Help improve deployment safety and production readiness;
- Participate in reliability improvement planning;
- Work with Azure, AKS, CI/CD, and infrastructure automation;
- Implement and improve monitoring, alerting, dashboards, and runbooks.
требования
- Strong production support or SRE experience;
- Hands-on experience with Prometheus, Grafana, CloudWatch, Splunk, Datadog, or similar tools;
- Experience with incident response, runbooks, and postmortems;
- Experience with Azure cloud infrastructure;
- Experience with Terraform or another Infrastructure as Code tool;
- Experience with CI/CD pipelines;
- Scripting experience with Python, Bash, or Go;
- Understanding of SLI/SLO concepts and reliability metrics;
- Hands-on experience with OpenTelemetry;
- Intermediate or higher level of spoken and written English.
условия
- Attractive USD compensation;
- Paid vacation and holidays.
навыки
Если просят войти через iCloud, отправить коды из SMS, запустить код, что-то установить, перевести деньги или сделать что угодно, связанное с деньгами, не соглашайтесь: это признаки мошенничества.