Senior DevOps Engineer

Ukraine · Full-time · Senior

About The Company

MWDN is a global IT outstaffing company with 23+ years of experience that connects exceptional tech talent with leading companies across Israel, the USA, Great Britain, and Western Europe. We offer opportunities to work on international products in a stable and professional environment.

Why does MWDN rock?

Here’s what you can expect when you join MWDN:

  • Security: We carefully vet our clients to minimize risks and ensure reliability and timely payments-no fraud or unpleasant surprises.
  • Career support: If a project isn’t the right fit, we support you and actively help find new opportunities that match your skills and career goals.
  • Legal assistance: We provide guidance on legal matters, including opening and managing your independent contractor or sole proprietorship status, taxes, and related processes.
  • Professional development: We offer English courses and professional growth opportunities, as well as team-building events.

Why choose us? MWDN is ranked among the top 5 IT employers in our region according to DOU. We take pride in our transparency and strong commitment to our team. Curious to learn more? See what our employees say about working with us on DOU.

What is your new project?

Domain: AI Infrastructure / Real-Time Data Processing

Client Location: Israel

Company size: 10 - 51

A fast-growing product company building next-generation infrastructure for production AI systems. The team is focused on high-performance, low-latency distributed technologies that operate directly within real-time data flows and AI workloads. The product addresses complex engineering challenges related to runtime reliability, distributed processing, networking, observability, and large-scale system performance.

What makes this project exciting?

A cutting-edge company building deterministic, low-latency real-time data infrastructure for production-grade AI systems. The platform enables scalable and reliable data pipelines designed to power mission-critical AI workloads. Now they are building a cloud-native SaaS platform that manages and orchestrates software components running inside customer environments.

You will be responsible for developing and maintaining cloud infrastructure, as well as automating CI/CD processes, observability, and platform security. You will also help scale the system and improve the reliability and performance of services while working closely with engineering teams.

What makes you a great fit

  • 5+ years of experience in DevOps, SRE, Platform Engineering, or Infrastructure Engineering roles.
  • Strong Linux system administration skills.
  • Hands-on experience operating production workloads in AWS.
  • Experience with Kubernetes in production environments.
  • Strong experience building and maintaining CI/CD pipelines (GitHub Actions preferred).
  • Experience writing automation and operational tooling using Python and Bash.
  • Experience with networking, TLS, DNS, routing, and secure connectivity.
  • Experience troubleshooting distributed systems and production incidents.
  • Familiarity with observability concepts including metrics, logs, tracing, and alerting.
  • Strong ownership mindset and ability to work independently.
  • Experience with Infrastructure as Code (Terraform preferred).

Nice to Have

  • Experience with Prometheus, Grafana, OpenTelemetry, Tempo, Mimir, Loki, or similar observability platforms.
  • Experience with PKI, certificate management, and mTLS environments.
  • Experience operating PostgreSQL, Kafka, Redis, or other distributed infrastructure components.
  • Experience supporting SaaS platforms and customer-deployed software.
  • Familiarity with security frameworks and compliance initiatives such as SOC2.
  • Experience working in startup or high-growth engineering environments.

Your day-to-day in this position

Cloud Infrastructure & Operations

  • Operate and expand our AWS-based infrastructure.
  • Manage Kubernetes environments supporting production and development workloads.
  • Improve infrastructure reliability, scalability, and operational efficiency.
  • Support networking, access control, and secure service communication.

CI/CD & Developer Platform

  • Design, maintain, and improve GitHub Actions CI/CD pipelines.
  • Manage self-hosted runners and deployment automation workflows.
  • Improve developer productivity through automation and internal tooling.
  • Support release management and deployment processes.

Infrastructure as Code & Automation

  • Build and maintain infrastructure using Infrastructure-as-Code principles.
  • Develop automation tooling using Python, Bash, and related technologies.
  • Reduce operational overhead through automation-first solutions.
  • Improve environment provisioning and operational consistency.

Observability & Reliability

  • Operate and evolve centralized observability platforms.
  • Build and maintain monitoring, alerting, logging, and tracing systems.
  • Define operational metrics and reliability standards.
  • Participate in incident response, troubleshooting, and root-cause analysis.

Security & Platform Engineering

  • Support secure machine identity, TLS/mTLS, certificate lifecycle management, and secrets handling.
  • Assist in implementing security controls and compliance initiatives.
  • Improve platform security posture across cloud and runtime environments.
  • Collaborate with engineering teams on secure-by-design architectures.

Why work with us?

  • People-first management with minimal bureaucracy
  • A friendly company culture, proven by employees who choose to return
  • Flexible working hours
  • 29 days of PTO (18 working days per year pluse all national holidays)
  • 10 paid recovery days
  • Full financial and legal support for independent contractors
  • Free English classes, with native speakers or Ukrainian teachers
  • Dedicated HR support

Our next steps

✅ Intro call with a Recruiter — ✅ Technical Interview — ✅ Interview with CTO and CEO — ✅ Offer