We are looking for a Senior Platform / DevOps Engineer to join our existing Platform team. We run a modern AWS + Kubernetes platform, but we also integrate with traditional / on-prem infrastructure.
You will help design, build and operate the core platform used by our product teams: Kubernetes clusters, networking, observability, GitOps, and infrastructure as code. Working closely with other platform engineers and the Lead DevOps Engineer, owning specific areas and driving improvements end-to-end.
This role is fully remote within Germany and requires close collaboration with engineers across multiple teams.
In addition, you'll be responsible for the following task:
Platform engineering
Design, build and operate our AWS- and Kubernetes-based platform.
Own one or more areas (e.g. Kubernetes, observability, networking, IaC) and act as the go-to person for those topics in the team.
Cloud & Kubernetes
Operate production AWS environments (multi-account, multi-environment).
Operate Kubernetes clusters (upgrades, capacity, reliability, security).
GitOps & delivery
Design and maintain Argo CD–based GitOps workflows (multi-cluster, multi-env).
Contribute to CI/CD patterns and deployment strategies together with the team.
Observability
Evolve and operate our observability stack:
Metrics: Prometheus, Thanos
Logs: Grafana Loki
Traces: Grafana Tempo
Instrumentation: OpenTelemetry
Help define SLOs, dashboards and alerting that are actually useful for teams.
Networking & connectivity
Work on Kubernetes networking, Ingress controllers and traffic routing.
Contribute to designs using Envoy (or Envoy-based components), API gateways and SD-VPN solutions.
Infrastructure as Code
Build and maintain Terraform modules and state layouts for AWS, Kubernetes and related infrastructure.
Help migrate existing/manual infrastructure to Terraform with minimal disruption.
(Bonus) Use Crossplane where it makes sense for Kubernetes-driven provisioning.
Hybrid & on-prem integration
Support connectivity and integration between cloud workloads and on-prem systems.
Collaboration
Work as part of the existing Platform team – participate in design reviews, incident reviews and on-call.
Support product teams as internal customers by providing clear platform interfaces, documentation and examples.
We are searching for a motivated candidate with following qualifications:
10+ years of experience in infrastructure / operations / platform / DevOps roles.
Strong experience with AWS in production:
Multi-account setups (e.g. separation of dev / test / prod).
Networking (VPC, subnets, routing, security groups, load balancers).
Landing Zones
IAM and security best practices.
Strong experience with Kubernetes in production:
Operating clusters (EKS or similar).
Workloads, deployments, autoscaling, upgrades and cluster lifecycle.
Solid observability experience with at least several of:
Prometheus, Thanos, Grafana.
Grafana Loki, Grafana Tempo or similar logging/tracing systems.
OpenTelemetry for metrics/logs/traces instrumentation and pipelines.
Solid networking knowledge:
TCP/IP, DNS, TLS, HTTP, routing, VPNs, firewalls, load balancing.
Experience with Envoy (directly or via service mesh / API gateway), Ingress controllers and API gateways.
Experience with SD-VPN solutions (e.g. AWS VPN, Tailscale or similar).
Strong experience with Terraform and Terragrun:
Modular design, state management, remote backends.
Managing AWS/Kubernetes infrastructure as code at team or org scale.
Hands-on experience with Argo CD and GitOps principles:
Application structure, app-of-apps patterns, promotion across environments, RBAC and ABAC.
Experience with traditional/on-prem infrastructure (VMs, networks, VPNs, storage, legacy systems) and connecting it to cloud environments.
Comfortable working in a remote-first environment in European time zones (good written communication, async collaboration).
Nice to have:
Experience with Crossplane in production.
Experience in a Platform/Enablement team serving multiple product teams..
Experience with Argo Workflows or similar workflow/orchestration tools.
Experience with AWS Control Tower and Account Factory
German language skills (B2)