DevOps/Infrastructure Engineer.

Copy link

Apply Now Recomend a friend

Bulgaria, Poland, Portugal, Romania

Remote

Who we are:

Adaptiq is a technology hub specializing in building, scaling, and supporting R&D teams for high-end, fast-growing product companies in a wide range of industries.

About the Product:

Our client develops and operates a globally distributed connectivity platform and core network engineered for large-scale IoT and mobile services. The solution combines on-premises deployments with cloud-hosted components—including a connectivity management portal, charging systems, and multi-IMSI SIM integrations—to deliver seamless coverage and billing across dozens of international sites. The platform handles high volumes of signaling and user traffic, enforces regional compliance, and demands sub-second reliability for critical services. Maintaining network security, ensuring low-latency data flows, and scaling infrastructure to meet unpredictable demand are ongoing technical challenges. Senior engineers are essential to design resilient architectures, troubleshoot complex cross-domain issues, and drive continuous improvements in an environment where downtime is not an option.

About the Role:

This is a hands-on senior position on a compact infrastructure team, placing you in direct ownership of production-grade, hybrid (on-prem + cloud) systems. You will operate and evolve Kubernetes clusters, manage virtual servers, design Infrastructure as Code, and optimize database performance at scale. You will lead incident response efforts, perform deep network-level troubleshooting, and collaborate with development teams to improve reliability and security. The role offers genuine autonomy, responsibility for high-availability services, and the opportunity to influence technical strategy for a mission-critical, global connectivity platform.

Key Responsibilities:

Design, build and operate highly available, scalable infrastructure across both on-premises data centers and public cloud environments
Operate and maintain physical servers, including hardware replacements, NIC configurations, basic troubleshooting, and coordination with data center vendors
Own and maintain production Kubernetes clusters: lifecycle management (upgrades, scaling, troubleshooting), StatefulSets, persistent volumes, networking, certificates, and ingress components
Implement and maintain Infrastructure as Code using Terraform, Ansible, or similar tools to standardize deployments
Operate and troubleshoot production databases (PostgreSQL, MySQL, ClickHouse): backups, restores, migrations, and performance tuning
Design, deploy and maintain monitoring and observability solutions (Prometheus, Grafana, alerting rules, logging stacks)
Participate in 24/7 incident response: diagnose complex production issues, drive root cause analysis and postmortems
Collaborate with software teams on system design, deployment automation and reliability engineering initiatives
Contribute to security best practices: network segmentation, access control, infrastructure hardening
Participate in on-call rotation (1 week per month) to ensure continuous support of critical services

Required Competence and Skills:

4+ years of hands-on experience in DevOps, SRE or infrastructure engineering roles
Strong experience operating Linux-based production systems at scale
Proven hands-on Kubernetes experience in a production environment
Familiarity with Git-based workflows and scripting for automation (Bash, Python or similar)
Deep understanding of networking fundamentals: TCP/IP, DNS, routing, NAT, VPNs, firewalls and load balancing
Proven experience managing production databases: PostgreSQL, MySQL
Proficiency with Infrastructure as Code tools such as Terraform and Ansible
Experience with monitoring and observability stacks (Prometheus, Grafana, logging solutions)
Strong troubleshooting skills with the ability to work independently under high-pressure production scenarios