Telecommunications
Bulgaria, Poland, Portugal, Romania
Remote
DevOps/Infrastructure Engineer.
Bulgaria, Poland, Portugal, Romania
Remote
Who we are:
Adaptiq is a technology hub specializing in building, scaling, and supporting R&D teams for high-end, fast-growing product companies in a wide range of industries.
About the Product:
Our client develops and operates a globally distributed connectivity platform and core network engineered for large-scale IoT and mobile services. The solution combines on-premises deployments with cloud-hosted components—including a connectivity management portal, charging systems, and multi-IMSI SIM integrations—to deliver seamless coverage and billing across dozens of international sites. The platform handles high volumes of signaling and user traffic, enforces regional compliance, and demands sub-second reliability for critical services. Maintaining network security, ensuring low-latency data flows, and scaling infrastructure to meet unpredictable demand are ongoing technical challenges. Senior engineers are essential to design resilient architectures, troubleshoot complex cross-domain issues, and drive continuous improvements in an environment where downtime is not an option.
About the Role:
This is a hands-on senior position on a compact infrastructure team, placing you in direct ownership of production-grade, hybrid (on-prem + cloud) systems. You will operate and evolve Kubernetes clusters, manage virtual servers, design Infrastructure as Code, and optimize database performance at scale. You will lead incident response efforts, perform deep network-level troubleshooting, and collaborate with development teams to improve reliability and security. The role offers genuine autonomy, responsibility for high-availability services, and the opportunity to influence technical strategy for a mission-critical, global connectivity platform.
Key Responsibilities:
- Design, build and operate highly available, scalable infrastructure across both on-premises data centers and public cloud environments
- Operate and maintain physical servers, including hardware replacements, NIC configurations, basic troubleshooting, and coordination with data center vendors
- Own and maintain production Kubernetes clusters: lifecycle management (upgrades, scaling, troubleshooting), StatefulSets, persistent volumes, networking, certificates, and ingress components
- Implement and maintain Infrastructure as Code using Terraform, Ansible, or similar tools to standardize deployments
- Operate and troubleshoot production databases (PostgreSQL, MySQL, ClickHouse): backups, restores, migrations, and performance tuning
- Design, deploy and maintain monitoring and observability solutions (Prometheus, Grafana, alerting rules, logging stacks)
- Participate in 24/7 incident response: diagnose complex production issues, drive root cause analysis and postmortems
- Collaborate with software teams on system design, deployment automation and reliability engineering initiatives
- Contribute to security best practices: network segmentation, access control, infrastructure hardening
- Participate in on-call rotation (1 week per month) to ensure continuous support of critical services
Required Competence and Skills:
- 4+ years of hands-on experience in DevOps, SRE or infrastructure engineering roles
- Strong experience operating Linux-based production systems at scale
- Proven hands-on Kubernetes experience in a production environment
- Familiarity with Git-based workflows and scripting for automation (Bash, Python or similar)
- Deep understanding of networking fundamentals: TCP/IP, DNS, routing, NAT, VPNs, firewalls and load balancing
- Proven experience managing production databases: PostgreSQL, MySQL
- Proficiency with Infrastructure as Code tools such as Terraform and Ansible
- Experience with monitoring and observability stacks (Prometheus, Grafana, logging solutions)
- Strong troubleshooting skills with the ability to work independently under high-pressure production scenarios
Nice to Have:
- Experience with hybrid infrastructure spanning on-premises and cloud
- Experience with bare-metal or virtualization platforms
- Knowledge of Kafka or other distributed stream-processing systems
- Background in telecom, IoT or other high-availability environments
- Familiarity with repository managers such as Artifactory or Nexus
Why Us:
We provide 20 days of vacation leave per calendar year (plus official national holidays of a country you are based in).
We provide full accounting and legal support in all countries we operate.
We utilize a fully remote work model with a powerful workstation and co-working space in case you need it.
We offer a highly competitive package with yearly performance and compensation reviews.