Job Description
We are looking for a motivated Zabbix Support Engineer to operate, maintain, and evolve the enterprise Zabbix Monitoring Platform deployed across cloud infrastructure (AWS / Azure). The engineer will be responsible for day-to-day monitoring operations, request handling, and platform maintenance, while ensuring stability, automation, and continuous service improvement within a 24×7 managed service environment.
Responsibilities
- Monitor, manage, and troubleshoot Zabbix servers, proxies, and agents across cloud and hybrid environments, ensuring platform stability and performance.
- Handle alerts, incidents, and service requests; perform configuration changes, alert tuning, and device onboarding in line with SLA targets.
- Support patching, version upgrades, and configuration management of Zabbix; maintain HA setup and agent lifecycle in collaboration with infrastructure teams.
- Develop scripts (Python/Shell) and lightweight tools to automate monitoring tasks; create and enhance Zabbix templates for new services and cloud components.
- Integrate Zabbix with AWS/Azure, CI/CD pipelines, and external tools (Grafana, ServiceNow, Jira) to enable unified, proactive monitoring.
- Ensure secure access, system hardening, and adherence to organizational security policies while operating via approved VDI environments.
Requirements
- 3–5 years of hands-on experience with Zabbix administration and monitoring operations.
- Strong knowledge of Linux (RHEL / CentOS) environments — installation, tuning, and troubleshooting.
- Familiarity with cloud infrastructure monitoring (AWS / Azure) and hybrid setups.
- Working understanding of network protocols (SNMP, ICMP, HTTP, TCP/IP).
- Scripting ability in Python, Bash, or PowerShell for automation and alert enrichment.
- Basic understanding of ITIL processes (Incident, Problem, Change).
- Experience integrating Zabbix with ITSM tools (ServiceNow, Jira) and visualization tools (Grafana).
- Strong analytical and problem-solving capability.
- Zabbix Certified Specialist / Professional certification.
- Red Hat Certified Engineer (RHCE) or RHCSA.
- Exposure to containerized environments (Docker, Kubernetes) and CI/CD pipeline monitoring.
- Experience in monitoring automation using REST APIs or webhook-based integrations.
- Excellent coordination and communication with onsite/offshore stakeholders.
- Capable of working in a global, multi-vendor support model.
- Commitment to continuous improvement, process discipline, and proactive monitoring.
- Ability to work in rotational shifts (16×5 / 24×7) with focus on service continuity.