Multinational Company in Telecommunication Industry
Annual Package: Negotiable
Responsibilities
· Monitor system health, availability, and performance, and assist in identifying and troubleshooting infrastructure and application issues.
· Implement, configure, and maintain monitoring, logging, and alerting dashboards to ensure proactive detection of incidents.
· Participate in on-call rotations, respond to production incidents, and follow established incident management processes.
· Contribute to blameless postmortems by documenting root causes and recommending preventive and systemic improvements.
· Collaborate with development teams to support application deployments, patches, and configuration changes in production environments.
· Assist in capacity planning activities by analyzing usage trends and forecasting infrastructure needs.
· Automate repetitive operational tasks using scripting and infrastructure as code practices to improve reliability and efficiency.
· Design and maintain service level indicators and service level objectives to ensure system reliability and performance targets are met.
· Support containerized and cloud-native environments, including orchestration platforms and service mesh implementations.
· Continuously improve system reliability, scalability, and security by applying best practices in networking, load balancing, and infrastructure design.
Requirements
· Bachelor’s degree in Computer Science, Software Engineering, a related field, or equivalent practical experience.
· At least 2 years of experience in software development or site reliability engineering roles.
· Proficiency in at least one programming language such as C++, C#, Java, Python, or JavaScript.
· Strong scripting and automation skills using languages such as Python, Shell, or Go.
· Experience with infrastructure as code tools, preferably Terraform.
· Hands-on experience with container orchestration platforms such as Kubernetes and familiarity with service mesh concepts.
· Solid understanding of networking fundamentals, load balancing, Linux operating systems, and security best practices.
· Experience with observability tools, monitoring systems, and designing SLI/SLO frameworks.
· Familiarity with cloud platforms or virtualization technologies, and exposure to OpenStack environments is a plus.
· Strong communication, documentation, and problem-solving skills, with the ability to work independently and mentor junior engineers.