Job Title: Manager – Regional Key Accounts Organisation: MTN…
Site Reliability Engineer/ System Administrator Jobs – ENGIE Energy Access
Job Title: Site Reliability Engineer/ System Administrator
Organisation: ENGIE Energy Access
Duty Station: Kampala, Uganda
About US:
ENGIE Energy Access is one of the leading Pay-As-You-Go (PAYGo) and mini-grids solutions provider in Africa, with a mission to deliver affordable, reliable and sustainable energy solutions and life-changing services with exceptional customer experience. The company is a result of the integration of Fenix International, ENGIE Mobisol and ENGIE PowerCorner; and develops innovative, off-grid solar solutions for homes, public services and businesses, enabling customers and distribution partners access to clean, affordable energy. The PAYGo solar home systems are financed through affordable installments from $0.14 per day and the mini-grids foster economic development by enabling electrical productive use and triggering business opportunities for entrepreneurs in rural communities. With over 1,700 employees, operations in 9 countries across Africa (Benin, Cote d’Ivoire, Kenya, Mozambique, Nigeria, Rwanda, Tanzania, Uganda and Zambia), over 1 million customers and more than 5 million lives impacted so far, ENGIE Energy Access aims to remain the leading clean energy company, serving millions of customers across Africa by 2025.
Job Summary: We are seeking a talented and experienced System Administrator/Site Reliability Engineer (SRE) to join our dynamic team. As an SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our systems and services. You will collaborate with cross-functional teams to implement and maintain robust infrastructure solutions, focusing on automation, monitoring, and incident response. The ideal candidate is passionate about optimizing and enhancing system reliability, possesses strong problem-solving skills, and is committed to driving excellence in operational practices.
Key Duties and Responsibilities:
Infrastructure Automation:
- Develop and maintain automation tools and scripts for provisioning, configuration, and deployment.
- Implement infrastructure as code (IaC) practices to ensure consistency and reproducibility.
Monitoring and Incident Response:
- Set up and maintain monitoring systems to detect and respond to performance issues and outages.
- Participate in on-call rotations and respond promptly to incidents, troubleshoot, and implement solutions to prevent recurrence.
Performance Optimization:
- Optimize system performance through continuous analysis and tuning.
Reliability Engineering:
- Implement best practices for reliability, such as error budgeting, SLIs/SLOs, and blameless post-mortems.
- Work towards minimizing manual intervention through automation.
System Administration:
- Manage and maintain server infrastructure, including installation, configuration, and troubleshooting of operating systems.
- Implement and maintain security measures, such as firewalls and intrusion detection systems.
- Perform regular system backups and recovery procedures.
Collaboration and Communication:
- Collaborate with cross-functional teams to align infrastructure and operational requirements.
- Provide technical guidance and support to colleagues in areas related to reliability.
Qualifications, Skills and Experience:
- The applicant must hold a Bachelor’s degree in computer science, Information Technology, or a related field.
- Proven experience as a Site Reliability Engineer or System Administrator.
- Strong Linux and Bash scripting skills.
- Proficiency in cloud platforms (e.g., AWS, Azure, GCP, Linode, DigitalOcean).
- Experience with container orchestration tools (e.g., Kubernetes, Docker, LXD).
- In-depth knowledge of networking, security, and system administration.
- Familiarity with infrastructure as code tools (e.g., Terraform, Ansible).
- Excellent problem-solving and troubleshooting skills.
- Strong communication and collaboration skills.
Preferred Qualifications:
- Experience with CI/CD pipelines and related tools.
- Knowledge of distributed systems and microservices architecture.
- Familiarity with observability tools (e.g., Prometheus, Grafana, ELK stack).
- Familiarity with programming languages (e.g., Python, Ruby).
How to Apply:
All suitably qualified and interested applicants should apply online at the link below.
For more of the latest jobs, please visit https://www.theugandanjobline.com or find us on our facebook page https://www.facebook.com/UgandanJobline