job description
Are you a seasoned technical leader with a passion for operational excellence and system stability? Monee is seeking a highly skilled IT Delivery and Production Support (DPS) Lead to join our high-performing team supporting MariBank. In this pivotal role, you will act as the bridge between development, infrastructure, and business stakeholders, ensuring that our digital banking platforms operate at peak performance 24/7.
As the DPS Lead, you will spearhead incident management, oversee deployment pipelines, and drive continuous improvement initiatives to enhance system reliability. We are looking for a proactive leader who thrives in fast-paced fintech environments and is dedicated to mentoring engineering talent while maintaining rigorous production standards. If you have a deep understanding of cloud infrastructure, SRE principles, and delivery orchestration, we want to hear from you.
Responsibility
- Lead and mentor the Production Support team to ensure high availability and stability of banking services.
- Manage end-to-end incident response, including root cause analysis (RCA) and post-incident reviews to prevent recurrence.
- Oversee the release management process, ensuring seamless deployments and minimal service disruption.
- Collaborate with cross-functional teams to monitor production environments and proactively address performance bottlenecks.
- Define and track Key Performance Indicators (KPIs) related to system reliability, MTTR, and incident frequency.
- Implement automation strategies to streamline manual support tasks and enhance operational efficiency.
- Serve as a primary point of escalation for critical production issues, ensuring effective stakeholder communication.
Qualifications
- Bachelor’s degree in Computer Science, Information Technology, or a related field.
- 5+ years of experience in Production Support, SRE, or DevOps roles, ideally within the financial services or banking sector.
- Strong technical proficiency in cloud platforms (AWS/GCP/Azure) and container orchestration tools like Kubernetes.
- Expertise in monitoring and observability tools (e.g., ELK Stack, Prometheus, Grafana, Datadog).
- Proven experience in CI/CD pipeline management and scripting (Python, Shell, or Go).
- Excellent communication and stakeholder management skills, with the ability to translate technical issues for non-technical audiences.
- Strong analytical mindset with a passion for problem-solving and process optimization.