Platform SRE and Reliability Engineer - Abu Dhabi, United Arab Emirates - Deeplight

    Deeplight
    Deeplight Abu Dhabi, United Arab Emirates

    1 day ago

    Description

    DeepLight AI is a specialist AI and data consultancy with extensive experience implementing intelligent enterprise systems across multiple industries, with particular depth in financial services and banking. Our team combines deep expertise in data science, statistical modeling, AI/ML technologies, workflow automation, and systems integration with a practical understanding of complex business operations.

    The Platform SRE and Reliability Engineer is responsible for ensuring the absolute quality, resilience, and performance of the Bank's next-generation AI and digital platforms. This role focuses on the high-stakes intersection of Site Reliability Engineering (SRE) and AI Quality Assurance, designing automated frameworks to validate everything from Conversational AI agents and RAG pipelines to core banking microservices. By implementing robust continuous testing pipelines and reliability governance, you will guarantee that the Bank's AI-driven experiences remain secure, scalable, and deterministically accurate under real-world conditions.

    As the Platform SRE & Reliability Engineer, your responsibilities include:

    • Building reusable automation frameworks to test the accuracy, stability, latency, and safety of Conversational AI platforms (voice and chat) and LLM-based agents.
    • Validating multi-agent orchestration, human-in-the-loop escalation logic, and the integrity of RAG pipelines and vector search results.
    • Testing AI/ML platform components for scaling behavior, failover resilience, high availability, and disaster recovery.
    • Integrating automated test pipelines into CI/CD workflows for MLOps, focusing on drift detection, retraining validation, and model registry integrity.
    • Verifying AI/ML pipelines on Azure AI Foundry and AWS SageMaker, ensuring data integrity across storage services (S3/Blobs) and serverless functions.
    • Conducting load testing for AI services and ensure engineering guardrails for fairness, explainability, and regulatory compliance are enforced.
    • Acting as a bridge between engineering and business, translating complex technical reliability requirements into actionable quality narratives.

    As an AI consultancy, our greatest asset is the expertise of our people.


    While technical mastery is the foundation of what we do, the ability to bridge the gap between complex data science and actionable business value is what defines your success with Deeplight.


    We're looking for individuals who are not only world-class in their fields of specialism, but also compelling communicators and persuasive advocates for their own skills.


    You will be the face of our firm, tasked with building trust, articulating the "why" behind your technical decisions, and effectively "selling" your vision to high-level stakeholders.


    If you thrive on the challenge of presenting cutting-edge solutions as much as you do on building them, you will fit right in.

    Requirements

    To be successful in this role, we need you to have:

    • A Bachelor's degree in Computer Science, AI, Software Engineering, or a related quantitative field. A Master's degree in AI/ML is highly preferred.
    • 5+ years in QA, Application Testing, or Reliability Engineering, ideally for a large-scale brand or digital-only bank.
    • Proven track record in deploying AI/ML QA solutions at an enterprise scale within the financial services sector.
    • Experience testing distributed architectures, microservices, and large-scale data platforms (Vector DBs, Data Lakes).
    • Expertise in Python-based automation frameworks and tools such as Selenium, Playwright, PyTest, JMeter, and Locust.
    • A deep understanding of LLM evaluation frameworks, prompt stability testing, and hallucination avoidance validation.
    • Hands-on experience testing and validating services across both Azure and AWS cloud environments.
    • Strong SQL/NoSQL validation skills (Postgres, MongoDB) and experience testing REST, GraphQL, and FastAPI integrations.
    • Be proficient in testing within Docker and Kubernetes (EKS/AKS) environments.


    It would be beneficial if you also had:

    • An ability to evaluate and adopt emerging QA tools for AI frameworks like LangChain, CrewAI, and Bedrock.
    • An understanding of cutting-edge quality trends, including multimodal QA and RLHF (Reinforcement Learning from Human Feedback) output evaluation.
    • A proactive approach to identifying edge cases in AI agents that could impact banking compliance or customer experience.
    • A strong ability to coordinate with different functional teams to implement models and monitor outcomes.
    Benefits

    Benefits & Growth Opportunities:

    • Competitive salary.
    • Visa Sponsorship for the successful individual.
    • Comprehensive health insurance for the successful individual.
    • Professional development and certification support.
    • Opportunity to work on cutting-edge AI projects.
    • Career advancement opportunities in a rapidly growing AI company.

    This position offers a unique opportunity to shape the future of AI implementation while working with a talented team of professionals at the forefront of technological innovation. The successful candidate will play a crucial role in driving our company's success in delivering transformative AI solutions to our clients.

    At DeepLight AI, we recognise that diversity drives innovation. We are committed to fostering an inclusive environment where individuals with different thinking styles can thrive and contribute their unique strengths to our specialised AI and data solutions.

    Our goal is to ensure our application and interview process is accessible, predictable, and fair for all candidates.

    If you require any specific adjustments to the application process, or if you require any reasonable adjustments should you be successful in being processed to the interview stage, please do let us know. This information will be kept strictly confidential and will not impact hiring decisions.


  • Work in company

    Engineer (Reliability)

    Only for registered members

    KBR is looking for Engineers to support the maintenance of crude flexibility project critical equipment at Adnoc Refining, Ruwais. · ...

    Abu Dhabi, Abu Dhabi Emirate

    1 month ago

  • Work in company

    Engineer (Reliability)

    Only for registered members

    KBR is looking for Engineers to support the maintenance of crude flexibility project critical equipment at Adnoc Refining, · Ruwais, · Abu Dhabi.Bachelors or masters degree in mechanical, Electrical, Control, Corrosion · ,or Metallurgy Engineering. · CERTIFIED RELIABILITY ENGINEE ...

    Abu Dhabi

    1 month ago

  • Work in company

    Engineer Reliability

    Only for registered members

    KBR is looking for Engineers to support the maintenance of crude flexibility project critical equipment at Adnoc Refining, Ruwais. · Designing, developing, testing, and maintaining systems. · ...

    Abu Dhabi

    1 month ago

  • Work in company

    Reliability and Maintenance Engineer

    Only for registered members

    The RME Coordinator serves as a vital member of the Engineering RME function, · working with the RMEAM , RME AE's, RMEP and directly under the guidance of · the RMEM. · ...

    Abu Dhabi, Abu Dhabi Emirate

    1 month ago

  • Work in company

    Site Reliability Engineer

    Only for registered members

    ++We are looking for an experienced Site Reliability Engineer (SRE) to help design operate and continuously improve highly reliable scalable and cost-efficient systems.+ · + ...

    Abu Dhabi, Abu Dhabi Emirate

    3 weeks ago

  • Work in company

    Site Reliability Engineer

    Only for registered members

    +Job summary · We are partnering with a leading UAE-based technology organisation delivering mission-critical national platforms to recruit an experienced Site Reliability Engineer (SRE).+ · +Designing and operating Azure infrastructure (AKS, Blob, secure integrations) · Managing ...

    Abu Dhabi, Abu Dhabi Emirate

    1 week ago

  • Work in company

    Site Reliability Engineer

    Only for registered members

    Orbitworks is revolutionizing access to space by building reliable, shareable satellites that drastically reduce the time and complexity traditionally required to get to orbit. · We operate satellites, fly customer payloads, and handle entire missions from end-to-end. Orbitworks ...

    Abu Dhabi

    1 month ago

  • Work in company

    Site Reliability Engineer

    Only for registered members

    We are partnering with a leading UAE-based technology organisation delivering mission-critical national platforms to recruit an experienced Site Reliability Engineer (SRE). This is not a standard cloud role. · ...

    Abu Dhabi

    1 week ago

  • Work in company

    Site Reliability Engineer

    Only for registered members

    We are looking for an experienced Site Reliability Engineer (SRE) to help design operate and continuously improve highly reliable scalable and cost-efficient systems. · ...

    Abu Dhabi

    3 weeks ago

  • Work in company

    Specialist Engineer Reliability

    Only for registered members

    Expert in a specific area of engineering, often with extensive experience and knowledge in that field. Ability to interpret asset performance data and present findings to stakeholders. · KBR is looking for Specialist Engineers to support the SMS Reliability Analysis Services Proj ...

    Abu Dhabi

    1 week ago

  • Work in company

    Site Reliability Engineer

    Only for registered members

    We're looking for a talented Site Reliability Engineer (SRE) to keep our systems running smoothly, reliably, · and at scale. · ...

    Abu Dhabi

    1 month ago

  • Work in company

    Senior Reliability Engineer

    Only for registered members

    We are seeking a Senior Reliability Engineer to join our team. The successful candidate will have a strong background in reliability engineering and be able to apply their skills to diverse industries and client organizations. · ​Applying wide variety of reliability techniques ac ...

    Abu Dhabi

    1 month ago

  • Work in company

    Reliability and Maintenance Engineer

    Only for registered members

    Experience in material handling systems (MHS) installation, operation and maintenance. · ...

    Abu Dhabi

    1 month ago

  • Work in company

    Site Reliability Engineer

    Only for registered members

    ++Orbitworks is revolutionizing access to space by building reliable, shareable satellites that drastically reduce the time and complexity traditionally required to get to orbit. · +As a Senior Site Reliability Engineer on our Infrastructure Team, you'll play a pivotal role in ma ...

    Abu Dhabi Full time

    1 month ago

  • Work in company

    Reliability and Maintenance Engineer

    Only for registered members

    The RME Coordinator serves as a vital member of the Engineering RME function, · working with the RMEAM , RME AE's, RMEP and directly under the guidance of the RMEM. · Serve as a champion for safety, · ensuring all operations meet or exceed safety standards and regulations. · ...

    Abu Dhabi

    1 month ago

  • Work in company

    Site Reliability Engineer

    Only for registered members

    We're expanding our engineering operations in the United Arab Emirates and are looking for a Senior Technical Site Engineer to take a key role in supporting mission-critical drone fleet management and UAS traffic management systems. · This is a high-impact, hands-on technical rol ...

    Abu Dhabi

    3 weeks ago

  • Work in company

    Reliability and Maintenance Engineer

    Only for registered members

    The RME Coordinator serves as a vital member of the Engineering RME function working with the RMEAM , RMEM under guidance of RMEM.This role is strategically positioned to drive management continuous improvement of Amazon Robotics reliability through disciplined application applic ...

    Abu Dhabi Full time

    1 month ago

  • Work in company

    Site Reliability Engineer

    Only for registered members

    +Job summary · The Site Reliability Engineer – L2 is responsible for supporting and maintaining Open Innovation AI Products and deployments across customer environments, including secure and isolated on-premises infrastructures.This role requires strong troubleshooting skills acr ...

    Abu Dhabi

    1 week ago

  • Work in company

    Senior Reliability Engineer

    Only for registered members

    Join our Team to be part of a successful team in Industrial Energy Technology (IET) part of Baker Hughes Company. As a Senior Reliability Engineer you will apply reliability techniques across diverse industries and client organizations. · Contemporary work-life balance policies a ...

    Abu Dhabi

    1 month ago

  • Work in company

    Reliability and Maintenance Engineer

    Only for registered members

    The RME Coordinator serves as a vital member of the Engineering RME function. · ...

    Abu Dhabi

    1 month ago

  • Work in company

    Site Reliability Engineer

    Only for registered members

    About the Role: Orbitworks is revolutionizing access to space by building reliable, shareable satellites that drastically reduce the time and complexity traditionally required to get to orbit. · Collaborate with developers, test engineers and satellite operators to foster a stron ...

    Abu Dhabi

    1 month ago

Jobs
>
Abu Dhabi