High Performance Computing Software Engineer - Abu Dhabi - Institute of Foundation Models

    Institute of Foundation Models
    Description
    About the Institute of Foundation Models   We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next generation of AI builders, and drive transformative contributions to a knowledge-driven economy.   As part of our team, you'll have the opportunity to work on the core of cutting-edge foundation model training, alongside world-class researchers, data scientists, and engineers, tackling the most fundamental and impactful challenges in AI development. You will participate in the development of groundbreaking AI solutions that have the potential to reshape entire industries. Strategic and innovative problem-solving skills will be instrumental in establishing MBZUAI as a global hub for high-performance computing in deep learning, driving impactful discoveries that inspire the next generation of AI pioneers.
    The Role
    IFM is building the foundational compute infrastructure that will power tomorrow's breakthroughs in AI and computational science. We're looking for a High Performance Computing Software Engineer to help us design, develop, and operate the software systems that run our large-scale AI workloads.
    In this role, you'll work at the intersection of high-performance computing and machine learning. You'll be part of a team responsible for crafting the software stack that enables training of cutting-edge ML models—spanning 1000+ GPUs—and ensuring our infrastructure is robust, performant, and developer-friendly.

    Job Responsibilities

    • Design and implement high-performance, distributed software solutions for large-scale AI/ML training.
    • Optimize low-level system components including Linux kernel, GPU/accelerator kernels, and interconnects.
    • Develop and tune communication libraries such as NCCL, MPI, UCX, RCCL, and RDMA-based systems.
    • Partner with ML researchers and engineers to support frameworks like PyTorch, MegatronLM, and DeepSpeed in large-scale production environments.
    • Contribute to our scheduling, orchestration, and job management systems, including Slurm and Kubernetes.
    • Debug and resolve complex issues across the stack—from kernel to container to model.
    • Work closely with hardware vendors, upstream open-source communities, and internal teams to drive performance and reliability improvements.

    Skills & Experience

    • Proven experience developing and optimizing software for large-scale ML workloads GPUs preferred).
    • Deep understanding of Linux kernel internals and accelerator (GPU) kernel development.
    • Proficiency with distributed communication libraries (e.g., NCCL, RCCL, MPI, UCX, SHARP, Libfabric).
    • Experience with ML frameworks like PyTorch, TensorFlow, JAX, or MegatronLM.
    • Strong knowledge of HPC job scheduling and orchestration tools (e.g., Slurm, Kubernetes, Pyxis).
    • Excellent debugging and systems performance tuning skills.
    • A collaborative mindset with a focus on shared success and technical excellence.

  • Star Services Abu Dhabi, United Arab Emirates

    Implement and optimize code in languages such as C, C++, Python etc. Integrate software with hardware and perform functionality testing. · Degree in Software Engineering Computer Science or a related field. · Experience in embedded software development or industrial applications. ...

  • Only for registered members Abu Dhabi, Abu Dhabi Emirate

    This person will contribute to the development of all aspects of embedded software applicable to a navigation software. · Responsible for designing, developing and optimizing embedded software for advanced navigation technologies such as GPS INS and other sensor fusion-based solu ...

  • Only for registered members Abu Dhabi

    ++This person will contribute to the development of all aspects of embedded software applicable to a navigation software. · + · Must be very proficient and independent at developing software in an embedded environment in languages such as C/C++. · ...

  • Only for registered members Abu Dhabi

    We are seeking a highly skilled Senior Fullstack Engineer to architect develop and maintain robust scalable web applications that integrate advanced AI capabilities This role involves close collaboration with product teams AI researchers designers and backend specialists to deliv ...

  • Only for registered members Abu Dhabi, Abu Dhabi Emirate

    The Big Data Analytics Center (BIDAC) at the United Arab Emirates University (UAEU) is seeking a talented and motivated Software Developer to support the design, development and deployment of advanced AI-driven platforms and data applications. · Design, develop and maintain web a ...

  • Only for registered members Abu Dhabi

    Design and develop high-quality software applications by translating user requirements into functional code. · ...

  • Only for registered members Abu Dhabi, Abu Dhabi Emirate

    This is an on-site contract role for a Software Engineer - R&D Robotics based in Abu Dhabi. In this position, you will collaborate with cross-functional teams to develop, implement and optimize software systems used in robotics technology. · ...

  • Only for registered members Abu Dhabi

    This software engineer will own end-to-end delivery of product features from discovery to deployment and iteration. · ...

  • Only for registered members Abu Dhabi

    Create software solutions using K2 BPMS, maintain business analysis skills, customize and configure workflows.Design, Develop, Maintain: Create software solutions using K2 Business Process Management System (BPMS). · Business Analysis: Collaborate with stakeholders to gather and ...

  • Only for registered members Abu Dhabi, Abu Dhabi Emirate

    We are looking for a Software Engineer (.NET) – Contractor to develop maintain and optimize software solutions using Microsoft .NET technologies. · Bachelor's degree in Computer Science IT or a related field (preferred). · 5+ years of experience in software development with exper ...

  • Only for registered members Abu Dhabi

    We are seeking a highly skilled Senior Fullstack Engineer to architect, · develop and maintain robust scalable web applications that integrate advanced AI capabilities. · ...

  • Only for registered members Abu Dhabi

    Design and develop high-quality software applications. · Collaborate with cross-functional teams to define and ship new features. · Conduct thorough testing and debugging of applications to ensure optimal performance. ...

  • VAMS Abu Dhabi, United Arab Emirates

    VAM Systems is a Business Consulting IT Technology Solutions company with operations in UAE USA Australia Singapore India. We are looking for Software Engineer MicroStrategy for our UAE operations with experience in MicroStrategy Data warehousing SQL Database Oracle database good ...

  • Only for registered members Abu Dhabi, Abu Dhabi Emirate

    We are seeking a highly skilled Senior Fullstack Engineer to architect, develop, and maintain robust, scalable web applications that integrate advanced AI capabilities. · Design, build, and maintain scalable fullstack applications with modern frameworks and cloud services. · ...

  • Only for registered members Abu Dhabi

    This person will contribute to the development of all aspects of embedded software applicable to a navigation software.Responsible for designing, developing, and optimizing embedded software for advanced navigation technologies, such as GPS, INS, and other sensor fusion-based sol ...

  • Only for registered members Abu Dhabi

    The Engineer will directly report to the on-shore Software Engineering Manager for the assigned project within the UAE. · Fully describe here all responsibilities}Contribute to break down the specification into development stories (under PO/Scrum Master supervision): clarify scop ...

  • Only for registered members Abu Dhabi

    The Big Data Analytics Center (BIDAC) at the United Arab Emirates University (UAEU) is seeking a talented and motivated Software Developer to support the design, development, · and deployment of advanced AI-driven platforms and data applications.This position offers an exceptiona ...

  • Only for registered members Abu Dhabi

    A stealth start-up is looking for a full-stack software engineer to join the team and help build out an initial MVP for a FinTech product leveraging Digital Assets & Analytics. · ...

  • Only for registered members Abu Dhabi

    The Big Data Analytics Center (BIDAC) at the United Arab Emirates University (UAEU) is seeking a talented and motivated Software Developer to support the design, development, · and deployment of advanced AI-driven platforms and data applications. · This position offers an excepti ...

  • Only for registered members Abu Dhabi

    We are seeking a highly skilled Senior Fullstack Engineer to architect, · develop and maintain robust scalable web applications that integrate advanced AI capabilities.This role involves close collaboration with product teams AI researchers designers backend specialists to delive ...

  • Only for registered members Abu Dhabi

    We are seeking a Fullstack Software Engineer who prioritizes system reliability and product impact over specific framework loyalty. · We value engineers who operate with a high degree of autonomy, technical pragmatism, · and a commitment to operational excellence. · We hire for ...

Jobs
>
Abu Dhabi