PhD in Computational Biology - MBZUAI PhD in Computational Biology
Apply Now More Information
img

Doctor of Philosophy in

Computational Biology

Overview

The goal of the Doctor of Philosophy in Computational Biology is to prepare students to become leading scientists in academia and industry, educating them to be highly competent in their chosen area of research while also providing them with a broad knowledge foundation in bioinformatics and computational biology.

Upon graduation students will have independently planned and conducted computational research in their chosen area, and will be able to conduct original interdisciplinary research across the life sciences. The program aims to be a collaborative hub for researchers and practitioners to drive global research and ethical and responsible innovation in the UAE and on a global scale.

  • icon Mode: Full-time
  • icon 60 credits
  • icon Location: On campus

The Computational Biology Department at MBZUAI sits at the cutting edge of biology, medicine, and AI. Our faculty lead efforts to model complex biological systems and extract insights from large-scale molecular data. Flagship initiatives like the Human Phenotype Project and analysis of the Emirati Genome Project provide rich, multi-omic datasets that fuel research in systems biology, genetics, biomarker discovery, and AI-driven disease prediction. Through this work, we aim to transform precision medicine and prepare the next generation of scientists to lead in data-driven healthcare innovation.

Eran Segal

Department Chair and Professor of Computational Biology

Read Bio

Meet the faculty

Aziz Khan

Aziz Khan

Assistant Professor of Computational Biology

Eduardo Beltrame

Eduardo Beltrame

Assistant Professor of Computational Biology

Imran Razzak

Imran Razzak

Associate Professor of Computational Biology

Natasa Przulj

Natasa Przulj

Professor of Computational Biology

Yu Li

Yu Li

Affiliated Assistant Professor of Computational Biology

National Qualifications Framework – three strands

The program learning outcomes (PLOs) are aligned with the Emirates Qualifications Framework and, as such, are divided into the following learning outcomes strands: knowledge (K), skills (S), and responsibility (R).

Upon completion of the program requirements, graduates will be able to:

  • PLO 01: Design scientific experiments, especially those related to biological systems, using the scientific method, while functioning as a team leader managing research projects of varying complexity.
  • PLO 02:Critically evaluate and understand scientific literature across computational biology, life sciences, and engineering.
  • PLO 03:Elaborate complete research manuscripts that demonstrate expert self-evaluation and advanced skills in communicating highly complex ideas.
  • PLO 04: Develop, analyze, improve and apply computational tools and technologies including artificial intelligence to solving problems of varying complexity.
  • PLO 05:Utilize methods and tools to integrate, analyze and interpret biological data from various sources, with respect to ethical norms and personal data privacy.

The PLOs are mapped to the National Qualifications Framework Level Eight (8) qualification and categorized into three domains (knowledge, skill, and responsibility) as per the National Qualifications Framework set by the UAE National Qualifications Centre (NQC) and the Ministry of Higher Education and Scientific Research (MoHESR):

PLOs Knowledge Skill Responsibility
PLO 01 K S R
PLO 02 K R
PLO 03 K S R
PLO 04 K S
PLO 05 K S R

The minimum degree requirements for the Doctor of Philosophy in Computational Biology program is 60 credits, distributed as follows:

Number of courses Credit hours
Core 4 16
Electives 2 8
Internship At least one internship of up to six weeks duration must be satisfactorily completed as a graduation requirement 2
Advanced research methods 1 2
Research thesis 1 32

The Doctor of Philosophy in Computational Biology program is primarily a research-based degree. The purpose of coursework is to equip students with the right skill set, so they can successfully accomplish their research project (thesis).

Code Course title Credit hours
CB803 Introduction to Single Cell Biology and Bioinformatics

Course description: This course provides a broad overview of bioinformatics for single cell omics technologies, a new and fast-growing family of biological assays that enables measuring the molecular contents of individual cells with very high resolution and is key to advancing precision medicine. The course starts with an accessible introduction to basic molecular biology: the cell structure, the central dogma of molecular biology, the flow of biological information in the cell, the different types of molecules in the cell, and how we can measure them. This course then introduces students to the diverse landscape of biological data, including its types and characteristics and explores the foundational principles of single-cell omics bioinformatics, encompassing key methodologies, tools, and computational workflows, with an emphasis on the development of foundation models for single cell omics data.

4
CB804 Introduction to Molecular Biology for Machine Learning

Course description: This interdisciplinary course is designed for students and professionals with a background in machine learning who seek to understand the fundamentals of molecular biology. The course explores key biological concepts and their applications to machine learning, particularly in the fields of bioinformatics, genomics, and computational biology. By bridging the gap between molecular biology and machine learning, students will gain the knowledge to apply machine learning techniques to biological data and solve complex problems in medicine, genetics, and drug discovery.

4
CB805 Analyzing Multi-Omics Network Data in Biology and Medicine

Course description: Computational biology has become an important discipline in the intersection of computing, mathematics, biology and medicine. Biological data sets produced by modern biotechnologies are large and hence they can only be understood by using mathematical modeling and computational techniques. Starting from analysis of genetic sequences, the field has progressed towards analysis and modeling of entire biological systems. A way of abstracting the vast amount of biomedical information is by modeling and analyzing these data by using networks (or graphs). Such approaches have been used to model phenomena in other research domains, apart from computational and systems biology and medicine.

4
CB806 Computational Genomics and Epigenomics

Course description: This interdisciplinary graduate course introduces students to the core concepts, tools, and emerging methods in computational genomics and epigenomics. It is designed for students from biology, computer science, machine learning, and related fields who are interested in applying their skills to cutting-edge questions in biology and medicine. Students will explore how genomic and epigenomic data, such as DNA, RNA, chromatin accessibility, and methylation, are generated and analyzed using statistical and machine learning techniques. The course covers foundational topics such as genome organization, gene regulation, and high-throughput sequencing technologies. It extends to advanced applications including multi-omic data integration, deep learning for genomics, and cancer omics.
Course description: This course examines the intersection of artificial intelligence (AI) and computer systems, focusing on how systems enable efficient AI workloads and how AI techniques enhance system performance. The course covers fundamental principles and emerging trends in system design, hardware-software co-design, resource management, and AI-driven system optimization. Through lectures, hands-on labs, and research projects, students will explore techniques for scaling AI applications, improving system efficiency, and integrating AI into traditional computing environments. By engaging with real-world case studies and cutting-edge frameworks, students will develop the skills needed to contribute to the rapidly evolving field of AI systems in both academia and industry. 

4

Students will select a minimum of two elective courses, with a total of eight (or more) credit hours. One must be selected from a list based on interest, proposed research thesis, and career aspirations, in consultation with their supervisory panel. The elective courses available for the Doctor of Philosophy in Computational Biology are listed in the tables below:

Code Course title Credit hours
ML801 Foundations and Advanced Topics in Machine Learning

This course provides the students a comprehensive introduction to artificial intelligence. It builds upon fundamental concepts in machine learning. Students will learn about supervised and unsupervised learning, various learning algorithms, and the basics of the neural network, deep learning, and reinforcement learning.

4
ML802 Advanced Machine Learning

This course provides a comprehensive overview of different concepts and methods related to deep learning. Students will first learn the foundations of deep learning, after which they will be introduced to a series of deep models: convolutional neural networks, autoencoders, recurrent neural network, and deep generative models. Students will work on case studies of deep learning in different fields such as computer vision, medical imaging, natural language processing, etc.

4
ML803 Advanced Probabilistic and Statistical Inference

This course provides a comprehensive introduction to the basics of human visual system and color perception, image acquisition and processing, linear and nonlinear image filtering, image features description and extraction, classification, and segmentation strategies. Moreover, students will be introduced to quality assessment methodologies for computer vision and image processing algorithms.
Prerequisite courses:  CS7101 Algorithms and Data Structures 

Course description: This course discusses the advanced concepts in operating system design and implementation. The operating system provides a convenient and efficient interface between user programs and the hardware of the computer on which they run. 

4
ML804 Advanced Topics in Continuous Optimization

The course provides a comprehensive introduction to the concepts, principles and methods of geometry-aware computer vision which helps in describing the shape and structure of the world. In particular, the objective of the course is to introduce the formal tools and techniques that are necessary for estimating depth, motion, disparity, volume, pose and shapes in 3D scenes.

4
ML806 Advanced Topics in Reinforcement Learning

This course provides a comprehensive overview of different concepts and methods related to visual object recognition and detection. In particular, the students will learn a large family of successful and recent state-of-the-art architectures of deep neural networks to solve the tasks of visual recognition, detection, and tracking.

4
ML807 Federated Learning

This course is an introductory course on data mining, which is the process of discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems.
Course description: This course provides an overview of foundational principles and contemporary topics in information security. Students will examine system protection strategies, structural security frameworks, software resilience, and detection of security threats. The course integrates theoretical concepts with practical applications to enhance the understanding of securing complex information systems. 

4
ML808 Advanced Topics in Causality and Machine Learning

This course is an introductory course on big data processing, which is the process of analyzing and utilizing big data. The course involves methods at the intersection of parallel computing, machine learning, statistics, database systems, etc.

4
ML812 Advanced Topics in Algorithms for Big Data

This course is an introductory course on Information Retrieval (IR). The explosive growth of available digital information (e.g., Web pages, emails, news, Tweets, Wikipedia pages) demands intelligent information agents that can sift through all available information and find out the most valuable and relevant information. Web search engines, such as Google and Bing, are several examples of such tools. This course studies the basic principles and practical algorithms used for information retrieval and text mining. It will cover algorithms, design, and implementation of modern information retrieval systems. Topics include: retrieval system design and implementation, text analysis techniques, retrieval models (e.g., Boolean, vector space, probabilistic, and learning-based methods), search evaluation, retrieval feedback, search log mining, and applications in web information management.

4
NLP801 Deep Learning for Language Processing

This course covers the fundamentals of theoretical statistics, which are the foundation for the analysis of the properties of machine learning algorithms. Covered topics include statistical models, statistical inference, maximum likelihood estimation, optimal hypothesis testing, decision theory and Bayesian inference, non-parametric statistics, and Bootstrap, (generalized) linear model and high dimensional statistics. All necessary tools from Probability theory: deviation inequalities, type of convergence, law of large numbers, central limit theorem, properties of the Gaussian distribution (etc) will be introduced whenever needed and their proofs given at the end of each chapter.

2
NLP802 Current Topics in Natural Language Processing

This course provides a graduate-level introduction to the principles and methods of medical imaging, with thorough grounding in the physics of the imaging problems. This course covers the fundamentals of X-ray, CT, MRI, ultrasound, and PET imaging. In addition, the course provides an overview of 3D geometry of medical images and a few classical problems in medical images analysis including classification, segmentation, registration, quantification, reconstruction and radiomics.

4
NLP803 Advanced Speech Processing

Assumed knowledge: Basic concepts in linear algebra, calculus, probability, and statistics. Programming in Python or similar language. 

Course description: This course provides a comprehensive introduction to natural language processing (NLP). It builds upon fundamental concepts in mathematics, specifically probability and statistics, linear algebra, and calculus, and assumes familiarity with programming. 

4
NLP804 Deep Learning for Natural Language Generation

This course provides a comprehensive introduction to machine learning. It builds upon fundamental concepts in mathematics, specifically probability and statistics, linear algebra, and calculus. Students will learn about supervised and unsupervised learning, various learning algorithms, and basics of learning theory, graphical models, and reinforcement learning.

4

Master’s thesis research exposes students to an unsolved research problem, where they are required to propose new solutions and contribute towards the body of knowledge. Students pursue an independent research study, under the guidance of a supervisory panel, for a period of one year.

Code Course title Credit hours
CS899 PhD Research Thesis

Course description: Master’s thesis research exposes students to an unsolved research problem, where they are required to propose new solutions and contribute towards the body of knowledge. Students pursue an independent research study, under the guidance of a supervisory panel, for a period of one (1) year. Master’s thesis research helps train graduates to pursue more advanced research in their Ph.D. degree. Further, it enables graduates to independently pursue an industrial project involving research component independently.

12

The MBZUAl internship with industry is intended to provide the student with hands-on experience, blending practical experiences with academic learning.

Code Course title Credit hours
INT899 Internship

Assumed knowledge: Prior to undertaking an internship, students must have successfully completed 24 credit hours. 

Course description: The MBZUAI Internship (up to six weeks) with industry is intended to provide the student with hands-on experience, blending practical experiences with academic learning. 

2

MBZUAI accepts applicants who hold a completed degree in a STEM field such as computer science, electrical engineering, computer engineering, mathematics, physics, or other relevant science or engineering major that demonstrates academic distinction in a discipline appropriate for the doctoral degree – either: 

  • Bachelor’s degree with a minimum CGPA of 3.5 (on a 4.0 scale) or equivalent, or 
  • Master’s degree with a minimum CGPA of 3.0 (on a 4.0 scale) or equivalent.  

Applicants must provide their completed degree certificates and official transcripts when submitting their application. Senior-level students can apply initially with a copy of their official transcript and expected graduation letter and upon admission must submit the official completed degree certificate and transcript. A degree attestation from UAE Ministry of Higher Education and Scientific Research (for degrees from the UAE) or Certificate of Recognition (for degrees acquired outside the UAE) should also be furnished within students’ first semester at MBZUAI. 

All applicants whose first language is not English must demonstrate proficiency in English through one of the following:

IELTS Academic: Minimum overall score of 6.5

TOEFL iBT: Minimum score of 90 

*Exams must be administered at an approved physical test center. Home Edition exams are not accepted. 

English language proficiency waiver eligibility

Applicants may qualify for a waiver if they meet one of the following conditions: 

  • Full exemption: Completed a degree entirely taught and assessed in English at a University located in a country where English is both the national language and the dominant language of instruction in higher education. This includes:
    • American Samoa, Australia, Botswana, Canada (excluding Quebec), Fiji, Ghana, Guyana, Ireland, Jamaica, Kenya, Lesotho, Liberia, New Zealand, Nigeria, Papua New Guinea, Samoa, Singapore, Solomon Islands, South Africa, Tonga, Trinidad and Tobago, United Kingdom, United States , Zambia, Zimbabwe. 
  • Conditional exemption: If your degree was completed in another country, you may still request an exemption if you can provide official documentation (on University letterhead, signed by an academic official) confirming that your entire program of study was taught and assessed in English. 

English language requirement deadline: The English language requirement should be submitted within the application deadline. However, for those who require more time to satisfy this requirement, there is a final deadline of March 1. 

Submission of Graduate Record Examination (GRE) scores is optional for all applicants but will be considered a plus during the evaluation.

In a 500 to 1000-word essay, explain why you would like to pursue a graduate degree at MBZUAI and include the following information: 

  • Motivation for applying to the University  
  • Personal and academic background and how it makes you suitable for the program you are applying for 
  • Experience in completing a diverse range of projects related to artificial intelligence 
  • Stand-out achievements, e.g. awards, distinction, etc. 
  • Goals as a prospective student 
  • Preferred career path and plans after graduation 
  • Any other details that will support the application.  

The research statement is a document summarizing the potential research project an applicant is interested in working on and clearly justify the research gap which the applicant would like to fill in during the course of his/her study. It must be presented in the context of currently existing literature and provide an overview of how the applicant aims to investigate the underlying research project as well as predict the expected outcomes. It should mention the relevance and suitability of the applicant’s background and experience to the project and highlight the project’s scientific and commercial significance.

The research statement should include the following details: 

  • Title 
  • Problem definition 
  • Literature review 
  • Proposed research/methods/solution (optional) 
  • Study timeline (a table, figure or a small paragraph presenting your plans for the four years in the Ph.D. program) 
  • List of references 
  • Applicants are expected to write the research statement independently. MBZUAI faculty will NOT help write it for the purpose of the application. The MBZUAI Admission Committee will review the submitted document and use it as one of the measures to gauge and assess applicants’ skills. 

Within 10 days of submitting your application, you will receive an invitation to book and complete an online screening exam that assesses knowledge and skills relevant to your chosen field. While you may choose to opt out of the screening exam, this is only recommended for applicants whose profiles already demonstrate strong evidence of the skills assessed in the exam. 

Exam topics

Math: Calculus, probability theory, linear algebra, trigonometry and optimization.

Programming: Knowledge surrounding specific programming concepts and principles such as algorithms, data structures, logic, OOP, and recursion as well as language–specific knowledge of Python.

Specialization topics: Knowledge and understanding of the theory of computation, computational complexity, databases, computer architecture and operating systems.

Applicants are highly encouraged to complete the following online courses to further improve their qualifications:

For more information regarding the screening exam (e.g. process, opting out criteria, and technical specifications), register on the application portal here, and view this knowledge article. 

A select number of applicants may be invited to an interview with faculty as part of the screening process. The time and instructions for this will be communicated to applicants on timely bases. 

Only one application per admission cycle must be submitted; multiple submissions are discouraged. 

Application portal opens   Priority deadline*   Final deadline   Decision notification date  Offer response deadline  
September 1, 2025 

(8 a.m. GST) 

 

 

November 15, 2025 

(5 p.m. GST) 

 

December 15, 2025 

(5 p.m. GST) 

March 15, 2026 

(5 p.m. GST)  

April 15, 2026 

 

 

 

 

*Applications submitted by the priority deadline will be reviewed first. While all applications submitted by the final deadline (December 15, 2025) will be considered, applying by the priority deadline is strongly encouraged. Admissions are highly competitive and space in the incoming cohort is limited. 

Detailed information on the application process and scholarships is available here. 

 

Year 1

Semester 1

CB8103 Introduction to Single Cell Biology and Bioinformatics (4 CR)
CB8104 Introduction to molecular biology for machine learning (4 CR)
One elective from the list (4 CR)

SEMESTER 2

CB8105 Analyzing Multi-Omics Network Data in Biology and Medicine (4 CR)
CB8106 Computational Genomics and Epigenomics (4 CR)
One elective from the list (4 CR)

SUMMER

INT8199 PhD Internship (2 CR)

Year 2

Semester 1

RES899 Advanced Research Methods (2 CR)
CB8199 Ph.D. Research Thesis

Semester 2

CB8199 Ph.D. Research Thesis

Year 3

Semester 1

CB8199 Ph.D. Research Thesis

Semester 2

CB8199 Ph.D. Research Thesis

Year 4

Semester 1

CB8199 Ph.D. Research Thesis

Semester 2

CB8199 Ph.D. Research Thesisx

Become a leader of the future

img

AI innovation

More information
img

AI startups

More information
img

The Node

More information

Register your interest for the Ph.D. in Computational Biology

We’ll keep you up-to-date with the latest news and when applications open.