Master of Science in Computational Biology - MBZUAI Master of Science in Computational Biology
Apply Now More Information
img

Master of Science in

Computational Biology

Overview

The goal of the Master of Science in Computational Biology is to equip students with knowledge and skills in both biology and computational sciences including programming, machine learning, statistics, and bioinformatics. The program prepares students for collaborative work with biologists, chemists, and clinicians by fostering interdisciplinary communication and teamwork. The program aims to prepare students to join the workforce in biotechnology and healthcare or to pursue a Ph.D..

The program supports the development of the UAE biotechnology and health ecosystem by developing a highly skilled workforce in computational biology and bioinformatics. It makes significant contributions to both the academic world and the broader scientific community while supporting the UAE societal and economic goals and transformation agenda, and bridges gaps between disciplines and industries and build a collaborative platform for the UAE biotechnology and healthcare ecosystem, with strong connections to the state-of-the-art international biotechnology research and industry.

  • icon Mode: Full-time
  • icon 36 credits
  • icon Location: On campus

The Computational Biology Department at MBZUAI sits at the cutting edge of biology, medicine, and AI. Our faculty lead efforts to model complex biological systems and extract insights from large-scale molecular data. Flagship initiatives like the Human Phenotype Project (HPP) and analysis of the Emirati Genome Project provide rich, multi-omic datasets that fuel research in systems biology, genetics, biomarker discovery, and AI-driven disease prediction. Through this work, we aim to transform precision medicine and prepare the next generation of scientists to lead in data-driven healthcare innovation.

Eran Segal

Department Chair and Professor of Computational Biology

Read Bio

Meet the faculty

Aziz Khan

Aziz Khan

Assistant Professor of Computational Biology

Eduardo Beltrame

Eduardo Beltrame

Assistant Professor of Computational Biology

Imran Razzak

Imran Razzak

Associate Professor of Computational Biology

Natasa Przulj

Natasa Przulj

Professor of Computational Biology

Yu Li

Yu Li

Affiliated Assistant Professor of Computational Biology

National Qualifications Framework – three strands

The program learning outcomes (PLOs) are aligned with the Emirates Qualifications Framework and, as such, are divided into the following learning outcomes strands: knowledge (K), skills (S), and responsibility (R).

Upon completion of the program requirements, graduates will be able to:

  • PLO 01: Design scientific experiments, especially those related to biological systems, using the scientific method.
  • PLO 02: Apply computational tools and technologies, including Artificial Intelligence, to solving problems of varying complexity, especially computational and experimental tools and methods utilized in life sciences, biotechnology and healthcare.
  • PLO 03: Utilize methods and tools to integrate, analyze and interpret biological data from various sources, respecting ethical norms and personal data privacy.
  • PLO 04: Function effectively as a member or leader of a team engaged in collaborative research projects.
  • PLO 05: Appraise and communicate data and scientific findings effectively to any audience, orally, visually, and in written format, individually or in teams.

The PLOs are mapped to the National Qualifications Framework Level Seven (7) qualification and categorized into three domains (knowledge, skill, and responsibility) as per the National Qualifications Framework set by the UAE National Qualifications Centre (NQC) and the Ministry of Higher Education and Scientific Research (MoHESR):

PLOs Knowledge Skill Responsibility
PLO 01 K S
PLO 02 S R
PLO 03 S R
PLO 04 S R
PLO 05 K S R

    The minimum degree requirements for the Master of Science in Computational Biology program is 36 credits, distributed as follows:

    Number of courses Credit hours
    Core 4 16
    Electives 2 8
    Internship At least one internship of up to six weeks duration must be satisfactorily completed as a graduation requirement 2
    Introduction to research methods 1 2
    Research thesis 1 8
    Total 9 36

    The Master of Science in Computational Biology program is primarily a research-based degree. The purpose of coursework is to equip students with the right skill set, so they can successfully accomplish their research project (thesis).

    All students must take the following courses:

    Code Course title Credit hours
    CB703 Introduction to Single Cell Biology and Bioinformatics

    Course description: This course provides a broad overview of bioinformatics for single cell omics technologies, a new and fast-growing family of biological assays that enables measuring the molecular contents of individual cells with very high resolution and is key to advancing precision medicine. The course starts with an accessible introduction to basic molecular biology: the cell structure, the central dogma of molecular biology, the flow of biological information in the cell, the different types of molecules in the cell, and how we can measure them. This course then introduces students to the diverse landscape of biological data, including its types and characteristics and explores the foundational principles of single-cell omics bioinformatics, encompassing key methodologies, tools, and computational workflows, with an emphasis on the development of foundation models for single cell omics data.

    4
    CB704 Introduction to Molecular Biology for Machine Learning

    Course description: This interdisciplinary course is designed for students and professionals with a background in machine learning who seek to understand the fundamentals of molecular biology. The course explores key biological concepts and their applications to machine learning, particularly in the fields of bioinformatics, genomics, and computational biology. By bridging the gap between molecular biology and machine learning, students will gain the knowledge to apply machine learning techniques to biological data and solve complex problems in medicine, genetics, and drug discovery.

    4
    CB705 Analyzing Multi-omics Network Data in Biology and Medicine

    Course description: Computational biology has become an important discipline in the intersection of computing, mathematics, biology and medicine. Biological datasets produced by modern biotechnologies are large and hence they can only be understood by using mathematical modeling and computational techniques. Starting from analysis of genetic sequences, the field has progressed towards analysis and modeling of entire biological systems. A way of abstracting the vast amount of biomedical information is by modeling and analyzing these data by using networks (or graphs). Such approaches have been used to model phenomena in other research domains, apart from computational and systems biology and medicine.

    4
    CB706 Computational Genomics and Epigenomics

    Course description: This interdisciplinary graduate course introduces students to the core concepts, tools, and emerging methods in computational genomics and epigenomics. It is designed for students from biology, computer science, machine learning, and related fields who are interested in applying their skills to cutting-edge questions in biology and medicine. Students will explore how genomic and epigenomic data, such as DNA, RNA, chromatin accessibility, and methylation, are generated and analyzed using statistical and machine learning techniques. The course covers foundational topics such as genome organization, gene regulation, and high-throughput sequencing technologies. It extends to advanced applications including multi-omic data integration, deep learning for genomics, and cancer omics.

    4

    Students will select a minimum of three elective courses, totaling twelve (or more) credit hours, based on their interests, proposed research thesis, and career aspirations, in consultation with their supervisory panel. The elective courses available for the Master of Science in Computer Science are listed below:

    Code Course title Credit hours
    AI701 Foundations of Artificial Intelligence

    Assumed knowledge: Basic concepts in calculus, linear algebra, and programming.

    Course description: This course provides a comprehensive introduction to artificial intelligence. It builds upon fundamental concepts in machine learning. Students will learn about supervised and unsupervised learning, various learning algorithms, and the basics of the neural network, deep learning, and reinforcement learning.

    4
    AI7102 Deep Learning

    Assumed knowledge: Basics of linear algebra, calculus, and probability and statistics. Proficiency in Python.

    Course description: This course provides a comprehensive overview of different concepts and methods related to deep learning. Students will first learn the foundations of deep learning, after which they will be introduced to a series of deep models: convolutional neural networks, autoencoders, recurrent neural network, and deep generative models. Students will work on case studies of deep learning in different fields such as computer vision, medical imaging, natural language processing, etc.

    4
    CV701 Human and Computer Vision

    Assumed knowledge: Basics of linear algebra, calculus, and probability and statistics. Proficiency in Python.

    Course description: This course provides a comprehensive introduction to the basics of human visual system and color perception, image acquisition and processing, linear and nonlinear image filtering, image features description and extraction, classification and segmentation strategies. Moreover, students will be introduced to quality assessment methodologies for computer vision and image processing algorithms.

    Prerequisite courses:  CS7101 Algorithms and Data Structures 

    Course description: This course discusses the advanced concepts in operating system design and implementation. The operating system provides a convenient and efficient interface between user programs and the hardware of the computer on which they run. 

    4
    CV702 Geometry for Computer Vision

    Assumed knowledge: Hands-on experience with Python and Pytorch.

    Prerequisite course/s: CV 701 Human and Computer Vision or equivalent

    Course description: The course provides a comprehensive introduction to the concepts, principles and methods of geometry-aware computer vision which helps in describing the shape and structure of the world. In particular, the objective of the course is to introduce the formal tools and techniques that are necessary for estimating depth, motion, disparity, volume, pose, and shapes in 3D scenes.

    4
    CV703 Visual Object Recognition and Detection

    Assumed knowledge: Basics of linear algebra, calculus, probability, and statistics demonstrated through relevant coursework. Proficiency in Python and Pytorch.

    Prerequisite course/s: CV701 Human and Computer Vision or equivalent

    Course description: This course provides a comprehensive overview of different concepts and methods related to visual object recognition and detection. In particular, the students will learn a large family of successful and recent state-of-the-art architectures of deep neural networks to solve the tasks of visual recognition, detection, and tracking.

    4
    DS701 Data Mining

    Assumed knowledge: Discrete mathematics, and probability and statistics. Proficiency in Java or Python.

    Course description: This course is an introductory course on data mining, which is the process of discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems.

    Course description: This course provides an overview of foundational principles and contemporary topics in information security. Students will examine system protection strategies, structural security frameworks, software resilience, and detection of security threats. The course integrates theoretical concepts with practical applications to enhance the understanding of securing complex information systems. 

    4
    DS702 Big Data Processing

    Assumed knowledge: Databases. Proficiency in Java or Python. Basic knowledge of calculus, linear algebra, and probability and statistics.

    Course description: This course is an introductory course on big data processing, which is the process of analyzing and utilizing big data. The course involves methods at the intersection of parallel computing, machine learning, statistics, database systems, etc.

    4
    DS703 Information Retrieval

    Assumed knowledge: Discrete mathematics, and probability and statistics. Proficiency in Python or Java or C++.

    Course description: This course is an introductory course on information retrieval (IR). The explosive growth of available digital information (e.g., web pages, emails, news, social, Wikipedia pages) demands intelligent information agents that can sift through all available information and find out the most valuable and relevant information. Web search engines, such as Google and Bing, are several examples of such tools. This course studies the basic principles and practical algorithms used for information retrieval and text mining. It will cover algorithms, design, and implementation of modern information retrieval systems. Topics include retrieval system design and implementation, text analysis techniques, retrieval models (e.g., Boolean, vector space, probabilistic, and learning-based methods), search evaluation, retrieval feedback, search log mining, and applications in web information management.

    4
    DS704 Statistical Aspects of Machine Learning Theory

    Assumed knowledge: Familiarity with the fundamental concepts of probability theory, linear algebra, and real analysis. A first course in statistics would be a plus.

    Prerequisite course/s: ML701 Machine Learning or AI701 Foundations of Artificial Intelligence

    Course description: This course covers the fundamentals of theoretical statistics, which are the foundation for the analysis of the properties of machine learning algorithms. Covered topics include statistical models, statistical inference, maximum likelihood estimation, optimal hypothesis testing, decision theory and Bayesian inference, non-parametric statistics, and Bootstrap, (generalized) linear model and high dimensional statistics. All necessary tools from probability theory: deviation inequalities, type of convergence, law of large numbers, central limit theorem, properties of the Gaussian distribution (etc.) will be introduced whenever needed and their proofs given at the end of each chapter.

    4
    HC701 Medical Imaging: Physics and Analysis

    Assumed knowledge: Familiarity with Python programming. Undergraduate course in signal processing or signals and systems.

    Course description: This course provides a graduate-level introduction to the principles and methods of medical imaging, with thorough grounding in the physics of imaging problems. This course covers the fundamentals of X-ray, CT, MRI, ultrasound, and PET imaging. In addition, the course provides an overview of 3D geometry of medical images and a few classical problems in medical images analysis including classification, segmentation, registration, quantification, reconstruction, and radiomics.

    4
    ML701 Machine Learning

    Assumed knowledge: Basic concepts in calculus, linear algebra, and programming.

    Course description: This course provides a comprehensive introduction to machine learning. It builds upon fundamental concepts in mathematics, specifically probability and statistics, linear algebra, and calculus. Students will learn about supervised and unsupervised learning, various learning algorithms, and the basics of learning theory, graphical models, and reinforcement learning.

    4
    ML7101 Probabilistic and Statistical Inference

    Assumed knowledge: Basic concepts in linear algebra, calculus, probability, and statistics. Programming in Python or similar language. 

    Prerequisite course: NLP701 Natural Language Processing  

    Course description: This course provides a comprehensive introduction to speech processing. It builds upon fundamental concepts in speech processing and assumes familiarity with mathematical and signal processing concepts. 

    2
    MTH702 Optimization

    Assumed knowledge: Linear algebra, matrix analysis, and probability and statistics.

    Course description: This course provides a graduate-level introduction to the principles and methods of optimization, with a thorough grounding in the mathematical formulation of optimization problems. The course covers fundamentals of convex functions and sets, first order and second order optimization methods, problems with equality and/or inequality constraints, and other advanced problems.

    4
    MTH703 Mathematics for Computer Science

    Course description: The course is designed to comprehensively understand various mathematical concepts and their applications for theoretical computer science. The lectures will cover topics such as asymptotics, the Central Limit Theorem, Chernoff bounds, mathematical problem solving, computational models, spectral graph theory, linear programming, semidefinite programming, error correcting codes, de-randomization, expander graphs, constraint satisfaction problems, treewidth, analysis of Boolean functions, communication complexity, information theory, LP hierarchies and proof complexity, quantum computation, cryptography, hardness assumptions, and the sketch of the PCP Theorem.

    4
    MTH7101 Mathematical Foundations of Artificial Intelligence

    Assumed knowledge: Students enrolling in this course should possess proficiency with high-school–level algebra and trigonometry, along with an introductory treatment of single-variable calculus (limits, basic differentiation, and elementary integration). Familiarity with manipulating simple vectors and matrices—as encountered in a first-year linear algebra or applied mathematics course—is beneficial but not strictly required; key concepts will be refreshed before deeper exploration. No prior coursework in probability or statistics is assumed.

    Course description: This seven-week course equips students with the essential mathematical tools required for modern artificial intelligence and machine learning study. Topics include single- and multivariate calculus (limits, derivatives, gradients, Jacobians, Hessians), linear algebra (vectors, matrices, eigenvalues, decompositions), and probability and statistics (discrete and continuous distributions, Bayes’ theorem, expectation, covariance, law of large numbers, central limit theorem, maximum-likelihood estimation). Emphasis is placed on conceptual understanding, geometric intuition, and hands-on problem solving that link theory to common AI algorithms such as optimization and probabilistic inference.

    2
    NLP701 Natural Language Processing

    Assumed knowledge: Basic concepts in linear algebra, calculus, probability, and statistics. Programming in Python or similar language.

    Course description: This course provides a comprehensive introduction to natural language processing (NLP). It builds upon fundamental concepts in mathematics, specifically probability and statistics, linear algebra, and calculus, and assumes familiarity with programming.

    4
    NLP702 Advanced Natural Language Processing

    Assumed knowledge: Basic concepts in linear algebra, calculus, probability, and statistics. Programming in Python or similar language. At least intermediate knowledge of PyTorch.

    Prerequisite course/s: NLP701 Natural Language Processing

    Course description: This course provides a methodological and an in-depth background on key core natural language processing (NLP) areas based on deep learning. It builds upon fundamental concepts in NLP integrating advances on large language models (LLMs). It assumes familiarity with mathematical and machine learning concepts and programming.

    4
    NLP703 Speech Processing

    Assumed knowledge: Basic concepts in linear algebra, calculus, probability, and statistics. Programming in Python or similar language.

    Prerequisite course/s: NLP701 Natural Language Processing

    Course description: This course provides a comprehensive introduction to speech processing. It builds upon fundamental concepts in speech processing and assumes familiarity with mathematical and signal processing concepts.

    4

    Master’s thesis research exposes students to an unsolved research problem, where they are required to propose new solutions and contribute towards the body of knowledge. Students pursue an independent research study, under the guidance of a supervisory panel, for a period of one year.

    Code Course title Credit hours
    CB799 Computational Biology Master's Research Thesis

    Course description: Master’s thesis research exposes students to an unsolved research problem, where they are required to propose new solutions and contribute towards the body of knowledge. Students pursue an independent research study, under the guidance of a supervisory panel, for a period of one (1) year. Master’s thesis research helps train graduates to pursue more advanced research in their Ph.D. degree. Further, it enables graduates to independently pursue an industrial project involving research component.

    32
    RES799 Introduction to Research Methods

    Course description: The course teaches research methods applicable to scientific research in general, and AI research in particular. It covers various topics including scientific methods, measurement and metrics in experimental research, critical appraisal and peer review, public communication, and ethical issues in AI research. Students will gain knowledge in selecting, evaluating, and collecting data and suitable research methods to address specific research questions. Additionally, they will learn design thinking skills to connect their research-based topic to practicality. After completing the course, students will have the skills to develop a full research methodology that is rigorous, entrepreneurial, and ethical.

    2

    The MBZUAl internship with industry is intended to provide the student with hands-on experience, blending practical experiences with academic learning.

    Code Course title Credit hours
    INT799 Master of Science Internship (up to six weeks)

    Assumed knowledge: Prior to undertaking an internship, students must have successfully completed 24 credit hours. 

    Course description: The MBZUAI internship (up to six weeks) with industry is intended to provide the student with hands-on experience, blending practical experiences with academic learning. 

    2

    MBZUAI accepts applicants from all nationalities who hold a completed bachelor’s degree in a STEM field such as computer science, electrical engineering, computer engineering, mathematics, physics or other relevant science or engineering major from a University accredited or recognized by the UAE Ministry of Education (MoE) with a minimum CCGPA of 3.0 (on a 4.0 scale) or equivalent.

    <p>Applicants must provide their completed degree certificates and official transcripts when submitting their application. Senior-level students can apply initially with a copy of their official transcript and expected graduation letter and upon admission must submit the official completed degree certificate and transcript. A degree attestation from UAE MoE (for degrees from the UAE) or Certificate of Recognition from UAE MoE (for degrees acquired outside the UAE) should also be furnished within students’ first semester at MBZUAI.

    All applicants whose first language is not English must demonstrate proficiency in English through one of the following:

    • IELTS Academic: Minimum overall score of 6.5
    • TOEFL iBT: Minimum score of 90

    *Exams must be administered at an approved physical test center. Home Edition exams are not accepted.

    English language proficiency waiver eligibility:
    Applicants may qualify for a waiver if they meet one of the following conditions:

    • Full exemption: Completed a degree entirely taught and assessed in English at a University located in a country where English is both the national language and the dominant language of instruction in higher education. This includes:
      • American Samoa, Australia, Botswana, Canada (excluding Quebec), Fiji, Ghana, Guyana, Ireland, Jamaica, Kenya, Lesotho, Liberia, New Zealand, Nigeria, Papua New Guinea, Samoa, Singapore, Solomon Islands, South Africa, Tonga, Trinidad and Tobago, United Kingdom, United States , Zambia, Zimbabwe.
    • Conditional exemption: Completed a degree in an English-medium institution in a non-English-speaking country.

    English language requirement deadline: The English language requirement should be submitted within the application deadline. However, for those who require more time to satisfy this requirement, there is a final deadline of March 1.

    The Graduate Record Examination (GRE) General score is a plus and would be considered in evaluating the applicants (optional).

    In an 800-word essay, please explain why you would like to pursue a graduate degree at MBZUAI and include the following information:

    • Motivation for applying to the University
    • Personal and academic background and how it makes you suitable for the program you are applying for
    • Experience in completing a diverse range of projects related to artificial intelligence
    • Stand-out achievements, e.g. awards, distinction, etc.
    • Goals as a prospective student
    • Preferred career path and plans after graduation
    • Any other details that will support the application.

    Selected applicants will be invited to participate in an entry exam that will include questions related to the following topics:

    • Admission interview

    Prior coursework in natural sciences is considered an advantage.

    A typical study plan is as follows:

    SEMESTER 1

    CB703 Introduction to Single Cell Biology and Bioinformatics (4CR)
    CB704 Introduction to Molecular Biology for Machine Learning (4CR)
    CB705 Analyzing Multi-Omics Network Data in Biology and Medicine (4CR)

    SEMESTER 2

    CB706 Genomics and Gene Regulation
    One elective from the list (4CR)
    One elective from the list (4CR)

    SUMMER

    INT799 Master of Science Internship (2CR)

    SEMESTER 3

    CB799 Computational Biology Master's Research Thesis (4CR)
    RES799 Introduction to Research Methods (2CR)

    SEMESTER 4

    CB799 Computational Biology Master's Research Thesis (4CR)
    Disclaimer: Subject to change.

    Become a leader of the future

    img

    AI innovation

    More information
    img

    AI startups

    More information
    img

    The Node

    More information

    Register your interest for the M.Sc. in Computational Biology

    We’ll keep you up-to-date with the latest news and when applications open.