Master of Science in Statistics and Data Science - MBZUAI Master of Science in Statistics and Data Science
Apply Now More Information
img

Master of Science in

Statistics and Data Science

Overview

The goals of the Master of Science in Statistics and Data Science are to train the next generation of data scientists with a focus on the field of statistics. This master’s program gives a solid foundation in statistical theory and a wide range of methods for working with data in many different application areas. In an increasingly data-driven world, being able to model, interpret and translate information into meaningful insights that organizations can use is a valuable skill. Our graduates are prepared for the pursuit of doctoral degrees as well as careers in business, health, and academia.

  • icon Mode: Full-time
  • icon 36 credits
  • icon Location: On campus

Deadlines for Applications for Fall 2025:

Priority Deadline for Early Decision: 28 February 2025 (5:00 PM UAE time)
Regular Deadline: 31 May 2025 (5:00 PM UAE time)
Decision Notification Deadline: 30 June 2025 (5:00 PM UAE time)

Welcome to the Department of Statistics and Data Science at Mohamed bin Zayed University of Artificial Intelligence (MBZUAI). Our department uniquely integrates rigorous statistical foundations with cutting-edge artificial intelligence. At MBZUAI, our faculty and researchers emphasize the critical role of statistical theory and modern data science in ensuring reliable and effective AI solutions. We equip our students with the expertise to tackle real-world challenges, including efficient data handling, uncertainty quantification, and responsible decision-making.

As AI increasingly transforms industries and society, statistics plays a pivotal role in guiding the responsible and informed development of these technologies. Our department leads the development of innovative AI methods, advancing their reliability and extending their applications to diverse domains such as life sciences, social sciences, healthcare, economics, and environmental studies. We actively explore how statistical approaches can enhance AI’s potential, ensuring trustworthy and impactful solutions across these critical areas.

Join us at MBZUAI in advancing the critical synergy between statistics, data science, and artificial intelligence.

Mladen Kolar

Department Chair of Statistics and Data Science, and Visiting Professor of SDS

Meet the faculty

Mladen Kolar

Mladen Kolar

Department Chair of Statistics and Data Science, and Visiting Professor of SDS

BIO
Qiang Sun

Qiang Sun

Associate Professor of Statistics and Data Science

BIO
Souhaib Ben Taieb

Souhaib Ben Taieb

Associate Professor of Statistics and Data Science

BIO
Eduard Gorbunov

Eduard Gorbunov

Assistant Professor of Statistics and Data Science

BIO
Jian Kang

Jian Kang

Assistant Professor of Statistics and Data Science, and Computer Science

BIO

National Qualifications Framework – three strands 

The program learning outcomes (PLOs) are aligned with the Emirates Qualifications Framework and, as such, are divided into the following learning outcomes strands: knowledge (K), skills (S), and responsibility (R). 

Program learning outcomes 

Upon completion of the program requirements, graduates will be able to:

  • PLO 01: Apply statistical inference and modeling to practical industry challenges.
  • PLO 02: Analyze datasets with supervised learning methods for functional approximation, classification, and forecasting and unsupervised learning methods for dimensionality reduction and clustering.
  • PLO 03: Formulate and use models of data analysis to provide innovative and sustainable solutions to business-related challenges.
  • PLO 04: Identify critical and emerging challenges in data analysis, formulate innovative research questions, and design experiments or studies that yield meaningful insights.
  • PLO 05: Conduct independent research, communicate data findings effectively to any audience, orally, visually, and in written format, individually or in teams, and apply ethical principles in professional settings.

The PLOs are mapped to the National Qualifications Framework Level Seven (7) qualification and categorized into three domains (knowledge, skill and Responsibility) as per the National Qualifications Framework set by the UAE National Qualifications Centre (NQC) and the Ministry of Higher Education and Scientific Research (MoHESR): 

PLOs Knowledge Skill Responsibility
PLO 01 K S R
PLO 02 K S R
PLO 03 K S R
PLO 04 K S R
PLO 05 S

The minimum degree requirements for the Master of Science in Statistics and Data Science program is 36 credits, distributed as follows:

Number of courses Credit hours
Core 4 16
Electives At least two dependent on credit hours 8
Internship At least one internship of up to six weeks duration must be satisfactorily completed as a graduation requirement 2
Introduction to research methods 1 2
Research thesis 1 8
Total 9 36

The Master of Science in Statistics and Data Science is primarily a research-based degree. The purpose of coursework is to equip students with the correct skillset so that they can successfully accomplish their research project (thesis). Students are required to take core courses as mandatory courses. They can select a minimum of two electives.

To accommodate a diverse group of students, coming from different academic backgrounds, students have been provided with flexibility in course selection. The decision on the courses to be taken will be made in consultation with the students’ supervisory panel, which will comprise two or more faculty members. Essentially, the student’s supervisory panel will help design a personalized coursework plan for each individual student by considering their prior academic track record and experience, as well as the planned research project.



The following core courses must be taken by all students:

Code Course title Credit hours
INT799 Master of Science Internship

Assumed knowledge: Prior to undertaking an internship, students must have successfully completed 24 credit hours. 

Course description: The MBZUAI Internship (up to six weeks) with industry is intended to provide the student with hands-on experience, blending practical experiences with academic learning. 

2
RES799 Introduction to Research Methods

Course description: The course teaches research methods applicable to scientific research in general, and AI research in particular. It covers various topics including quantitative, qualitative, and mixed methods, measurement and metrics in experimental research, critical appraisal and peer review, public communication, reproducibility and open science, and ethical issues in AI research. Students will gain knowledge in selecting, evaluating, collecting, and sharing data and suitable research methods to address specific research questions. After completing the course, students will have the skills to develop a full research methodology that is rigorous and ethical. 

2
SDS799 Statistics and Data Science Master’s Research Thesis

Course description: Master’s thesis research exposes students to an unsolved research problem, where they are required to propose new solutions and contribute towards the body of knowledge. Students pursue an independent research study, under the guidance of a supervisory panel, for a period of one (1) year. Master’s thesis research helps train graduates to pursue more advanced research in their Ph.D. degree. Further, it enables graduates to independently pursue an industrial project involving research component independently.

8
SDS7101 Probability and Statistics

Assumed knowledge: Calculus and linear algebra. A calculus-based introduction to probability theory. An introductory course in statistics is recommended. 

Course description: Mathematical statistics is crucial because it forms the backbone of data analysis and enable the development of models to infer, describe, and predict real-world phenomena. It provides the theoretical foundation needed to understand complex data sets, evaluate uncertainty, and make informed decisions. The course is an attempt to provide a reasonable balance between mathematical rigor and statistical practice. It provides a detailed account on the foundations of statistics and the required basics of probability theory but also emphasizes the applicability of statistics to real-world problems.

4
SDS7102 Linear Models and Extensions

Assumed knowledge: Calculus and linear algebra. A calculus-based introduction to probability theory, An introductory course in statistics is recommended. 

Course description: Linear models (LM) and generalized linear models (GLM) are central to data science. These models are critical for analyzing complex data sets, enabling accurate predictions and providing insights into underlying patterns and trends. They facilitate decision making and strategy development in various fields such as finance, healthcare, and social sciences. In this course we present a modern introduction to regression, covering the basics of classical inference methods and introducing newer results from sparse estimation (LASSO).  

4
SDS7103 Data Analysis and Visualization

Assumed knowledge: Familiarity with the fundamental concepts of linear algebra, real analysis, probability theory and statistical inference, and linear model and extension. 

Course description: This course offers a deep dive into advanced techniques in data analysis, focusing on density estimation, clustering, dimension reduction, visualization and practical problem-solving in data science. Students will explore diverse methods for density estimation, including histograms and kernels, and delve into the intricacies of clustering techniques such as K-means, mixture models, hierarchical clustering, and spectral clustering. The course also covers various dimension reduction and visualization methods like principal component analysis (PCA), kernel PCA, t-SNE, and autoencoders. Moreover, students will tackle practical challenges in data science, including outlier detection, handling missing values, and proposing an interpretation of machine learning models. Through practical exercises and real-world examples, students will develop essential skills for analyzing complex datasets and extracting valuable insights. 

4
SDS7104 Prediction and Forecasting

Assumed knowledge: Fundamental course on mathematical statistics. 

Course description: This intensive course provides a comprehensive exploration of advanced statistical learning techniques for prediction and forecasting. Through a blend of theoretical understanding and hands-on implementation, students will delve into fundamental concepts such as model evaluation metrics, penalized regressions, support vector machines, ensemble methods, and specialized algorithms for time series data. 

4

Master’s thesis research exposes students to an unsolved research problem, where they are required to propose new solutions and contribute towards the body of knowledge. Students pursue an independent research study, under the guidance of a supervisory panel, for a period of one (1) year.

Code Course title Credit hours
STADS799 Statistics and Data Science Master's Research Thesis

Course description: Master’s thesis research exposes students to an unsolved research problem, where they are required to propose new solutions and contribute towards the body of knowledge. Students pursue an independent research study, under the guidance of a supervisory panel, for a period of one (1) year. Master’s thesis research helps train graduates to pursue more advanced research in their Ph.D. degree. Further, it enables graduates to independently pursue an industrial project involving research component independently.

8
RES799 Introduction to Research Methods

Course description: The course teaches research methods applicable to scientific research in general, and AI research in particular. It covers various topics including scientific methods, measurement and metrics in experimental research, critical appraisal and peer review, public communication, and ethical issues in AI research. Students will gain knowledge in selecting, evaluating, and collecting data and suitable research methods to address specific research questions. Additionally, they will learn design thinking skills to connect their research-based topic to practicality. After completing the course, students will have the skills to develop a full research methodology that is rigorous, entrepreneurial, and ethical.

2

The MBZUAl internship with industry is intended to provide the student with hands-on experience, blending practical experiences with academic learning.

Code Course title Credit hours
INT799 Master of Science Internship (up to six weeks)

Assumed knowledge: Prior to undertaking an internship, students must have successfully completed 24 credit hours. 

Course description: The MBZUAI Internship (up to six weeks) with industry is intended to provide the student with hands-on experience, blending practical experiences with academic learning. 

2

Students will select a minimum of two (2) elective courses, with a total of eight (8) (or more) credit hours based on interest, proposed research thesis, and career aspirations, in consultation with their supervisory panel. The elective courses available for the Master of Science in Statistics and Data Science are listed below.

Code Course title Credit hours
AI7101 Machine Learning with Python

Assumed knowledge: Linear algebra, mathematical analysis, algorithms. At least intermediate programming skills are necessary. 

Course description: The course gives an introduction to the main topics of modern machine learning such as classification, regression, clustering, and dimensionality reduction. Each topic is accompanied by a survey of key machine learning algorithms solving the problem and is illustrated with a set of real-world examples. The primary objective of the course is giving a broad overview of major machine learning techniques. Particular attention is paid to the modern Python machine learning libraries which allow solving efficiently the problems mentioned above. 

2
AI7102 Introduction to Deep Learning

Assumed knowledge: Basics of linear algebra, calculus, probability and statistics, and basic machine learning concepts. Proficiency in Python. 

Course description: This course covers key concepts and methods in deep learning. Students will begin by learning the foundational principles of deep learning, including the universal approximation theorem, strategies for modeling complex patterns using neural architectures, and the specifics of training deep networks. The course then introduces a range of deep models, including convolutional neural networks, recurrent neural networks, and transformer-based architectures. Students will gain hands-on experience in building and training deep neural networks across various domains such as computer vision, medical imaging, and natural language processing.

2
DS701 Data Mining

Assumed knowledge: Discrete mathematics, probability, and statistics. Proficiency in Java or Python. 

Course description: This course is an introductory course on data mining, which is the process of discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems. 

4
DS702 Big Data Processing

Assumed knowledge: Databases. Proficiency in Java or Python. Basic knowledge of calculus, linear algebra, probability, and statistics. 

Course description: This course is an introductory course on big data processing, which is the process of analyzing and utilizing big data. The course involves methods at the intersection of parallel computing, machine learning, statistics, database systems, etc. 

4
ENT799 Entrepreneurship in Action

Course description: This course provides a comprehensive introduction to entrepreneurship in the AI era, focusing on transforming technical expertise into viable business ventures. Students will learn to identify market opportunities, validate ideas, develop prototypes, and communicate their vision effectively. The course emphasizes experiential learning through real-world applications, industry engagement, and culminates in a demo day presentation to investors and industry experts. Students will work in teams to develop and pitch AI-driven startup concepts while learning essential entrepreneurial skills, including design thinking, customer validation, storytelling, and fundraising fundamentals. 

2
NLP701 Natural Language Processing

Assumed knowledge: Basic concepts in linear algebra, calculus, probability, and statistics. Programming in Python or similar language. 

Course description: This course provides a comprehensive introduction to natural language processing (NLP). It builds upon fundamental concepts in mathematics, specifically probability and statistics, linear algebra, and calculus, and assumes familiarity with programming. 

4
SDS7501 Numerical Methods for Optimization

Assumed knowledge: Basic concepts in calculus, linear algebra, and programming.  

Course description: The course delves into the world of optimization, linear algebra, and numerical methods, and is specifically tailored to machine learning and data science applications. It aims to bridge the gap between general courses in linear algebra, optimization, and numerical methods and the specific needs of data science practitioners. The emphasis is on understanding and using the underlying mathematical principles, including numerical techniques, rather than naively applying data science solutions.

4
SDS7502 Bayesian Statistics for Data Science

Assumed knowledge: Fundamental course on mathematical statistics. 

Course description: The course introduces Bayesian statistical modeling and inference, focusing on computational methods such as Markov chain Monte Carlo (MCMC) and other simulation methods to aid in the analysis of both low and high-dimensional datasets. 

4

MBZUAI accepts applicants who hold a completed bachelor’s degree in a STEM field such as computer science, electrical engineering, computer engineering, mathematics, physics or other relevant science or engineering major from a University accredited or recognized by the UAE Ministry of Higher Education and Scientific Research (MoHESR) with a minimum CCGPA of 3.0 (on a 4.0 scale) or equivalent.

Applicants must provide their completed degree certificates and official transcripts when submitting their application. Senior-level students can apply initially with a copy of their official transcript and expected graduation letter and upon admission must submit the official completed degree certificate and transcript. A degree attestation from UAE MoHESR (for degrees from the UAE) or Certificate of Recognition (for degrees acquired outside the UAE) should also be furnished within students’ first semester at MBZUAI.

All applicants whose first language is not English must demonstrate proficiency in English through one of the following:

  • IELTS Academic: Minimum overall score of 6.5
  • TOEFL iBT: Minimum score of 90

*Exams must be administered at an approved physical test center. Home Edition exams are not accepted.

English language proficiency waiver eligibility

Applicants may qualify for a waiver if they meet one of the following conditions:

  • Full exemption: Completed a degree entirely taught and assessed in English at a University located in a country where English is both the national language and the dominant language of instruction in higher education. This includes:
    • American Samoa, Australia, Botswana, Canada (excluding Quebec), Fiji, Ghana, Guyana, Ireland, Jamaica, Kenya, Lesotho, Liberia, New Zealand, Nigeria, Papua New Guinea, Samoa, Singapore, Solomon Islands, South Africa, Tonga, Trinidad and Tobago, United Kingdom, United States , Zambia, Zimbabwe.
  • Conditional exemption: If your degree was completed in another country, you may still request an exemption if you can provide official documentation (on University letterhead, signed by an academic official) confirming that your entire program of study was taught and assessed in English. 

English language requirement deadline: The English language requirement should be submitted within the application deadline. However, for those who require more time to satisfy this requirement, there is a final deadline of March 1. 

Submission of GRE scores is optional for all applicants but will be considered a plus during the evaluation.

In a 500- to 1000-word essay, explain why you would like to pursue a graduate degree at MBZUAI and include the following information:

  • Motivation for applying to the university
  • Personal and academic background and how it makes you suitable for the program you are applying for
  • Experience in completing a diverse range of projects related to artificial intelligence
  • Stand-out achievements, e.g. awards, distinction, etc.
  • Goals as a prospective student
  • Preferred career path and plans after graduation
  • Any other details that will support the application.

Applicants will be required to nominate referees who can recommend their application. M.Sc. applicants should have a minimum of two (2) referees wherein one was a previous course instructor or faculty/research advisor and the other a current or previous work supervisor.

To avoid issues and delays in the provision of the recommendation, applicants have to inform their referees of their nomination beforehand and provide the latter’s accurate information in the online application portal. Automated notifications will be sent out to the referees upon application submission.

Within 10 days of submitting your application, you will receive an invitation to book and complete an online screening exam that assesses knowledge and skills relevant to your chosen field. While you may choose to opt out of the screening exam, this is only recommended for applicants whose profiles already demonstrate strong evidence of the skills assessed in the exam. 

Exam topics

Math: Calculus, probability theory, linear algebra, and trigonometry.

Programming: Knowledge surrounding specific programming concepts and principles such as algorithms, data structures, logic, OOP, and recursion as well as language–specific knowledge of Python.

Applicants are highly encouraged to complete the following online courses to further improve their qualifications:

For more information regarding the screening exam (e.g. process, opting out criteria, and technical specifications), register on the application portal here, and view this knowledge article. 

A select number of applicants may be invited to an interview with faculty as part of the screening process. The time and instructions for this will be communicated to applicants in a timely manner.

Only one application per admission cycle must be submitted; multiple submissions are discouraged.

Application portal opens Priority deadline* Final deadline Decision notification date Offer response deadline
September 1, 2025
(8 a.m. GST)
November 15, 2025
(5 p.m. GST)
December 15, 2025
(5 p.m. GST)
March 15, 2026
(5:00 p.m. GST)
April 15, 2026

* Applications submitted by the priority deadline will be reviewed first. While all applications submitted by the final deadline (December 15, 2025) will be considered, applying by the priority deadline is strongly encouraged. Admissions are highly competitive and space in the incoming cohort is limited.

Detailed information on the application process and scholarships is available here.

Students are expected to complete coursework in the first year of degree and focus more on the research project and thesis writing in the second year. However, this is an indicative plan, and students have the flexibility to take a light course load in the second year as well and, similarly, can start research in the first year (e.g., literature review, background study, data collection or initial framework design) with the approval of their supervisory panel.

A typical study plan is as follows:

SEMESTER 1

SDS7101 Probability and Statistics (4 CR)
SDS7102 Linear Models and Extensions (4 CR)
SDS7103 Data Analysis and Visualization (4 CR)

SEMESTER 2

SDS7104 Prediction and Forecasting (4 CR)
Two electives from the list (8 CR)

SUMMER

INT799 Master of Science Internship (2 CR)

SEMESTER 3

RES799 Introduction to Research Methods (2 CR)
SDS799 Statistics and Data Science Master's Research Thesis (4 CR)

SEMESTER 4

SDS799 Statistics and Data Science Master's Research Thesis (4 CR)
Disclaimer: Subject to change.

Become a leader of the future

img

AI innovation

More information
img

AI startups

More information
img

The Node

More information

Register your interest for the M.Sc. in Statistics and Data Science

We’ll keep you up-to-date with the latest news and when applications open.