The goals of the Master of Science in Statistics and Data Science are to train the next generation of data scientists with a focus on the field of statistics. This master’s program gives a solid foundation in statistical theory and a wide range of methods for working with data in many different application areas. In an increasingly data-driven world, being able to model, interpret and translate information into meaningful insights that organizations can use is a valuable skill. Our graduates are prepared for the pursuit of doctoral degrees as well as careers in business, health, and academia.
Deadlines for Applications for Fall 2025:
Priority Deadline for Early Decision: 28 February 2025 (5:00 PM UAE time)
Regular Deadline: 31 May 2025 (5:00 PM UAE time)
Decision Notification Deadline: 30 June 2025 (5:00 PM UAE time)
Welcome to the Department of Statistics and Data Science at Mohamed bin Zayed University of Artificial Intelligence (MBZUAI). Our department uniquely integrates rigorous statistical foundations with cutting-edge artificial intelligence. At MBZUAI, our faculty and researchers emphasize the critical role of statistical theory and modern data science in ensuring reliable and effective AI solutions. We equip our students with the expertise to tackle real-world challenges, including efficient data handling, uncertainty quantification, and responsible decision-making.
As AI increasingly transforms industries and society, statistics plays a pivotal role in guiding the responsible and informed development of these technologies. Our department leads the development of innovative AI methods, advancing their reliability and extending their applications to diverse domains such as life sciences, social sciences, healthcare, economics, and environmental studies. We actively explore how statistical approaches can enhance AI’s potential, ensuring trustworthy and impactful solutions across these critical areas.
Join us at MBZUAI in advancing the critical synergy between statistics, data science, and artificial intelligence.
Department Chair of Statistics and Data Science, and Visiting Professor of SDS
National Qualifications Framework – three strands
The program learning outcomes (PLOs) are aligned with the Emirates Qualifications Framework and, as such, are divided into the following learning outcomes strands: knowledge (K), skills (S), and responsibility (R).
Program learning outcomes
Upon completion of the program requirements, graduates will be able to:
The PLOs are mapped to the National Qualifications Framework Level Seven (7) qualification and categorized into three domains (knowledge, skill and Responsibility) as per the National Qualifications Framework set by the UAE National Qualifications Centre (NQC) and the Ministry of Higher Education and Scientific Research (MoHESR):
PLOs | Knowledge | Skill | Responsibility |
---|---|---|---|
PLO 01 | K | S | R |
PLO 02 | K | S | R |
PLO 03 | K | S | R |
PLO 04 | K | S | R |
PLO 05 | – | S | – |
The minimum degree requirements for the Master of Science in Statistics and Data Science program is 36 credits, distributed as follows:
Number of courses | Credit hours | |
---|---|---|
Core | 4 | 16 |
Electives | At least two dependent on credit hours | 8 |
Internship | At least one internship of up to six weeks duration must be satisfactorily completed as a graduation requirement | 2 |
Introduction to research methods | 1 | 2 |
Research thesis | 1 | 8 |
Total | 9 | 36 |
The Master of Science in Statistics and Data Science is primarily a research-based degree. The purpose of coursework is to equip students with the correct skillset so that they can successfully accomplish their research project (thesis). Students are required to take core courses as mandatory courses. They can select a minimum of two electives.
To accommodate a diverse group of students, coming from different academic backgrounds, students have been provided with flexibility in course selection. The decision on the courses to be taken will be made in consultation with the students’ supervisory panel, which will comprise two or more faculty members. Essentially, the student’s supervisory panel will help design a personalized coursework plan for each individual student by considering their prior academic track record and experience, as well as the planned research project.
The following core courses must be taken by all students:Code | Course title | Credit hours |
---|---|---|
INT799 |
Master of Science Internship
Assumed knowledge: Prior to undertaking an internship, students must have successfully completed 24 credit hours. Course description: The MBZUAI Internship (up to six weeks) with industry is intended to provide the student with hands-on experience, blending practical experiences with academic learning. |
2 |
RES799 |
Introduction to Research Methods
Course description: The course teaches research methods applicable to scientific research in general, and AI research in particular. It covers various topics including quantitative, qualitative, and mixed methods, measurement and metrics in experimental research, critical appraisal and peer review, public communication, reproducibility and open science, and ethical issues in AI research. Students will gain knowledge in selecting, evaluating, collecting, and sharing data and suitable research methods to address specific research questions. After completing the course, students will have the skills to develop a full research methodology that is rigorous and ethical. |
2 |
SDS799 |
Statistics and Data Science Master’s Research Thesis
Course description: Master’s thesis research exposes students to an unsolved research problem, where they are required to propose new solutions and contribute towards the body of knowledge. Students pursue an independent research study, under the guidance of a supervisory panel, for a period of one (1) year. Master’s thesis research helps train graduates to pursue more advanced research in their Ph.D. degree. Further, it enables graduates to independently pursue an industrial project involving research component independently. |
8 |
SDS7101 |
Probability and Statistics
Assumed knowledge: Calculus and linear algebra. A calculus-based introduction to probability theory. An introductory course in statistics is recommended. Course description: Mathematical statistics is crucial because it forms the backbone of data analysis and enable the development of models to infer, describe, and predict real-world phenomena. It provides the theoretical foundation needed to understand complex data sets, evaluate uncertainty, and make informed decisions. The course is an attempt to provide a reasonable balance between mathematical rigor and statistical practice. It provides a detailed account on the foundations of statistics and the required basics of probability theory but also emphasizes the applicability of statistics to real-world problems. |
4 |
SDS7102 |
Linear Models and Extensions
Assumed knowledge: Calculus and linear algebra. A calculus-based introduction to probability theory, An introductory course in statistics is recommended. Course description: Linear models (LM) and generalized linear models (GLM) are central to data science. These models are critical for analyzing complex data sets, enabling accurate predictions and providing insights into underlying patterns and trends. They facilitate decision making and strategy development in various fields such as finance, healthcare, and social sciences. In this course we present a modern introduction to regression, covering the basics of classical inference methods and introducing newer results from sparse estimation (LASSO). |
4 |
SDS7103 |
Data Analysis and Visualization
Assumed knowledge: Familiarity with the fundamental concepts of linear algebra, real analysis, probability theory and statistical inference, and linear model and extension. Course description: This course offers a deep dive into advanced techniques in data analysis, focusing on density estimation, clustering, dimension reduction, visualization and practical problem-solving in data science. Students will explore diverse methods for density estimation, including histograms and kernels, and delve into the intricacies of clustering techniques such as K-means, mixture models, hierarchical clustering, and spectral clustering. The course also covers various dimension reduction and visualization methods like principal component analysis (PCA), kernel PCA, t-SNE, and autoencoders. Moreover, students will tackle practical challenges in data science, including outlier detection, handling missing values, and proposing an interpretation of machine learning models. Through practical exercises and real-world examples, students will develop essential skills for analyzing complex datasets and extracting valuable insights. |
4 |
SDS7104 |
Prediction and Forecasting
Assumed knowledge: Fundamental course on mathematical statistics. Course description: This intensive course provides a comprehensive exploration of advanced statistical learning techniques for prediction and forecasting. Through a blend of theoretical understanding and hands-on implementation, students will delve into fundamental concepts such as model evaluation metrics, penalized regressions, support vector machines, ensemble methods, and specialized algorithms for time series data. |
4 |
Master’s thesis research exposes students to an unsolved research problem, where they are required to propose new solutions and contribute towards the body of knowledge. Students pursue an independent research study, under the guidance of a supervisory panel, for a period of one (1) year.
Code | Course title | Credit hours |
---|---|---|
STADS799 |
Statistics and Data Science Master's Research Thesis
Course description: Master’s thesis research exposes students to an unsolved research problem, where they are required to propose new solutions and contribute towards the body of knowledge. Students pursue an independent research study, under the guidance of a supervisory panel, for a period of one (1) year. Master’s thesis research helps train graduates to pursue more advanced research in their Ph.D. degree. Further, it enables graduates to independently pursue an industrial project involving research component independently. |
8 |
RES799 |
Introduction to Research Methods
Course description: The course teaches research methods applicable to scientific research in general, and AI research in particular. It covers various topics including scientific methods, measurement and metrics in experimental research, critical appraisal and peer review, public communication, and ethical issues in AI research. Students will gain knowledge in selecting, evaluating, and collecting data and suitable research methods to address specific research questions. Additionally, they will learn design thinking skills to connect their research-based topic to practicality. After completing the course, students will have the skills to develop a full research methodology that is rigorous, entrepreneurial, and ethical. |
2 |
The MBZUAl internship with industry is intended to provide the student with hands-on experience, blending practical experiences with academic learning.
Code | Course title | Credit hours |
---|---|---|
INT799 |
Master of Science Internship (up to six weeks)
Assumed knowledge: Prior to undertaking an internship, students must have successfully completed 24 credit hours. Course description: The MBZUAI Internship (up to six weeks) with industry is intended to provide the student with hands-on experience, blending practical experiences with academic learning. |
2 |
Students will select a minimum of two (2) elective courses, with a total of eight (8) (or more) credit hours based on interest, proposed research thesis, and career aspirations, in consultation with their supervisory panel. The elective courses available for the Master of Science in Statistics and Data Science are listed below.
Code | Course title | Credit hours |
---|---|---|
AI7101 |
Machine Learning with Python
Assumed knowledge: Linear algebra, mathematical analysis, algorithms. At least intermediate programming skills are necessary. Course description: The course gives an introduction to the main topics of modern machine learning such as classification, regression, clustering, and dimensionality reduction. Each topic is accompanied by a survey of key machine learning algorithms solving the problem and is illustrated with a set of real-world examples. The primary objective of the course is giving a broad overview of major machine learning techniques. Particular attention is paid to the modern Python machine learning libraries which allow solving efficiently the problems mentioned above. |
2 |
AI7102 |
Introduction to Deep Learning
Assumed knowledge: Basics of linear algebra, calculus, probability and statistics, and basic machine learning concepts. Proficiency in Python. Course description: This course covers key concepts and methods in deep learning. Students will begin by learning the foundational principles of deep learning, including the universal approximation theorem, strategies for modeling complex patterns using neural architectures, and the specifics of training deep networks. The course then introduces a range of deep models, including convolutional neural networks, recurrent neural networks, and transformer-based architectures. Students will gain hands-on experience in building and training deep neural networks across various domains such as computer vision, medical imaging, and natural language processing. |
2 |
DS701 |
Data Mining
Assumed knowledge: Discrete mathematics, probability, and statistics. Proficiency in Java or Python. Course description: This course is an introductory course on data mining, which is the process of discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems. |
4 |
DS702 |
Big Data Processing
Assumed knowledge: Databases. Proficiency in Java or Python. Basic knowledge of calculus, linear algebra, probability, and statistics. Course description: This course is an introductory course on big data processing, which is the process of analyzing and utilizing big data. The course involves methods at the intersection of parallel computing, machine learning, statistics, database systems, etc. |
4 |
ENT799 |
Entrepreneurship in Action
Course description: This course provides a comprehensive introduction to entrepreneurship in the AI era, focusing on transforming technical expertise into viable business ventures. Students will learn to identify market opportunities, validate ideas, develop prototypes, and communicate their vision effectively. The course emphasizes experiential learning through real-world applications, industry engagement, and culminates in a demo day presentation to investors and industry experts. Students will work in teams to develop and pitch AI-driven startup concepts while learning essential entrepreneurial skills, including design thinking, customer validation, storytelling, and fundraising fundamentals. |
2 |
NLP701 |
Natural Language Processing
Assumed knowledge: Basic concepts in linear algebra, calculus, probability, and statistics. Programming in Python or similar language. Course description: This course provides a comprehensive introduction to natural language processing (NLP). It builds upon fundamental concepts in mathematics, specifically probability and statistics, linear algebra, and calculus, and assumes familiarity with programming. |
4 |
SDS7501 |
Numerical Methods for Optimization
Assumed knowledge: Basic concepts in calculus, linear algebra, and programming. Course description: The course delves into the world of optimization, linear algebra, and numerical methods, and is specifically tailored to machine learning and data science applications. It aims to bridge the gap between general courses in linear algebra, optimization, and numerical methods and the specific needs of data science practitioners. The emphasis is on understanding and using the underlying mathematical principles, including numerical techniques, rather than naively applying data science solutions. |
4 |
SDS7502 |
Bayesian Statistics for Data Science
Assumed knowledge: Fundamental course on mathematical statistics. Course description: The course introduces Bayesian statistical modeling and inference, focusing on computational methods such as Markov chain Monte Carlo (MCMC) and other simulation methods to aid in the analysis of both low and high-dimensional datasets. |
4 |
MBZUAI accepts applicants who hold a completed bachelor’s degree in a STEM field such as computer science, electrical engineering, computer engineering, mathematics, physics or other relevant science or engineering major from a University accredited or recognized by the UAE Ministry of Higher Education and Scientific Research (MoHESR) with a minimum CCGPA of 3.0 (on a 4.0 scale) or equivalent.
Applicants must provide their completed degree certificates and official transcripts when submitting their application. Senior-level students can apply initially with a copy of their official transcript and expected graduation letter and upon admission must submit the official completed degree certificate and transcript. A degree attestation from UAE MoHESR (for degrees from the UAE) or Certificate of Recognition (for degrees acquired outside the UAE) should also be furnished within students’ first semester at MBZUAI.
All applicants whose first language is not English must demonstrate proficiency in English through one of the following:
*Exams must be administered at an approved physical test center. Home Edition exams are not accepted.
English language proficiency waiver eligibility
Applicants may qualify for a waiver if they meet one of the following conditions:
English language requirement deadline: The English language requirement should be submitted within the application deadline. However, for those who require more time to satisfy this requirement, there is a final deadline of March 1.
Submission of GRE scores is optional for all applicants but will be considered a plus during the evaluation.
In a 500- to 1000-word essay, explain why you would like to pursue a graduate degree at MBZUAI and include the following information:
Applicants will be required to nominate referees who can recommend their application. M.Sc. applicants should have a minimum of two (2) referees wherein one was a previous course instructor or faculty/research advisor and the other a current or previous work supervisor.
To avoid issues and delays in the provision of the recommendation, applicants have to inform their referees of their nomination beforehand and provide the latter’s accurate information in the online application portal. Automated notifications will be sent out to the referees upon application submission.
Within 10 days of submitting your application, you will receive an invitation to book and complete an online screening exam that assesses knowledge and skills relevant to your chosen field. While you may choose to opt out of the screening exam, this is only recommended for applicants whose profiles already demonstrate strong evidence of the skills assessed in the exam.
Exam topics
Math: Calculus, probability theory, linear algebra, and trigonometry.
Programming: Knowledge surrounding specific programming concepts and principles such as algorithms, data structures, logic, OOP, and recursion as well as language–specific knowledge of Python.
Applicants are highly encouraged to complete the following online courses to further improve their qualifications:
For more information regarding the screening exam (e.g. process, opting out criteria, and technical specifications), register on the application portal here, and view this knowledge article.
A select number of applicants may be invited to an interview with faculty as part of the screening process. The time and instructions for this will be communicated to applicants in a timely manner.
Only one application per admission cycle must be submitted; multiple submissions are discouraged.
Application portal opens | Priority deadline* | Final deadline | Decision notification date | Offer response deadline |
---|---|---|---|---|
September 1, 2025 (8 a.m. GST) |
November 15, 2025 (5 p.m. GST) |
December 15, 2025 (5 p.m. GST) |
March 15, 2026 (5:00 p.m. GST) |
April 15, 2026 |
* Applications submitted by the priority deadline will be reviewed first. While all applications submitted by the final deadline (December 15, 2025) will be considered, applying by the priority deadline is strongly encouraged. Admissions are highly competitive and space in the incoming cohort is limited.
Detailed information on the application process and scholarships is available here.
AI is reshaping industries worldwide and MBZUAI’s research continues to highlight the impact of AI advancements across key industries.
More informationThe Incubation and Entrepreneurship Center (IEC) is a leading AI-native incubator with the aim to nurture and support the next generation of AI-driven startups.
More informationWe’ll keep you up-to-date with the latest news and when applications open.