PhD in Machine Learning

START YOUR JOURNEY NOW

More information

Home / Study AI Degrees / Artificial Intelligence PhD Programs / PhD in Machine Learning

Overview

The scientific study of algorithms and statistical models that computer systems use to effectively perform a specific task without using explicit instructions, relying on patterns and inference instead. These algorithms are based on mathematical models learned automatically from data, thus allowing machines to intelligently interpret and analyze input data to derive useful knowledge and arrive at important conclusions. Machine learning is heavily used for enterprise applications (e.g., business intelligence and analytics), effective web search, robotics, smart cities, and understanding of the human genome.

Full time Mode
60 Credits
On Campus Location

The Machine Learning (ML) Department at MBZUAI is dedicated to imparting a world-class education in ML to our students. From foundational principles to advanced applications, our research-intensive education model will provide our students theoretical concepts to test under supervision from senior AI researchers in the field as they tackle real-world problems and produce meaningful results.

Kun Zhang

Acting Chair of Machine Learning, Professor of Machine Learning, and Director of Center for Integrative Artificial Intelligence (CIAI)

Read Bio

Meet the Faculty

Eric Xing

President and University Professor

BIO

Kun Zhang

Acting Chair of Machine Learning, Professor of Machine Learning, and Director of Center for Integrative Artificial Intelligence (CIAI)

BIO

Martin Takáč

Deputy Department Chair of Machine Learning, and Associate Professor of Machine Learning

BIO

Mohsen Guizani

Professor of Machine Learning

BIO

Le Song

Professor of Machine Learning

BIO

Bin Gu

Assistant Professor of Machine Learning

BIO

Qirong Ho

Assistant Professor of Machine Learning

BIO

Samuel Horváth

Assistant Professor of Machine Learning

BIO

Zhiqiang Xu

Assistant Professor of Machine Learning

BIO

Eric Moulines

Adjunct Professor of Machine Learning

BIO

Pengtao Xie

Adjunct Assistant Professor of Machine Learning

BIO

Chih-Jen Lin

Affiliated Professor of Machine Learning

BIO

Huseyin Ucar

Visiting Assistant Professor of Machine Learning

Jin Tian

Professor of Machine Learning

BIO

Maxim Panov

Assistant Professor of Machine Learning

BIO

Mingming Gong

Affiliated Associate Professor of Machine Learning

BIO

Nils Lukas

Assistant Professor of Machine Learning

BIO

Praneeth Vepakomma

Assistant Professor of Machine Learning

BIO

Michalis Vazirgiannis

Adjunct Professor of Machine Learning

BIO

Fakhreddine (Fakhri) Karray

Professor of Machine Learning

BIO

Gus Xia

Assistant Professor of Machine Learning

BIO

Tongliang Liu

Affiliated Associate Professor of Machine Learning

BIO

Salem Lahlou

Assistant Professor of Machine Learning

BIO

Yuanzhi Li

Affiliated Assistant Professor of Machine Learning

BIO

Zhiqiang Shen

Assistant Professor of Machine Learning

BIO

PLO 01: Express a comprehensive and deep understanding of the pipelines at the frontier of machine learning: data, models, algorithmic principles, and empirics.
PLO 02: Apply a range of skills and techniques in data pre-processing, exploration, and visualization of data statistics as well as complex algorithmic outcomes.
PLO 03: Identify the capabilities and limitations of the different forms of learning algorithms and critically analyze, evaluate, and improve the performance of the learning algorithms.
PLO 04: Develop problem-solving skills through independently applying the principles and methods learned in the program to various complex real-world problems.
PLO 05: Compare and contrast statistical properties and performance guarantees including convergence rates (in theory and practice) for different learning algorithms.
PLO 06: Employ and deploy ML-relevant programming tools for a variety of complex ML problems.
PLO 07: Identify the limitations of existing machine learning algorithms and conceptualize, design, and implement an innovative, sustainable, and entrepreneurial solution for a variety of highly complex problems.
PLO 08: Initiate, manage, and complete research manuscripts that demonstrate expert self-evaluation and advanced skills in communicating highly complex ideas related to machine learning.
PLO 09: Initiate, manage, and complete multiple complex project reports, and critiques.

The minimum degree requirements for the Doctor of Philosophy in Machine Learning is 60 credits, distributed as follows:

	Number of Courses	Credit Hours
Core	2	8
ML Electives	2	8
Other Electives	2	8
Advanced Research Methods	1	2
Research Thesis	1	32

Core Courses

The Doctor of Philosophy in Machine Learning is primarily a research-based degree. The purpose of coursework is to equip students with the right skillset, so they can successfully accomplish their research project (thesis). Students are required to take ML801 and ML814 and then choose two ML electives from the ML Electives list.

	Course Title	Credit Hours
ML801	Foundations and Advanced Topics in Machine Learning This course focuses on building foundations and introducing recent advances in machine learning, and on developing skills for performing research to advance the state of the art in machine learning. This course builds upon basic concepts in machine learning and additionally assumes familiarity with fundamental concepts in probability, statistics and math. The course covers foundations and advanced topics in probability, statistical machine learning, supervised and unsupervised learning, deep neural networks, reinforcement learning, and classical learning methods. Students will be engaged through coursework, assignments, and labs.	4
ML814	Selected Topics in Machine Learning This advanced course is offering an in-depth exploration of state-of-the-art techniques and concepts in machine learning. Covering a broad spectrum of topics, the course delves into advanced optimization methods, causality in machine learning, and the intricacies of large model architectures. Through a combination of lectures and lab sessions, students will gain hands-on experience in designing, implementing, and optimizing cutting-edge machine learning models. The course also emphasizes critical evaluation of current research and the application of multimodal approaches to complex problems. This curriculum is tailored to equip students with the theoretical knowledge and practical skills necessary to contribute to and lead in the evolving field of machine learning.	4

Elective courses

Students will select a minimum of four elective courses, with a total of 16 credit hours. Two electives must be selected from the ML electives and the remaining electives may be selected from the other list or the ML elective list. The two should be selected based on interest, proposed research thesis, and career aspirations, in consultation with their supervisory panel. The elective courses available for the Doctor of Philosophy in Machine Learning are listed in the tables below:

	Course Title	Credit Hours
	At least two (2) electives must be selected from the following:
ML803	Advanced Probabilistic and Statistical Inference The study of probabilistic and statistical inference deals with the process of drawing useful conclusions about data populations or scientific truths from uncertain and noisy data. This course will cover some highly specialized topics related to statistical inference and their application to real-world problems. The main topics covered in this course are latent variable learning, kernel methods and approximate probabilistic inference strategies. This course will provide an in-depth treatment to various learning techniques (likelihood, Bayesian and max-margin) and numerous practical complexities (missing data, observed and unobserved confounding, biases) for performing inference.	4
ML804	Advanced Topics in Continuous Optimization The course covers advanced topics in continuous optimization, such as stochastic gradient descent and its variants, methods that use more than first-order information, primal-dual methods, and methods for composite problems. Participants will read the current state-of-the-art relevant literature and prepare presentations to the other students. Participants will explore how the presented methods work for optimization problems that arise in various fields of Machine Learning and test them in real-world optimization formulations to get a deeper understanding of the challenges being discussed.	4
ML805	Advanced Machine Learning This course offers an in-depth exploration of foundational and cutting-edge topics within the field, including Diffusion Models, Generative Flow Networks, and the handling of various types of noise in data. Further, it delves into specialized areas such as Graph Machine Learning, Multimodal Foundation Models, and applications of Graph Generative AI in the Bio/Medical fields. The course also covers the basics and advancements in Automated Machine Learning (AutoML), including Hyperparameter Optimization, Neural Architecture Search, and the intersection of Meta Learning with AutoML, particularly in the context of Natural Language Processing (NLP). Students will engage in hands-on labs to implement algorithms and strategies discussed in lectures, enabling a deep understanding of the complexities and challenges in Machine Learning. The course structure is designed to foster high-level competencies in both theoretical understanding and practical application, preparing students to contribute innovatively to the field of Machine Learning.	4
ML818	Emerging Topics in Trustworthy ML This course will provide students with a deep dive into key issues related to trustworthy and responsible machine learning. Students will learn about adversarial/poisoning and privacy/confidentiality attacks against machine learning systems and defense mechanisms to mitigate them. The course will approach adversarial machine learning through an optimization and game-theoretic framework. The emphasis on privacy-preserving computation for data science and machine learning would be through a formal mathematical notion of privacy called differential privacy. The course will also cover other ethical issues in machine learning such as fairness, safe unbiased and responsible content generation, watermarking for content authentication and ownership verification (provenance), and attacks such as deep fakes, their detection and verification of robustness of relevant defenses (multimedia forensics).	4
ML820	Machine Learning for Industry This comprehensive course delves into the intricacies of effectively working and succeeding in the industry. The course covers aspects often overlooked in traditional curriculum. This includes understanding of industrial workflows, and access to industrial problem-solving experience through real-world industrial projects and case-studies. This includes imparting an understanding of continuous integration/development workflows (referred to as CI/CD), systems aspects of using schedulers and resource management tools, Rest API infrastructures, micro-services, model deployment tools such as FLASK along with data management skills, container architectures and ML workflow management and monitoring skills needed to succeed and fit seamlessly in the industry. The course provides access to real-world industrial case studies and projects. A diverse array of industrial projects that are covered include routing and job shop scheduling problems, real-world statistical hypothesis testing with applications to identifying bad actors on social media, projects in healthcare and pharmaceutical industry such as remote healthcare monitoring, dose-response curve modeling and clinical trials for drug development, public sector projects for public safety and crime analytics, responsible AI projects for collecting demographics, privacy-preserving collection of usage-statistics from smart phones along with secure and private deployments of federated learning.	4
	Remaining electives may be selected from the following list:
CB803	Single Cell Biology and Bioinformatics This course introduces students to the diverse landscape of biological data, including its types and characteristics and explores the foundational principles of single-cell omics bioinformatics, encompassing key methodologies, tools, and computational workflows, with an emphasis on the development of foundation models for single cell omics data from a research perspective. Single cell omics technologies are a new and fast-growing family of biological assays that enables measuring the molecular contents of individual cells with very high resolution and is key to advancing precision medicine. The course covers essential bioinformatics aspects for working with single cell omics data.
CV801	Advanced Computer Vision This course provides a comprehensive introduction to Advanced computer vision techniques. The students will develop skills to critique the state-of-the-art computer vision research papers. The course aims at building foundation concepts for modern computer vision as well as developing expertise in several specialized areas of research in computer vision. The following topics will be covered in the course. (i) Deep learning for computer vision (ii) Recent developments in convolutional neural networks and transformers (iii) Advanced techniques in object detection and segmentation (iv) Advanced Vision applications such as medica image segmentation and Remote sensing change detection (v) Development of efficient computer vision architectures (vi) Human centric Vision and (vii) Introduction to Vision Language Models and Diffusion models.	4
CV802	Advanced 3D Computer Vision The course exercises an in-depth coverage of special topics in 3D computer vision. The students will be able to critique the state-of-the-art methods on multi-view stereo, 3D reconstruction, 3D shape analysis, 3D deep learning and synthesis, students will have to implement papers to accomplish the following goals: (1) reproduce results reported in the papers, and (2) improve the performance of published peer-reviewed works. This course assumes that the students are familiar with the basic concepts of Computer vision, linear algebra and numerical methods.	4
CV803	Advanced Techniques in Visual Object Recognition and Detection This course provides focused coverage of special topics on visual object recognition (image classification), detection and segmentation. The students will develop skills to critique the state-of-the-art works on visual object recognition, detection and segmentation. Moreover, students will be required to implement papers with the following aims: (1) reproduce results reported in the seminal research papers, and (2) improve the performance of the published works. This course assumes familiarity with fundamental concepts in computer vision and machine learning.	4
CV805	Life-long Learning Agents for Vision In the field of computer vision, models have typically been trained to perform well on a specific task or dataset by maximizing performance on a validation set. However, this approach only represents a small part of the types of scenarios that are of interest in real-world applications. In recent years, there has been growing interest in exploring different approaches to learning that can be applied in more diverse and dynamic environments. These approaches, which include lifelong learning, continual learning, meta-learning, transfer learning, multi-task learning, and out-of-distribution generalization, aim to enable models to be more robust, efficient, versatile, and well-behaved in non-stationary settings. This graduate course will focus on these emerging learning paradigms and how they can be applied to computer vision and multimodal learning tasks.	4
CV806	Advanced Topics in Vision and Language Vision and language encode complementary information and have long been studied together. With the advent of Large Language Models, vision-language models are now more popular than ever and represent one of the most active areas of modern computer vision. This course will cover learning methods and joint models for image and text modalities and will address a wide range of problems including vision-language pretraining, text-based image search, image and video captioning, visual question answering, visual dialog, text-to-image synthesis as well as vision-language navigation and manipulation.	4
CV807	Safe and Robust Computer Vision Computer vision/machine learning systems are typically designed to operate under benign scenarios by trusted users. Recently, several studies have shown that computer vision systems have vulnerabilities, which can be exploited by adversaries to compromise the integrity and availability of such systems. These vulnerabilities include both inference-time evasion (adversarial) attacks and training-time poisoning (backdoor) attacks. Many techniques have also been proposed to counter these threats. This advanced graduate course will focus on analyzing these adversarial security threats and potential countermeasures.	4
ML806	Advanced Topics in Reinforcement Learning The course covers advanced topics in Reinforcement Learning (RL). Participants will read the current state-of-the-art relevant literature and prepare presentations to the other students. Participants will explore how the presented methods work in simplified computing environments to get a deeper understanding of the challenges that are being discussed. Topics discussed include exploration, imitation learning, hierarchical RL, multi agent RL in both competitive and collaborative setting. The course will also explore multitask and transfer learning in RL setting.	4
ML807	Federated Learning This is a graduate course in a new branch of machine learning: federated learning (FL). In FL, machine learning models are trained on mobile devices with an explicit effort to preserve the privacy of users’ data. FL combines supervised machine learning, privacy, distributed and edge computing, optimization, communication compression, and systems. This is a new and fast-growing field with few theoretical results and early production systems (e.g., Tensor Flow Federated and FedML). This course aims for students to become familiar with the field’s key developments and practices, namely optimization methods for FL and techniques to address communication bottlenecks, systems and data heterogeneities, client selection, robustness, fairness, personalization and privacy aspects of FL. The evaluation of the course heavily relies on students’ paper presentations and the final project selected by the student.	4
ML808	Advanced Topics in Causality and Machine Learning In the past decades, interesting advances were made in machine learning, philosophy, and statistics for tackling long-standing causality problems, including how to discover causal knowledge from observational data, known as causal discovery, and how to infer the effect of interventions. Furthermore, it has recently been shown that the causal perspective may facilitate understanding and solving various machine learning / artificial intelligence problems such as transfer learning, semi-supervised learning, out-of-distribution prediction, disentanglement, and adversarial vulnerability. This course is concerned with understanding causality, learning causality from observational data, and using causality to tackle a large class of learning problems. The course will include topics like graphical models, causal inference, causal discovery, and counterfactual reasoning. It will also discuss how we can learn causal representations, perform transfer learning, and understand deep generative models.	4
ML809	Advanced Learning Theory This course is an introduction to the core ideas and theories of statistical learning theory, and their uses in designing and analyzing machine learning systems. Statistical learning theory studies how to fit predictive models to training data, usually by solving an optimization problem, in such a way that the model will predict well, on average, on new data.	4
ML812	Advanced Topics in Algorithms for Big Data This course is an advanced course on algorithms for big data that involves the use of randomized methods, such as sketching and sampling, to provide dimensionality reduction. It also discussed topics such as Sub-space Embeddings, Low rank Approximation, L1 Regression, Data Streams. The course lies at the intersection of machine learning and statistics.	4
ML813	Topics in Dimensionality Reduction and Manifold Learning The course focuses on building foundations and introducing recent advances in dimensionality reduction and manifold learning, important topics in machine learning. This course builds upon fundamental concepts in machine learning and additionally assumes familiarity with concepts in optimization and mathematics. The course covers advanced topics in spectral, probabilistic, and neural network-based dimensionality reduction and manifold learning. Students will be engaged through course-work, assignments, and projects.	4
ML815	Advanced Parallel and Distributed Machine Learning Systems Training the largest Machine Learning (ML) programs requires petaFLOPs (1015) to exaFLOPs (1018) of computing operations, as well as multiple terabytes (1012) of hardware accelerator memory. Accordingly, 100s to 1000s of these accelerators are needed to satisfy both the computing and memory requirements of the large-scale ML. This course covers systems architecture design, communication strategies and algorithmic modifications required to execute ML training in a parallel and distributed fashion across many network-connected hardware accelerators. In the first part of the course, students will learn a comprehensive set of principles, representations, and performance metrics for parallelizing ML programs and learning algorithms, as well as learn how to compare and evaluate different parallel ML strategies composed out of basic parallel ML “aspects”. In the second part of the course, students will apply these skills to read and critique peer-reviewed literature on parallel and distributed ML systems.	4
ML817	AI for Science and Engineering This comprehensive course covers a wide array of topics, including classical paradigms for AI in science, active learning, constraints handling in machine learning, AI-driven solutions for renewable energy sources and smart grids, physics-informed AI, and AI-driven approaches in material science and catalysis. Through a balanced blend of theoretical lectures and practical lab sessions, students will delve into the latest AI methodologies, learning how to apply these advanced techniques to address complex challenges in scientific research and engineering solutions. The course emphasizes hands-on experience with real-world datasets, preparing students to become leaders in leveraging AI for innovative solutions across diverse domains. It is designed for those seeking to make a significant impact at the intersection of AI and domain-specific applications such us physics, material science and renewable energy, driving forward the frontiers of research and industry practices.	4
ML819	TinyML and Large Language Models This comprehensive PhD-level course explores the intricacies of modern machine learning, with a specific focus on TinyML, efficient machine learning and deep learning. Through an integration of lectures, readings, and practical labs, students will be exposed to the evolution of TinyML from its traditional roots to the deep learning era. This course will introduce efficient AI computing strategies that facilitate robust deep learning applications on devices with limited resources. We will explore various techniques including model compression, pruning, quantization, neural architecture search, as well as strategies for distributed training, such as data and model parallelism, gradient compression, and methods for adapting models directly on devices. Additionally, the course will focus on specific acceleration approaches tailored for large language models and diffusion models. Participants will gain practical experience in implementing large language models on standard laptops.	4
NLP804	Deep Learning for Natural Language Generation The course introduces students to the emerging topic of natural language generation and prepares them to perform research to advance the state of the art in this research area.	4
NLP805	Natural Language Processing This course focuses on recent research in Natural Language Processing and on developing skills for performing research to advance the state of the art in Natural Language Processing.	4
NLP806	Advanced Natural Language Processing This course focuses on recent topics in Natural Language Processing and on developing skills for performing research to advance the state of the art in Natural Language Processing. Specifically, this course will cover fundamentals of LLMs such as Transformers architecture, methods on training and evaluating LLMs via distributed training and efficiency methods, and application in multilinguality, translation and multimodality.	4
NLP807	Speech Processing This course provides a comprehensive introduction to Speech Processing. It focuses on developing knowledge about the state of the art in a wide range of Speech Processing tasks, and readiness for performing research to advance the state of the art in these topics. Topics include speech production, speech signal analysis, automatic speech recognition, speech synthesis, neural speech recognition and synthesis, and recent topics in foundation models and speech processing.	4
NLP808	Current Topics in Natural Language Processing This course focuses on recent topics in Natural Language Processing and on developing skills for performing research to advance the state of the art in Natural Language Processing.	4
NLP809	Advanced Speech Processing This course explores the cutting-edge techniques and methodologies in the field of speech processing. The course covers advanced topics such as Automatic Speech Recognition, Language Modeling and Decoding, Speech Synthesis, Speaker Identification, Speech Diarization, Paralinguistic analysis, Speech Translation & Summarization, Multilinguality and low-resource languages and Spoken Dialog Systems. Students will delve into modern models and frameworks for the different speech tasks. The course emphasizes both theoretical understanding and practical implementation, fostering skills necessary for innovative research and development in speech technologies.	4
NLP810	Robust and Trustworthy Natural Language Processing The course introduces students to advanced topics in natural language processing concerning the robustness and trustworthiness of language models (LMs), specifically world knowledge in LMs, safety and inclusivity of LMs, and inner workings of LMs. The course prepares them to perform research that advances the state of the art in these research areas.	4

Research Thesis

The Ph.D. thesis exposes students to cutting-edge and unsolved research problems in the field of machine learning, where they are required to propose new solutions and significantly contribute towards the body of knowledge. Students pursue an independent research study, under the guidance of a supervisory panel, for a period of three to four years.

	Course Title	Credit Hours
RES899	Advanced Research Methods This course will prepare students to produce professional-quality research and solve a practical research challenge in an organization based on an innovative, sustainable, and entrepreneurial research topic. This course will provide exposure to a variety of special topics, research integrity, ethics, organizational challenges, and needs related to various disciplines. Students will design and implement a research project suitable for conference presentation or journal submission relevant to their field of interest, in addition to peer-reviewing a paper. The instructor, and guest lecturers, as appropriate, will present topics necessary to develop well-rounded researchers, innovators, and entrepreneurs in the AI disciplines.	2
ML899	Machine Learning Ph.D. Research Thesis PhD thesis research exposes students to cutting-edge and unsolved research problems, where they are required to propose new solutions and significantly contribute towards the body of knowledge. Students pursue an independent research study, under the guidance of a supervisory panel, for a period of 3 to 4 years. PhD thesis research helps train graduates to become leaders in their chosen area of research through partly supervised study, eventually transforming them into researchers who can work independently or interdependently to carry out cutting edge research.	32

Internship

The MBZUAl internship with industry is intended to provide the student with hands-on experience, blending practical experiences with academic learning.

	Course Title	Credit Hours
INT899	PhD Internship (up to four months) PhD Internship (up to four months)	2

Completed Degree

MBZUAI accepts applicants from all nationalities who have a completed degree in a STEM field such as Computer Science, Electrical Engineering, Computer Engineering, Mathematics, Physics, or other relevant Science or Engineering major that demonstrates academic distinction in a discipline appropriate for the doctoral degree – either:

Bachelor’s degree with a minimum CGPA of 3.5 (on a 4.0 scale) or equivalent, or
Master’s degree with a minimum CGPA of 3.2 (on a 4.0 scale) or equivalent

Applicants must provide their completed degree certificates and official transcripts when submitting their application. Senior-level students can apply initially with a copy of their official transcript and expected graduation letter and upon admission must submit the official completed degree certificate and transcript. A degree attestation from UAE MoE (for degrees from the UAE) or Certificate of Recognition from UAE MoE (for degrees acquired outside the UAE) should also be furnished within students’ first semester at MBZUAI.

All submitted documents must either be in English, originally, or include legal English translations.
Additionally, official academic documents should be stamped and signed by the university authorities.

English Language Proficiency

Each applicant must show proof of English language ability by providing valid certificate copies of either of the following:

TOEFL iBT with a minimum total score of 90
IELTS Academic with a minimum overall score of 6.5
EmSAT English with a minimum score of 1550

TOEFL iBT and IELTS academic certificates are valid for two (2) years from the date of the exam while EmSAT results are valid for eighteen (18) months. Only standard versions (i.e. conducted at physical test centers) of the accepted English language proficiency exams will be considered.

Waiver requests from eligible applicants who are citizens (by passport or nationality) of UK, USA, Australia, and New Zealand who completed their studies from K-12 until bachelor’s degree and master’s degree (if applicable) from those same countries will be processed. They need to submit notarized copies of their documents during the application stage and attested documents upon admission. Waiver decisions will be given within seven (7) days after receiving all requirements.

Graduate Record Examination (GRE)

Submission of GRE scores is optional for all applicants but will be considered a plus during the evaluation.

Statement of Purpose

In a 500- to 1000-word essay, explain why you would like to pursue a graduate degree at MBZUAI and include the following information:

Motivation for applying to the university
Personal and academic background and how it makes you suitable for the program you are applying for
Experience in completing a diverse range of projects related to artificial intelligence
Stand-out achievements, e.g. awards, distinction, etc
Goals as a prospective student
Preferred career path and plans after graduation
Any other details that will support the application

Research Statement

The research statement is a document summarizing the potential research project an applicant is interested in working on and clearly justify the research gap which the applicant would like to fill in during the course of his/her study. It must be presented in the context of currently existing literature and provide an overview of how the applicant aims to investigate the underlying research project as well as predict the expected outcomes. It should mention the relevance and suitability of the applicant’s background and experience to the project and highlight the project’s scientific and commercial significance. The research statement should include the following details:

Title
Problem definition
Literature review
Proposed research/methods/solution (optional)
Study timeline (a table, figure or a small paragraph presenting your plans for the four years in the Ph.D. program)
List of references

Applicants are expected to write the research statement independently. MBZUAI faculty will NOT help write it for the purpose of the application. The MBZUAI Admission Committee will review the submitted document and use it as one of the measures to gauge and assess applicants’ skills.

Referee Recommendation

Applicants will be required to nominate referees who can recommend their application. Ph.D. applicants should have a minimum of three (3) referees wherein at least one was a previous course instructor or faculty/researcher advisor and the others were current or previous work supervisors.

To avoid issues and delays in the provision of the recommendation, applicants have to inform their referees of their nomination beforehand and provide the latter’s accurate information in the online application portal. Automated notifications will be sent out to the referees upon application submission.

Screening Exam

All applicants with complete files, including the required number of recommendations, will be invited to participate in an online screening exam to assess their knowledge and skills. Completion of the exam is not mandatory but highly encouraged as it would provide additional information to the evaluation committee. Waiving the exam is only recommended for those students who can provide strong evidence of their research capability, subject matter expertise, and technical skills.

Exam Topics

Math: Calculus, probability theory, linear algebra, trigonometry and optimization

Machine learning: Machine learning algorithms and concepts such as linear regression, decision trees, loss functions, support vector machines, classification, regression, clustering, convolutional neural networks, dimensionality reduction, neural networks and unsupervised learning

Programming: Knowledge surrounding specific programming concepts and principles such as algorithms, data structures, logic, OOP, and recursion as well as language–specific knowledge of Python

Applicants are highly encouraged to complete the following online courses to further improve their qualifications :

The exam instructions are available here

Admission Interview

A select number of applicants may be invited to an interview with faculty as part of the screening process. The time and instructions for this will be communicated to applicants on timely bases.

Admission Key Dates

Only one application per admission cycle must be submitted; multiple submissions are discouraged.

Application portal opens	Regular deadline	Decision notification date	Late deadline
1st October 2024 (8:00 AM UAE time)	15th January 2025 (5:00 PM UAE time)	31st March 2025 (5:00 PM UAE time)	31st May 2025 (5:00 PM UAE time)
	High-calibre applicants who apply by the ‘Regular Deadline’ and have complete applications (including the required recommendations) will be given full consideration.		The online application portal will remain open until the ‘Late Deadline’. We do not guarantee that these late applications will be given full consideration.

Detailed information on the application process and scholarships is available here.

A typical study plan is as follows:

SEMESTER 1

ML801 Foundations and Advanced Topics in Machine Learning (4 CR)
Two electives from the list (8 CR)

SEMESTER 2

ML814 Selected Topics in Machine Learning (4 CR)
Two electives from the list (8 CR)

SUMMER

INT899 PhD Internship (2 CR)

SEMESTER 3

RES899 Advanced Research Methods (2 CR)
ML899 Machine Learning Ph.D. Research Thesis

SEMESTER 4

ML899 Machine Learning Ph.D. Research Thesis

SEMESTER 5

ML899 Machine Learning Ph.D. Research Thesis

SEMESTER 6

ML899 Ph.D. Research Thesis

SEMESTER 7

ML899 Machine Learning Ph.D. Research Thesis

SEMESTER 8

ML899 Machine Learning Ph.D. Research Thesis

Disclaimer: Subject to change.

Become a Leader of the Future

Student Resources

Register your interest for the PhD in Machine Learning

We’ll keep you up to date with the latest news and when applications open.

PhD in

Machine Learning

START YOUR JOURNEY NOW

Overview

Kun Zhang

Meet the Faculty

Eric Xing

Kun Zhang

Martin Takáč

Mohsen Guizani

Le Song

Bin Gu

Qirong Ho

Samuel Horváth

Zhiqiang Xu

Eric Moulines

Pengtao Xie

Chih-Jen Lin

Huseyin Ucar

Jin Tian

Maxim Panov

Mingming Gong

Nils Lukas

Praneeth Vepakomma

Michalis Vazirgiannis

Fakhreddine (Fakhri) Karray

Gus Xia

Tongliang Liu

Salem Lahlou

Yuanzhi Li

Zhiqiang Shen

A typical study plan is as follows: SEMESTER 1

SEMESTER 2

SUMMER

SEMESTER 3

SEMESTER 4

SEMESTER 5

SEMESTER 6

SEMESTER 7

SEMESTER 8

Become a Leader of the Future

AI Innovation

AI Start-ups

The Node

Student Resources

Register your interest for the PhD in Machine Learning

A typical study plan is as follows:

SEMESTER 1