Professor Cholakkal’s research lies at the intersection of computer vision and multimodal learning, with a focus on foundation models and multimodal large language models (LMMs). His objective is to build omnimodal AI companions that seamlessly integrate vision, audio, speech and text across different languages and cultures, and to deploy these systems in smart wearables such as smart glasses. He is also interested in the application of multimodal large language models and AI companions to healthcare and for social good. To support this vision, his research program is structured around three interconnected pillars: multimodal learning, healthcare foundation models, and advanced visual recognition architectures.
Email
Prior to joining MBZUAI, Professor Cholakkal held research and technical leadership positions at the Inception Institute of Artificial Intelligence (IIAI) in the UAE, Mercedes-Benz R&D India, BEL Central Research Laboratory (India), and the Advanced Digital Sciences Center in Singapore. With more than 12 years of experience in computer vision and multimodal AI, he bridges fundamental research, teaching, and AI product development at scale.
As a principal investigator at MBZUAI, Professor Cholakkal has secured more than eight research grants and awards, including the Meta Llama Impact Innovation Award (2024), NVIDIA Academic Grant (2025), Meta Regional Research Grant (2025), and Google Gift Research Award (2023). His research as a PI has received paper awards and recognitions, including the SAC Highlights Award at EMNLP 2025, awarded to selected top papers at the conference.
His teaching contributions at MBZUAI have been recognized through the inaugural MBZUAI Teaching Excellence Award (2025). His role in building the University as one of its founding faculty members was acknowledged through the MBZUAI Founding Service Award.
He serves in leadership roles at top AI conferences, including General Chair of ACM Multimedia Asia 2026 and Local Chair of ACCV 2028. He has also held Area Chair positions at leading conferences such as CVPR, ICLR, NeurIPS, ACM Multimedia, ECCV, and BMVC. In addition, he has organized workshops on foundation models and vision transformers at major venues including CVPR, ICCV, NeurIPS, ACCV, and ICME.
Professor Cholakkal has published more than 100 research papers and holds more than eight granted U.S. patents across three primary research pillars: multimodal learning, healthcare foundation models, and efficient visual recognition architectures. His representative research publications are listed below:
Interested in working with
our renowned faculty?
Fill out the below form and we will get back to you.