Home / News / Knowledge distillation and the greening of LLMs

Knowledge distillation and the greening of LLMs

Tuesday, May 02, 2023

[wps_image-right image=”https://staticcdn.mbzuai.ac.ae/mbzuaiwpprd01/2023/05/AstronautCities.jpeg” caption=” ” first-paragraph=” Large language models (LLM) have burst onto the world stage with the introduction of Open AI’s Generative Pre-trained Model (GPT), Google’s Language Model for Dialogue Applications (LaMDA), and many others. These models take an impossible amount of data — the entire internet, for example — and crush it into an algorithm that informs a decision-making engine that can power interfaces, answer questions, generate high-quality content, and quite a bit else.” second-paragraph=” “]

These powerful systems have captured global public imagination. They have also swallowed up city-sized helpings of electricity, water, and money, in the process of training and use. Through the use of instruction tuning and knowledge distillation, a team of researchers at MBZUAI, The University of British Columbia, and Monash University, are working to drastically cut back on the electricity, water, and money required to train and use them, and in the process, deliver on the promise of LLMs. Not only that, the team aims to make LLMs far more secure.

A number of high-profile organizations have recently featured in the news because they compromised the security and privacy of their data by uploading it to an LLM. Associate Professor of Natural Language Processing Alham Fikri Aji and Visiting Associate Professor of Natural Language Processing and Machine Learning Muhammad Abdul-Mageed, along with team members Mighao Wu, Abdul Waheed, and Chiyu Zhang, have created LaMini-LM — a collection of language models that they want to deploy in resource-deprived scenarios such as consumer laptops and mobile devices. This would virtually eliminate security concerns while allowing institutions of all sizes to deploy the power of an LLM on their home network or devices in a relatively efficient manner.

“LaMini-LM is a collection of small-sized, efficient language models distilled from ChatGPT and trained on a large-scale dataset of 2.58M instructions,” Aji said. “We explore different model architectures and sizes, and extensively evaluate their performance across various NLP benchmarks and through human evaluation.”

The team developed LaMini-LM by distilling knowledge from ChatGPT, similar to how a teacher passes on a condensed version of their knowledge to students. They asked ChatGPT questions, received answers, and used those answers to train the LaMini models. Despite taking much less time to train, these smaller LaMini models performed almost as well as their larger counterparts. The team propose that, instead of leaving your workforce to use cloud-based LLMs to answer questions and produce content, a solution such as LaMini-LM could be customized to fit your use case, while helping to keep your data secure.

Aji et al. have posted a paper on their work entitled: “LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions.” To learn more about the team’s data; the models they have developed; and the NLP and human evaluation outcomes; visit https://mbzuai-nlp.github.io/LaMini/

Aji was quick to stress that this is the first iteration of LaMini-LM and that the team are actively working to improve the model. LaMini-LM is part of a wider initiative at the university, working with a range of global collaborators, to decarbonize LLMs, while making them more nimble and more secure — Vicuna being another prominent example.

Wednesday, May 20, 2026

Knowledge distillation and the greening of LLMs

Related

Commencement 2026: Opening the black box of AI

Commencement 2026: Finding the fun in AI code detection

Keeping secrets with differential privacy

About

Resources

Programs

Calendars

Knowledge distillation and the greening of LLMs

Related

Commencement 2026: Opening the black box of AI

Commencement 2026: Finding the fun in AI code detection

Keeping secrets with differential privacy

Subscribe to The Node