 
                    Around the world, people benefit, not only from the personalized care they receive, but also from the work of public health systems that provide population-level information germane to personalized treatment. AI researchers facilitate this work and enable the safe sharing of information while maintaining privacy using tools like federated learning.
Sharing anonymized health information helps us all benefit from the experiences, successes, and failures, of millions of our fellow humans, and helps doctors gain greater visibility of proven treatments and emerging threats to our health faster and more easily. But not everyone feels a responsibility to engage with such systems for the betterment of humanity.
AI system administrators must protect against hacking, which is an ongoing threat, but unfortunately, they must also battle with bad actors within the system who seek to skew the outcomes of their data. Whether malicious or not, skewing data undermines the very work researchers set out to do.
The research of a collaborative team from MBZUAI, KAUST, and Mila is aimed at helping administrators identify bad actors and minimize the negative impacts on results. What they found is enlightening about the state of information sharing systems, and how administrators of these systems might well protect the critical insights AI promises to deliver in human health.
In federated learning systems, a central administrator designs algorithms that use the data on our devices, runs computations, and then sends the anonymous, encrypted outcomes of that data to a central repository. This type of system is how we collect and analyze vast troves of health information and make safe, ethical use of it in modern, AI-driven health systems. It’s how, for example, scientists crush hundreds of millions of data points about cancer treatment down into the handful of insights and recommendations that could save lives.
MBZUAI Assistant Professor of Machine Learning Samuel Horváth and postdoctoral fellow Eduard Gorbunov work to better understand these systems, optimize them, and protect them from bad actors. On a routine basis, administrators of such a system will update the algorithm, and train the network of devices (your phone included) on the latest version of the math that will turn your data into insights and lives saved. But unfortunately, not everyone is as honest as you are.
“Some users will send back bad information, or even attacks,” Horváth said. “They’re not even necessarily trying to destroy the model, but the manner in which they skew data can be very disruptive to the findings they are based on.”
Gorbunov and Horváth, along with their co-authors, KAUST Professor of Computer Science Peter Richtárik, and Assistant Professor at Université de Montréal and core faculty member at Mila Gauthier Gidel, have a paper accepted at ICLR 2023 titled: “Variance Reduction is an Antidote to Byzantine Workers: Better Rates, Weaker Assumptions, and Communication Compression as a Cherry on the Top.”
The team found that often, defending against Byzantines is more disruptive to data than doing nothing at all. In response, Gorbunov et al. propose a new Byzantine-tolerant method that helps to stabilize training and increase speed, while outperforming benchmarks.
In the end, the team propose a solution that is worth implementing both because it improves outcomes, but also because it is moderately effective against various forms of attack. Essentially there is no “cost” associated with using their Byzantine reduction strategy, and there is no optimization “price” or speed loss either.
A team from MBZUAI is improving LLMs' performance across languages by helping them find the nuances of.....
The Arabic language is underrepresented in the digital world, making AI inaccessible for many of its 400.....
Martin Takáč and Zangir Iklassov's 'self-guided exploration' significantly improves LLM performance in solving combinatorial problems.