Lin provides critical insights at SIGIR 2023

Monday, August 21, 2023

MBZUAI affiliated professor of Machine Learning Chih-Jen Lin gave one of the keynotes at the recent 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, held in Taipei, Taiwan.

Lin, also a distinguished professor at the Department of Computer Science, National Taiwan University, joined a select group of keynote speakers that included Marc Najork, Distinguished Research Scientist, Google DeepMind; Ranjitha Kumar, Associate Professor, University of Illinois at Urbana-Champaign; and Ryen W. White, General Manager and Deputy Lab Director, Microsoft Research.

Titled ‘On the “Rough Use” of Machine Learning Techniques’, Lin’s address focused on instances where machine learning techniques were employed inappropriately. Lin emphasized that such challenges are not unusual and can sometimes arise unavoidably.

Navigating the complexities of machine learning

Introducing the concept of the “rough use” of these techniques, he used two real-life stories. Firstly, he explored the realm of graph representation learning, where the evaluation of obtained representations often involves a node classification problem with multiple labels. However, Lin revealed a common unrealistic assumption: many researchers assume that they know the number of labels for each test instance during prediction. He highlighted the rarity of having such ground truth information available in practical situations.

Secondly, Lin delved into the realm of deep neural networks and how they are trained. He exposed a common misunderstanding where users incorrectly combine training, validation, and test sets in certain scenarios. By sharing real stories, he highlighted the prevalent confusion around the relationship between these sets and the potential pitfalls that can arise.

“Although the rough use of machine learning methods is common and sometimes unavoidable, the community should work together to change the culture and improve the practical use,” Lin said.

Lin’s presentation concluded with a call to action. He argued that in the intricate landscape of machine learning, achieving perfection is elusive, and sometimes, missteps are inevitable. He argued that the key to improving the situation lies in developing high-quality, user-friendly software. Such software, he posited, would significantly enhance the practical application of machine learning techniques and mitigate instances of misuse.

Related

thumbnail
Tuesday, April 15, 2025

New test that recovers hidden relationships in data to be presented at ICLR

Boyang Sun explains how his team's research into variables addresses a fundamental problem in machine learning and.....

  1. ICLR ,
  2. machine learning ,
  3. research ,
  4. statistics ,
  5. conference ,
  6. data ,
  7. variables ,
Read More
thumbnail
Monday, March 24, 2025

MBZUAI and Berkeley explore the future of machine learning

Machine learning pioneer Michael I. Jordan was among the speakers discussing the cutting-edge ideas shaping the field.

  1. workshop ,
  2. berkeley ,
  3. ML ,
  4. collaborations ,
  5. innovation ,
  6. research ,
  7. machine learning ,
Read More
thumbnail
Tuesday, March 18, 2025

Culturally Yours: A new tool for understanding cultural references in text

Researchers from MBZUAI have developed a tool that uses demographic information to help bridge linguistic and cultural.....

  1. COLING 2025 ,
  2. linguistics ,
  3. languages ,
  4. culture ,
  5. llms ,
  6. large language models ,
  7. research ,
Read More