Hanan Aldarmaki

Assistant Professor of Natural Language Processing

Research interests

Hanan Aldarmaki works on natural language and speech processing for low-resource languages, with a special focus on the Arabic language. The methods she explores include unsupervised learning, transfer learning, and distant supervision to adapt text and speech models to languages and dialects for which labeled data are scarce or non-existent. This includes studying the regularities in text and speech patterns to discover and map terms across languages or modalities, such as unsupervised dictionary induction, cross-lingual embeddings of speech and text, and unsupervised speech-to-text mapping.

Email

Prior to joining MBZUAI, Al Darmaki was an assistant professor in the department of computer science and software engineering at UAE University (UAEU). While completing her Ph.D., she worked as a teaching assistant and lecturer at George Washington University as well as on research projects at Apple Inc. and Amazon Web Services as an intern.

Before starting her Ph.D., she worked as a statistical analyst at the Statistics Center-Abu Dhabi (SCAD), and as a network engineer at Dubai Electricity and Water Authority.

  • Ph.D. in computer science from The George Washington University, USA
  • Master of Philosophy in computer speech, text, and internet technology (CSTIT) from University of Cambridge, UK
  • Bachelor of Science in computer engineering from American University of Sharjah, UAE

Aldarmaki’s current research activities include natural language processing and speech applications for low-resource languages. She also works on developing open-source speech and language models and datasets for the Arabic language and dialects. 

  • “ArTST: Arabic Text and Speech Transformer”. Processings of The First Arabic Natural Language Processing Conference (ArabicNLP 2023).
  • “ClArTTS: An Open-Source Classical Arabic Text-to-Speech Corpus”.  Proceedings of INTERSPEECH 2023. 
  • “Unsupervised Automatic Speech Recognition: A Review”. Speech Communication, 2022 – Elsevier.
  • “Efficient Sentence Embedding using Discrete Cosine Transform”. Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP).
  • “Scalable Cross-Lingual Transfer of Neural Sentence Embeddings”. Proceedings of the Joint Conference on Lexical and Computational Semantics (*SEM), 2019.
  • “Context-Aware Cross-Lingual Mapping”. Proceedings of the Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2019.
  • “Evaluation of Unsupervised Compositional Representations”. Proceedings of the 27th International Conference on Computational Linguistics (COLING), 2018.
  • “Unsupervised Word Mapping Using Structural Similarities in Monolingual Embeddings”. Transactions Of The Association For Computational Linguistics (TACL), 2018.

Contact faculty affairs

Interested in working with our renowned faculty?
Fill out the below form and we will get back to you.