Recent publications
Article
Mitra, K, Malapati, A & Lee, M 2026, 'Joint multilingual adaptive attention fusion based multi-teacher KD with contrastive learning for Indic LoRes cross-domain, multi-intent NLU', Knowledge-Based Systems, vol. 340, 115726. https://doi.org/10.1016/j.knosys.2026.115726
Mitra, K, Kolasani, AV, Shruthi, PS, Chelliah, K, Malapati, A & Lee, M 2026, 'MDMIC: An Augmented Indic Corpus and Joint Multitask Attention-Based Fusion Framework for Cross-Domain, Multi-Intent NLU in LoRes Languages', IEEE Access, vol. 14, pp. 28631-28653. https://doi.org/10.1109/ACCESS.2026.3664703
Conference contribution
Liu, J, Bahja, M, Kovatchev, V & Lee, M 2026, Capturing Classic Authorial Style in Long-Form Story Generation with GRPO Fine-Tuning. in Proceedings of the 30th Conference on Computational Natural Language Learning. Association for Computational Linguistics, ACL, 30th Conference on Computational Natural Language Learning, San Diego, California, United States, 3/07/26.
Gamboa, LC, Feng, Y & Lee, M 2026, Robust Bias Evaluation with FilBBQ: A Filipino Bias Benchmark for Question-Answering Language Models. in Proceedings of The Fifteenth Language Resources and Evaluation Conference. Association for Computational Linguistics, ACL, Fifteenth Language Resources and Evaluation Conference, Palma, Spain, 13/05/26.
Abbas, A, Lee, M, Shanavas, N, Kovatchev, V & Ali, M 2026, Struct2Unstruct: Creating Tender NER Datasets from Structured Procurement Records using Large Language Models. in Proceedings of the fourth international workshop on the role of resources in the age of large language models (RESOURCEFUL-2026). NEALT Proceedings Series, Fourth international workshop on the role of resources in the age of large language models, Palma, Spain, 11/05/26.
Gokhan, T, Ali, M & Lee, M 2026, Summarising Regulations: An Empirical Study of Long-Document Summarisation Methods under Extreme Compression. in Natural Language Processing and Information Systems. Lecture Notes in Computer Science, Springer, The 31st Annual International Conference on Natural Language & Information Systems, Trondheim, Norway, 17/06/26.
Al Amer, S, Lee, M & Smith, P 2025, Adopting Ensemble Learning for Cross-lingual Classification of Crisis-related Text On Social Media. in AK Ojha, C Liu, E Vylomova, F Pirinen, J Abbott, J Washington, N Oco, V Malykh, V Logacheva & X Zhao (eds), Proceedings of The Seventh Workshop on Technologies for Machine Translation of Low-Resource Languages (LoResMT 2024). Association for Computational Linguistics, ACL, pp. 159-165, Seventh Workshop on Technologies for Machine Translation of Low-Resource Languages , Bangkok, Thailand, 15/08/24. https://doi.org/10.18653/v1/2024.loresmt-1.16
Wang, Y, Dsouza, R, Lee, R, Apperly, I, Devine, RT, van der Kleij, S & Lee, M 2025, Automatic Scoring of an Open-Response Measure of Advanced Mind-Reading Using Large Language Models. in Proceedings of the 10th Workshop on Computational Linguistics and Clinical Psychology (CLPsych 2025)). Association for Computational Linguistics, ACL, pp. 79–89, The Workshop on Computational Linguistics and Clinical Psychology, Albuquerque, New Mexico, United States, 3/05/25. https://doi.org/10.18653/v1/2025.clpsych-1.7
Li, W, Li, L, Lee, M, Sun, S, Zhang, L, Xue, W & Guo, Y 2025, Bayeskd: Bayesian knowledge distillation for compact llms in constrained fine-tuning scenarios. in W Che, J Nabende, E Shutova & MT Pilehvar (eds), Findings of the Association for Computational Linguistics: ACL 2025. Association for Computational Linguistics, ACL, pp. 138-152, 63rd Annual Meeting of the Association for Computational Linguistics, Vienna, Austria, 27/07/25. <https://aclanthology.org/2025.findings-acl.7/>
Gamboa, LC, Feng, Y & Lee, M 2025, Bias Attribution in Filipino Language Model: Extending a Bias Interpretability Metric for Application on Agglutinative Languages. in Proceedings of the 6th Workshop on Gender Bias in Natural Language Processing (GeBNLP). Association for Computational Linguistics, ACL, The 6th Workshop on Gender Bias in Natural Language Processing at ACL 2025., Vienna, Austria, 1/08/25.
Al Amer, S, Lee, M & Smith, P 2025, Comparative Evaluation of Machine Translation Models Using Human-Translated Social Media Posts as References: Human-Translated Datasets. in Proceedings of the Eighth Workshop on Technologies for Machine Translation of Low-Resource Languages (LoResMT 2025). Association for Computational Linguistics, ACL, The Eighth Workshop on Technologies for Machine Translation of Low-Resource Languages, Albuquerque, New Mexico, United States, 3/05/25.
Preprint
Alrajeh, D, Nowack, V, Benjamin, P, Thomas, K, Hobson, W, Muñoz, CG, Hamilton-Giachritsis, C, Kloess, JA, Woodhams, J, Butler, D, Law, M, Morton, R, Costello, B, Burrell, A, Grant, T, Shah, P, de Leon, FL & Lee, M 2026 'Data-Dependent Goal Modeling for ML-Enabled Law Enforcement Systems' arXiv. https://doi.org/10.48550/arXiv.2601.06237
Gamboa, LCL, Feng, Y & Lee, M 2026 'Robust Bias Evaluation with FilBBQ: A Filipino Bias Benchmark for Question-Answering Language Models' arXiv. https://doi.org/10.48550/arXiv.2602.14466
Gamboa, LCL, Feng, Y & Lee, M 2025 'Bias Attribution in Filipino Language Models: Extending a Bias Interpretability Metric for Application on Agglutinative Languages' arXiv, pp. 1-12. https://doi.org/10.48550/arXiv.2506.07249
Liu, J, Bahja, M, Kovatchev, V & Lee, M 2025 'Capturing Classic Authorial Style in Long-Form Story Generation with GRPO Fine-Tuning' arXiv. https://doi.org/10.48550/arXiv.2512.05747
View all publications in research portal