Simplifying Paragraph-level Question Generation via Transformer Language Models LE Lopez, DK Cruz, JCB Cruz, C Cheng Pacific Rim International Conference on Artificial Intelligence, 323-334, 2021 | 96* | 2021 |
Localization of Fake News Detection via Multitask Transfer Learning JCB Cruz, JA Tan, C Cheng Proceedings of the 12th International Conference on Language Resources and …, 2020 | 50 | 2020 |
Establishing baselines for text classification in low-resource languages JCB Cruz, C Cheng arXiv preprint arXiv:2005.02068, 2020 | 48 | 2020 |
Multilingual large language models are not (yet) code-switchers R Zhang, S Cahyawijaya, JCB Cruz, GI Winata, AF Aji arXiv preprint arXiv:2305.14235, 2023 | 38 | 2023 |
Evaluating language model finetuning techniques for low-resource languages JCB Cruz, C Cheng arXiv preprint arXiv:1907.00409, 2019 | 37 | 2019 |
Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages ZX Yong, R Zhang, JZ Forde, S Wang, A Subramonian, H Lovenia, ... Sixth Workshop on Computational Approaches to Linguistic Code-Switching, 2023 | 34* | 2023 |
Improving Large-scale Language Models and Resources for Filipino JCB Cruz, C Cheng Proceedings of the 13th Conference on Language Resources and Evaluation …, 2022 | 25 | 2022 |
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark D Romero, C Lyu, HA Wibowo, T Lynn, I Hamed, AN Kishore, A Mandal, ... arXiv preprint arXiv:2406.05967, 2024 | 17 | 2024 |
Exploiting News Article Structure for Automatic Corpus Generation of Entailment Datasets JCB Cruz, JK Resabal, J Lin, DJ Velasco, C Cheng Pacific Rim International Conference on Artificial Intelligence, 86-99, 2021 | 16* | 2021 |
Building Guitar Strum Models for an Interactive Air Guitar Prototype JE Tamani, JCB Cruz, JR Cruzada, J Valenzuela, KG Chan, JA Deja Proceedings of the 4th International Conference on Human-Computer …, 2018 | 9 | 2018 |
Towards automatic construction of filipino wordnet: Word sense induction and synset induction using sentence embeddings DJ Velasco, A Alba, TG Pelagio, BA Ramirez, JCB Cruz, U Chua, ... Proceedings of the First Workshop in South East Asian Language Processing, 1-12, 2023 | 5* | 2023 |
Samsung R&D Institute Philippines at WMT 2023 JCB Cruz arXiv preprint arXiv:2310.16322, 2023 | 5 | 2023 |
Data Processing Matters: SRPH-Konvergen AI’s Machine Translation System for WMT’21 L Sutawika, JCB Cruz Proceedings of the Sixth Conference on Machine Translation, 436-443, 2021 | 3 | 2021 |
WORLDCUISINES: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines GI Winata, F Hudi, PA Irawan, D Anugraha, RA Putri, Y Wang, A Nohejl, ... arXiv preprint arXiv:2410.12705, 2024 | 2 | 2024 |
Using Synthetic Data to Train a Conversational Response Generation Model in Low Resource Settings DA Co, S Ng, GL Tan, AP Ty, JB Cruz, C Cheng 2022 International Conference on Asian Language Processing (IALP), 306-311, 2022 | 2* | 2022 |
Samsung Research Philippines-Datasaur AI’s Submission for the WMT22 Large Scale Multilingual Translation Task JCB Cruz, L Sutawika Proceedings of the Seventh Conference on Machine Translation: Shared Task Papers, 2022 | 2 | 2022 |
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages H Lovenia, R Mahendra, SM Akbar, LJV Miranda, J Santoso, E Aco, ... arXiv preprint arXiv:2406.10118, 2024 | 1 | 2024 |
Samsung R&D Institute Philippines@ WMT 2024 Low-Resource Languages of Spain Shared Task DJ Velasco, MA Rufino, JCB Cruz Proceedings of the Ninth Conference on Machine Translation, Miami …, 2024 | 1 | 2024 |
Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation JCB Cruz Proceedings of the First Workshop on Language Models for Low-Resource …, 2025 | | 2025 |
Thank You, Stingray: Multilingual Large Language Models Can Not (Yet) Disambiguate Cross-Lingual Word Sense S Cahyawijaya, R Zhang, H Lovenia, JCB Cruz, H Nomoto, AF Aji arXiv preprint arXiv:2410.21573, 2024 | | 2024 |