Fajri Koto

Cited by

	All	Since 2019
Citations	2272	2214
h-index	19	17
i10-index	26	26

880

440

220

660

20162017201820192020202120222023202417 13 20 29 44 109 332 822 877

Co-authors

Timothy BaldwinMBZUAI and The University of MelbourneVerified email at unimelb.edu.au
Jey Han LauComputing and Information Systems, The University of MelbourneVerified email at unimelb.edu.au
Haonan LiLibrAI & MBZUAIVerified email at mbzuai.ac.ae
Mirna AdrianiFaculty of Computer Science, Universitas IndonesiaVerified email at cs.ui.ac.id
Genta Indra WinataCapital One AI FoundationsVerified email at capitalone.com
Samuel CahyawijayaHKUSTVerified email at connect.ust.hk
Rahmad MahendraUniversitas Indonesia and RMIT UniversityVerified email at cs.ui.ac.id
Sebastian RuderResearch Scientist, CohereVerified email at cohere.com
Pascale FungDept. of Electronic & Computer Engineering, the Hong Kong University of Science & TechnologyVerified email at ece.ust.hk
Afshin RahimiApplied Scientist @ AmazonVerified email at uq.edu.au
Preslav NakovMBZUAIVerified email at mbzuai.ac.ae
Graham NeubigCarnegie Mellon UniversityVerified email at cs.cmu.edu
Satoshi NakamuraNara Institute of Science and TechnologyVerified email at is.naist.jp
Tomoki TodaNagoya UniversityVerified email at icts.nagoya-u.ac.jp
Sakriani SaktiProfessor, Nara Institute of Science and TechnologyVerified email at is.naist.jp
Rico SennrichUniversity of ZurichVerified email at cl.uzh.ch
Ayu PurwariantiAssociate Professor, Informatics, Institut Teknologi Bandung, Indonesia & Co-Founder at Prosa.aiVerified email at stei.itb.ac.id
Eric XingPresident at Mohamed bin Zayed University of AI, Professor of Computer Science, Carnegie Mellon UVerified email at cs.cmu.edu
Iryna GurevychFull Professor, TU Darmstadt; Adjunct Professor, MBZUAI, UAE; Affiliated Professor, INSAIT, BulgariaVerified email at cs.tu-darmstadt.de

Fajri Koto

MBZUAI

Verified email at mbzuai.ac.ae - Homepage

Computational Linguistics Natural Language Processing Language Generation Multilingual NLP


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
NusaCrowd: Open source initiative for Indonesian NLP resources S Cahyawijaya, H Lovenia, AF Aji, G Winata, B Wilie, F Koto, R Mahendra, ... Findings of the Association for Computational Linguistics: ACL 2023, 13745-13818, 2023	1048	2023
IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP F Koto, A Rahimi, JH Lau, T Baldwin Proceedings of the 28th COLING 2020, 757-770, 2020	205	2020
Inset lexicon: Evaluation of a word list for Indonesian sentiment analysis in microblogs F Koto, GY Rahmaningtyas 2017 International Conference on Asian Language Processing (IALP), 391-394, 2017	140	2017
CMMLU: Measuring Massive Multitask Language Understanding in Chinese H Li, Y Zhang, F Koto, Y Yang, H Zhao, Y Gong, N Duan, T Baldwin Findings of ACL 2024, 2024	92	2024
A comparative study on twitter sentiment analysis: Which features are good? F Koto, M Adriani Proceedings of the 20th NLDB 2015, 453-457, 2015	74	2015
SMOTE-Out, SMOTE-Cosine, and Selected-SMOTE: An Enhancement Strategy to Handle Imbalance in Data Level F Koto The 6th ICACSIS, 2014	63	2014
IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization F Koto, JH Lau, T Baldwin Proceedings of EMNLP 2021, 2021	58	2021
One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia AF Aji, GI Winata, F Koto, S Cahyawijaya, A Romadhony, R Mahendra, ... Proceedings of ACL 2022, 2022	55	2022
Nusax: Multilingual parallel sentiment dataset for 10 indonesian local languages GI Winata, AF Aji, S Cahyawijaya, R Mahendra, F Koto, A Romadhony, ... Proceedings of the 17th EACL 2023, 2022	49	2022
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models N Sengupta, SK Sahu, B Jia, S Katipomu, H Li, F Koto, OM Afzal, ... Technical Report, 2023	43	2023
Discourse Probing of Pretrained Language Models F Koto, JH Lau, T Baldwin Proceedings of NAACL 2021, 2021	42	2021
Bactrian-x: Multilingual replicable instruction-following models with low-rank adaptation H Li, F Koto, M Wu, AF Aji, T Baldwin arXiv preprint arXiv:2305.15011, 2023	39	2023
Liputan6: A Large-scale Indonesian Dataset for Text Summarization F Koto, JH Lau, T Baldwin Proceedings of AACL 2020, 2020	37	2020
Apparatus and method for sharing personal electronic-data of health A Kurniawan, O ABDILLAH, Fajri US Patent App. 15/221,140, 2017	36*	2017
Top-down Discourse Parsing via Sequence Labelling F Koto, JH Lau, T Baldwin Proceedings of the 16th EACL 2021, 2021	31	2021
FFCI: A framework for interpretable automatic evaluation of summarization F Koto, T Baldwin, JH Lau Journal of Artificial Intelligence Research (JAIR) 73, 1553–1607, 2022	29	2022
Llm360: Towards fully transparent open-source llms Z Liu, A Qiao, W Neiswanger, H Wang, B Tan, T Tao, J Li, Y Wang, S Sun, ... arXiv preprint arXiv:2312.06550, 2023	28	2023
A Publicly Available Indonesian Corpora for Automatic Abstractive and Extractive Chat Summarization F Koto The 10th International Conference on Language Resources and Evaluation (LREC), 2016	23	2016
HBE: Hashtag-based emotion lexicons for twitter sentiment analysis F Koto, M Adriani Proceedings of the 7th Forum for Information Retrieval Evaluation, 31-34, 2015	23	2015
The Use of POS Sequence for Analyzing Sentence Pattern in Twitter Sentiment Analysis F Koto, M Adriani The 29th International Conference on Advanced Information Networking and …, 2015	17	2015

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors