Code-Mixed Sentiment Analysis using Transformer for Twitter Social Media Data

Astuti, Laksmita Widya and Sari, Yunita and Suprapto, Suprapto (2023) Code-Mixed Sentiment Analysis using Transformer for Twitter Social Media Data. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 14 (10). pp. 498-504.

Full text not available from this repository. (Request a copy)

Abstract

underrepresentation of the Indonesian language in the field of Natural Language Processing (NLP) can be attributed to several key factors, including the absence of annotated datasets, limited language resources, and a lack of standardization in these resources. One notable linguistic phenomenon in Indonesia is code-mixing between Bahasa Indonesia and English, which is influenced by various sociolinguistic factors, including individual speaker characteristics, the linguistic environment, the societal status of languages, and everyday language usage. In an effort to address the challenges posed by code-mixed data, this research project has successfully created a code-mixed dataset for sentiment analysis. This dataset was constructed based on keywords derived from the sociolinguistic phenomenon observed among teenagers in South Jakarta. Utilizing this newly developed dataset, we conducted a series of experiments employing different pre-processing techniques and pre-trained models. The results of these experiments have demonstrated that the IndoBERTweet pre-trained model is highly effective in solving sentiment analysis tasks when applied to Indonesian-English code-mixed data. These experiments yielded an average precision of 76.07%, a recall of 75.52%, an F-1 score of 75.51%, and an accuracy of 76.56%.

Item Type: Article
Uncontrolled Keywords: Sentiment analysis; code-mixed; BERT; bahasa Indonesia
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Mathematics and Natural Sciences > Computer Science & Electronics Department
Depositing User: Sri JUNANDI
Date Deposited: 28 Nov 2024 08:49
Last Modified: 28 Nov 2024 08:49
URI: https://ir.lib.ugm.ac.id/id/eprint/11755

Actions (login required)

View Item
View Item