Big Data, Computer, and Technology in Language Studies: The Potentials of Sketch Engine in Indonesia's Research

Isti'anah, Arina and Febrina, Ria and Suhandano, Suhandano and Winarti, Daru and Sutrisno, Adi (2023) Big Data, Computer, and Technology in Language Studies: The Potentials of Sketch Engine in Indonesia's Research. In: 2023 International Seminar on Application for Technology of Information and Communication: Smart Technology Based on Industry 4.0: A New Way of Recovery from Global Pandemic and Global Economic Crisis, ISemantic 2023, 16 September 2023, Semarang, Indonesia.

[thumbnail of Big_Data_Computer_and_Technology_in_Language_Studies_The_Potentials_of_Sketch_Engine_in_Indonesias_Research.pdf] Text
Big_Data_Computer_and_Technology_in_Language_Studies_The_Potentials_of_Sketch_Engine_in_Indonesias_Research.pdf - Published Version
Restricted to Registered users only

Download (1MB) | Request a copy

Abstract

The emergence of the personal computer and internet in the last quarter of the 20th century has triggered the development of corpus linguistic studies that enable researchers to investigate big data in synchronic and diachronic settings. Scholars have developed Corpus tools to provide linguistic analysis features such as word lists, keywords, collocations, and concordances. In 2003, a lexicographer named Adam Kilgariff developed Sketch Engine, which provides more than 700 text corpora in monolingual and multilingual forms and enables users to create their corpus by crawling the data online or uploading the compiled corpus themselves. In the Indonesian context, Sketch Engine has yet to be used widely, even though this tool provides a promising opportunity in language research. This paper discusses 1) the linguistic features available in Sketch Engine, 2) recent language research articles utilizing Sketch Engine, and 3) possible opportunities and barriers for Indonesia's research. A conceptual and systematic review of corpus linguistics and Sketch Engine is provided to reach the objectives. The analysis found that Indonesian researchers have not contributed yet to the Indonesian corpus development in Sketch Engine. Possible barriers are as follows: 1) the subscription charge, 2) the unfamiliarity of the tool in corpus linguistic publication in Indonesia, and 3) the lack of understanding and application of corpus linguistics in Indonesian academia. Considering Indonesia as a multicultural country, vast opportunities in micro- and macro-linguistic studies will be beneficial to improve the research quality in the Indonesian context.

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: corpora; corpus linguistics; internet; language; macro-linguistics; micro-linguistics
Subjects: P Language and Literature > P Philology. Linguistics
Divisions: Faculty of Cultural Sciences > Indonesian Literature Department
Depositing User: OKTAVIANA DWI P
Date Deposited: 06 Sep 2024 06:17
Last Modified: 06 Sep 2024 06:17
URI: https://ir.lib.ugm.ac.id/id/eprint/6699

Actions (login required)

View Item
View Item