Isti'anah, Arina and Febrina, Ria and Suhandano, Suhandano and Winarti, Daru and Sutrisno, Adi (2023) Big Data, Computer, and Technology in Language Studies: The Potentials of Sketch Engine in Indonesia's Research. In: 2023 International Seminar on Application for Technology of Information and Communication: Smart Technology Based on Industry 4.0: A New Way of Recovery from Global Pandemic and Global Economic Crisis, ISemantic 2023, 16 September 2023, Semarang, Indonesia.
Big_Data_Computer_and_Technology_in_Language_Studies_The_Potentials_of_Sketch_Engine_in_Indonesias_Research.pdf - Published Version
Restricted to Registered users only
Download (1MB) | Request a copy
Abstract
The emergence of the personal computer and internet in the last quarter of the 20th century has triggered the development of corpus linguistic studies that enable researchers to investigate big data in synchronic and diachronic settings. Scholars have developed Corpus tools to provide linguistic analysis features such as word lists, keywords, collocations, and concordances. In 2003, a lexicographer named Adam Kilgariff developed Sketch Engine, which provides more than 700 text corpora in monolingual and multilingual forms and enables users to create their corpus by crawling the data online or uploading the compiled corpus themselves. In the Indonesian context, Sketch Engine has yet to be used widely, even though this tool provides a promising opportunity in language research. This paper discusses 1) the linguistic features available in Sketch Engine, 2) recent language research articles utilizing Sketch Engine, and 3) possible opportunities and barriers for Indonesia's research. A conceptual and systematic review of corpus linguistics and Sketch Engine is provided to reach the objectives. The analysis found that Indonesian researchers have not contributed yet to the Indonesian corpus development in Sketch Engine. Possible barriers are as follows: 1) the subscription charge, 2) the unfamiliarity of the tool in corpus linguistic publication in Indonesia, and 3) the lack of understanding and application of corpus linguistics in Indonesian academia. Considering Indonesia as a multicultural country, vast opportunities in micro- and macro-linguistic studies will be beneficial to improve the research quality in the Indonesian context.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Uncontrolled Keywords: | corpora; corpus linguistics; internet; language; macro-linguistics; micro-linguistics |
Subjects: | P Language and Literature > P Philology. Linguistics |
Divisions: | Faculty of Cultural Sciences > Indonesian Literature Department |
Depositing User: | OKTAVIANA DWI P |
Date Deposited: | 06 Sep 2024 06:17 |
Last Modified: | 06 Sep 2024 06:17 |
URI: | https://ir.lib.ugm.ac.id/id/eprint/6699 |