Named entity recognition and dependency parsing for better concept extraction in summary obfuscation detection

Taufiq, Umar and Pulungan, Reza and Suyanto, Yohanes (2023) Named entity recognition and dependency parsing for better concept extraction in summary obfuscation detection. Expert Systems With Applications, 217. ISSN 9574174

[thumbnail of 1366.Named.pdf] Text
1366.Named.pdf
Restricted to Registered users only

Download (1MB) | Request a copy

Abstract

Summary obfuscation is a type of idea plagiarism where a summary of a text document is inserted into another text document so that it is more difficult to detect with ordinary plagiarism detection methods. Various methods have been developed to overcome this problem, one of which is based on genetic algorithms. This paper proposes a new approach for summary obfuscation detection based on named entity recognition and dependency parsing, which is straightforward but accurate and easy to analyze compared to genetic algorithm-based methods. The proposed method successfully detects summary obfuscation at the document level more accurately than existing genetic algorithm-based methods. Our method produced accuracy at sentence level up to more than 84% for specific benchmark and threshold cases. In addition, we have also tested our proposed method on other types of plagiarism, and the resulting accuracy is excellent

Item Type: Article
Additional Information: Library Dosen
Uncontrolled Keywords: Cosine similarity; Dependency parsing; Idea plagiarism; Information anchors; Named entity recognition; Summary obfuscation
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Mathematics and Natural Sciences > Computer Science & Electronics Department
Depositing User: Masrumi Fathurrohmah
Date Deposited: 31 Jul 2024 07:39
Last Modified: 31 Jul 2024 07:39
URI: https://ir.lib.ugm.ac.id/id/eprint/2536

Actions (login required)

View Item
View Item