A Machine Learning Approach on Outlier Removal for Decision Tree Regression Method

Sihabuddin, Agus and Rokhman, Nur and Wahyudi, Erwin Eko (2024) A Machine Learning Approach on Outlier Removal for Decision Tree Regression Method. Ingenierie des Systemes d'Information, 29 (4). 1397 -1403. ISSN 16331311

[thumbnail of 95 A Machine Learning ....pdf] Text
95 A Machine Learning ....pdf - Published Version
Restricted to Registered users only

Download (1MB) | Request a copy

Abstract

Outliers can occur in application areas, adversely affecting the prediction method's performance. Outliers can be removed by using robust statistical algorithms. However, statistical methods have limitations in capturing the outlier for high-dimensional data. Approaches using Machine Learning (ML) are offered as they develop rapidly due to their excellent interpretability and strong generalization capabilities. So, ML is popular in detecting or eliminating outliers to increase the accuracy of forecasting methods, such as Isolation Forest (IF), an unsupervised outlier detection strategy using a collective approach to calculate the isolation score for every data point. This research objective is to improve the prediction accuracy of the Decision Tree Regression (DTR) method by proposing an IF as an ML-based outlier removal method. The proposed method was tested by two Air Quality Index (AQI) dataset that contained outliers with Mean Absolute Error (MAE), R-Square, and Root Mean Square Error (RMSE) as the accuracy measurements. The results showed that the proposed method outperforms previous studies.

Item Type: Article
Uncontrolled Keywords: decision tree regression; Isolation Forest; machine learning; outlier removal; supervised learning
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Mathematics and Natural Sciences > Computer Science & Electronics Department
Depositing User: Wiyarsih Wiyarsih
Date Deposited: 16 Apr 2025 07:36
Last Modified: 16 Apr 2025 07:36
URI: https://ir.lib.ugm.ac.id/id/eprint/16114

Actions (login required)

View Item
View Item