Search for collections on University of Merdeka Malang Repository

Systematic Literature Review of Data Distribution in Preprocessing Stage with Focus on Outliers

Muslikh, Ahmad Rofiqul ORCID: https://orcid.org/0009-0000-2457-6803, Andono, Pulung Nurtantio, Marjuni, Aris and Santoso, Heru Agus (2023) Systematic Literature Review of Data Distribution in Preprocessing Stage with Focus on Outliers. In: 2023 International Seminar on Application for Technology of Information and Communication (iSemantic), 16-17 September 2023, Semarang, Indonesia.

[thumbnail of Systematic Literature Review of Data Distribution in Preprocessing Stage with Focus on Outliers.pdf]
Preview
Text
Systematic Literature Review of Data Distribution in Preprocessing Stage with Focus on Outliers.pdf

Download (303kB) | Preview
[thumbnail of Similarity Report Systematic Literature Review of Data Distribution in Preprocessing Stage with Focus on Outliers.pdf]
Preview
Text
Similarity Report Systematic Literature Review of Data Distribution in Preprocessing Stage with Focus on Outliers.pdf

Download (341kB) | Preview

Abstract

Data Preprocessing refers to the steps and techniques applied to raw data before it is ready to be analyzed or modeled as a substantive part of the data flow and aims to transform, clean and organize data in a revised way for the quality, relevance and efficiency of subsequent data analysis tasks. Handling outliers in the N2O Emissions Dataset, Fertilizer Prediction and Crop Yield Prediction Dataset is an important step in the data analysis process. The approach taken will depend on the specific context and purpose of the analysis, and it is important to carefully consider the impact of outliers on the results. Using the methods discussed researchers and analysts can effectively identify and treat outliers in the N2O Emissions Dataset, Fertilizer Prediction, Crop Yield Prediction Dataset, and produce more accurate and reliable results. Implemented a systematic literature that involved searching for articles published from 2015 to 2023 for review. The quality of the existing studies used the assessment criteria of 50 relevant studies identified as having been conducted following systematic literature guidelines.

Item Type: Conference or Workshop Item (Paper)
Additional Information: Ahmad Rofikul Muslikh NIDN: 0724038903
Uncontrolled Keywords: SLR, Data Preprocessing, Data Distribution, Outliers
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Fakultas Teknologi Informasi > S1 Sistem Informasi
Depositing User: Rita Juliani
Date Deposited: 13 Mar 2024 06:28
Last Modified: 13 Mar 2024 06:28
URI: https://eprints.unmer.ac.id/id/eprint/4101

Actions (login required)

View Item View Item